What exactly is a translation unit in C

后端 未结 4 1152
旧巷少年郎
旧巷少年郎 2021-01-03 23:59

The commonly used definition of a translation unit is what comes after preprocessing (header files inclusions, macros, etc along with the source file). This definit

相关标签:
4条回答
  • 2021-01-04 00:42

    However, if one can makes a question out of the above statement and ask: does it mean an implementation is free to consider multiple source files as a single translation unit

    No. The definition is clear:

    A source file together with all the headers and source files included via the preprocessing directive #include is known as a preprocessing translation unit. After preprocessing, a preprocessing translation unit is called a translation unit.

    A translation unit is the result of preprocessing one source file and its includes. The fact that you might translate two translation units at the same time doesn't mean you can treat them as one translation unit.

    0 讨论(0)
  • 2021-01-04 00:49

    A translation unit means a dot C file. To all intents and purposes, including its associated dot h includes. Rarely #include directives are used to add other file types or other dot C files.

    static variables are visible only within the translation unit. It's very common to have a few public functions with external linkage and many static functions and data items t support. So a C translation unit is a bit like a singleton C++ class. If the compiler doesn't handle static correctly it is non-conforming.

    Typically one object file is created for each translation unit, and they are then linked by the linker. That's not actually mandated by the standard but is the natural and obvious way to do things in an environment where files are cheap to create and compiling is relatively slow.

    0 讨论(0)
  • 2021-01-04 00:50

    In the second line you quoted:

    The text of the program is kept in units called source files, (or preprocessing files) in this International Standard

    If there are two source files then there are two preprocessing files, and therefore two preprocessing translation units, and therefore two translation units. One corresponding to each source file.

    The standard doesn't define source file. I guess the compiler could say "I'm making up my own version of 'source file' by declaring that file1.c and file2.c are not source files after all!" and concatenate them, but this would be at odds with programmer expectations. I think you would have a hard time arguing that file1.c is not a source file.

    0 讨论(0)
  • 2021-01-04 01:02

    Compilers are free to translate several source files at the same time, but they cannot change their semantics.

    Translating several files together will likely be somewhat faster (because the compiler starts only once) and will permit better whole program optimization: Source code of called functions in other translation units is then available at the point of call from other translation units. The compiler can inspect the called code and use the information, much as it can with a single translation unit. From the gcc 6.3.0 manual:

    The compiler performs optimization based on the knowledge it has of the program. Compiling multiple files at once to a single output file mode allows the compiler to use information gained from all of the files when compiling each of them.

    Called functions can be inspected for absence of aliasing, factual const-ness of pointed-to objects etc., enabling the compiler to perform optimizations which would be wrong in the general case.

    And, of course, such functions can be inlined.

    But there are semantics of (preprocessing) translation units (which correspond to source files after preprocessing, per your standard quote) which the compiler must respect. @Malcolm mentioned one, file-static variables. My gut feeling is that there may be other, more subtle issues concerning declarations and declaration order.

    Another obvious source code scope issue concerns defines. From the n1570 draft, 6.10.3.5:

    A macro definition lasts (independent of block structure) until a corresponding #undef directive is encountered or (if none is encountered) until the end of the preprocessing translation unit.

    Both issues forbid simple C source file concatenation; the compiler must additionally apply some rudimentary logic.

    0 讨论(0)
提交回复
热议问题