3 @settitle The C Preprocessor
6 @dircategory Programming
8 * Cpp: (cpp). The GNU C preprocessor.
15 @setchapternewpage odd
17 This file documents the GNU C Preprocessor.
19 Copyright 1987, 1989, 1991, 1992, 1993, 1994, 1995 Free Software
22 Permission is granted to make and distribute verbatim copies of
23 this manual provided the copyright notice and this permission notice
24 are preserved on all copies.
27 Permission is granted to process this file through Tex and print the
28 results, provided the printed document carries copying permission
29 notice identical to this one except for the removal of this paragraph
30 (this paragraph not being relevant to the printed manual).
33 Permission is granted to copy and distribute modified versions of this
34 manual under the conditions for verbatim copying, provided also that
35 the entire resulting derived work is distributed under the terms of a
36 permission notice identical to this one.
38 Permission is granted to copy and distribute translations of this manual
39 into another language, under the above conditions for modified versions.
44 @title The C Preprocessor
45 @subtitle Last revised September 1998
46 @subtitle for GCC version 2
47 @author Richard M. Stallman
50 This booklet is eventually intended to form the first chapter of a GNU
53 @vskip 0pt plus 1filll
54 Copyright @copyright{} 1987, 1989, 1991-1998
55 Free Software Foundation, Inc.
57 Permission is granted to make and distribute verbatim copies of
58 this manual provided the copyright notice and this permission notice
59 are preserved on all copies.
61 Permission is granted to copy and distribute modified versions of this
62 manual under the conditions for verbatim copying, provided also that
63 the entire resulting derived work is distributed under the terms of a
64 permission notice identical to this one.
66 Permission is granted to copy and distribute translations of this manual
67 into another language, under the above conditions for modified versions.
71 @node Top, Global Actions,, (DIR)
72 @chapter The C Preprocessor
74 The C preprocessor is a @dfn{macro processor} that is used automatically by
75 the C compiler to transform your program before actual compilation. It is
76 called a macro processor because it allows you to define @dfn{macros},
77 which are brief abbreviations for longer constructs.
79 The C preprocessor provides four separate facilities that you can use as
84 Inclusion of header files. These are files of declarations that can be
85 substituted into your program.
88 Macro expansion. You can define @dfn{macros}, which are abbreviations
89 for arbitrary fragments of C code, and then the C preprocessor will
90 replace the macros with their definitions throughout the program.
93 Conditional compilation. Using special preprocessing directives, you
94 can include or exclude parts of the program according to various
98 Line control. If you use a program to combine or rearrange source files into
99 an intermediate file which is then compiled, you can use line control
100 to inform the compiler of where each source line originally came from.
103 C preprocessors vary in some details. This manual discusses the GNU C
104 preprocessor, the C Compatible Compiler Preprocessor. The GNU C
105 preprocessor provides a superset of the features of ANSI Standard C@.
107 ANSI Standard C requires the rejection of many harmless constructs commonly
108 used by today's C programs. Such incompatibility would be inconvenient for
109 users, so the GNU C preprocessor is configured to accept these constructs
110 by default. Strictly speaking, to get ANSI Standard C, you must use the
111 options @samp{-trigraphs}, @samp{-undef} and @samp{-pedantic}, but in
112 practice the consequences of having strict ANSI Standard C make it
113 undesirable to do this. @xref{Invocation}.
115 The C preprocessor is designed for C-like languages; you may run into
116 problems if you apply it to other kinds of languages, because it assumes
117 that it is dealing with C@. For example, the C preprocessor sometimes
118 outputs extra white space to avoid inadvertent C token concatenation,
119 and this may cause problems with other languages.
122 * Global Actions:: Actions made uniformly on all input files.
123 * Directives:: General syntax of preprocessing directives.
124 * Header Files:: How and why to use header files.
125 * Macros:: How and why to use macros.
126 * Conditionals:: How and why to use conditionals.
127 * Combining Sources:: Use of line control when you combine source files.
128 * Other Directives:: Miscellaneous preprocessing directives.
129 * Output:: Format of output from the C preprocessor.
130 * Invocation:: How to invoke the preprocessor; command options.
131 * Concept Index:: Index of concepts and terms.
132 * Index:: Index of directives, predefined macros and options.
135 @node Global Actions, Directives, Top, Top
136 @section Transformations Made Globally
138 Most C preprocessor features are inactive unless you give specific directives
139 to request their use. (Preprocessing directives are lines starting with
140 @samp{#}; @pxref{Directives}). But there are three transformations that the
141 preprocessor always makes on all the input it receives, even in the absence
146 All C comments are replaced with single spaces.
149 Backslash-Newline sequences are deleted, no matter where. This
150 feature allows you to break long lines for cosmetic purposes without
151 changing their meaning.
154 Predefined macro names are replaced with their expansions
155 (@pxref{Predefined}).
158 The first two transformations are done @emph{before} nearly all other parsing
159 and before preprocessing directives are recognized. Thus, for example, you
160 can split a line cosmetically with Backslash-Newline anywhere (except
161 when trigraphs are in use; see below).
173 is equivalent into @samp{#define FOO 1020}. You can split even an escape
174 sequence with Backslash-Newline. For example, you can split @code{"foo\bar"}
175 between the @samp{\} and the @samp{b} to get
183 This behavior is unclean: in all other contexts, a Backslash can be
184 inserted in a string constant as an ordinary character by writing a double
185 Backslash, and this creates an exception. But the ANSI C standard requires
186 it. (Strict ANSI C does not allow Newlines in string constants, so they
187 do not consider this a problem.)
189 But there are a few exceptions to all three transformations.
193 C comments and predefined macro names are not recognized inside a
194 @samp{#include} directive in which the file name is delimited with
195 @samp{<} and @samp{>}.
198 C comments and predefined macro names are never recognized within a
199 character or string constant. (Strictly speaking, this is the rule,
200 not an exception, but it is worth noting here anyway.)
203 Backslash-Newline may not safely be used within an ANSI ``trigraph''.
204 Trigraphs are converted before Backslash-Newline is deleted. If you
205 write what looks like a trigraph with a Backslash-Newline inside, the
206 Backslash-Newline is deleted as usual, but it is then too late to
207 recognize the trigraph.
209 This exception is relevant only if you use the @samp{-trigraphs}
210 option to enable trigraph processing. @xref{Invocation}.
213 @node Directives, Header Files, Global Actions, Top
214 @section Preprocessing Directives
216 @cindex preprocessing directives
218 Most preprocessor features are active only if you use preprocessing directives
219 to request their use.
221 Preprocessing directives are lines in your program that start with @samp{#}.
222 The @samp{#} is followed by an identifier that is the @dfn{directive name}.
223 For example, @samp{#define} is the directive that defines a macro.
224 Whitespace is also allowed before and after the @samp{#}.
226 The set of valid directive names is fixed. Programs cannot define new
227 preprocessing directives.
229 Some directive names require arguments; these make up the rest of the directive
230 line and must be separated from the directive name by whitespace. For example,
231 @samp{#define} must be followed by a macro name and the intended expansion
232 of the macro. @xref{Simple Macros}.
234 A preprocessing directive cannot be more than one line in normal circumstances.
235 It may be split cosmetically with Backslash-Newline, but that has no effect
236 on its meaning. Comments containing Newlines can also divide the
237 directive into multiple lines, but the comments are changed to Spaces
238 before the directive is interpreted. The only way a significant Newline
239 can occur in a preprocessing directive is within a string constant or
240 character constant. Note that
241 most C compilers that might be applied to the output from the preprocessor
242 do not accept string or character constants containing Newlines.
244 The @samp{#} and the directive name cannot come from a macro expansion. For
245 example, if @samp{foo} is defined as a macro expanding to @samp{define},
246 that does not make @samp{#foo} a valid preprocessing directive.
248 @node Header Files, Macros, Directives, Top
249 @section Header Files
252 A header file is a file containing C declarations and macro definitions
253 (@pxref{Macros}) to be shared between several source files. You request
254 the use of a header file in your program with the C preprocessing directive
258 * Header Uses:: What header files are used for.
259 * Include Syntax:: How to write @samp{#include} directives.
260 * Include Operation:: What @samp{#include} does.
261 * Once-Only:: Preventing multiple inclusion of one header file.
262 * Inheritance:: Including one header file in another header file.
265 @node Header Uses, Include Syntax, Header Files, Header Files
266 @subsection Uses of Header Files
268 Header files serve two kinds of purposes.
272 @findex system header files
273 System header files declare the interfaces to parts of the operating
274 system. You include them in your program to supply the definitions and
275 declarations you need to invoke system calls and libraries.
278 Your own header files contain declarations for interfaces between the
279 source files of your program. Each time you have a group of related
280 declarations and macro definitions all or most of which are needed in
281 several different source files, it is a good idea to create a header
285 Including a header file produces the same results in C compilation as
286 copying the header file into each source file that needs it. But such
287 copying would be time-consuming and error-prone. With a header file, the
288 related declarations appear in only one place. If they need to be changed,
289 they can be changed in one place, and programs that include the header file
290 will automatically use the new version when next recompiled. The header
291 file eliminates the labor of finding and changing all the copies as well as
292 the risk that a failure to find one copy will result in inconsistencies
295 The usual convention is to give header files names that end with
296 @file{.h}. Avoid unusual characters in header file names, as they
299 @node Include Syntax, Include Operation, Header Uses, Header Files
300 @subsection The @samp{#include} Directive
303 Both user and system header files are included using the preprocessing
304 directive @samp{#include}. It has three variants:
307 @item #include <@var{file}>
308 This variant is used for system header files. It searches for a file
309 named @var{file} in a list of directories specified by you, then in a
310 standard list of system directories. You specify directories to
311 search for header files with the command option @samp{-I}
312 (@pxref{Invocation}). The option @samp{-nostdinc} inhibits searching
313 the standard system directories; in this case only the directories
314 you specify are searched.
316 The parsing of this form of @samp{#include} is slightly special
317 because comments are not recognized within the @samp{<@dots{}>}.
318 Thus, in @samp{#include <x/*y>} the @samp{/*} does not start a comment
319 and the directive specifies inclusion of a system header file named
320 @file{x/*y}. Of course, a header file with such a name is unlikely to
321 exist on Unix, where shell wildcard features would make it hard to
324 The argument @var{file} may not contain a @samp{>} character. It may,
325 however, contain a @samp{<} character.
327 @item #include "@var{file}"
328 This variant is used for header files of your own program. It
329 searches for a file named @var{file} first in the current directory,
330 then in the same directories used for system header files. The
331 current directory is the directory of the current input file. It is
332 tried first because it is presumed to be the location of the files
333 that the current input file refers to. (If the @samp{-I-} option is
334 used, the special treatment of the current directory is inhibited.)
336 The argument @var{file} may not contain @samp{"} characters. If
337 backslashes occur within @var{file}, they are considered ordinary text
338 characters, not escape characters. None of the character escape
339 sequences appropriate to string constants in C are processed. Thus,
340 @samp{#include "x\n\\y"} specifies a filename containing three
341 backslashes. It is not clear why this behavior is ever useful, but
342 the ANSI standard specifies it.
344 @item #include @var{anything else}
345 @cindex computed @samp{#include}
346 This variant is called a @dfn{computed #include}. Any @samp{#include}
347 directive whose argument does not fit the above two forms is a computed
348 include. The text @var{anything else} is checked for macro calls,
349 which are expanded (@pxref{Macros}). When this is done, the result
350 must fit one of the above two variants---in particular, the expanded
351 text must in the end be surrounded by either quotes or angle braces.
353 This feature allows you to define a macro which controls the file name
354 to be used at a later point in the program. One application of this is
355 to allow a site-specific configuration file for your program to specify
356 the names of the system include files to be used. This can help in
357 porting the program to various operating systems in which the necessary
358 system header files are found in different places.
361 @node Include Operation, Once-Only, Include Syntax, Header Files
362 @subsection How @samp{#include} Works
364 The @samp{#include} directive works by directing the C preprocessor to scan
365 the specified file as input before continuing with the rest of the current
366 file. The output from the preprocessor contains the output already
367 generated, followed by the output resulting from the included file,
368 followed by the output that comes from the text after the @samp{#include}
369 directive. For example, given a header file @file{header.h} as follows,
376 and a main program called @file{program.c} that uses the header file,
390 the output generated by the C preprocessor for @file{program.c} as input
403 Included files are not limited to declarations and macro definitions; those
404 are merely the typical uses. Any fragment of a C program can be included
405 from another file. The include file could even contain the beginning of a
406 statement that is concluded in the containing file, or the end of a
407 statement that was started in the including file. However, a comment or a
408 string or character constant may not start in the included file and finish
409 in the including file. An unterminated comment, string constant or
410 character constant in an included file is considered to end (with an error
411 message) at the end of the file.
413 It is possible for a header file to begin or end a syntactic unit such
414 as a function definition, but that would be very confusing, so don't do
417 The line following the @samp{#include} directive is always treated as a
418 separate line by the C preprocessor even if the included file lacks a final
421 @node Once-Only, Inheritance, Include Operation, Header Files
422 @subsection Once-Only Include Files
423 @cindex repeated inclusion
424 @cindex including just once
426 Very often, one header file includes another. It can easily result that a
427 certain header file is included more than once. This may lead to errors,
428 if the header file defines structure types or typedefs, and is certainly
429 wasteful. Therefore, we often wish to prevent multiple inclusion of a
432 The standard way to do this is to enclose the entire real contents of the
433 file in a conditional, like this:
436 #ifndef FILE_FOO_SEEN
437 #define FILE_FOO_SEEN
439 @var{the entire file}
441 #endif /* FILE_FOO_SEEN */
444 The macro @code{FILE_FOO_SEEN} indicates that the file has been included
445 once already. In a user header file, the macro name should not begin
446 with @samp{_}. In a system header file, this name should begin with
447 @samp{__} to avoid conflicts with user programs. In any kind of header
448 file, the macro name should contain the name of the file and some
449 additional text, to avoid conflicts with other header files.
451 The GNU C preprocessor is programmed to notice when a header file uses
452 this particular construct and handle it efficiently. If a header file
453 is contained entirely in a @samp{#ifndef} conditional, then it records
454 that fact. If a subsequent @samp{#include} specifies the same file,
455 and the macro in the @samp{#ifndef} is already defined, then the file
456 is entirely skipped, without even reading it.
459 There is also an explicit directive to tell the preprocessor that it need
460 not include a file more than once. This is called @samp{#pragma once},
461 and was used @emph{in addition to} the @samp{#ifndef} conditional around
462 the contents of the header file. @samp{#pragma once} is now obsolete
463 and should not be used at all.
466 In the Objective C language, there is a variant of @samp{#include}
467 called @samp{#import} which includes a file, but does so at most once.
468 If you use @samp{#import} @emph{instead of} @samp{#include}, then you
469 don't need the conditionals inside the header file to prevent multiple
470 execution of the contents.
472 @samp{#import} is obsolete because it is not a well designed feature.
473 It requires the users of a header file---the applications
474 programmers---to know that a certain header file should only be included
475 once. It is much better for the header file's implementor to write the
476 file so that users don't need to know this. Using @samp{#ifndef}
477 accomplishes this goal.
479 @node Inheritance,, Once-Only, Header Files
480 @subsection Inheritance and Header Files
482 @cindex overriding a header file
484 @dfn{Inheritance} is what happens when one object or file derives some
485 of its contents by virtual copying from another object or file. In
486 the case of C header files, inheritance means that one header file
487 includes another header file and then replaces or adds something.
489 If the inheriting header file and the base header file have different
490 names, then inheritance is straightforward: simply write @samp{#include
491 "@var{base}"} in the inheriting file.
493 Sometimes it is necessary to give the inheriting file the same name as
494 the base file. This is less straightforward.
496 For example, suppose an application program uses the system header
497 @file{sys/signal.h}, but the version of @file{/usr/include/sys/signal.h}
498 on a particular system doesn't do what the application program expects.
499 It might be convenient to define a ``local'' version, perhaps under the
500 name @file{/usr/local/include/sys/signal.h}, to override or add to the
501 one supplied by the system.
503 You can do this by compiling with the option @samp{-I.}, and
504 writing a file @file{sys/signal.h} that does what the application
505 program expects. But making this file include the standard
506 @file{sys/signal.h} is not so easy---writing @samp{#include
507 <sys/signal.h>} in that file doesn't work, because it includes your own
508 version of the file, not the standard system version. Used in that file
509 itself, this leads to an infinite recursion and a fatal error in
512 @samp{#include </usr/include/sys/signal.h>} would find the proper file,
513 but that is not clean, since it makes an assumption about where the
514 system header file is found. This is bad for maintenance, since it
515 means that any change in where the system's header files are kept
516 requires a change somewhere else.
518 @findex #include_next
519 The clean way to solve this problem is to use
520 @samp{#include_next}, which means, ``Include the @emph{next} file with
521 this name.'' This directive works like @samp{#include} except in
522 searching for the specified file: it starts searching the list of header
523 file directories @emph{after} the directory in which the current file
526 Suppose you specify @samp{-I /usr/local/include}, and the list of
527 directories to search also includes @file{/usr/include}; and suppose
528 both directories contain @file{sys/signal.h}. Ordinary
529 @samp{#include <sys/signal.h>} finds the file under
530 @file{/usr/local/include}. If that file contains @samp{#include_next
531 <sys/signal.h>}, it starts searching after that directory, and finds the
532 file in @file{/usr/include}.
534 @node Macros, Conditionals, Header Files, Top
537 A macro is a sort of abbreviation which you can define once and then
538 use later. There are many complicated features associated with macros
539 in the C preprocessor.
542 * Simple Macros:: Macros that always expand the same way.
543 * Argument Macros:: Macros that accept arguments that are substituted
544 into the macro expansion.
545 * Predefined:: Predefined macros that are always available.
546 * Stringification:: Macro arguments converted into string constants.
547 * Concatenation:: Building tokens from parts taken from macro arguments.
548 * Undefining:: Cancelling a macro's definition.
549 * Redefining:: Changing a macro's definition.
550 * Macro Pitfalls:: Macros can confuse the unwary. Here we explain
551 several common problems and strange features.
554 @node Simple Macros, Argument Macros, Macros, Macros
555 @subsection Simple Macros
557 @cindex manifest constant
559 A @dfn{simple macro} is a kind of abbreviation. It is a name which
560 stands for a fragment of code. Some people refer to these as
561 @dfn{manifest constants}.
563 Before you can use a macro, you must @dfn{define} it explicitly with the
564 @samp{#define} directive. @samp{#define} is followed by the name of the
565 macro and then the code it should be an abbreviation for. For example,
568 #define BUFFER_SIZE 1020
572 defines a macro named @samp{BUFFER_SIZE} as an abbreviation for the text
573 @samp{1020}. If somewhere after this @samp{#define} directive there comes
574 a C statement of the form
577 foo = (char *) xmalloc (BUFFER_SIZE);
581 then the C preprocessor will recognize and @dfn{expand} the macro
582 @samp{BUFFER_SIZE}, resulting in
585 foo = (char *) xmalloc (1020);
588 The use of all upper case for macro names is a standard convention.
589 Programs are easier to read when it is possible to tell at a glance which
592 Normally, a macro definition must be a single line, like all C
593 preprocessing directives. (You can split a long macro definition
594 cosmetically with Backslash-Newline.) There is one exception: Newlines
595 can be included in the macro definition if within a string or character
596 constant. This is because it is not possible for a macro definition to
597 contain an unbalanced quote character; the definition automatically
598 extends to include the matching quote character that ends the string or
599 character constant. Comments within a macro definition may contain
600 Newlines, which make no difference since the comments are entirely
601 replaced with Spaces regardless of their contents.
603 Aside from the above, there is no restriction on what can go in a macro
604 body. Parentheses need not balance. The body need not resemble valid C
605 code. (But if it does not, you may get error messages from the C
606 compiler when you use the macro.)
608 The C preprocessor scans your program sequentially, so macro definitions
609 take effect at the place you write them. Therefore, the following input to
627 After the preprocessor expands a macro name, the macro's definition body is
628 appended to the front of the remaining input, and the check for macro calls
629 continues. Therefore, the macro body can contain calls to other macros.
634 #define TABLESIZE BUFSIZE
638 the name @samp{TABLESIZE} when used in the program would go through two
639 stages of expansion, resulting ultimately in @samp{1020}.
641 This is not at all the same as defining @samp{TABLESIZE} to be @samp{1020}.
642 The @samp{#define} for @samp{TABLESIZE} uses exactly the body you
643 specify---in this case, @samp{BUFSIZE}---and does not check to see whether
644 it too is the name of a macro. It's only when you @emph{use} @samp{TABLESIZE}
645 that the result of its expansion is checked for more macro names.
646 @xref{Cascaded Macros}.
648 @node Argument Macros, Predefined, Simple Macros, Macros
649 @subsection Macros with Arguments
650 @cindex macros with argument
651 @cindex arguments in macro definitions
652 @cindex function-like macro
654 A simple macro always stands for exactly the same text, each time it is
655 used. Macros can be more flexible when they accept @dfn{arguments}.
656 Arguments are fragments of code that you supply each time the macro is
657 used. These fragments are included in the expansion of the macro
658 according to the directions in the macro definition. A macro that
659 accepts arguments is called a @dfn{function-like macro} because the
660 syntax for using it looks like a function call.
663 To define a macro that uses arguments, you write a @samp{#define} directive
664 with a list of @dfn{argument names} in parentheses after the name of the
665 macro. The argument names may be any valid C identifiers, separated by
666 commas and optionally whitespace. The open-parenthesis must follow the
667 macro name immediately, with no space in between.
669 For example, here is a macro that computes the minimum of two numeric
670 values, as it is defined in many C programs:
673 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
677 (This is not the best way to define a ``minimum'' macro in GNU C@.
678 @xref{Side Effects}, for more information.)
680 To use a macro that expects arguments, you write the name of the macro
681 followed by a list of @dfn{actual arguments} in parentheses, separated by
682 commas. The number of actual arguments you give must match the number of
683 arguments the macro expects. Examples of use of the macro @samp{min}
684 include @samp{min (1, 2)} and @samp{min (x + 28, *p)}.
686 The expansion text of the macro depends on the arguments you use.
687 Each of the argument names of the macro is replaced, throughout the
688 macro definition, with the corresponding actual argument. Using the
689 same macro @samp{min} defined above, @samp{min (1, 2)} expands into
692 ((1) < (2) ? (1) : (2))
696 where @samp{1} has been substituted for @samp{X} and @samp{2} for @samp{Y}.
698 Likewise, @samp{min (x + 28, *p)} expands into
701 ((x + 28) < (*p) ? (x + 28) : (*p))
704 Parentheses in the actual arguments must balance; a comma within
705 parentheses does not end an argument. However, there is no requirement
706 for brackets or braces to balance, and they do not prevent a comma from
707 separating arguments. Thus,
710 macro (array[x = y, x + 1])
714 passes two arguments to @code{macro}: @samp{array[x = y} and @samp{x +
715 1]}. If you want to supply @samp{array[x = y, x + 1]} as an argument,
716 you must write it as @samp{array[(x = y, x + 1)]}, which is equivalent C
719 After the actual arguments are substituted into the macro body, the entire
720 result is appended to the front of the remaining input, and the check for
721 macro calls continues. Therefore, the actual arguments can contain calls
722 to other macros, either with or without arguments, or even to the same
723 macro. The macro body can also contain calls to other macros. For
724 example, @samp{min (min (a, b), c)} expands into this text:
727 ((((a) < (b) ? (a) : (b))) < (c)
728 ? (((a) < (b) ? (a) : (b)))
733 (Line breaks shown here for clarity would not actually be generated.)
735 @cindex blank macro arguments
736 @cindex space as macro argument
737 If a macro @code{foo} takes one argument, and you want to supply an
738 empty argument, you must write at least some whitespace between the
739 parentheses, like this: @samp{foo ( )}. Just @samp{foo ()} is providing
740 no arguments, which is an error if @code{foo} expects an argument. But
741 @samp{foo0 ()} is the correct way to call a macro defined to take zero
742 arguments, like this:
745 #define foo0() @dots{}
748 If you use the macro name followed by something other than an
749 open-parenthesis (after ignoring any spaces, tabs and comments that
750 follow), it is not a call to the macro, and the preprocessor does not
751 change what you have written. Therefore, it is possible for the same name
752 to be a variable or function in your program as well as a macro, and you
753 can choose in each instance whether to refer to the macro (if an actual
754 argument list follows) or the variable or function (if an argument list
757 Such dual use of one name could be confusing and should be avoided
758 except when the two meanings are effectively synonymous: that is, when the
759 name is both a macro and a function and the two have similar effects. You
760 can think of the name simply as a function; use of the name for purposes
761 other than calling it (such as, to take the address) will refer to the
762 function, while calls will expand the macro and generate better but
763 equivalent code. For example, you can use a function named @samp{min} in
764 the same source file that defines the macro. If you write @samp{&min} with
765 no argument list, you refer to the function. If you write @samp{min (x,
766 bb)}, with an argument list, the macro is expanded. If you write
767 @samp{(min) (a, bb)}, where the name @samp{min} is not followed by an
768 open-parenthesis, the macro is not expanded, so you wind up with a call to
769 the function @samp{min}.
771 You may not define the same name as both a simple macro and a macro with
774 In the definition of a macro with arguments, the list of argument names
775 must follow the macro name immediately with no space in between. If there
776 is a space after the macro name, the macro is defined as taking no
777 arguments, and all the rest of the line is taken to be the expansion. The
778 reason for this is that it is often useful to define a macro that takes no
779 arguments and whose definition begins with an identifier in parentheses.
780 This rule about spaces makes it possible for you to do either this:
783 #define FOO(x) - 1 / (x)
787 (which defines @samp{FOO} to take an argument and expand into minus the
788 reciprocal of that argument) or this:
791 #define BAR (x) - 1 / (x)
795 (which defines @samp{BAR} to take no argument and always expand into
796 @samp{(x) - 1 / (x)}).
798 Note that the @emph{uses} of a macro with arguments can have spaces before
799 the left parenthesis; it's the @emph{definition} where it matters whether
802 @node Predefined, Stringification, Argument Macros, Macros
803 @subsection Predefined Macros
805 @cindex predefined macros
806 Several simple macros are predefined. You can use them without giving
807 definitions for them. They fall into two classes: standard macros and
808 system-specific macros.
811 * Standard Predefined:: Standard predefined macros.
812 * Nonstandard Predefined:: Nonstandard predefined macros.
815 @node Standard Predefined, Nonstandard Predefined, Predefined, Predefined
816 @subsubsection Standard Predefined Macros
817 @cindex standard predefined macros
819 The standard predefined macros are available with the same meanings
820 regardless of the machine or operating system on which you are using GNU C@.
821 Their names all start and end with double underscores. Those preceding
822 @code{__GNUC__} in this table are standardized by ANSI C; the rest are
828 This macro expands to the name of the current input file, in the form of
829 a C string constant. The precise name returned is the one that was
830 specified in @samp{#include} or as the input file name argument.
834 This macro expands to the current input line number, in the form of a
835 decimal integer constant. While we call it a predefined macro, it's
836 a pretty strange macro, since its ``definition'' changes with each
837 new line of source code.
839 This and @samp{__FILE__} are useful in generating an error message to
840 report an inconsistency detected by the program; the message can state
841 the source line at which the inconsistency was detected. For example,
844 fprintf (stderr, "Internal error: "
845 "negative string length "
846 "%d at %s, line %d.",
847 length, __FILE__, __LINE__);
850 A @samp{#include} directive changes the expansions of @samp{__FILE__}
851 and @samp{__LINE__} to correspond to the included file. At the end of
852 that file, when processing resumes on the input file that contained
853 the @samp{#include} directive, the expansions of @samp{__FILE__} and
854 @samp{__LINE__} revert to the values they had before the
855 @samp{#include} (but @samp{__LINE__} is then incremented by one as
856 processing moves to the line after the @samp{#include}).
858 The expansions of both @samp{__FILE__} and @samp{__LINE__} are altered
859 if a @samp{#line} directive is used. @xref{Combining Sources}.
863 This macro expands to a string constant that describes the date on
864 which the preprocessor is being run. The string constant contains
865 eleven characters and looks like @w{@samp{"Feb 1 1996"}}.
866 @c After reformatting the above, check that the date remains `Feb 1 1996',
867 @c all on one line, with two spaces between the `Feb' and the `1'.
871 This macro expands to a string constant that describes the time at
872 which the preprocessor is being run. The string constant contains
873 eight characters and looks like @samp{"23:59:01"}.
877 This macro expands to the constant 1, to signify that this is ANSI
878 Standard C@. (Whether that is actually true depends on what C compiler
879 will operate on the output from the preprocessor.)
881 On some hosts, system include files use a different convention, where
882 @samp{__STDC__} is normally 0, but is 1 if the user specifies strict
883 conformance to the C Standard. The preprocessor follows the host convention
884 when processing system include files, but when processing user files it follows
885 the usual GNU C convention.
887 This macro is not defined if the @samp{-traditional} option is used.
889 @item __STDC_VERSION__
890 @findex __STDC_VERSION__
891 This macro expands to the C Standard's version number,
892 a long integer constant of the form @samp{@var{yyyy}@var{mm}L}
893 where @var{yyyy} and @var{mm} are the year and month of the Standard version.
894 This signifies which version of the C Standard the preprocessor conforms to.
895 Like @samp{__STDC__}, whether this version number is accurate
896 for the entire implementation depends on what C compiler
897 will operate on the output from the preprocessor.
899 This macro is not defined if the @samp{-traditional} option is used.
903 This macro is defined if and only if this is GNU C@. This macro is
904 defined only when the entire GNU C compiler is in use; if you invoke the
905 preprocessor directly, @samp{__GNUC__} is undefined. The value
906 identifies the major version number of GNU CC (@samp{1} for GNU CC
907 version 1, which is now obsolete, and @samp{2} for version 2).
910 @findex __GNUC_MINOR__
911 The macro contains the minor version number of the compiler. This can
912 be used to work around differences between different releases of the
913 compiler (for example, if gcc 2.6.3 is known to support a feature, you
914 can test for @code{__GNUC__ > 2 || (__GNUC__ == 2 && __GNUC_MINOR__ >= 6)}).
915 The last number, @samp{3} in the
916 example above, denotes the bugfix level of the compiler; no macro
921 The GNU C compiler defines this when the compilation language is
922 C++; use @samp{__GNUG__} to distinguish between GNU C and GNU
927 The draft ANSI standard for C++ used to require predefining this
928 variable. Though it is no longer required, GNU C++ continues to define
929 it, as do other popular C++ compilers. You can use @samp{__cplusplus}
930 to test whether a header is compiled by a C compiler or a C++ compiler.
932 @item __STRICT_ANSI__
933 @findex __STRICT_ANSI__
934 GNU C defines this macro if and only if the @samp{-ansi} switch was
935 specified when GNU C was invoked. Its definition is the null string.
936 This macro exists primarily to direct certain GNU header files not to
937 define certain traditional Unix constructs which are incompatible with
941 @findex __BASE_FILE__
942 This macro expands to the name of the main input file, in the form
943 of a C string constant. This is the source file that was specified
944 as an argument when the C compiler was invoked.
946 @item __INCLUDE_LEVEL__
947 @findex __INCLUDE_LEVEL_
948 This macro expands to a decimal integer constant that represents the
949 depth of nesting in include files. The value of this macro is
950 incremented on every @samp{#include} directive and decremented at every
951 end of file. For input files specified by command line arguments,
952 the nesting level is zero.
956 This macro expands to a string constant which describes the version number of
957 GNU C@. The string is normally a sequence of decimal numbers separated
958 by periods, such as @samp{"2.6.0"}.
962 GNU CC defines this macro in optimizing compilations. It causes certain
963 GNU header files to define alternative macro definitions for some system
964 library functions. You should not refer to or test the definition of
965 this macro unless you make very sure that programs will execute with the
966 same effect regardless.
968 @item __CHAR_UNSIGNED__
969 @findex __CHAR_UNSIGNED__
970 GNU C defines this macro if and only if the data type @code{char} is
971 unsigned on the target machine. It exists to cause the standard header
972 file @file{limits.h} to work correctly. You should not refer to this
973 macro yourself; instead, refer to the standard macros defined in
974 @file{limits.h}. The preprocessor uses this macro to determine whether
975 or not to sign-extend large character constants written in octal; see
976 @ref{#if Directive,,The @samp{#if} Directive}.
978 @item __REGISTER_PREFIX__
979 @findex __REGISTER_PREFIX__
980 This macro expands to a string (not a string constant) describing the
981 prefix applied to CPU registers in assembler code. You can use it to
982 write assembler code that is usable in multiple environments. For
983 example, in the @samp{m68k-aout} environment it expands to the null
984 string, but in the @samp{m68k-coff} environment it expands to the string
987 @item __USER_LABEL_PREFIX__
988 @findex __USER_LABEL_PREFIX__
989 Similar to @code{__REGISTER_PREFIX__}, but describes the prefix applied
990 to user generated labels in assembler code. For example, in the
991 @samp{m68k-aout} environment it expands to the string @samp{_}, but in
992 the @samp{m68k-coff} environment it expands to the null string. This
993 does not work with the @samp{-mno-underscores} option that the i386
994 OSF/rose and m88k targets provide nor with the @samp{-mcall*} options of
995 the rs6000 System V Release 4 target.
998 @node Nonstandard Predefined,, Standard Predefined, Predefined
999 @subsubsection Nonstandard Predefined Macros
1001 The C preprocessor normally has several predefined macros that vary between
1002 machines because their purpose is to indicate what type of system and
1003 machine is in use. This manual, being for all systems and machines, cannot
1004 tell you exactly what their names are; instead, we offer a list of some
1005 typical ones. You can use @samp{cpp -dM} to see the values of
1006 predefined macros; see @ref{Invocation}.
1008 Some nonstandard predefined macros describe the operating system in use,
1009 with more or less specificity. For example,
1014 @samp{unix} is normally predefined on all Unix systems.
1018 @samp{BSD} is predefined on recent versions of Berkeley Unix
1019 (perhaps only in version 4.3).
1022 Other nonstandard predefined macros describe the kind of CPU, with more or
1023 less specificity. For example,
1028 @samp{vax} is predefined on Vax computers.
1032 @samp{mc68000} is predefined on most computers whose CPU is a Motorola
1033 68000, 68010 or 68020.
1037 @samp{m68k} is also predefined on most computers whose CPU is a 68000,
1038 68010 or 68020; however, some makers use @samp{mc68000} and some use
1039 @samp{m68k}. Some predefine both names. What happens in GNU C
1040 depends on the system you are using it on.
1044 @samp{M68020} has been observed to be predefined on some systems that
1045 use 68020 CPUs---in addition to @samp{mc68000} and @samp{m68k}, which
1052 Both @samp{_AM29K} and @samp{_AM29000} are predefined for the AMD 29000
1057 @samp{ns32000} is predefined on computers which use the National
1058 Semiconductor 32000 series CPU.
1061 Yet other nonstandard predefined macros describe the manufacturer of
1062 the system. For example,
1067 @samp{sun} is predefined on all models of Sun computers.
1071 @samp{pyr} is predefined on all models of Pyramid computers.
1075 @samp{sequent} is predefined on all models of Sequent computers.
1078 These predefined symbols are not only nonstandard, they are contrary to the
1079 ANSI standard because their names do not start with underscores.
1080 Therefore, the option @samp{-ansi} inhibits the definition of these
1083 This tends to make @samp{-ansi} useless, since many programs depend on the
1084 customary nonstandard predefined symbols. Even system header files check
1085 them and will generate incorrect declarations if they do not find the names
1086 that are expected. You might think that the header files supplied for the
1087 Uglix computer would not need to test what machine they are running on,
1088 because they can simply assume it is the Uglix; but often they do, and they
1089 do so using the customary names. As a result, very few C programs will
1090 compile with @samp{-ansi}. We intend to avoid such problems on the GNU
1093 What, then, should you do in an ANSI C program to test the type of machine
1096 GNU C offers a parallel series of symbols for this purpose, whose names
1097 are made from the customary ones by adding @samp{__} at the beginning
1098 and end. Thus, the symbol @code{__vax__} would be available on a Vax,
1101 The set of nonstandard predefined names in the GNU C preprocessor is
1102 controlled (when @code{cpp} is itself compiled) by the macro
1103 @samp{CPP_PREDEFINES}, which should be a string containing @samp{-D}
1104 options, separated by spaces. For example, on the Sun 3, we use the
1105 following definition:
1108 #define CPP_PREDEFINES "-Dmc68000 -Dsun -Dunix -Dm68k"
1112 This macro is usually specified in @file{tm.h}.
1114 @node Stringification, Concatenation, Predefined, Macros
1115 @subsection Stringification
1117 @cindex stringification
1118 @dfn{Stringification} means turning a code fragment into a string constant
1119 whose contents are the text for the code fragment. For example,
1120 stringifying @samp{foo (z)} results in @samp{"foo (z)"}.
1122 In the C preprocessor, stringification is an option available when macro
1123 arguments are substituted into the macro definition. In the body of the
1124 definition, when an argument name appears, the character @samp{#} before
1125 the name specifies stringification of the corresponding actual argument
1126 when it is substituted at that point in the definition. The same argument
1127 may be substituted in other places in the definition without
1128 stringification if the argument name appears in those places with no
1131 Here is an example of a macro definition that uses stringification:
1135 #define WARN_IF(EXP) \
1137 fprintf (stderr, "Warning: " #EXP "\n"); @} \
1143 Here the actual argument for @samp{EXP} is substituted once as given,
1144 into the @samp{if} statement, and once as stringified, into the
1145 argument to @samp{fprintf}. The @samp{do} and @samp{while (0)} are
1146 a kludge to make it possible to write @samp{WARN_IF (@var{arg});},
1147 which the resemblance of @samp{WARN_IF} to a function would make
1148 C programmers want to do; see @ref{Swallow Semicolon}.
1150 The stringification feature is limited to transforming one macro argument
1151 into one string constant: there is no way to combine the argument with
1152 other text and then stringify it all together. But the example above shows
1153 how an equivalent result can be obtained in ANSI Standard C using the
1154 feature that adjacent string constants are concatenated as one string
1155 constant. The preprocessor stringifies the actual value of @samp{EXP}
1156 into a separate string constant, resulting in text like
1161 fprintf (stderr, "Warning: " "x == 0" "\n"); @} \
1167 but the C compiler then sees three consecutive string constants and
1168 concatenates them into one, producing effectively
1172 fprintf (stderr, "Warning: x == 0\n"); @} \
1176 Stringification in C involves more than putting doublequote characters
1177 around the fragment; it is necessary to put backslashes in front of all
1178 doublequote characters, and all backslashes in string and character
1179 constants, in order to get a valid C string constant with the proper
1180 contents. Thus, stringifying @samp{p = "foo\n";} results in @samp{"p =
1181 \"foo\\n\";"}. However, backslashes that are not inside of string or
1182 character constants are not duplicated: @samp{\n} by itself stringifies to
1185 Whitespace (including comments) in the text being stringified is handled
1186 according to precise rules. All leading and trailing whitespace is ignored.
1187 Any sequence of whitespace in the middle of the text is converted to
1188 a single space in the stringified result.
1190 @node Concatenation, Undefining, Stringification, Macros
1191 @subsection Concatenation
1192 @cindex concatenation
1194 @dfn{Concatenation} means joining two strings into one. In the context
1195 of macro expansion, concatenation refers to joining two lexical units
1196 into one longer one. Specifically, an actual argument to the macro can be
1197 concatenated with another actual argument or with fixed text to produce
1198 a longer name. The longer name might be the name of a function,
1199 variable or type, or a C keyword; it might even be the name of another
1200 macro, in which case it will be expanded.
1202 When you define a macro, you request concatenation with the special
1203 operator @samp{##} in the macro body. When the macro is called,
1204 after actual arguments are substituted, all @samp{##} operators are
1205 deleted, and so is any whitespace next to them (including whitespace
1206 that was part of an actual argument). The result is to concatenate
1207 the syntactic tokens on either side of the @samp{##}.
1209 Consider a C program that interprets named commands. There probably needs
1210 to be a table of commands, perhaps an array of structures declared as
1217 void (*function) ();
1220 struct command commands[] =
1222 @{ "quit", quit_command@},
1223 @{ "help", help_command@},
1228 It would be cleaner not to have to give each command name twice, once in
1229 the string constant and once in the function name. A macro which takes the
1230 name of a command as an argument can make this unnecessary. The string
1231 constant can be created with stringification, and the function name by
1232 concatenating the argument with @samp{_command}. Here is how it is done:
1235 #define COMMAND(NAME) @{ #NAME, NAME ## _command @}
1237 struct command commands[] =
1245 The usual case of concatenation is concatenating two names (or a name and a
1246 number) into a longer name. But this isn't the only valid case. It is
1247 also possible to concatenate two numbers (or a number and a name, such as
1248 @samp{1.5} and @samp{e3}) into a number. Also, multi-character operators
1249 such as @samp{+=} can be formed by concatenation. In some cases it is even
1250 possible to piece together a string constant. However, two pieces of text
1251 that don't together form a valid lexical unit cannot be concatenated. For
1252 example, concatenation with @samp{x} on one side and @samp{+} on the other
1253 is not meaningful because those two characters can't fit together in any
1254 lexical unit of C@. The ANSI standard says that such attempts at
1255 concatenation are undefined, but in the GNU C preprocessor it is well
1256 defined: it puts the @samp{x} and @samp{+} side by side with no particular
1259 Keep in mind that the C preprocessor converts comments to whitespace before
1260 macros are even considered. Therefore, you cannot create a comment by
1261 concatenating @samp{/} and @samp{*}: the @samp{/*} sequence that starts a
1262 comment is not a lexical unit, but rather the beginning of a ``long'' space
1263 character. Also, you can freely use comments next to a @samp{##} in a
1264 macro definition, or in actual arguments that will be concatenated, because
1265 the comments will be converted to spaces at first sight, and concatenation
1266 will later discard the spaces.
1268 @node Undefining, Redefining, Concatenation, Macros
1269 @subsection Undefining Macros
1271 @cindex undefining macros
1272 To @dfn{undefine} a macro means to cancel its definition. This is done
1273 with the @samp{#undef} directive. @samp{#undef} is followed by the macro
1274 name to be undefined.
1276 Like definition, undefinition occurs at a specific point in the source
1277 file, and it applies starting from that point. The name ceases to be a
1278 macro name, and from that point on it is treated by the preprocessor as if
1279 it had never been a macro name.
1300 In this example, @samp{FOO} had better be a variable or function as well
1301 as (temporarily) a macro, in order for the result of the expansion to be
1304 The same form of @samp{#undef} directive will cancel definitions with
1305 arguments or definitions that don't expect arguments. The @samp{#undef}
1306 directive has no effect when used on a name not currently defined as a macro.
1308 @node Redefining, Macro Pitfalls, Undefining, Macros
1309 @subsection Redefining Macros
1311 @cindex redefining macros
1312 @dfn{Redefining} a macro means defining (with @samp{#define}) a name that
1313 is already defined as a macro.
1315 A redefinition is trivial if the new definition is transparently identical
1316 to the old one. You probably wouldn't deliberately write a trivial
1317 redefinition, but they can happen automatically when a header file is
1318 included more than once (@pxref{Header Files}), so they are accepted
1319 silently and without effect.
1321 Nontrivial redefinition is considered likely to be an error, so
1322 it provokes a warning message from the preprocessor. However, sometimes it
1323 is useful to change the definition of a macro in mid-compilation. You can
1324 inhibit the warning by undefining the macro with @samp{#undef} before the
1327 In order for a redefinition to be trivial, the new definition must
1328 exactly match the one already in effect, with two possible exceptions:
1332 Whitespace may be added or deleted at the beginning or the end.
1335 Whitespace may be changed in the middle (but not inside strings).
1336 However, it may not be eliminated entirely, and it may not be added
1337 where there was no whitespace at all.
1340 Recall that a comment counts as whitespace.
1342 @node Macro Pitfalls,, Redefining, Macros
1343 @subsection Pitfalls and Subtleties of Macros
1344 @cindex problems with macros
1345 @cindex pitfalls of macros
1347 In this section we describe some special rules that apply to macros and
1348 macro expansion, and point out certain cases in which the rules have
1349 counterintuitive consequences that you must watch out for.
1352 * Misnesting:: Macros can contain unmatched parentheses.
1353 * Macro Parentheses:: Why apparently superfluous parentheses
1354 may be necessary to avoid incorrect grouping.
1355 * Swallow Semicolon:: Macros that look like functions
1356 but expand into compound statements.
1357 * Side Effects:: Unsafe macros that cause trouble when
1358 arguments contain side effects.
1359 * Self-Reference:: Macros whose definitions use the macros' own names.
1360 * Argument Prescan:: Actual arguments are checked for macro calls
1361 before they are substituted.
1362 * Cascaded Macros:: Macros whose definitions use other macros.
1363 * Newlines in Args:: Sometimes line numbers get confused.
1366 @node Misnesting, Macro Parentheses, Macro Pitfalls, Macro Pitfalls
1367 @subsubsection Improperly Nested Constructs
1369 Recall that when a macro is called with arguments, the arguments are
1370 substituted into the macro body and the result is checked, together with
1371 the rest of the input file, for more macro calls.
1373 It is possible to piece together a macro call coming partially from the
1374 macro body and partially from the actual arguments. For example,
1377 #define double(x) (2*(x))
1378 #define call_with_1(x) x(1)
1382 would expand @samp{call_with_1 (double)} into @samp{(2*(1))}.
1384 Macro definitions do not have to have balanced parentheses. By writing an
1385 unbalanced open parenthesis in a macro body, it is possible to create a
1386 macro call that begins inside the macro body but ends outside of it. For
1390 #define strange(file) fprintf (file, "%s %d",
1392 strange(stderr) p, 35)
1396 This bizarre example expands to @samp{fprintf (stderr, "%s %d", p, 35)}!
1398 @node Macro Parentheses, Swallow Semicolon, Misnesting, Macro Pitfalls
1399 @subsubsection Unintended Grouping of Arithmetic
1400 @cindex parentheses in macro bodies
1402 You may have noticed that in most of the macro definition examples shown
1403 above, each occurrence of a macro argument name had parentheses around it.
1404 In addition, another pair of parentheses usually surround the entire macro
1405 definition. Here is why it is best to write macros that way.
1407 Suppose you define a macro as follows,
1410 #define ceil_div(x, y) (x + y - 1) / y
1414 whose purpose is to divide, rounding up. (One use for this operation is
1415 to compute how many @samp{int} objects are needed to hold a certain
1416 number of @samp{char} objects.) Then suppose it is used as follows:
1419 a = ceil_div (b & c, sizeof (int));
1426 a = (b & c + sizeof (int) - 1) / sizeof (int);
1430 which does not do what is intended. The operator-precedence rules of
1431 C make it equivalent to this:
1434 a = (b & (c + sizeof (int) - 1)) / sizeof (int);
1438 But what we want is this:
1441 a = ((b & c) + sizeof (int) - 1)) / sizeof (int);
1445 Defining the macro as
1448 #define ceil_div(x, y) ((x) + (y) - 1) / (y)
1452 provides the desired result.
1454 Unintended grouping can result in another way. Consider
1455 @samp{sizeof ceil_div(1, 2)}. That has the appearance of a C expression
1456 that would compute the size of the type of @samp{ceil_div (1, 2)}, but in
1457 fact it means something very different. Here is what it expands to:
1460 sizeof ((1) + (2) - 1) / (2)
1464 This would take the size of an integer and divide it by two. The precedence
1465 rules have put the division outside the @samp{sizeof} when it was intended
1468 Parentheses around the entire macro definition can prevent such problems.
1469 Here, then, is the recommended way to define @samp{ceil_div}:
1472 #define ceil_div(x, y) (((x) + (y) - 1) / (y))
1475 @node Swallow Semicolon, Side Effects, Macro Parentheses, Macro Pitfalls
1476 @subsubsection Swallowing the Semicolon
1478 @cindex semicolons (after macro calls)
1479 Often it is desirable to define a macro that expands into a compound
1480 statement. Consider, for example, the following macro, that advances a
1481 pointer (the argument @samp{p} says where to find it) across whitespace
1485 #define SKIP_SPACES (p, limit) \
1486 @{ register char *lim = (limit); \
1487 while (p != lim) @{ \
1488 if (*p++ != ' ') @{ \
1493 Here Backslash-Newline is used to split the macro definition, which must
1494 be a single line, so that it resembles the way such C code would be
1495 laid out if not part of a macro definition.
1497 A call to this macro might be @samp{SKIP_SPACES (p, lim)}. Strictly
1498 speaking, the call expands to a compound statement, which is a complete
1499 statement with no need for a semicolon to end it. But it looks like a
1500 function call. So it minimizes confusion if you can use it like a function
1501 call, writing a semicolon afterward, as in @samp{SKIP_SPACES (p, lim);}
1503 But this can cause trouble before @samp{else} statements, because the
1504 semicolon is actually a null statement. Suppose you write
1508 SKIP_SPACES (p, lim);
1513 The presence of two statements---the compound statement and a null
1514 statement---in between the @samp{if} condition and the @samp{else}
1515 makes invalid C code.
1517 The definition of the macro @samp{SKIP_SPACES} can be altered to solve
1518 this problem, using a @samp{do @dots{} while} statement. Here is how:
1521 #define SKIP_SPACES (p, limit) \
1522 do @{ register char *lim = (limit); \
1523 while (p != lim) @{ \
1524 if (*p++ != ' ') @{ \
1525 p--; break; @}@}@} \
1529 Now @samp{SKIP_SPACES (p, lim);} expands into
1532 do @{@dots{}@} while (0);
1536 which is one statement.
1538 @node Side Effects, Self-Reference, Swallow Semicolon, Macro Pitfalls
1539 @subsubsection Duplication of Side Effects
1541 @cindex side effects (in macro arguments)
1542 @cindex unsafe macros
1543 Many C programs define a macro @samp{min}, for ``minimum'', like this:
1546 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
1549 When you use this macro with an argument containing a side effect,
1553 next = min (x + y, foo (z));
1557 it expands as follows:
1560 next = ((x + y) < (foo (z)) ? (x + y) : (foo (z)));
1564 where @samp{x + y} has been substituted for @samp{X} and @samp{foo (z)}
1567 The function @samp{foo} is used only once in the statement as it appears
1568 in the program, but the expression @samp{foo (z)} has been substituted
1569 twice into the macro expansion. As a result, @samp{foo} might be called
1570 two times when the statement is executed. If it has side effects or
1571 if it takes a long time to compute, the results might not be what you
1572 intended. We say that @samp{min} is an @dfn{unsafe} macro.
1574 The best solution to this problem is to define @samp{min} in a way that
1575 computes the value of @samp{foo (z)} only once. The C language offers no
1576 standard way to do this, but it can be done with GNU C extensions as
1581 (@{ typeof (X) __x = (X), __y = (Y); \
1582 (__x < __y) ? __x : __y; @})
1585 If you do not wish to use GNU C extensions, the only solution is to be
1586 careful when @emph{using} the macro @samp{min}. For example, you can
1587 calculate the value of @samp{foo (z)}, save it in a variable, and use that
1588 variable in @samp{min}:
1591 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
1595 next = min (x + y, tem);
1600 (where we assume that @samp{foo} returns type @samp{int}).
1602 @node Self-Reference, Argument Prescan, Side Effects, Macro Pitfalls
1603 @subsubsection Self-Referential Macros
1605 @cindex self-reference
1606 A @dfn{self-referential} macro is one whose name appears in its definition.
1607 A special feature of ANSI Standard C is that the self-reference is not
1608 considered a macro call. It is passed into the preprocessor output
1611 Let's consider an example:
1614 #define foo (4 + foo)
1618 where @samp{foo} is also a variable in your program.
1620 Following the ordinary rules, each reference to @samp{foo} will expand into
1621 @samp{(4 + foo)}; then this will be rescanned and will expand into @samp{(4
1622 + (4 + foo))}; and so on until it causes a fatal error (memory full) in the
1625 However, the special rule about self-reference cuts this process short
1626 after one step, at @samp{(4 + foo)}. Therefore, this macro definition
1627 has the possibly useful effect of causing the program to add 4 to
1628 the value of @samp{foo} wherever @samp{foo} is referred to.
1630 In most cases, it is a bad idea to take advantage of this feature. A
1631 person reading the program who sees that @samp{foo} is a variable will
1632 not expect that it is a macro as well. The reader will come across the
1633 identifier @samp{foo} in the program and think its value should be that
1634 of the variable @samp{foo}, whereas in fact the value is four greater.
1636 The special rule for self-reference applies also to @dfn{indirect}
1637 self-reference. This is the case where a macro @var{x} expands to use a
1638 macro @samp{y}, and the expansion of @samp{y} refers to the macro
1639 @samp{x}. The resulting reference to @samp{x} comes indirectly from the
1640 expansion of @samp{x}, so it is a self-reference and is not further
1641 expanded. Thus, after
1649 @samp{x} would expand into @samp{(4 + (2 * x))}. Clear?
1651 But suppose @samp{y} is used elsewhere, not from the definition of @samp{x}.
1652 Then the use of @samp{x} in the expansion of @samp{y} is not a self-reference
1653 because @samp{x} is not ``in progress''. So it does expand. However,
1654 the expansion of @samp{x} contains a reference to @samp{y}, and that
1655 is an indirect self-reference now because @samp{y} is ``in progress''.
1656 The result is that @samp{y} expands to @samp{(2 * (4 + y))}.
1658 It is not clear that this behavior would ever be useful, but it is specified
1659 by the ANSI C standard, so you may need to understand it.
1661 @node Argument Prescan, Cascaded Macros, Self-Reference, Macro Pitfalls
1662 @subsubsection Separate Expansion of Macro Arguments
1663 @cindex expansion of arguments
1664 @cindex macro argument expansion
1665 @cindex prescan of macro arguments
1667 We have explained that the expansion of a macro, including the substituted
1668 actual arguments, is scanned over again for macro calls to be expanded.
1670 What really happens is more subtle: first each actual argument text is scanned
1671 separately for macro calls. Then the results of this are substituted into
1672 the macro body to produce the macro expansion, and the macro expansion
1673 is scanned again for macros to expand.
1675 The result is that the actual arguments are scanned @emph{twice} to expand
1676 macro calls in them.
1678 Most of the time, this has no effect. If the actual argument contained
1679 any macro calls, they are expanded during the first scan. The result
1680 therefore contains no macro calls, so the second scan does not change it.
1681 If the actual argument were substituted as given, with no prescan,
1682 the single remaining scan would find the same macro calls and produce
1685 You might expect the double scan to change the results when a
1686 self-referential macro is used in an actual argument of another macro
1687 (@pxref{Self-Reference}): the self-referential macro would be expanded once
1688 in the first scan, and a second time in the second scan. But this is not
1689 what happens. The self-references that do not expand in the first scan are
1690 marked so that they will not expand in the second scan either.
1692 The prescan is not done when an argument is stringified or concatenated.
1702 expands to @samp{"foo"}. Once more, prescan has been prevented from
1703 having any noticeable effect.
1705 More precisely, stringification and concatenation use the argument as
1706 written, in un-prescanned form. The same actual argument would be used in
1707 prescanned form if it is substituted elsewhere without stringification or
1711 #define str(s) #s lose(s)
1716 expands to @samp{"foo" lose(4)}.
1718 You might now ask, ``Why mention the prescan, if it makes no difference?
1719 And why not skip it and make the preprocessor faster?'' The answer is
1720 that the prescan does make a difference in three special cases:
1724 Nested calls to a macro.
1727 Macros that call other macros that stringify or concatenate.
1730 Macros whose expansions contain unshielded commas.
1733 We say that @dfn{nested} calls to a macro occur when a macro's actual
1734 argument contains a call to that very macro. For example, if @samp{f}
1735 is a macro that expects one argument, @samp{f (f (1))} is a nested
1736 pair of calls to @samp{f}. The desired expansion is made by
1737 expanding @samp{f (1)} and substituting that into the definition of
1738 @samp{f}. The prescan causes the expected result to happen.
1739 Without the prescan, @samp{f (1)} itself would be substituted as
1740 an actual argument, and the inner use of @samp{f} would appear
1741 during the main scan as an indirect self-reference and would not
1742 be expanded. Here, the prescan cancels an undesirable side effect
1743 (in the medical, not computational, sense of the term) of the special
1744 rule for self-referential macros.
1746 But prescan causes trouble in certain other cases of nested macro calls.
1751 #define bar(x) lose(x)
1752 #define lose(x) (1 + (x))
1758 We would like @samp{bar(foo)} to turn into @samp{(1 + (foo))}, which
1759 would then turn into @samp{(1 + (a,b))}. But instead, @samp{bar(foo)}
1760 expands into @samp{lose(a,b)}, and you get an error because @code{lose}
1761 requires a single argument. In this case, the problem is easily solved
1762 by the same parentheses that ought to be used to prevent misnesting of
1763 arithmetic operations:
1767 #define bar(x) lose((x))
1770 The problem is more serious when the operands of the macro are not
1771 expressions; for example, when they are statements. Then parentheses
1772 are unacceptable because they would make for invalid C code:
1775 #define foo @{ int a, b; @dots{} @}
1779 In GNU C you can shield the commas using the @samp{(@{@dots{}@})}
1780 construct which turns a compound statement into an expression:
1783 #define foo (@{ int a, b; @dots{} @})
1786 Or you can rewrite the macro definition to avoid such commas:
1789 #define foo @{ int a; int b; @dots{} @}
1792 There is also one case where prescan is useful. It is possible
1793 to use prescan to expand an argument and then stringify it---if you use
1794 two levels of macros. Let's add a new macro @samp{xstr} to the
1795 example shown above:
1798 #define xstr(s) str(s)
1804 This expands into @samp{"4"}, not @samp{"foo"}. The reason for the
1805 difference is that the argument of @samp{xstr} is expanded at prescan
1806 (because @samp{xstr} does not specify stringification or concatenation of
1807 the argument). The result of prescan then forms the actual argument for
1808 @samp{str}. @samp{str} uses its argument without prescan because it
1809 performs stringification; but it cannot prevent or undo the prescanning
1810 already done by @samp{xstr}.
1812 @node Cascaded Macros, Newlines in Args, Argument Prescan, Macro Pitfalls
1813 @subsubsection Cascaded Use of Macros
1815 @cindex cascaded macros
1816 @cindex macro body uses macro
1817 A @dfn{cascade} of macros is when one macro's body contains a reference
1818 to another macro. This is very common practice. For example,
1821 #define BUFSIZE 1020
1822 #define TABLESIZE BUFSIZE
1825 This is not at all the same as defining @samp{TABLESIZE} to be @samp{1020}.
1826 The @samp{#define} for @samp{TABLESIZE} uses exactly the body you
1827 specify---in this case, @samp{BUFSIZE}---and does not check to see whether
1828 it too is the name of a macro.
1830 It's only when you @emph{use} @samp{TABLESIZE} that the result of its expansion
1831 is checked for more macro names.
1833 This makes a difference if you change the definition of @samp{BUFSIZE}
1834 at some point in the source file. @samp{TABLESIZE}, defined as shown,
1835 will always expand using the definition of @samp{BUFSIZE} that is
1836 currently in effect:
1839 #define BUFSIZE 1020
1840 #define TABLESIZE BUFSIZE
1846 Now @samp{TABLESIZE} expands (in two stages) to @samp{37}. (The
1847 @samp{#undef} is to prevent any warning about the nontrivial
1848 redefinition of @code{BUFSIZE}.)
1850 @node Newlines in Args,, Cascaded Macros, Macro Pitfalls
1851 @subsection Newlines in Macro Arguments
1852 @cindex newlines in macro arguments
1854 Traditional macro processing carries forward all newlines in macro
1855 arguments into the expansion of the macro. This means that, if some of
1856 the arguments are substituted more than once, or not at all, or out of
1857 order, newlines can be duplicated, lost, or moved around within the
1858 expansion. If the expansion consists of multiple statements, then the
1859 effect is to distort the line numbers of some of these statements. The
1860 result can be incorrect line numbers, in error messages or displayed in
1863 The GNU C preprocessor operating in ANSI C mode adjusts appropriately
1864 for multiple use of an argument---the first use expands all the
1865 newlines, and subsequent uses of the same argument produce no newlines.
1866 But even in this mode, it can produce incorrect line numbering if
1867 arguments are used out of order, or not used at all.
1869 Here is an example illustrating this problem:
1872 #define ignore_second_arg(a,b,c) a; c
1874 ignore_second_arg (foo (),
1880 The syntax error triggered by the tokens @samp{syntax error} results
1881 in an error message citing line four, even though the statement text
1882 comes from line five.
1884 @node Conditionals, Combining Sources, Macros, Top
1885 @section Conditionals
1887 @cindex conditionals
1888 In a macro processor, a @dfn{conditional} is a directive that allows a part
1889 of the program to be ignored during compilation, on some conditions.
1890 In the C preprocessor, a conditional can test either an arithmetic expression
1891 or whether a name is defined as a macro.
1893 A conditional in the C preprocessor resembles in some ways an @samp{if}
1894 statement in C, but it is important to understand the difference between
1895 them. The condition in an @samp{if} statement is tested during the execution
1896 of your program. Its purpose is to allow your program to behave differently
1897 from run to run, depending on the data it is operating on. The condition
1898 in a preprocessing conditional directive is tested when your program is compiled.
1899 Its purpose is to allow different code to be included in the program depending
1900 on the situation at the time of compilation.
1903 * Uses: Conditional Uses. What conditionals are for.
1904 * Syntax: Conditional Syntax. How conditionals are written.
1905 * Deletion: Deleted Code. Making code into a comment.
1906 * Macros: Conditionals-Macros. Why conditionals are used with macros.
1907 * Assertions:: How and why to use assertions.
1908 * Errors: #error Directive. Detecting inconsistent compilation parameters.
1911 @node Conditional Uses
1912 @subsection Why Conditionals are Used
1914 Generally there are three kinds of reason to use a conditional.
1918 A program may need to use different code depending on the machine or
1919 operating system it is to run on. In some cases the code for one
1920 operating system may be erroneous on another operating system; for
1921 example, it might refer to library routines that do not exist on the
1922 other system. When this happens, it is not enough to avoid executing
1923 the invalid code: merely having it in the program makes it impossible
1924 to link the program and run it. With a preprocessing conditional, the
1925 offending code can be effectively excised from the program when it is
1929 You may want to be able to compile the same source file into two
1930 different programs. Sometimes the difference between the programs is
1931 that one makes frequent time-consuming consistency checks on its
1932 intermediate data, or prints the values of those data for debugging,
1933 while the other does not.
1936 A conditional whose condition is always false is a good way to exclude
1937 code from the program but keep it as a sort of comment for future
1941 Most simple programs that are intended to run on only one machine will
1942 not need to use preprocessing conditionals.
1944 @node Conditional Syntax
1945 @subsection Syntax of Conditionals
1948 A conditional in the C preprocessor begins with a @dfn{conditional
1949 directive}: @samp{#if}, @samp{#ifdef} or @samp{#ifndef}.
1950 @xref{Conditionals-Macros}, for information on @samp{#ifdef} and
1951 @samp{#ifndef}; only @samp{#if} is explained here.
1954 * If: #if Directive. Basic conditionals using @samp{#if} and @samp{#endif}.
1955 * Else: #else Directive. Including some text if the condition fails.
1956 * Elif: #elif Directive. Testing several alternative possibilities.
1960 @subsubsection The @samp{#if} Directive
1962 The @samp{#if} directive in its simplest form consists of
1965 #if @var{expression}
1966 @var{controlled text}
1967 #endif /* @var{expression} */
1970 The comment following the @samp{#endif} is not required, but it is a good
1971 practice because it helps people match the @samp{#endif} to the
1972 corresponding @samp{#if}. Such comments should always be used, except in
1973 short conditionals that are not nested. In fact, you can put anything at
1974 all after the @samp{#endif} and it will be ignored by the GNU C preprocessor,
1975 but only comments are acceptable in ANSI Standard C@.
1977 @var{expression} is a C expression of integer type, subject to stringent
1978 restrictions. It may contain
1982 Integer constants, which are all regarded as @code{long} or
1983 @code{unsigned long}.
1986 Character constants, which are interpreted according to the character
1987 set and conventions of the machine and operating system on which the
1988 preprocessor is running. The GNU C preprocessor uses the C data type
1989 @samp{char} for these character constants; therefore, whether some
1990 character codes are negative is determined by the C compiler used to
1991 compile the preprocessor. If it treats @samp{char} as signed, then
1992 character codes large enough to set the sign bit will be considered
1993 negative; otherwise, no character code is considered negative.
1996 Arithmetic operators for addition, subtraction, multiplication,
1997 division, bitwise operations, shifts, comparisons, and logical
1998 operations (@samp{&&} and @samp{||}).
2001 Identifiers that are not macros, which are all treated as zero(!).
2004 Macro calls. All macro calls in the expression are expanded before
2005 actual computation of the expression's value begins.
2008 Note that @samp{sizeof} operators and @code{enum}-type values are not allowed.
2009 @code{enum}-type values, like all other identifiers that are not taken
2010 as macro calls and expanded, are treated as zero.
2012 The @var{controlled text} inside of a conditional can include
2013 preprocessing directives. Then the directives inside the conditional are
2014 obeyed only if that branch of the conditional succeeds. The text can
2015 also contain other conditional groups. However, the @samp{#if} and
2016 @samp{#endif} directives must balance.
2018 @node #else Directive
2019 @subsubsection The @samp{#else} Directive
2022 The @samp{#else} directive can be added to a conditional to provide
2023 alternative text to be used if the condition is false. This is what
2027 #if @var{expression}
2029 #else /* Not @var{expression} */
2031 #endif /* Not @var{expression} */
2034 If @var{expression} is nonzero, and thus the @var{text-if-true} is
2035 active, then @samp{#else} acts like a failing conditional and the
2036 @var{text-if-false} is ignored. Contrariwise, if the @samp{#if}
2037 conditional fails, the @var{text-if-false} is considered included.
2039 @node #elif Directive
2040 @subsubsection The @samp{#elif} Directive
2043 One common case of nested conditionals is used to check for more than two
2044 possible alternatives. For example, you might have
2058 Another conditional directive, @samp{#elif}, allows this to be abbreviated
2066 #else /* X != 2 and X != 1*/
2068 #endif /* X != 2 and X != 1*/
2071 @samp{#elif} stands for ``else if''. Like @samp{#else}, it goes in the
2072 middle of a @samp{#if}-@samp{#endif} pair and subdivides it; it does not
2073 require a matching @samp{#endif} of its own. Like @samp{#if}, the
2074 @samp{#elif} directive includes an expression to be tested.
2076 The text following the @samp{#elif} is processed only if the original
2077 @samp{#if}-condition failed and the @samp{#elif} condition succeeds.
2078 More than one @samp{#elif} can go in the same @samp{#if}-@samp{#endif}
2079 group. Then the text after each @samp{#elif} is processed only if the
2080 @samp{#elif} condition succeeds after the original @samp{#if} and any
2081 previous @samp{#elif} directives within it have failed. @samp{#else} is
2082 equivalent to @samp{#elif 1}, and @samp{#else} is allowed after any
2083 number of @samp{#elif} directives, but @samp{#elif} may not follow
2087 @subsection Keeping Deleted Code for Future Reference
2088 @cindex commenting out code
2090 If you replace or delete a part of the program but want to keep the old
2091 code around as a comment for future reference, the easy way to do this
2092 is to put @samp{#if 0} before it and @samp{#endif} after it. This is
2093 better than using comment delimiters @samp{/*} and @samp{*/} since those
2094 won't work if the code already contains comments (C comments do not
2097 This works even if the code being turned off contains conditionals, but
2098 they must be entire conditionals (balanced @samp{#if} and @samp{#endif}).
2100 Conversely, do not use @samp{#if 0} for comments which are not C code.
2101 Use the comment delimiters @samp{/*} and @samp{*/} instead. The
2102 interior of @samp{#if 0} must consist of complete tokens; in particular,
2103 singlequote characters must balance. But comments often contain
2104 unbalanced singlequote characters (known in English as apostrophes).
2105 These confuse @samp{#if 0}. They do not confuse @samp{/*}.
2107 @node Conditionals-Macros
2108 @subsection Conditionals and Macros
2110 Conditionals are useful in connection with macros or assertions, because
2111 those are the only ways that an expression's value can vary from one
2112 compilation to another. A @samp{#if} directive whose expression uses no
2113 macros or assertions is equivalent to @samp{#if 1} or @samp{#if 0}; you
2114 might as well determine which one, by computing the value of the
2115 expression yourself, and then simplify the program.
2117 For example, here is a conditional that tests the expression
2118 @samp{BUFSIZE == 1020}, where @samp{BUFSIZE} must be a macro.
2122 printf ("Large buffers!\n");
2123 #endif /* BUFSIZE is large */
2126 (Programmers often wish they could test the size of a variable or data
2127 type in @samp{#if}, but this does not work. The preprocessor does not
2128 understand @code{sizeof}, or typedef names, or even the type keywords
2129 such as @code{int}.)
2132 The special operator @samp{defined} is used in @samp{#if} expressions to
2133 test whether a certain name is defined as a macro. Either @samp{defined
2134 @var{name}} or @samp{defined (@var{name})} is an expression whose value
2135 is 1 if @var{name} is defined as macro at the current point in the
2136 program, and 0 otherwise. For the @samp{defined} operator it makes no
2137 difference what the definition of the macro is; all that matters is
2138 whether there is a definition. Thus, for example,@refill
2141 #if defined (vax) || defined (ns16000)
2145 would succeed if either of the names @samp{vax} and @samp{ns16000} is
2146 defined as a macro. You can test the same condition using assertions
2147 (@pxref{Assertions}), like this:
2150 #if #cpu (vax) || #cpu (ns16000)
2153 If a macro is defined and later undefined with @samp{#undef},
2154 subsequent use of the @samp{defined} operator returns 0, because
2155 the name is no longer defined. If the macro is defined again with
2156 another @samp{#define}, @samp{defined} will recommence returning 1.
2160 Conditionals that test whether just one name is defined are very common,
2161 so there are two special short conditional directives for this case.
2164 @item #ifdef @var{name}
2165 is equivalent to @samp{#if defined (@var{name})}.
2167 @item #ifndef @var{name}
2168 is equivalent to @samp{#if ! defined (@var{name})}.
2171 Macro definitions can vary between compilations for several reasons.
2175 Some macros are predefined on each kind of machine. For example, on a
2176 Vax, the name @samp{vax} is a predefined macro. On other machines, it
2177 would not be defined.
2180 Many more macros are defined by system header files. Different
2181 systems and machines define different macros, or give them different
2182 values. It is useful to test these macros with conditionals to avoid
2183 using a system feature on a machine where it is not implemented.
2186 Macros are a common way of allowing users to customize a program for
2187 different machines or applications. For example, the macro
2188 @samp{BUFSIZE} might be defined in a configuration file for your
2189 program that is included as a header file in each source file. You
2190 would use @samp{BUFSIZE} in a preprocessing conditional in order to
2191 generate different code depending on the chosen configuration.
2194 Macros can be defined or undefined with @samp{-D} and @samp{-U}
2195 command options when you compile the program. You can arrange to
2196 compile the same source file into two different programs by choosing
2197 a macro name to specify which program you want, writing conditionals
2198 to test whether or how this macro is defined, and then controlling
2199 the state of the macro with compiler command options.
2204 Assertions are usually predefined, but can be defined with preprocessor
2205 directives or command-line options.
2209 @subsection Assertions
2212 @dfn{Assertions} are a more systematic alternative to macros in writing
2213 conditionals to test what sort of computer or system the compiled
2214 program will run on. Assertions are usually predefined, but you can
2215 define them with preprocessing directives or command-line options.
2218 The macros traditionally used to describe the type of target are not
2219 classified in any way according to which question they answer; they may
2220 indicate a hardware architecture, a particular hardware model, an
2221 operating system, a particular version of an operating system, or
2222 specific configuration options. These are jumbled together in a single
2223 namespace. In contrast, each assertion consists of a named question and
2224 an answer. The question is usually called the @dfn{predicate}.
2225 An assertion looks like this:
2228 #@var{predicate} (@var{answer})
2232 You must use a properly formed identifier for @var{predicate}. The
2233 value of @var{answer} can be any sequence of words; all characters are
2234 significant except for leading and trailing whitespace, and differences
2235 in internal whitespace sequences are ignored. Thus, @samp{x + y} is
2236 different from @samp{x+y} but equivalent to @samp{x + y}. @samp{)} is
2237 not allowed in an answer.
2239 @cindex testing predicates
2240 Here is a conditional to test whether the answer @var{answer} is asserted
2241 for the predicate @var{predicate}:
2244 #if #@var{predicate} (@var{answer})
2248 There may be more than one answer asserted for a given predicate. If
2249 you omit the answer, you can test whether @emph{any} answer is asserted
2250 for @var{predicate}:
2253 #if #@var{predicate}
2259 Most of the time, the assertions you test will be predefined assertions.
2260 GNU C provides three predefined predicates: @code{system}, @code{cpu},
2261 and @code{machine}. @code{system} is for assertions about the type of
2262 software, @code{cpu} describes the type of computer architecture, and
2263 @code{machine} gives more information about the computer. For example,
2264 on a GNU system, the following assertions would be true:
2270 #system (mach 3.@var{subversion})
2272 #system (hurd @var{version})
2276 and perhaps others. The alternatives with
2277 more or less version information let you ask more or less detailed
2278 questions about the type of system software.
2280 On a Unix system, you would find @code{#system (unix)} and perhaps one of:
2281 @code{#system (aix)}, @code{#system (bsd)}, @code{#system (hpux)},
2282 @code{#system (lynx)}, @code{#system (mach)}, @code{#system (posix)},
2283 @code{#system (svr3)}, @code{#system (svr4)}, or @code{#system (xpg4)}
2284 with possible version numbers following.
2286 Other values for @code{system} are @code{#system (mvs)}
2287 and @code{#system (vms)}.
2289 @strong{Portability note:} Many Unix C compilers provide only one answer
2290 for the @code{system} assertion: @code{#system (unix)}, if they support
2291 assertions at all. This is less than useful.
2293 An assertion with a multi-word answer is completely different from several
2294 assertions with individual single-word answers. For example, the presence
2295 of @code{system (mach 3.0)} does not mean that @code{system (3.0)} is true.
2296 It also does not directly imply @code{system (mach)}, but in GNU C, that
2297 last will normally be asserted as well.
2299 The current list of possible assertion values for @code{cpu} is:
2300 @code{#cpu (a29k)}, @code{#cpu (alpha)}, @code{#cpu (arm)}, @code{#cpu
2301 (clipper)}, @code{#cpu (convex)}, @code{#cpu (elxsi)}, @code{#cpu
2302 (tron)}, @code{#cpu (h8300)}, @code{#cpu (i370)}, @code{#cpu (i386)},
2303 @code{#cpu (i860)}, @code{#cpu (i960)}, @code{#cpu (m68k)}, @code{#cpu
2304 (m88k)}, @code{#cpu (mips)}, @code{#cpu (ns32k)}, @code{#cpu (hppa)},
2305 @code{#cpu (pyr)}, @code{#cpu (ibm032)}, @code{#cpu (rs6000)},
2306 @code{#cpu (sh)}, @code{#cpu (sparc)}, @code{#cpu (spur)}, @code{#cpu
2307 (tahoe)}, @code{#cpu (vax)}, @code{#cpu (we32000)}.
2310 You can create assertions within a C program using @samp{#assert}, like
2314 #assert @var{predicate} (@var{answer})
2318 (Note the absence of a @samp{#} before @var{predicate}.)
2321 @cindex assertions, undoing
2322 @cindex retracting assertions
2324 Each time you do this, you assert a new true answer for @var{predicate}.
2325 Asserting one answer does not invalidate previously asserted answers;
2326 they all remain true. The only way to remove an assertion is with
2327 @samp{#unassert}. @samp{#unassert} has the same syntax as
2328 @samp{#assert}. You can also remove all assertions about
2329 @var{predicate} like this:
2332 #unassert @var{predicate}
2335 You can also add or cancel assertions using command options
2336 when you run @code{gcc} or @code{cpp}. @xref{Invocation}.
2338 @node #error Directive
2339 @subsection The @samp{#error} and @samp{#warning} Directives
2342 The directive @samp{#error} causes the preprocessor to report a fatal
2343 error. The rest of the line that follows @samp{#error} is used as the
2344 error message. The line must consist of complete tokens.
2346 You would use @samp{#error} inside of a conditional that detects a
2347 combination of parameters which you know the program does not properly
2348 support. For example, if you know that the program will not run
2349 properly on a Vax, you might write
2354 #error "Won't work on Vaxen. See comments at get_last_object."
2360 @xref{Nonstandard Predefined}, for why this works.
2362 If you have several configuration parameters that must be set up by
2363 the installation in a consistent way, you can use conditionals to detect
2364 an inconsistency and report it with @samp{#error}. For example,
2367 #if HASH_TABLE_SIZE % 2 == 0 || HASH_TABLE_SIZE % 3 == 0 \
2368 || HASH_TABLE_SIZE % 5 == 0
2369 #error HASH_TABLE_SIZE should not be divisible by a small prime
2374 The directive @samp{#warning} is like the directive @samp{#error}, but causes
2375 the preprocessor to issue a warning and continue preprocessing. The rest of
2376 the line that follows @samp{#warning} is used as the warning message.
2378 You might use @samp{#warning} in obsolete header files, with a message
2379 directing the user to the header file which should be used instead.
2381 @node Combining Sources, Other Directives, Conditionals, Top
2382 @section Combining Source Files
2384 @cindex line control
2385 One of the jobs of the C preprocessor is to inform the C compiler of where
2386 each line of C code came from: which source file and which line number.
2388 C code can come from multiple source files if you use @samp{#include};
2389 both @samp{#include} and the use of conditionals and macros can cause
2390 the line number of a line in the preprocessor output to be different
2391 from the line's number in the original source file. You will appreciate
2392 the value of making both the C compiler (in error messages) and symbolic
2393 debuggers such as GDB use the line numbers in your source file.
2395 The C preprocessor builds on this feature by offering a directive by which
2396 you can control the feature explicitly. This is useful when a file for
2397 input to the C preprocessor is the output from another program such as the
2398 @code{bison} parser generator, which operates on another file that is the
2399 true source file. Parts of the output from @code{bison} are generated from
2400 scratch, other parts come from a standard parser file. The rest are copied
2401 nearly verbatim from the source file, but their line numbers in the
2402 @code{bison} output are not the same as their original line numbers.
2403 Naturally you would like compiler error messages and symbolic debuggers to
2404 know the original source file and line number of each line in the
2408 @code{bison} arranges this by writing @samp{#line} directives into the output
2409 file. @samp{#line} is a directive that specifies the original line number
2410 and source file name for subsequent input in the current preprocessor input
2411 file. @samp{#line} has three variants:
2414 @item #line @var{linenum}
2415 Here @var{linenum} is a decimal integer constant. This specifies that
2416 the line number of the following line of input, in its original source file,
2419 @item #line @var{linenum} @var{filename}
2420 Here @var{linenum} is a decimal integer constant and @var{filename}
2421 is a string constant. This specifies that the following line of input
2422 came originally from source file @var{filename} and its line number there
2423 was @var{linenum}. Keep in mind that @var{filename} is not just a
2424 file name; it is surrounded by doublequote characters so that it looks
2425 like a string constant.
2427 @item #line @var{anything else}
2428 @var{anything else} is checked for macro calls, which are expanded.
2429 The result should be a decimal integer constant followed optionally
2430 by a string constant, as described above.
2433 @samp{#line} directives alter the results of the @samp{__FILE__} and
2434 @samp{__LINE__} predefined macros from that point on. @xref{Standard
2437 The output of the preprocessor (which is the input for the rest of the
2438 compiler) contains directives that look much like @samp{#line} directives.
2439 They start with just @samp{#} instead of @samp{#line}, but this is
2440 followed by a line number and file name as in @samp{#line}. @xref{Output}.
2442 @node Other Directives, Output, Combining Sources, Top
2443 @section Miscellaneous Preprocessing Directives
2445 @cindex null directive
2446 This section describes three additional preprocessing directives. They are
2447 not very useful, but are mentioned for completeness.
2449 The @dfn{null directive} consists of a @samp{#} followed by a Newline, with
2450 only whitespace (including comments) in between. A null directive is
2451 understood as a preprocessing directive but has no effect on the preprocessor
2452 output. The primary significance of the existence of the null directive is
2453 that an input line consisting of just a @samp{#} will produce no output,
2454 rather than a line of output containing just a @samp{#}. Supposedly
2455 some old C programs contain such lines.
2458 The ANSI standard specifies that the effect of the @samp{#pragma}
2459 directive is implementation-defined. In the GNU C preprocessor,
2460 @samp{#pragma} directives are not used, except for @samp{#pragma once}
2461 (@pxref{Once-Only}). However, they are left in the preprocessor output,
2462 so they are available to the compilation pass.
2465 The @samp{#ident} directive is supported for compatibility with certain
2466 other systems. It is followed by a line of text. On some systems, the
2467 text is copied into a special place in the object file; on most systems,
2468 the text is ignored and this directive has no effect. Typically
2469 @samp{#ident} is only used in header files supplied with those systems
2470 where it is meaningful.
2472 @node Output, Invocation, Other Directives, Top
2473 @section C Preprocessor Output
2475 @cindex output format
2476 The output from the C preprocessor looks much like the input, except
2477 that all preprocessing directive lines have been replaced with blank lines
2478 and all comments with spaces. Whitespace within a line is not altered;
2479 however, unless @samp{-traditional} is used, spaces may be inserted into
2480 the expansions of macro calls to prevent tokens from being concatenated.
2482 Source file name and line number information is conveyed by lines of
2486 # @var{linenum} @var{filename} @var{flags}
2490 which are inserted as needed into the middle of the input (but never
2491 within a string or character constant). Such a line means that the
2492 following line originated in file @var{filename} at line @var{linenum}.
2494 After the file name comes zero or more flags, which are @samp{1},
2495 @samp{2}, @samp{3}, or @samp{4}. If there are multiple flags, spaces separate
2496 them. Here is what the flags mean:
2500 This indicates the start of a new file.
2502 This indicates returning to a file (after having included another file).
2504 This indicates that the following text comes from a system header file,
2505 so certain warnings should be suppressed.
2507 This indicates that the following text should be treated as C@.
2508 @c maybe cross reference NO_IMPLICIT_EXTERN_C
2511 @node Invocation, Concept Index, Output, Top
2512 @section Invoking the C Preprocessor
2513 @cindex invocation of the preprocessor
2515 Most often when you use the C preprocessor you will not have to invoke it
2516 explicitly: the C compiler will do so automatically. However, the
2517 preprocessor is sometimes useful on its own.
2519 The C preprocessor expects two file names as arguments, @var{infile} and
2520 @var{outfile}. The preprocessor reads @var{infile} together with any other
2521 files it specifies with @samp{#include}. All the output generated by the
2522 combined input files is written in @var{outfile}.
2524 Either @var{infile} or @var{outfile} may be @samp{-}, which as @var{infile}
2525 means to read from standard input and as @var{outfile} means to write to
2526 standard output. Also, if @var{outfile} or both file names are omitted,
2527 the standard output and standard input are used for the omitted file names.
2530 Here is a table of command options accepted by the C preprocessor.
2531 These options can also be given when compiling a C program; they are
2532 passed along automatically to the preprocessor when it is invoked by the
2538 Inhibit generation of @samp{#}-lines with line-number information in
2539 the output from the preprocessor (@pxref{Output}). This might be
2540 useful when running the preprocessor on something that is not C code
2541 and will be sent to a program which might be confused by the
2546 Do not discard comments: pass them through to the output file.
2547 Comments appearing in arguments of a macro call will be copied to the
2548 output before the expansion of the macro call.
2551 @findex -traditional
2552 Try to imitate the behavior of old-fashioned C, as opposed to ANSI C@.
2556 Traditional macro expansion pays no attention to singlequote or
2557 doublequote characters; macro argument symbols are replaced by the
2558 argument values even when they appear within apparent string or
2559 character constants.
2562 Traditionally, it is permissible for a macro expansion to end in the
2563 middle of a string or character constant. The constant continues into
2564 the text surrounding the macro call.
2567 However, traditionally the end of the line terminates a string or
2568 character constant, with no error.
2571 In traditional C, a comment is equivalent to no text at all. (In ANSI
2572 C, a comment counts as whitespace.)
2575 Traditional C does not have the concept of a ``preprocessing number''.
2576 It considers @samp{1.0e+4} to be three tokens: @samp{1.0e}, @samp{+},
2580 A macro is not suppressed within its own definition, in traditional C@.
2581 Thus, any macro that is used recursively inevitably causes an error.
2584 The character @samp{#} has no special meaning within a macro definition
2588 In traditional C, the text at the end of a macro expansion can run
2589 together with the text after the macro call, to produce a single token.
2590 (This is impossible in ANSI C@.)
2593 Traditionally, @samp{\} inside a macro argument suppresses the syntactic
2594 significance of the following character.
2598 @cindex unterminated
2599 Use the @samp{-traditional} option when preprocessing Fortran code,
2600 so that singlequotes and doublequotes
2601 within Fortran comment lines
2602 (which are generally not recognized as such by the preprocessor)
2603 do not cause diagnostics
2604 about unterminated character or string constants.
2606 However, this option does not prevent diagnostics
2607 about unterminated comments
2608 when a C-style comment appears to start, but not end,
2609 within Fortran-style commentary.
2611 So, the following Fortran comment lines are accepted with
2612 @samp{-traditional}:
2615 C This isn't an unterminated character constant
2616 C Neither is "20000000000, an octal constant
2617 C in some dialects of Fortran
2620 However, this type of comment line will likely produce a diagnostic,
2621 or at least unexpected output from the preprocessor,
2622 due to the unterminated comment:
2625 C Some Fortran compilers accept /* as starting
2626 C an inline comment.
2630 Note that @code{g77} automatically supplies
2631 the @samp{-traditional} option
2632 when it invokes the preprocessor.
2633 However, a future version of @code{g77}
2634 might use a different, more-Fortran-aware preprocessor
2635 in place of @code{cpp}.
2639 Process ANSI standard trigraph sequences. These are three-character
2640 sequences, all starting with @samp{??}, that are defined by ANSI C to
2641 stand for single characters. For example, @samp{??/} stands for
2642 @samp{\}, so @samp{'??/n'} is a character constant for a newline.
2643 Strictly speaking, the GNU C preprocessor does not support all
2644 programs in ANSI Standard C unless @samp{-trigraphs} is used, but if
2645 you ever notice the difference it will be with relief.
2647 You don't want to know any more about trigraphs.
2651 Issue warnings required by the ANSI C standard in certain cases such
2652 as when text other than a comment follows @samp{#else} or @samp{#endif}.
2654 @item -pedantic-errors
2655 @findex -pedantic-errors
2656 Like @samp{-pedantic}, except that errors are produced rather than
2661 Warn if any trigraphs are encountered (assuming they are enabled).
2666 @c "Not worth documenting" both singular and plural forms of this
2667 @c option, per RMS. But also unclear which is better; hence may need to
2668 @c switch this at some future date. pesch@cygnus.com, 2jan92.
2670 (Both forms have the same effect).
2672 Warn whenever a comment-start sequence @samp{/*} appears in a @samp{/*}
2673 comment, or whenever a Backslash-Newline appears in a @samp{//} comment.
2677 Requests both @samp{-Wtrigraphs} and @samp{-Wcomment} (but not
2678 @samp{-Wtraditional} or @samp{-Wundef}).
2681 @findex -Wtraditional
2682 Warn about certain constructs that behave differently in traditional and
2687 Warn if an undefined identifier is evaluated in an @samp{#if} directive.
2689 @item -I @var{directory}
2691 Add the directory @var{directory} to the head of the list of
2692 directories to be searched for header files (@pxref{Include Syntax}).
2693 This can be used to override a system header file, substituting your
2694 own version, since these directories are searched before the system
2695 header file directories. If you use more than one @samp{-I} option,
2696 the directories are scanned in left-to-right order; the standard
2697 system directories come after.
2700 Any directories specified with @samp{-I} options before the @samp{-I-}
2701 option are searched only for the case of @samp{#include "@var{file}"};
2702 they are not searched for @samp{#include <@var{file}>}.
2704 If additional directories are specified with @samp{-I} options after
2705 the @samp{-I-}, these directories are searched for all @samp{#include}
2708 In addition, the @samp{-I-} option inhibits the use of the current
2709 directory as the first search directory for @samp{#include "@var{file}"}.
2710 Therefore, the current directory is searched only if it is requested
2711 explicitly with @samp{-I.}. Specifying both @samp{-I-} and @samp{-I.}
2712 allows you to control precisely which directories are searched before
2713 the current one and which are searched after.
2717 Do not search the standard system directories for header files.
2718 Only the directories you have specified with @samp{-I} options
2719 (and the current directory, if appropriate) are searched.
2723 Do not search for header files in the C++-specific standard directories,
2724 but do still search the other standard directories.
2725 (This option is used when building the C++ library.)
2729 When searching for a header file in a directory, remap file names if a
2730 file named @file{header.gcc} exists in that directory. This can be used
2731 to work around limitations of file systems with file name restrictions.
2732 The @file{header.gcc} file should contain a series of lines with two
2733 tokens on each line: the first token is the name to map, and the second
2734 token is the actual name to use.
2738 Predefine @var{name} as a macro, with definition @samp{1}.
2740 @item -D @var{name}=@var{definition}
2741 Predefine @var{name} as a macro, with definition @var{definition}.
2742 There are no restrictions on the contents of @var{definition}, but if
2743 you are invoking the preprocessor from a shell or shell-like program you
2744 may need to use the shell's quoting syntax to protect characters such as
2745 spaces that have a meaning in the shell syntax. If you use more than
2746 one @samp{-D} for the same @var{name}, the rightmost definition takes
2751 Do not predefine @var{name}. If both @samp{-U} and @samp{-D} are
2752 specified for one name, the @samp{-U} beats the @samp{-D} and the name
2757 Do not predefine any nonstandard macros.
2759 @item -A @var{predicate}(@var{answer})
2761 Make an assertion with the predicate @var{predicate} and answer
2762 @var{answer}. @xref{Assertions}.
2765 You can use @samp{-A-} to disable all predefined assertions; it also
2766 undefines all predefined macros that identify the type of target system.
2770 Instead of outputting the result of preprocessing, output a list of
2771 @samp{#define} directives for all the macros defined during the
2772 execution of the preprocessor, including predefined macros. This gives
2773 you a way of finding out what is predefined in your version of the
2774 preprocessor; assuming you have no file @samp{foo.h}, the command
2777 touch foo.h; cpp -dM foo.h
2781 will show the values of any predefined macros.
2785 Like @samp{-dM} except in two respects: it does @emph{not} include the
2786 predefined macros, and it outputs @emph{both} the @samp{#define}
2787 directives and the result of preprocessing. Both kinds of output go to
2788 the standard output file.
2792 Output @samp{#include} directives in addition to the result of preprocessing.
2796 Instead of outputting the result of preprocessing, output a rule
2797 suitable for @code{make} describing the dependencies of the main
2798 source file. The preprocessor outputs one @code{make} rule containing
2799 the object file name for that source file, a colon, and the names of
2800 all the included files. If there are many included files then the
2801 rule is split into several lines using @samp{\}-newline.
2803 @samp{-MG} says to treat missing header files as generated files and assume
2804 they live in the same directory as the source file. It must be specified
2805 in addition to @samp{-M}.
2807 This feature is used in automatic updating of makefiles.
2811 Like @samp{-M} but mention only the files included with @samp{#include
2812 "@var{file}"}. System header files included with @samp{#include
2813 <@var{file}>} are omitted.
2815 @item -MD @var{file}
2817 Like @samp{-M} but the dependency information is written to @var{file}.
2818 This is in addition to compiling the file as specified---@samp{-MD} does
2819 not inhibit ordinary compilation the way @samp{-M} does.
2821 When invoking @code{gcc}, do not specify the @var{file} argument.
2822 @code{gcc} will create file names made by replacing ".c" with ".d" at
2823 the end of the input file names.
2825 In Mach, you can use the utility @code{md} to merge multiple dependency
2826 files into a single dependency file suitable for using with the @samp{make}
2829 @item -MMD @var{file}
2831 Like @samp{-MD} except mention only user header files, not system
2836 Print the name of each header file used, in addition to other normal
2839 @item -imacros @var{file}
2841 Process @var{file} as input, discarding the resulting output, before
2842 processing the regular input file. Because the output generated from
2843 @var{file} is discarded, the only effect of @samp{-imacros @var{file}}
2844 is to make the macros defined in @var{file} available for use in the
2847 @item -include @var{file}
2849 Process @var{file} as input, and include all the resulting output,
2850 before processing the regular input file.
2852 @item -idirafter @var{dir}
2854 @cindex second include path
2855 Add the directory @var{dir} to the second include path. The directories
2856 on the second include path are searched when a header file is not found
2857 in any of the directories in the main include path (the one that
2860 @item -iprefix @var{prefix}
2862 Specify @var{prefix} as the prefix for subsequent @samp{-iwithprefix}
2865 @item -iwithprefix @var{dir}
2866 @findex -iwithprefix
2867 Add a directory to the second include path. The directory's name is
2868 made by concatenating @var{prefix} and @var{dir}, where @var{prefix}
2869 was specified previously with @samp{-iprefix}.
2871 @item -isystem @var{dir}
2873 Add a directory to the beginning of the second include path, marking it
2874 as a system directory, so that it gets the same special treatment as
2875 is applied to the standard system directories.
2886 @findex -lang-objc++
2887 Specify the source language. @samp{-lang-c} is the default; it
2888 allows recognition of C++ comments (comments that begin with
2889 @samp{//} and end at end of line) and hexadecimal floating-point constants,
2890 since these features will most likely appear in the next C standard.
2891 @samp{-lang-c89} disables recognition of C++ comments and
2892 hexadecimal floating-point constants. @samp{-lang-c++}
2893 handles C++ comment syntax and includes extra default include
2894 directories for C++. @samp{-lang-objc} enables the Objective C
2895 @samp{#import} directive. @samp{-lang-objc++} enables both C++ and Objective C
2898 These options are generated by the compiler driver @code{gcc}, but not
2899 passed from the @samp{gcc} command line unless you use the driver's
2903 Look for commands to the program checker @code{lint} embedded in
2904 comments, and emit them preceded by @samp{#pragma lint}. For example,
2905 the comment @samp{/* NOTREACHED */} becomes @samp{#pragma lint
2908 This option is available only when you call @code{cpp} directly;
2909 @code{gcc} will not pass it from its command line.
2913 Forbid the use of @samp{$} in identifiers. This was formerly required
2914 for strict conformance to the C Standard before the standard was
2917 This option is available only when you call @code{cpp} directly;
2918 @code{gcc} will not pass it from its command line.
2922 @node Concept Index, Index, Invocation, Top
2923 @unnumbered Concept Index
2926 @node Index,, Concept Index, Top
2927 @unnumbered Index of Directives, Macros and Options