1 @node I/O on Streams, Low-Level I/O, I/O Overview, Top
2 @chapter Input/Output on Streams
4 This chapter describes the functions for creating streams and performing
5 input and output operations on them. As discussed in @ref{I/O
6 Overview}, a stream is a fairly abstract, high-level concept
7 representing a communications channel to a file, device, or process.
10 * Streams:: About the data type representing a stream.
11 * Standard Streams:: Streams to the standard input and output
12 devices are created for you.
13 * Opening Streams:: How to create a stream to talk to a file.
14 * Closing Streams:: Close a stream when you are finished with it.
15 * Simple Output:: Unformatted output by characters and lines.
16 * Character Input:: Unformatted input by characters and words.
17 * Line Input:: Reading a line or a record from a stream.
18 * Unreading:: Peeking ahead/pushing back input just read.
19 * Block Input/Output:: Input and output operations on blocks of data.
20 * Formatted Output:: @code{printf} and related functions.
21 * Customizing Printf:: You can define new conversion specifiers for
22 @code{printf} and friends.
23 * Formatted Input:: @code{scanf} and related functions.
24 * EOF and Errors:: How you can tell if an I/O error happens.
25 * Binary Streams:: Some systems distinguish between text files
27 * File Positioning:: About random-access streams.
28 * Portable Positioning:: Random access on peculiar ISO C systems.
29 * Stream Buffering:: How to control buffering of streams.
30 * Other Kinds of Streams:: Streams that do not necessarily correspond
37 For historical reasons, the type of the C data structure that represents
38 a stream is called @code{FILE} rather than ``stream''. Since most of
39 the library functions deal with objects of type @code{FILE *}, sometimes
40 the term @dfn{file pointer} is also used to mean ``stream''. This leads
41 to unfortunate confusion over terminology in many books on C. This
42 manual, however, is careful to use the terms ``file'' and ``stream''
43 only in the technical sense.
47 The @code{FILE} type is declared in the header file @file{stdio.h}.
51 @deftp {Data Type} FILE
52 This is the data type used to represent stream objects. A @code{FILE}
53 object holds all of the internal state information about the connection
54 to the associated file, including such things as the file position
55 indicator and buffering information. Each stream also has error and
56 end-of-file status indicators that can be tested with the @code{ferror}
57 and @code{feof} functions; see @ref{EOF and Errors}.
60 @code{FILE} objects are allocated and managed internally by the
61 input/output library functions. Don't try to create your own objects of
62 type @code{FILE}; let the library do it. Your programs should
63 deal only with pointers to these objects (that is, @code{FILE *} values)
64 rather than the objects themselves.
65 @c !!! should say that FILE's have "No user-serviceable parts inside."
67 @node Standard Streams
68 @section Standard Streams
69 @cindex standard streams
70 @cindex streams, standard
72 When the @code{main} function of your program is invoked, it already has
73 three predefined streams open and available for use. These represent
74 the ``standard'' input and output channels that have been established
77 These streams are declared in the header file @file{stdio.h}.
82 @deftypevar {FILE *} stdin
83 The @dfn{standard input} stream, which is the normal source of input for the
86 @cindex standard input stream
90 @deftypevar {FILE *} stdout
91 The @dfn{standard output} stream, which is used for normal output from
94 @cindex standard output stream
98 @deftypevar {FILE *} stderr
99 The @dfn{standard error} stream, which is used for error messages and
100 diagnostics issued by the program.
102 @cindex standard error stream
104 In the GNU system, you can specify what files or processes correspond to
105 these streams using the pipe and redirection facilities provided by the
106 shell. (The primitives shells use to implement these facilities are
107 described in @ref{File System Interface}.) Most other operating systems
108 provide similar mechanisms, but the details of how to use them can vary.
110 In the GNU C library, @code{stdin}, @code{stdout}, and @code{stderr} are
111 normal variables which you can set just like any others. For example, to redirect
112 the standard output to a file, you could do:
116 stdout = fopen ("standard-output-file", "w");
119 Note however, that in other systems @code{stdin}, @code{stdout}, and
120 @code{stderr} are macros that you cannot assign to in the normal way.
121 But you can use @code{freopen} to get the effect of closing one and
122 reopening it. @xref{Opening Streams}.
124 @node Opening Streams
125 @section Opening Streams
127 @cindex opening a stream
128 Opening a file with the @code{fopen} function creates a new stream and
129 establishes a connection between the stream and a file. This may
130 involve creating a new file.
133 Everything described in this section is declared in the header file
138 @deftypefun {FILE *} fopen (const char *@var{filename}, const char *@var{opentype})
139 The @code{fopen} function opens a stream for I/O to the file
140 @var{filename}, and returns a pointer to the stream.
142 The @var{opentype} argument is a string that controls how the file is
143 opened and specifies attributes of the resulting stream. It must begin
144 with one of the following sequences of characters:
148 Open an existing file for reading only.
151 Open the file for writing only. If the file already exists, it is
152 truncated to zero length. Otherwise a new file is created.
155 Open a file for append access; that is, writing at the end of file only.
156 If the file already exists, its initial contents are unchanged and
157 output to the stream is appended to the end of the file.
158 Otherwise, a new, empty file is created.
161 Open an existing file for both reading and writing. The initial contents
162 of the file are unchanged and the initial file position is at the
163 beginning of the file.
166 Open a file for both reading and writing. If the file already exists, it
167 is truncated to zero length. Otherwise, a new file is created.
170 Open or create file for both reading and appending. If the file exists,
171 its initial contents are unchanged. Otherwise, a new file is created.
172 The initial file position for reading is at the beginning of the file,
173 but output is always appended to the end of the file.
176 As you can see, @samp{+} requests a stream that can do both input and
177 output. The ISO standard says that when using such a stream, you must
178 call @code{fflush} (@pxref{Stream Buffering}) or a file positioning
179 function such as @code{fseek} (@pxref{File Positioning}) when switching
180 from reading to writing or vice versa. Otherwise, internal buffers
181 might not be emptied properly. The GNU C library does not have this
182 limitation; you can do arbitrary reading and writing operations on a
183 stream in whatever order.
185 Additional characters may appear after these to specify flags for the
186 call. Always put the mode (@samp{r}, @samp{w+}, etc.) first; that is
187 the only part you are guaranteed will be understood by all systems.
189 The GNU C library defines one additional character for use in
190 @var{opentype}: the character @samp{x} insists on creating a new
191 file---if a file @var{filename} already exists, @code{fopen} fails
192 rather than opening it. If you use @samp{x} you can are guaranteed that
193 you will not clobber an existing file. This is equivalent to the
194 @code{O_EXCL} option to the @code{open} function (@pxref{Opening and
197 The character @samp{b} in @var{opentype} has a standard meaning; it
198 requests a binary stream rather than a text stream. But this makes no
199 difference in POSIX systems (including the GNU system). If both
200 @samp{+} and @samp{b} are specified, they can appear in either order.
201 @xref{Binary Streams}.
203 Any other characters in @var{opentype} are simply ignored. They may be
204 meaningful in other systems.
206 If the open fails, @code{fopen} returns a null pointer.
209 You can have multiple streams (or file descriptors) pointing to the same
210 file open at the same time. If you do only input, this works
211 straightforwardly, but you must be careful if any output streams are
212 included. @xref{Stream/Descriptor Precautions}. This is equally true
213 whether the streams are in one program (not usual) or in several
214 programs (which can easily happen). It may be advantageous to use the
215 file locking facilities to avoid simultaneous access. @xref{File
220 @deftypevr Macro int FOPEN_MAX
221 The value of this macro is an integer constant expression that
222 represents the minimum number of streams that the implementation
223 guarantees can be open simultaneously. You might be able to open more
224 than this many streams, but that is not guaranteed. The value of this
225 constant is at least eight, which includes the three standard streams
226 @code{stdin}, @code{stdout}, and @code{stderr}. In POSIX.1 systems this
227 value is determined by the @code{OPEN_MAX} parameter; @pxref{General
228 Limits}. In BSD and GNU, it is controlled by the @code{RLIMIT_NOFILE}
229 resource limit; @pxref{Limits on Resources}.
234 @deftypefun {FILE *} freopen (const char *@var{filename}, const char *@var{opentype}, FILE *@var{stream})
235 This function is like a combination of @code{fclose} and @code{fopen}.
236 It first closes the stream referred to by @var{stream}, ignoring any
237 errors that are detected in the process. (Because errors are ignored,
238 you should not use @code{freopen} on an output stream if you have
239 actually done any output using the stream.) Then the file named by
240 @var{filename} is opened with mode @var{opentype} as for @code{fopen},
241 and associated with the same stream object @var{stream}.
243 If the operation fails, a null pointer is returned; otherwise,
244 @code{freopen} returns @var{stream}.
246 @code{freopen} has traditionally been used to connect a standard stream
247 such as @code{stdin} with a file of your own choice. This is useful in
248 programs in which use of a standard stream for certain purposes is
249 hard-coded. In the GNU C library, you can simply close the standard
250 streams and open new ones with @code{fopen}. But other systems lack
251 this ability, so using @code{freopen} is more portable.
255 @node Closing Streams
256 @section Closing Streams
258 @cindex closing a stream
259 When a stream is closed with @code{fclose}, the connection between the
260 stream and the file is cancelled. After you have closed a stream, you
261 cannot perform any additional operations on it.
265 @deftypefun int fclose (FILE *@var{stream})
266 This function causes @var{stream} to be closed and the connection to
267 the corresponding file to be broken. Any buffered output is written
268 and any buffered input is discarded. The @code{fclose} function returns
269 a value of @code{0} if the file was closed successfully, and @code{EOF}
270 if an error was detected.
272 It is important to check for errors when you call @code{fclose} to close
273 an output stream, because real, everyday errors can be detected at this
274 time. For example, when @code{fclose} writes the remaining buffered
275 output, it might get an error because the disk is full. Even if you
276 know the buffer is empty, errors can still occur when closing a file if
279 The function @code{fclose} is declared in @file{stdio.h}.
282 To close all streams currently available the GNU C Library provides
287 @deftypefun int fcloseall (void)
288 This function causes all open streams of the process to be closed and
289 the connection to corresponding files to be broken. All buffered data
290 is written and any buffered inputis discarded. The @code{fcloseall}
291 function returns a value of @code{0} if all the files were closed
292 successfully, and @code{EOF} if an error was detected.
294 This function should be used in only in special situation, e.g., when an
295 error occurred and the program must be aborted. Normally each single
296 stream should be closed separately so that problems with one stream can
297 be identifier. It is also problematic since the standard streams
298 (@pxref{Standard Streams}) will also be closed.
300 The function @code{fcloseall} is declared in @file{stdio.h}.
303 If the @code{main} function to your program returns, or if you call the
304 @code{exit} function (@pxref{Normal Termination}), all open streams are
305 automatically closed properly. If your program terminates in any other
306 manner, such as by calling the @code{abort} function (@pxref{Aborting a
307 Program}) or from a fatal signal (@pxref{Signal Handling}), open streams
308 might not be closed properly. Buffered output might not be flushed and
309 files may be incomplete. For more information on buffering of streams,
310 see @ref{Stream Buffering}.
313 @section Simple Output by Characters or Lines
315 @cindex writing to a stream, by characters
316 This section describes functions for performing character- and
317 line-oriented output.
319 These functions are declared in the header file @file{stdio.h}.
324 @deftypefun int fputc (int @var{c}, FILE *@var{stream})
325 The @code{fputc} function converts the character @var{c} to type
326 @code{unsigned char}, and writes it to the stream @var{stream}.
327 @code{EOF} is returned if a write error occurs; otherwise the
328 character @var{c} is returned.
333 @deftypefun int putc (int @var{c}, FILE *@var{stream})
334 This is just like @code{fputc}, except that most systems implement it as
335 a macro, making it faster. One consequence is that it may evaluate the
336 @var{stream} argument more than once, which is an exception to the
337 general rule for macros. @code{putc} is usually the best function to
338 use for writing a single character.
343 @deftypefun int putchar (int @var{c})
344 The @code{putchar} function is equivalent to @code{putc} with
345 @code{stdout} as the value of the @var{stream} argument.
350 @deftypefun int fputs (const char *@var{s}, FILE *@var{stream})
351 The function @code{fputs} writes the string @var{s} to the stream
352 @var{stream}. The terminating null character is not written.
353 This function does @emph{not} add a newline character, either.
354 It outputs only the characters in the string.
356 This function returns @code{EOF} if a write error occurs, and otherwise
357 a non-negative value.
362 fputs ("Are ", stdout);
363 fputs ("you ", stdout);
364 fputs ("hungry?\n", stdout);
368 outputs the text @samp{Are you hungry?} followed by a newline.
373 @deftypefun int puts (const char *@var{s})
374 The @code{puts} function writes the string @var{s} to the stream
375 @code{stdout} followed by a newline. The terminating null character of
376 the string is not written. (Note that @code{fputs} does @emph{not}
377 write a newline as this function does.)
379 @code{puts} is the most convenient function for printing simple
380 messages. For example:
383 puts ("This is a message.");
389 @deftypefun int putw (int @var{w}, FILE *@var{stream})
390 This function writes the word @var{w} (that is, an @code{int}) to
391 @var{stream}. It is provided for compatibility with SVID, but we
392 recommend you use @code{fwrite} instead (@pxref{Block Input/Output}).
395 @node Character Input
396 @section Character Input
398 @cindex reading from a stream, by characters
399 This section describes functions for performing character-oriented input.
400 These functions are declared in the header file @file{stdio.h}.
403 These functions return an @code{int} value that is either a character of
404 input, or the special value @code{EOF} (usually -1). It is important to
405 store the result of these functions in a variable of type @code{int}
406 instead of @code{char}, even when you plan to use it only as a
407 character. Storing @code{EOF} in a @code{char} variable truncates its
408 value to the size of a character, so that it is no longer
409 distinguishable from the valid character @samp{(char) -1}. So always
410 use an @code{int} for the result of @code{getc} and friends, and check
411 for @code{EOF} after the call; once you've verified that the result is
412 not @code{EOF}, you can be sure that it will fit in a @samp{char}
413 variable without loss of information.
417 @deftypefun int fgetc (FILE *@var{stream})
418 This function reads the next character as an @code{unsigned char} from
419 the stream @var{stream} and returns its value, converted to an
420 @code{int}. If an end-of-file condition or read error occurs,
421 @code{EOF} is returned instead.
426 @deftypefun int getc (FILE *@var{stream})
427 This is just like @code{fgetc}, except that it is permissible (and
428 typical) for it to be implemented as a macro that evaluates the
429 @var{stream} argument more than once. @code{getc} is often highly
430 optimized, so it is usually the best function to use to read a single
436 @deftypefun int getchar (void)
437 The @code{getchar} function is equivalent to @code{getc} with @code{stdin}
438 as the value of the @var{stream} argument.
441 Here is an example of a function that does input using @code{fgetc}. It
442 would work just as well using @code{getc} instead, or using
443 @code{getchar ()} instead of @w{@code{fgetc (stdin)}}.
447 y_or_n_p (const char *question)
449 fputs (question, stdout);
453 /* @r{Write a space to separate answer from question.} */
455 /* @r{Read the first character of the line.}
456 @r{This should be the answer character, but might not be.} */
457 c = tolower (fgetc (stdin));
459 /* @r{Discard rest of input line.} */
460 while (c != '\n' && c != EOF)
462 /* @r{Obey the answer if it was valid.} */
467 /* @r{Answer was invalid: ask for valid answer.} */
468 fputs ("Please answer y or n:", stdout);
475 @deftypefun int getw (FILE *@var{stream})
476 This function reads a word (that is, an @code{int}) from @var{stream}.
477 It's provided for compatibility with SVID. We recommend you use
478 @code{fread} instead (@pxref{Block Input/Output}). Unlike @code{getc},
479 any @code{int} value could be a valid result. @code{getw} returns
480 @code{EOF} when it encounters end-of-file or an error, but there is no
481 way to distinguish this from an input word with value -1.
485 @section Line-Oriented Input
487 Since many programs interpret input on the basis of lines, it's
488 convenient to have functions to read a line of text from a stream.
490 Standard C has functions to do this, but they aren't very safe: null
491 characters and even (for @code{gets}) long lines can confuse them. So
492 the GNU library provides the nonstandard @code{getline} function that
493 makes it easy to read lines reliably.
495 Another GNU extension, @code{getdelim}, generalizes @code{getline}. It
496 reads a delimited record, defined as everything through the next
497 occurrence of a specified delimiter character.
499 All these functions are declared in @file{stdio.h}.
503 @deftypefun ssize_t getline (char **@var{lineptr}, size_t *@var{n}, FILE *@var{stream})
504 This function reads an entire line from @var{stream}, storing the text
505 (including the newline and a terminating null character) in a buffer
506 and storing the buffer address in @code{*@var{lineptr}}.
508 Before calling @code{getline}, you should place in @code{*@var{lineptr}}
509 the address of a buffer @code{*@var{n}} bytes long, allocated with
510 @code{malloc}. If this buffer is long enough to hold the line,
511 @code{getline} stores the line in this buffer. Otherwise,
512 @code{getline} makes the buffer bigger using @code{realloc}, storing the
513 new buffer address back in @code{*@var{lineptr}} and the increased size
514 back in @code{*@var{n}}.
515 @xref{Unconstrained Allocation}.
517 If you set @code{*@var{lineptr}} to a null pointer, and @code{*@var{n}}
518 to zero, before the call, then @code{getline} allocates the initial
519 buffer for you by calling @code{malloc}.
521 In either case, when @code{getline} returns, @code{*@var{lineptr}} is
522 a @code{char *} which points to the text of the line.
524 When @code{getline} is successful, it returns the number of characters
525 read (including the newline, but not including the terminating null).
526 This value enables you to distinguish null characters that are part of
527 the line from the null character inserted as a terminator.
529 This function is a GNU extension, but it is the recommended way to read
530 lines from a stream. The alternative standard functions are unreliable.
532 If an error occurs or end of file is reached, @code{getline} returns
538 @deftypefun ssize_t getdelim (char **@var{lineptr}, size_t *@var{n}, int @var{delimiter}, FILE *@var{stream})
539 This function is like @code{getline} except that the character which
540 tells it to stop reading is not necessarily newline. The argument
541 @var{delimiter} specifies the delimiter character; @code{getdelim} keeps
542 reading until it sees that character (or end of file).
544 The text is stored in @var{lineptr}, including the delimiter character
545 and a terminating null. Like @code{getline}, @code{getdelim} makes
546 @var{lineptr} bigger if it isn't big enough.
548 @code{getline} is in fact implemented in terms of @code{getdelim}, just
553 getline (char **lineptr, size_t *n, FILE *stream)
555 return getdelim (lineptr, n, '\n', stream);
562 @deftypefun {char *} fgets (char *@var{s}, int @var{count}, FILE *@var{stream})
563 The @code{fgets} function reads characters from the stream @var{stream}
564 up to and including a newline character and stores them in the string
565 @var{s}, adding a null character to mark the end of the string. You
566 must supply @var{count} characters worth of space in @var{s}, but the
567 number of characters read is at most @var{count} @minus{} 1. The extra
568 character space is used to hold the null character at the end of the
571 If the system is already at end of file when you call @code{fgets}, then
572 the contents of the array @var{s} are unchanged and a null pointer is
573 returned. A null pointer is also returned if a read error occurs.
574 Otherwise, the return value is the pointer @var{s}.
576 @strong{Warning:} If the input data has a null character, you can't tell.
577 So don't use @code{fgets} unless you know the data cannot contain a null.
578 Don't use it to read files edited by the user because, if the user inserts
579 a null character, you should either handle it properly or print a clear
580 error message. We recommend using @code{getline} instead of @code{fgets}.
585 @deftypefn {Deprecated function} {char *} gets (char *@var{s})
586 The function @code{gets} reads characters from the stream @code{stdin}
587 up to the next newline character, and stores them in the string @var{s}.
588 The newline character is discarded (note that this differs from the
589 behavior of @code{fgets}, which copies the newline character into the
590 string). If @code{gets} encounters a read error or end-of-file, it
591 returns a null pointer; otherwise it returns @var{s}.
593 @strong{Warning:} The @code{gets} function is @strong{very dangerous}
594 because it provides no protection against overflowing the string
595 @var{s}. The GNU library includes it for compatibility only. You
596 should @strong{always} use @code{fgets} or @code{getline} instead. To
597 remind you of this, the linker (if using GNU @code{ld}) will issue a
598 warning whenever you use @code{gets}.
603 @cindex peeking at input
604 @cindex unreading characters
605 @cindex pushing input back
607 In parser programs it is often useful to examine the next character in
608 the input stream without removing it from the stream. This is called
609 ``peeking ahead'' at the input because your program gets a glimpse of
610 the input it will read next.
612 Using stream I/O, you can peek ahead at input by first reading it and
613 then @dfn{unreading} it (also called @dfn{pushing it back} on the stream).
614 Unreading a character makes it available to be input again from the stream,
615 by the next call to @code{fgetc} or other input function on that stream.
618 * Unreading Idea:: An explanation of unreading with pictures.
619 * How Unread:: How to call @code{ungetc} to do unreading.
623 @subsection What Unreading Means
625 Here is a pictorial explanation of unreading. Suppose you have a
626 stream reading a file that contains just six characters, the letters
627 @samp{foobar}. Suppose you have read three characters so far. The
628 situation looks like this:
636 so the next input character will be @samp{b}.
638 @c @group Invalid outside @example
639 If instead of reading @samp{b} you unread the letter @samp{o}, you get a
650 so that the next input characters will be @samp{o} and @samp{b}.
654 If you unread @samp{9} instead of @samp{o}, you get this situation:
664 so that the next input characters will be @samp{9} and @samp{b}.
668 @subsection Using @code{ungetc} To Do Unreading
670 The function to unread a character is called @code{ungetc}, because it
671 reverses the action of @code{getc}.
675 @deftypefun int ungetc (int @var{c}, FILE *@var{stream})
676 The @code{ungetc} function pushes back the character @var{c} onto the
677 input stream @var{stream}. So the next input from @var{stream} will
678 read @var{c} before anything else.
680 If @var{c} is @code{EOF}, @code{ungetc} does nothing and just returns
681 @code{EOF}. This lets you call @code{ungetc} with the return value of
682 @code{getc} without needing to check for an error from @code{getc}.
684 The character that you push back doesn't have to be the same as the last
685 character that was actually read from the stream. In fact, it isn't
686 necessary to actually read any characters from the stream before
687 unreading them with @code{ungetc}! But that is a strange way to write
688 a program; usually @code{ungetc} is used only to unread a character
689 that was just read from the same stream.
691 The GNU C library only supports one character of pushback---in other
692 words, it does not work to call @code{ungetc} twice without doing input
693 in between. Other systems might let you push back multiple characters;
694 then reading from the stream retrieves the characters in the reverse
695 order that they were pushed.
697 Pushing back characters doesn't alter the file; only the internal
698 buffering for the stream is affected. If a file positioning function
699 (such as @code{fseek} or @code{rewind}; @pxref{File Positioning}) is
700 called, any pending pushed-back characters are discarded.
702 Unreading a character on a stream that is at end of file clears the
703 end-of-file indicator for the stream, because it makes the character of
704 input available. After you read that character, trying to read again
705 will encounter end of file.
708 Here is an example showing the use of @code{getc} and @code{ungetc} to
709 skip over whitespace characters. When this function reaches a
710 non-whitespace character, it unreads that character to be seen again on
711 the next read operation on the stream.
718 skip_whitespace (FILE *stream)
722 /* @r{No need to check for @code{EOF} because it is not}
723 @r{@code{isspace}, and @code{ungetc} ignores @code{EOF}.} */
730 @node Block Input/Output
731 @section Block Input/Output
733 This section describes how to do input and output operations on blocks
734 of data. You can use these functions to read and write binary data, as
735 well as to read and write text in fixed-size blocks instead of by
737 @cindex binary I/O to a stream
738 @cindex block I/O to a stream
739 @cindex reading from a stream, by blocks
740 @cindex writing to a stream, by blocks
742 Binary files are typically used to read and write blocks of data in the
743 same format as is used to represent the data in a running program. In
744 other words, arbitrary blocks of memory---not just character or string
745 objects---can be written to a binary file, and meaningfully read in
746 again by the same program.
748 Storing data in binary form is often considerably more efficient than
749 using the formatted I/O functions. Also, for floating-point numbers,
750 the binary form avoids possible loss of precision in the conversion
751 process. On the other hand, binary files can't be examined or modified
752 easily using many standard file utilities (such as text editors), and
753 are not portable between different implementations of the language, or
754 different kinds of computers.
756 These functions are declared in @file{stdio.h}.
761 @deftypefun size_t fread (void *@var{data}, size_t @var{size}, size_t @var{count}, FILE *@var{stream})
762 This function reads up to @var{count} objects of size @var{size} into
763 the array @var{data}, from the stream @var{stream}. It returns the
764 number of objects actually read, which might be less than @var{count} if
765 a read error occurs or the end of the file is reached. This function
766 returns a value of zero (and doesn't read anything) if either @var{size}
767 or @var{count} is zero.
769 If @code{fread} encounters end of file in the middle of an object, it
770 returns the number of complete objects read, and discards the partial
771 object. Therefore, the stream remains at the actual end of the file.
776 @deftypefun size_t fwrite (const void *@var{data}, size_t @var{size}, size_t @var{count}, FILE *@var{stream})
777 This function writes up to @var{count} objects of size @var{size} from
778 the array @var{data}, to the stream @var{stream}. The return value is
779 normally @var{count}, if the call succeeds. Any other value indicates
780 some sort of error, such as running out of space.
783 @node Formatted Output
784 @section Formatted Output
786 @cindex format string, for @code{printf}
787 @cindex template, for @code{printf}
788 @cindex formatted output to a stream
789 @cindex writing to a stream, formatted
790 The functions described in this section (@code{printf} and related
791 functions) provide a convenient way to perform formatted output. You
792 call @code{printf} with a @dfn{format string} or @dfn{template string}
793 that specifies how to format the values of the remaining arguments.
795 Unless your program is a filter that specifically performs line- or
796 character-oriented processing, using @code{printf} or one of the other
797 related functions described in this section is usually the easiest and
798 most concise way to perform output. These functions are especially
799 useful for printing error messages, tables of data, and the like.
802 * Formatted Output Basics:: Some examples to get you started.
803 * Output Conversion Syntax:: General syntax of conversion
805 * Table of Output Conversions:: Summary of output conversions and
807 * Integer Conversions:: Details about formatting of integers.
808 * Floating-Point Conversions:: Details about formatting of
809 floating-point numbers.
810 * Other Output Conversions:: Details about formatting of strings,
811 characters, pointers, and the like.
812 * Formatted Output Functions:: Descriptions of the actual functions.
813 * Dynamic Output:: Functions that allocate memory for the output.
814 * Variable Arguments Output:: @code{vprintf} and friends.
815 * Parsing a Template String:: What kinds of args does a given template
817 * Example of Parsing:: Sample program using @code{parse_printf_format}.
820 @node Formatted Output Basics
821 @subsection Formatted Output Basics
823 The @code{printf} function can be used to print any number of arguments.
824 The template string argument you supply in a call provides
825 information not only about the number of additional arguments, but also
826 about their types and what style should be used for printing them.
828 Ordinary characters in the template string are simply written to the
829 output stream as-is, while @dfn{conversion specifications} introduced by
830 a @samp{%} character in the template cause subsequent arguments to be
831 formatted and written to the output stream. For example,
832 @cindex conversion specifications (@code{printf})
836 char filename[] = "foo.txt";
837 printf ("Processing of `%s' is %d%% finished.\nPlease be patient.\n",
845 Processing of `foo.txt' is 37% finished.
849 This example shows the use of the @samp{%d} conversion to specify that
850 an @code{int} argument should be printed in decimal notation, the
851 @samp{%s} conversion to specify printing of a string argument, and
852 the @samp{%%} conversion to print a literal @samp{%} character.
854 There are also conversions for printing an integer argument as an
855 unsigned value in octal, decimal, or hexadecimal radix (@samp{%o},
856 @samp{%u}, or @samp{%x}, respectively); or as a character value
859 Floating-point numbers can be printed in normal, fixed-point notation
860 using the @samp{%f} conversion or in exponential notation using the
861 @samp{%e} conversion. The @samp{%g} conversion uses either @samp{%e}
862 or @samp{%f} format, depending on what is more appropriate for the
863 magnitude of the particular number.
865 You can control formatting more precisely by writing @dfn{modifiers}
866 between the @samp{%} and the character that indicates which conversion
867 to apply. These slightly alter the ordinary behavior of the conversion.
868 For example, most conversion specifications permit you to specify a
869 minimum field width and a flag indicating whether you want the result
870 left- or right-justified within the field.
872 The specific flags and modifiers that are permitted and their
873 interpretation vary depending on the particular conversion. They're all
874 described in more detail in the following sections. Don't worry if this
875 all seems excessively complicated at first; you can almost always get
876 reasonable free-format output without using any of the modifiers at all.
877 The modifiers are mostly used to make the output look ``prettier'' in
880 @node Output Conversion Syntax
881 @subsection Output Conversion Syntax
883 This section provides details about the precise syntax of conversion
884 specifications that can appear in a @code{printf} template
887 Characters in the template string that are not part of a
888 conversion specification are printed as-is to the output stream.
889 Multibyte character sequences (@pxref{Extended Characters}) are permitted in
892 The conversion specifications in a @code{printf} template string have
896 % @var{flags} @var{width} @r{[} . @var{precision} @r{]} @var{type} @var{conversion}
899 For example, in the conversion specifier @samp{%-10.8ld}, the @samp{-}
900 is a flag, @samp{10} specifies the field width, the precision is
901 @samp{8}, the letter @samp{l} is a type modifier, and @samp{d} specifies
902 the conversion style. (This particular type specifier says to
903 print a @code{long int} argument in decimal notation, with a minimum of
904 8 digits left-justified in a field at least 10 characters wide.)
906 In more detail, output conversion specifications consist of an
907 initial @samp{%} character followed in sequence by:
911 Zero or more @dfn{flag characters} that modify the normal behavior of
912 the conversion specification.
913 @cindex flag character (@code{printf})
916 An optional decimal integer specifying the @dfn{minimum field width}.
917 If the normal conversion produces fewer characters than this, the field
918 is padded with spaces to the specified width. This is a @emph{minimum}
919 value; if the normal conversion produces more characters than this, the
920 field is @emph{not} truncated. Normally, the output is right-justified
922 @cindex minimum field width (@code{printf})
924 You can also specify a field width of @samp{*}. This means that the
925 next argument in the argument list (before the actual value to be
926 printed) is used as the field width. The value must be an @code{int}.
927 If the value is negative, this means to set the @samp{-} flag (see
928 below) and to use the absolute value as the field width.
931 An optional @dfn{precision} to specify the number of digits to be
932 written for the numeric conversions. If the precision is specified, it
933 consists of a period (@samp{.}) followed optionally by a decimal integer
934 (which defaults to zero if omitted).
935 @cindex precision (@code{printf})
937 You can also specify a precision of @samp{*}. This means that the next
938 argument in the argument list (before the actual value to be printed) is
939 used as the precision. The value must be an @code{int}, and is ignored
940 if it is negative. If you specify @samp{*} for both the field width and
941 precision, the field width argument precedes the precision argument.
942 Other C library versions may not recognize this syntax.
945 An optional @dfn{type modifier character}, which is used to specify the
946 data type of the corresponding argument if it differs from the default
947 type. (For example, the integer conversions assume a type of @code{int},
948 but you can specify @samp{h}, @samp{l}, or @samp{L} for other integer
950 @cindex type modifier character (@code{printf})
953 A character that specifies the conversion to be applied.
956 The exact options that are permitted and how they are interpreted vary
957 between the different conversion specifiers. See the descriptions of the
958 individual conversions for information about the particular options that
961 With the @samp{-Wformat} option, the GNU C compiler checks calls to
962 @code{printf} and related functions. It examines the format string and
963 verifies that the correct number and types of arguments are supplied.
964 There is also a GNU C syntax to tell the compiler that a function you
965 write uses a @code{printf}-style format string.
966 @xref{Function Attributes, , Declaring Attributes of Functions,
967 gcc.info, Using GNU CC}, for more information.
969 @node Table of Output Conversions
970 @subsection Table of Output Conversions
971 @cindex output conversions, for @code{printf}
973 Here is a table summarizing what all the different conversions do:
976 @item @samp{%d}, @samp{%i}
977 Print an integer as a signed decimal number. @xref{Integer
978 Conversions}, for details. @samp{%d} and @samp{%i} are synonymous for
979 output, but are different when used with @code{scanf} for input
980 (@pxref{Table of Input Conversions}).
983 Print an integer as an unsigned octal number. @xref{Integer
984 Conversions}, for details.
987 Print an integer as an unsigned decimal number. @xref{Integer
988 Conversions}, for details.
990 @item @samp{%x}, @samp{%X}
991 Print an integer as an unsigned hexadecimal number. @samp{%x} uses
992 lower-case letters and @samp{%X} uses upper-case. @xref{Integer
993 Conversions}, for details.
996 Print a floating-point number in normal (fixed-point) notation.
997 @xref{Floating-Point Conversions}, for details.
999 @item @samp{%e}, @samp{%E}
1000 Print a floating-point number in exponential notation. @samp{%e} uses
1001 lower-case letters and @samp{%E} uses upper-case. @xref{Floating-Point
1002 Conversions}, for details.
1004 @item @samp{%g}, @samp{%G}
1005 Print a floating-point number in either normal or exponential notation,
1006 whichever is more appropriate for its magnitude. @samp{%g} uses
1007 lower-case letters and @samp{%G} uses upper-case. @xref{Floating-Point
1008 Conversions}, for details.
1011 Print a single character. @xref{Other Output Conversions}.
1014 Print a string. @xref{Other Output Conversions}.
1017 Print the value of a pointer. @xref{Other Output Conversions}.
1020 Get the number of characters printed so far. @xref{Other Output Conversions}.
1021 Note that this conversion specification never produces any output.
1024 Print the string corresponding to the value of @code{errno}.
1025 (This is a GNU extension.)
1026 @xref{Other Output Conversions}.
1029 Print a literal @samp{%} character. @xref{Other Output Conversions}.
1032 If the syntax of a conversion specification is invalid, unpredictable
1033 things will happen, so don't do this. If there aren't enough function
1034 arguments provided to supply values for all the conversion
1035 specifications in the template string, or if the arguments are not of
1036 the correct types, the results are unpredictable. If you supply more
1037 arguments than conversion specifications, the extra argument values are
1038 simply ignored; this is sometimes useful.
1040 @node Integer Conversions
1041 @subsection Integer Conversions
1043 This section describes the options for the @samp{%d}, @samp{%i},
1044 @samp{%o}, @samp{%u}, @samp{%x}, and @samp{%X} conversion
1045 specifications. These conversions print integers in various formats.
1047 The @samp{%d} and @samp{%i} conversion specifications both print an
1048 @code{int} argument as a signed decimal number; while @samp{%o},
1049 @samp{%u}, and @samp{%x} print the argument as an unsigned octal,
1050 decimal, or hexadecimal number (respectively). The @samp{%X} conversion
1051 specification is just like @samp{%x} except that it uses the characters
1052 @samp{ABCDEF} as digits instead of @samp{abcdef}.
1054 The following flags are meaningful:
1058 Left-justify the result in the field (instead of the normal
1059 right-justification).
1062 For the signed @samp{%d} and @samp{%i} conversions, print a
1063 plus sign if the value is positive.
1066 For the signed @samp{%d} and @samp{%i} conversions, if the result
1067 doesn't start with a plus or minus sign, prefix it with a space
1068 character instead. Since the @samp{+} flag ensures that the result
1069 includes a sign, this flag is ignored if you supply both of them.
1072 For the @samp{%o} conversion, this forces the leading digit to be
1073 @samp{0}, as if by increasing the precision. For @samp{%x} or
1074 @samp{%X}, this prefixes a leading @samp{0x} or @samp{0X} (respectively)
1075 to the result. This doesn't do anything useful for the @samp{%d},
1076 @samp{%i}, or @samp{%u} conversions. Using this flag produces output
1077 which can be parsed by the @code{strtoul} function (@pxref{Parsing of
1078 Integers}) and @code{scanf} with the @samp{%i} conversion
1079 (@pxref{Numeric Input Conversions}).
1082 Separate the digits into groups as specified by the locale specified for
1083 the @code{LC_NUMERIC} category; @pxref{General Numeric}. This flag is a
1087 Pad the field with zeros instead of spaces. The zeros are placed after
1088 any indication of sign or base. This flag is ignored if the @samp{-}
1089 flag is also specified, or if a precision is specified.
1092 If a precision is supplied, it specifies the minimum number of digits to
1093 appear; leading zeros are produced if necessary. If you don't specify a
1094 precision, the number is printed with as many digits as it needs. If
1095 you convert a value of zero with an explicit precision of zero, then no
1096 characters at all are produced.
1098 Without a type modifier, the corresponding argument is treated as an
1099 @code{int} (for the signed conversions @samp{%i} and @samp{%d}) or
1100 @code{unsigned int} (for the unsigned conversions @samp{%o}, @samp{%u},
1101 @samp{%x}, and @samp{%X}). Recall that since @code{printf} and friends
1102 are variadic, any @code{char} and @code{short} arguments are
1103 automatically converted to @code{int} by the default argument
1104 promotions. For arguments of other integer types, you can use these
1109 Specifies that the argument is a @code{short int} or @code{unsigned
1110 short int}, as appropriate. A @code{short} argument is converted to an
1111 @code{int} or @code{unsigned int} by the default argument promotions
1112 anyway, but the @samp{h} modifier says to convert it back to a
1116 Specifies that the argument is a @code{long int} or @code{unsigned long
1117 int}, as appropriate. Two @samp{l} characters is like the @samp{L}
1123 Specifies that the argument is a @code{long long int}. (This type is
1124 an extension supported by the GNU C compiler. On systems that don't
1125 support extra-long integers, this is the same as @code{long int}.)
1127 The @samp{q} modifier is another name for the same thing, which comes
1128 from 4.4 BSD; a @w{@code{long long int}} is sometimes called a ``quad''
1132 Specifies that the argument is a @code{size_t}. This is a GNU extension.
1135 Here is an example. Using the template string:
1138 "|%5d|%-5d|%+5d|%+-5d|% 5d|%05d|%5.0d|%5.2d|%d|\n"
1142 to print numbers using the different options for the @samp{%d}
1143 conversion gives results like:
1146 | 0|0 | +0|+0 | 0|00000| | 00|0|
1147 | 1|1 | +1|+1 | 1|00001| 1| 01|1|
1148 | -1|-1 | -1|-1 | -1|-0001| -1| -01|-1|
1149 |100000|100000|+100000| 100000|100000|100000|100000|100000|
1152 In particular, notice what happens in the last case where the number
1153 is too large to fit in the minimum field width specified.
1155 Here are some more examples showing how unsigned integers print under
1156 various format options, using the template string:
1159 "|%5u|%5o|%5x|%5X|%#5o|%#5x|%#5X|%#10.8x|\n"
1163 | 0| 0| 0| 0| 0| 0x0| 0X0|0x00000000|
1164 | 1| 1| 1| 1| 01| 0x1| 0X1|0x00000001|
1165 |100000|303240|186a0|186A0|0303240|0x186a0|0X186A0|0x000186a0|
1169 @node Floating-Point Conversions
1170 @subsection Floating-Point Conversions
1172 This section discusses the conversion specifications for floating-point
1173 numbers: the @samp{%f}, @samp{%e}, @samp{%E}, @samp{%g}, and @samp{%G}
1176 The @samp{%f} conversion prints its argument in fixed-point notation,
1177 producing output of the form
1178 @w{[@code{-}]@var{ddd}@code{.}@var{ddd}},
1179 where the number of digits following the decimal point is controlled
1180 by the precision you specify.
1182 The @samp{%e} conversion prints its argument in exponential notation,
1183 producing output of the form
1184 @w{[@code{-}]@var{d}@code{.}@var{ddd}@code{e}[@code{+}|@code{-}]@var{dd}}.
1185 Again, the number of digits following the decimal point is controlled by
1186 the precision. The exponent always contains at least two digits. The
1187 @samp{%E} conversion is similar but the exponent is marked with the letter
1188 @samp{E} instead of @samp{e}.
1190 The @samp{%g} and @samp{%G} conversions print the argument in the style
1191 of @samp{%e} or @samp{%E} (respectively) if the exponent would be less
1192 than -4 or greater than or equal to the precision; otherwise they use the
1193 @samp{%f} style. Trailing zeros are removed from the fractional portion
1194 of the result and a decimal-point character appears only if it is
1195 followed by a digit.
1197 The following flags can be used to modify the behavior:
1199 @comment We use @asis instead of @samp so we can have ` ' as an item.
1202 Left-justify the result in the field. Normally the result is
1206 Always include a plus or minus sign in the result.
1209 If the result doesn't start with a plus or minus sign, prefix it with a
1210 space instead. Since the @samp{+} flag ensures that the result includes
1211 a sign, this flag is ignored if you supply both of them.
1214 Specifies that the result should always include a decimal point, even
1215 if no digits follow it. For the @samp{%g} and @samp{%G} conversions,
1216 this also forces trailing zeros after the decimal point to be left
1217 in place where they would otherwise be removed.
1220 Separate the digits of the integer part of the result into groups as
1221 specified by the locale specified for the @code{LC_NUMERIC} category;
1222 @pxref{General Numeric}. This flag is a GNU extension.
1225 Pad the field with zeros instead of spaces; the zeros are placed
1226 after any sign. This flag is ignored if the @samp{-} flag is also
1230 The precision specifies how many digits follow the decimal-point
1231 character for the @samp{%f}, @samp{%e}, and @samp{%E} conversions. For
1232 these conversions, the default precision is @code{6}. If the precision
1233 is explicitly @code{0}, this suppresses the decimal point character
1234 entirely. For the @samp{%g} and @samp{%G} conversions, the precision
1235 specifies how many significant digits to print. Significant digits are
1236 the first digit before the decimal point, and all the digits after it.
1237 If the precision @code{0} or not specified for @samp{%g} or @samp{%G},
1238 it is treated like a value of @code{1}. If the value being printed
1239 cannot be expressed accurately in the specified number of digits, the
1240 value is rounded to the nearest number that fits.
1242 Without a type modifier, the floating-point conversions use an argument
1243 of type @code{double}. (By the default argument promotions, any
1244 @code{float} arguments are automatically converted to @code{double}.)
1245 The following type modifier is supported:
1249 An uppercase @samp{L} specifies that the argument is a @code{long
1253 Here are some examples showing how numbers print using the various
1254 floating-point conversions. All of the numbers were printed using
1255 this template string:
1258 "|%12.4f|%12.4e|%12.4g|\n"
1264 | 0.0000| 0.0000e+00| 0|
1265 | 1.0000| 1.0000e+00| 1|
1266 | -1.0000| -1.0000e+00| -1|
1267 | 100.0000| 1.0000e+02| 100|
1268 | 1000.0000| 1.0000e+03| 1000|
1269 | 10000.0000| 1.0000e+04| 1e+04|
1270 | 12345.0000| 1.2345e+04| 1.234e+04|
1271 | 100000.0000| 1.0000e+05| 1e+05|
1272 | 123456.0000| 1.2346e+05| 1.234e+05|
1275 Notice how the @samp{%g} conversion drops trailing zeros.
1277 @node Other Output Conversions
1278 @subsection Other Output Conversions
1280 This section describes miscellaneous conversions for @code{printf}.
1282 The @samp{%c} conversion prints a single character. The @code{int}
1283 argument is first converted to an @code{unsigned char}. The @samp{-}
1284 flag can be used to specify left-justification in the field, but no
1285 other flags are defined, and no precision or type modifier can be given.
1289 printf ("%c%c%c%c%c", 'h', 'e', 'l', 'l', 'o');
1293 prints @samp{hello}.
1295 The @samp{%s} conversion prints a string. The corresponding argument
1296 must be of type @code{char *} (or @code{const char *}). A precision can
1297 be specified to indicate the maximum number of characters to write;
1298 otherwise characters in the string up to but not including the
1299 terminating null character are written to the output stream. The
1300 @samp{-} flag can be used to specify left-justification in the field,
1301 but no other flags or type modifiers are defined for this conversion.
1305 printf ("%3s%-6s", "no", "where");
1309 prints @samp{ nowhere }.
1311 If you accidentally pass a null pointer as the argument for a @samp{%s}
1312 conversion, the GNU library prints it as @samp{(null)}. We think this
1313 is more useful than crashing. But it's not good practice to pass a null
1314 argument intentionally.
1316 The @samp{%m} conversion prints the string corresponding to the error
1317 code in @code{errno}. @xref{Error Messages}. Thus:
1320 fprintf (stderr, "can't open `%s': %m\n", filename);
1327 fprintf (stderr, "can't open `%s': %s\n", filename, strerror (errno));
1331 The @samp{%m} conversion is a GNU C library extension.
1333 The @samp{%p} conversion prints a pointer value. The corresponding
1334 argument must be of type @code{void *}. In practice, you can use any
1337 In the GNU system, non-null pointers are printed as unsigned integers,
1338 as if a @samp{%#x} conversion were used. Null pointers print as
1339 @samp{(nil)}. (Pointers might print differently in other systems.)
1344 printf ("%p", "testing");
1348 prints @samp{0x} followed by a hexadecimal number---the address of the
1349 string constant @code{"testing"}. It does not print the word
1352 You can supply the @samp{-} flag with the @samp{%p} conversion to
1353 specify left-justification, but no other flags, precision, or type
1354 modifiers are defined.
1356 The @samp{%n} conversion is unlike any of the other output conversions.
1357 It uses an argument which must be a pointer to an @code{int}, but
1358 instead of printing anything it stores the number of characters printed
1359 so far by this call at that location. The @samp{h} and @samp{l} type
1360 modifiers are permitted to specify that the argument is of type
1361 @code{short int *} or @code{long int *} instead of @code{int *}, but no
1362 flags, field width, or precision are permitted.
1368 printf ("%d %s%n\n", 3, "bears", &nchar);
1379 and sets @code{nchar} to @code{7}, because @samp{3 bears} is seven
1383 The @samp{%%} conversion prints a literal @samp{%} character. This
1384 conversion doesn't use an argument, and no flags, field width,
1385 precision, or type modifiers are permitted.
1388 @node Formatted Output Functions
1389 @subsection Formatted Output Functions
1391 This section describes how to call @code{printf} and related functions.
1392 Prototypes for these functions are in the header file @file{stdio.h}.
1393 Because these functions take a variable number of arguments, you
1394 @emph{must} declare prototypes for them before using them. Of course,
1395 the easiest way to make sure you have all the right prototypes is to
1396 just include @file{stdio.h}.
1401 @deftypefun int printf (const char *@var{template}, @dots{})
1402 The @code{printf} function prints the optional arguments under the
1403 control of the template string @var{template} to the stream
1404 @code{stdout}. It returns the number of characters printed, or a
1405 negative value if there was an output error.
1410 @deftypefun int fprintf (FILE *@var{stream}, const char *@var{template}, @dots{})
1411 This function is just like @code{printf}, except that the output is
1412 written to the stream @var{stream} instead of @code{stdout}.
1417 @deftypefun int sprintf (char *@var{s}, const char *@var{template}, @dots{})
1418 This is like @code{printf}, except that the output is stored in the character
1419 array @var{s} instead of written to a stream. A null character is written
1420 to mark the end of the string.
1422 The @code{sprintf} function returns the number of characters stored in
1423 the array @var{s}, not including the terminating null character.
1425 The behavior of this function is undefined if copying takes place
1426 between objects that overlap---for example, if @var{s} is also given
1427 as an argument to be printed under control of the @samp{%s} conversion.
1428 @xref{Copying and Concatenation}.
1430 @strong{Warning:} The @code{sprintf} function can be @strong{dangerous}
1431 because it can potentially output more characters than can fit in the
1432 allocation size of the string @var{s}. Remember that the field width
1433 given in a conversion specification is only a @emph{minimum} value.
1435 To avoid this problem, you can use @code{snprintf} or @code{asprintf},
1441 @deftypefun int snprintf (char *@var{s}, size_t @var{size}, const char *@var{template}, @dots{})
1442 The @code{snprintf} function is similar to @code{sprintf}, except that
1443 the @var{size} argument specifies the maximum number of characters to
1444 produce. The trailing null character is counted towards this limit, so
1445 you should allocate at least @var{size} characters for the string @var{s}.
1447 The return value is the number of characters stored, not including the
1448 terminating null. If this value equals @code{@var{size} - 1}, then
1449 there was not enough space in @var{s} for all the output. You should
1450 try again with a bigger output string. Here is an example of doing
1455 /* @r{Construct a message describing the value of a variable}
1456 @r{whose name is @var{name} and whose value is @var{value}.} */
1458 make_message (char *name, char *value)
1460 /* @r{Guess we need no more than 100 chars of space.} */
1462 char *buffer = (char *) xmalloc (size);
1467 /* @r{Try to print in the allocated space.} */
1468 int nchars = snprintf (buffer, size,
1469 "value of %s is %s",
1471 /* @r{If that worked, return the string.} */
1474 /* @r{Else try again with twice as much space.} */
1476 buffer = (char *) xrealloc (size, buffer);
1482 In practice, it is often easier just to use @code{asprintf}, below.
1485 @node Dynamic Output
1486 @subsection Dynamically Allocating Formatted Output
1488 The functions in this section do formatted output and place the results
1489 in dynamically allocated memory.
1493 @deftypefun int asprintf (char **@var{ptr}, const char *@var{template}, @dots{})
1494 This function is similar to @code{sprintf}, except that it dynamically
1495 allocates a string (as with @code{malloc}; @pxref{Unconstrained
1496 Allocation}) to hold the output, instead of putting the output in a
1497 buffer you allocate in advance. The @var{ptr} argument should be the
1498 address of a @code{char *} object, and @code{asprintf} stores a pointer
1499 to the newly allocated string at that location.
1501 Here is how to use @code{asprintf} to get the same result as the
1502 @code{snprintf} example, but more easily:
1505 /* @r{Construct a message describing the value of a variable}
1506 @r{whose name is @var{name} and whose value is @var{value}.} */
1508 make_message (char *name, char *value)
1511 asprintf (&result, "value of %s is %s", name, value);
1519 @deftypefun int obstack_printf (struct obstack *@var{obstack}, const char *@var{template}, @dots{})
1520 This function is similar to @code{asprintf}, except that it uses the
1521 obstack @var{obstack} to allocate the space. @xref{Obstacks}.
1523 The characters are written onto the end of the current object.
1524 To get at them, you must finish the object with @code{obstack_finish}
1525 (@pxref{Growing Objects}).@refill
1528 @node Variable Arguments Output
1529 @subsection Variable Arguments Output Functions
1531 The functions @code{vprintf} and friends are provided so that you can
1532 define your own variadic @code{printf}-like functions that make use of
1533 the same internals as the built-in formatted output functions.
1535 The most natural way to define such functions would be to use a language
1536 construct to say, ``Call @code{printf} and pass this template plus all
1537 of my arguments after the first five.'' But there is no way to do this
1538 in C, and it would be hard to provide a way, since at the C language
1539 level there is no way to tell how many arguments your function received.
1541 Since that method is impossible, we provide alternative functions, the
1542 @code{vprintf} series, which lets you pass a @code{va_list} to describe
1543 ``all of my arguments after the first five.''
1545 When it is sufficient to define a macro rather than a real function,
1546 the GNU C compiler provides a way to do this much more easily with macros.
1550 #define myprintf(a, b, c, d, e, rest...) printf (mytemplate , ## rest...)
1554 @xref{Macro Varargs, , Macros with Variable Numbers of Arguments,
1555 gcc.info, Using GNU CC}, for details. But this is limited to macros,
1556 and does not apply to real functions at all.
1558 Before calling @code{vprintf} or the other functions listed in this
1559 section, you @emph{must} call @code{va_start} (@pxref{Variadic
1560 Functions}) to initialize a pointer to the variable arguments. Then you
1561 can call @code{va_arg} to fetch the arguments that you want to handle
1562 yourself. This advances the pointer past those arguments.
1564 Once your @code{va_list} pointer is pointing at the argument of your
1565 choice, you are ready to call @code{vprintf}. That argument and all
1566 subsequent arguments that were passed to your function are used by
1567 @code{vprintf} along with the template that you specified separately.
1569 In some other systems, the @code{va_list} pointer may become invalid
1570 after the call to @code{vprintf}, so you must not use @code{va_arg}
1571 after you call @code{vprintf}. Instead, you should call @code{va_end}
1572 to retire the pointer from service. However, you can safely call
1573 @code{va_start} on another pointer variable and begin fetching the
1574 arguments again through that pointer. Calling @code{vprintf} does not
1575 destroy the argument list of your function, merely the particular
1576 pointer that you passed to it.
1578 GNU C does not have such restrictions. You can safely continue to fetch
1579 arguments from a @code{va_list} pointer after passing it to
1580 @code{vprintf}, and @code{va_end} is a no-op. (Note, however, that
1581 subsequent @code{va_arg} calls will fetch the same arguments which
1582 @code{vprintf} previously used.)
1584 Prototypes for these functions are declared in @file{stdio.h}.
1589 @deftypefun int vprintf (const char *@var{template}, va_list @var{ap})
1590 This function is similar to @code{printf} except that, instead of taking
1591 a variable number of arguments directly, it takes an argument list
1597 @deftypefun int vfprintf (FILE *@var{stream}, const char *@var{template}, va_list @var{ap})
1598 This is the equivalent of @code{fprintf} with the variable argument list
1599 specified directly as for @code{vprintf}.
1604 @deftypefun int vsprintf (char *@var{s}, const char *@var{template}, va_list @var{ap})
1605 This is the equivalent of @code{sprintf} with the variable argument list
1606 specified directly as for @code{vprintf}.
1611 @deftypefun int vsnprintf (char *@var{s}, size_t @var{size}, const char *@var{template}, va_list @var{ap})
1612 This is the equivalent of @code{snprintf} with the variable argument list
1613 specified directly as for @code{vprintf}.
1618 @deftypefun int vasprintf (char **@var{ptr}, const char *@var{template}, va_list @var{ap})
1619 The @code{vasprintf} function is the equivalent of @code{asprintf} with the
1620 variable argument list specified directly as for @code{vprintf}.
1625 @deftypefun int obstack_vprintf (struct obstack *@var{obstack}, const char *@var{template}, va_list @var{ap})
1626 The @code{obstack_vprintf} function is the equivalent of
1627 @code{obstack_printf} with the variable argument list specified directly
1628 as for @code{vprintf}.@refill
1631 Here's an example showing how you might use @code{vfprintf}. This is a
1632 function that prints error messages to the stream @code{stderr}, along
1633 with a prefix indicating the name of the program
1634 (@pxref{Error Messages}, for a description of
1635 @code{program_invocation_short_name}).
1643 eprintf (const char *template, ...)
1646 extern char *program_invocation_short_name;
1648 fprintf (stderr, "%s: ", program_invocation_short_name);
1649 va_start (ap, count);
1650 vfprintf (stderr, template, ap);
1657 You could call @code{eprintf} like this:
1660 eprintf ("file `%s' does not exist\n", filename);
1663 In GNU C, there is a special construct you can use to let the compiler
1664 know that a function uses a @code{printf}-style format string. Then it
1665 can check the number and types of arguments in each call to the
1666 function, and warn you when they do not match the format string.
1667 For example, take this declaration of @code{eprintf}:
1670 void eprintf (const char *template, ...)
1671 __attribute__ ((format (printf, 1, 2)));
1675 This tells the compiler that @code{eprintf} uses a format string like
1676 @code{printf} (as opposed to @code{scanf}; @pxref{Formatted Input});
1677 the format string appears as the first argument;
1678 and the arguments to satisfy the format begin with the second.
1679 @xref{Function Attributes, , Declaring Attributes of Functions,
1680 gcc.info, Using GNU CC}, for more information.
1682 @node Parsing a Template String
1683 @subsection Parsing a Template String
1684 @cindex parsing a template string
1686 You can use the function @code{parse_printf_format} to obtain
1687 information about the number and types of arguments that are expected by
1688 a given template string. This function permits interpreters that
1689 provide interfaces to @code{printf} to avoid passing along invalid
1690 arguments from the user's program, which could cause a crash.
1692 All the symbols described in this section are declared in the header
1693 file @file{printf.h}.
1697 @deftypefun size_t parse_printf_format (const char *@var{template}, size_t @var{n}, int *@var{argtypes})
1698 This function returns information about the number and types of
1699 arguments expected by the @code{printf} template string @var{template}.
1700 The information is stored in the array @var{argtypes}; each element of
1701 this array describes one argument. This information is encoded using
1702 the various @samp{PA_} macros, listed below.
1704 The @var{n} argument specifies the number of elements in the array
1705 @var{argtypes}. This is the most elements that
1706 @code{parse_printf_format} will try to write.
1708 @code{parse_printf_format} returns the total number of arguments required
1709 by @var{template}. If this number is greater than @var{n}, then the
1710 information returned describes only the first @var{n} arguments. If you
1711 want information about more than that many arguments, allocate a bigger
1712 array and call @code{parse_printf_format} again.
1715 The argument types are encoded as a combination of a basic type and
1720 @deftypevr Macro int PA_FLAG_MASK
1721 This macro is a bitmask for the type modifier flag bits. You can write
1722 the expression @code{(argtypes[i] & PA_FLAG_MASK)} to extract just the
1723 flag bits for an argument, or @code{(argtypes[i] & ~PA_FLAG_MASK)} to
1724 extract just the basic type code.
1727 Here are symbolic constants that represent the basic types; they stand
1735 This specifies that the base type is @code{int}.
1741 This specifies that the base type is @code{int}, cast to @code{char}.
1747 This specifies that the base type is @code{char *}, a null-terminated string.
1753 This specifies that the base type is @code{void *}, an arbitrary pointer.
1759 This specifies that the base type is @code{float}.
1765 This specifies that the base type is @code{double}.
1771 You can define additional base types for your own programs as offsets
1772 from @code{PA_LAST}. For example, if you have data types @samp{foo}
1773 and @samp{bar} with their own specialized @code{printf} conversions,
1774 you could define encodings for these types as:
1777 #define PA_FOO PA_LAST
1778 #define PA_BAR (PA_LAST + 1)
1782 Here are the flag bits that modify a basic type. They are combined with
1783 the code for the basic type using inclusive-or.
1790 If this bit is set, it indicates that the encoded type is a pointer to
1791 the base type, rather than an immediate value.
1792 For example, @samp{PA_INT|PA_FLAG_PTR} represents the type @samp{int *}.
1797 @vindex PA_FLAG_SHORT
1798 If this bit is set, it indicates that the base type is modified with
1799 @code{short}. (This corresponds to the @samp{h} type modifier.)
1804 @vindex PA_FLAG_LONG
1805 If this bit is set, it indicates that the base type is modified with
1806 @code{long}. (This corresponds to the @samp{l} type modifier.)
1810 @item PA_FLAG_LONG_LONG
1811 @vindex PA_FLAG_LONG_LONG
1812 If this bit is set, it indicates that the base type is modified with
1813 @code{long long}. (This corresponds to the @samp{L} type modifier.)
1817 @item PA_FLAG_LONG_DOUBLE
1818 @vindex PA_FLAG_LONG_DOUBLE
1819 This is a synonym for @code{PA_FLAG_LONG_LONG}, used by convention with
1820 a base type of @code{PA_DOUBLE} to indicate a type of @code{long double}.
1824 For an example of using these facilities, see @ref{Example of Parsing}.
1827 @node Example of Parsing
1828 @subsection Example of Parsing a Template String
1830 Here is an example of decoding argument types for a format string. We
1831 assume this is part of an interpreter which contains arguments of type
1832 @code{NUMBER}, @code{CHAR}, @code{STRING} and @code{STRUCTURE} (and
1833 perhaps others which are not valid here).
1836 /* @r{Test whether the @var{nargs} specified objects}
1837 @r{in the vector @var{args} are valid}
1838 @r{for the format string @var{format}:}
1839 @r{if so, return 1.}
1840 @r{If not, return 0 after printing an error message.} */
1843 validate_args (char *format, int nargs, OBJECT *args)
1848 /* @r{Get the information about the arguments.}
1849 @r{Each conversion specification must be at least two characters}
1850 @r{long, so there cannot be more specifications than half the}
1851 @r{length of the string.} */
1853 argtypes = (int *) alloca (strlen (format) / 2 * sizeof (int));
1854 nwanted = parse_printf_format (string, nelts, argtypes);
1856 /* @r{Check the number of arguments.} */
1857 if (nwanted > nargs)
1859 error ("too few arguments (at least %d required)", nwanted);
1863 /* @r{Check the C type wanted for each argument}
1864 @r{and see if the object given is suitable.} */
1865 for (i = 0; i < nwanted; i++)
1869 if (argtypes[i] & PA_FLAG_PTR)
1872 switch (argtypes[i] & ~PA_FLAG_MASK)
1889 if (TYPE (args[i]) != wanted)
1891 error ("type mismatch for arg number %d", i);
1899 @node Customizing Printf
1900 @section Customizing @code{printf}
1901 @cindex customizing @code{printf}
1902 @cindex defining new @code{printf} conversions
1903 @cindex extending @code{printf}
1905 The GNU C library lets you define your own custom conversion specifiers
1906 for @code{printf} template strings, to teach @code{printf} clever ways
1907 to print the important data structures of your program.
1909 The way you do this is by registering the conversion with the function
1910 @code{register_printf_function}; see @ref{Registering New Conversions}.
1911 One of the arguments you pass to this function is a pointer to a handler
1912 function that produces the actual output; see @ref{Defining the Output
1913 Handler}, for information on how to write this function.
1915 You can also install a function that just returns information about the
1916 number and type of arguments expected by the conversion specifier.
1917 @xref{Parsing a Template String}, for information about this.
1919 The facilities of this section are declared in the header file
1923 * Registering New Conversions:: Using @code{register_printf_function}
1924 to register a new output conversion.
1925 * Conversion Specifier Options:: The handler must be able to get
1926 the options specified in the
1927 template when it is called.
1928 * Defining the Output Handler:: Defining the handler and arginfo
1929 functions that are passed as arguments
1930 to @code{register_printf_function}.
1931 * Printf Extension Example:: How to define a @code{printf}
1935 @strong{Portability Note:} The ability to extend the syntax of
1936 @code{printf} template strings is a GNU extension. ISO standard C has
1939 @node Registering New Conversions
1940 @subsection Registering New Conversions
1942 The function to register a new output conversion is
1943 @code{register_printf_function}, declared in @file{printf.h}.
1948 @deftypefun int register_printf_function (int @var{spec}, printf_function @var{handler-function}, printf_arginfo_function @var{arginfo-function})
1949 This function defines the conversion specifier character @var{spec}.
1950 Thus, if @var{spec} is @code{'z'}, it defines the conversion @samp{%z}.
1951 You can redefine the built-in conversions like @samp{%s}, but flag
1952 characters like @samp{#} and type modifiers like @samp{l} can never be
1953 used as conversions; calling @code{register_printf_function} for those
1954 characters has no effect.
1956 The @var{handler-function} is the function called by @code{printf} and
1957 friends when this conversion appears in a template string.
1958 @xref{Defining the Output Handler}, for information about how to define
1959 a function to pass as this argument. If you specify a null pointer, any
1960 existing handler function for @var{spec} is removed.
1962 The @var{arginfo-function} is the function called by
1963 @code{parse_printf_format} when this conversion appears in a
1964 template string. @xref{Parsing a Template String}, for information
1967 @c The following is not true anymore. The `parse_printf_format' function
1968 @c is now also called from `vfprintf' via `parse_one_spec'.
1969 @c --drepper@gnu, 1996/11/14
1971 @c Normally, you install both functions for a conversion at the same time,
1972 @c but if you are never going to call @code{parse_printf_format}, you do
1973 @c not need to define an arginfo function.
1975 @strong{Attention:} In the GNU C library version before 2.0 the
1976 @var{arginfo-function} function did not need to be installed unless
1977 the user uses the @code{parse_printf_format} function. This changed.
1978 Now a call to any of the @code{printf} functions will call this
1979 function when this format specifier appears in the format string.
1981 The return value is @code{0} on success, and @code{-1} on failure
1982 (which occurs if @var{spec} is out of range).
1984 You can redefine the standard output conversions, but this is probably
1985 not a good idea because of the potential for confusion. Library routines
1986 written by other people could break if you do this.
1989 @node Conversion Specifier Options
1990 @subsection Conversion Specifier Options
1992 If you define a meaning for @samp{%A}, what if the template contains
1993 @samp{%+23A} or @samp{%-#A}? To implement a sensible meaning for these,
1994 the handler when called needs to be able to get the options specified in
1997 Both the @var{handler-function} and @var{arginfo-function} arguments
1998 to @code{register_printf_function} accept an argument that points to a
1999 @code{struct printf_info}, which contains information about the options
2000 appearing in an instance of the conversion specifier. This data type
2001 is declared in the header file @file{printf.h}.
2006 @deftp {Type} {struct printf_info}
2007 This structure is used to pass information about the options appearing
2008 in an instance of a conversion specifier in a @code{printf} template
2009 string to the handler and arginfo functions for that specifier. It
2010 contains the following members:
2014 This is the precision specified. The value is @code{-1} if no precision
2015 was specified. If the precision was given as @samp{*}, the
2016 @code{printf_info} structure passed to the handler function contains the
2017 actual value retrieved from the argument list. But the structure passed
2018 to the arginfo function contains a value of @code{INT_MIN}, since the
2019 actual value is not known.
2022 This is the minimum field width specified. The value is @code{0} if no
2023 width was specified. If the field width was given as @samp{*}, the
2024 @code{printf_info} structure passed to the handler function contains the
2025 actual value retrieved from the argument list. But the structure passed
2026 to the arginfo function contains a value of @code{INT_MIN}, since the
2027 actual value is not known.
2030 This is the conversion specifier character specified. It's stored in
2031 the structure so that you can register the same handler function for
2032 multiple characters, but still have a way to tell them apart when the
2033 handler function is called.
2035 @item unsigned int is_long_double
2036 This is a boolean that is true if the @samp{L}, @samp{ll}, or @samp{q}
2037 type modifier was specified. For integer conversions, this indicates
2038 @code{long long int}, as opposed to @code{long double} for floating
2041 @item unsigned int is_short
2042 This is a boolean that is true if the @samp{h} type modifier was specified.
2044 @item unsigned int is_long
2045 This is a boolean that is true if the @samp{l} type modifier was specified.
2047 @item unsigned int alt
2048 This is a boolean that is true if the @samp{#} flag was specified.
2050 @item unsigned int space
2051 This is a boolean that is true if the @samp{ } flag was specified.
2053 @item unsigned int left
2054 This is a boolean that is true if the @samp{-} flag was specified.
2056 @item unsigned int showsign
2057 This is a boolean that is true if the @samp{+} flag was specified.
2059 @item unsigned int group
2060 This is a boolean that is true if the @samp{'} flag was specified.
2062 @item unsigned int extra
2063 This flag has a special meaning depending on the context. It could
2064 be used freely by the user-defined handlers but when called from
2065 the @code{printf} function this variable always contains the value
2069 This is the character to use for padding the output to the minimum field
2070 width. The value is @code{'0'} if the @samp{0} flag was specified, and
2071 @code{' '} otherwise.
2076 @node Defining the Output Handler
2077 @subsection Defining the Output Handler
2079 Now let's look at how to define the handler and arginfo functions
2080 which are passed as arguments to @code{register_printf_function}.
2082 @strong{Compatibility Note:} The interface change in the GNU libc
2083 version 2.0. Previously the third argument was of type
2086 You should define your handler functions with a prototype like:
2089 int @var{function} (FILE *stream, const struct printf_info *info,
2090 const void *const *args)
2093 The @var{stream} argument passed to the handler function is the stream to
2094 which it should write output.
2096 The @var{info} argument is a pointer to a structure that contains
2097 information about the various options that were included with the
2098 conversion in the template string. You should not modify this structure
2099 inside your handler function. @xref{Conversion Specifier Options}, for
2100 a description of this data structure.
2102 @c The following changes some time back. --drepper@gnu, 1996/11/14
2104 @c The @code{ap_pointer} argument is used to pass the tail of the variable
2105 @c argument list containing the values to be printed to your handler.
2106 @c Unlike most other functions that can be passed an explicit variable
2107 @c argument list, this is a @emph{pointer} to a @code{va_list}, rather than
2108 @c the @code{va_list} itself. Thus, you should fetch arguments by
2109 @c means of @code{va_arg (*ap_pointer, @var{type})}.
2111 @c (Passing a pointer here allows the function that calls your handler
2112 @c function to update its own @code{va_list} variable to account for the
2113 @c arguments that your handler processes. @xref{Variadic Functions}.)
2115 The @var{args} is a vector of pointers to the arguments data.
2116 The number of arguments were determined by calling the argument
2117 information function provided by the user.
2119 Your handler function should return a value just like @code{printf}
2120 does: it should return the number of characters it has written, or a
2121 negative value to indicate an error.
2125 @deftp {Data Type} printf_function
2126 This is the data type that a handler function should have.
2129 If you are going to use @w{@code{parse_printf_format}} in your
2130 application, you must also define a function to pass as the
2131 @var{arginfo-function} argument for each new conversion you install with
2132 @code{register_printf_function}.
2134 You have to define these functions with a prototype like:
2137 int @var{function} (const struct printf_info *info,
2138 size_t n, int *argtypes)
2141 The return value from the function should be the number of arguments the
2142 conversion expects. The function should also fill in no more than
2143 @var{n} elements of the @var{argtypes} array with information about the
2144 types of each of these arguments. This information is encoded using the
2145 various @samp{PA_} macros. (You will notice that this is the same
2146 calling convention @code{parse_printf_format} itself uses.)
2150 @deftp {Data Type} printf_arginfo_function
2151 This type is used to describe functions that return information about
2152 the number and type of arguments used by a conversion specifier.
2155 @node Printf Extension Example
2156 @subsection @code{printf} Extension Example
2158 Here is an example showing how to define a @code{printf} handler function.
2159 This program defines a data structure called a @code{Widget} and
2160 defines the @samp{%W} conversion to print information about @w{@code{Widget *}}
2161 arguments, including the pointer value and the name stored in the data
2162 structure. The @samp{%W} conversion supports the minimum field width and
2163 left-justification options, but ignores everything else.
2166 @include rprintf.c.texi
2169 The output produced by this program looks like:
2172 |<Widget 0xffeffb7c: mywidget>|
2173 | <Widget 0xffeffb7c: mywidget>|
2174 |<Widget 0xffeffb7c: mywidget> |
2177 @node Formatted Input
2178 @section Formatted Input
2180 @cindex formatted input from a stream
2181 @cindex reading from a stream, formatted
2182 @cindex format string, for @code{scanf}
2183 @cindex template, for @code{scanf}
2184 The functions described in this section (@code{scanf} and related
2185 functions) provide facilities for formatted input analogous to the
2186 formatted output facilities. These functions provide a mechanism for
2187 reading arbitrary values under the control of a @dfn{format string} or
2188 @dfn{template string}.
2191 * Formatted Input Basics:: Some basics to get you started.
2192 * Input Conversion Syntax:: Syntax of conversion specifications.
2193 * Table of Input Conversions:: Summary of input conversions and what they do.
2194 * Numeric Input Conversions:: Details of conversions for reading numbers.
2195 * String Input Conversions:: Details of conversions for reading strings.
2196 * Dynamic String Input:: String conversions that @code{malloc} the buffer.
2197 * Other Input Conversions:: Details of miscellaneous other conversions.
2198 * Formatted Input Functions:: Descriptions of the actual functions.
2199 * Variable Arguments Input:: @code{vscanf} and friends.
2202 @node Formatted Input Basics
2203 @subsection Formatted Input Basics
2205 Calls to @code{scanf} are superficially similar to calls to
2206 @code{printf} in that arbitrary arguments are read under the control of
2207 a template string. While the syntax of the conversion specifications in
2208 the template is very similar to that for @code{printf}, the
2209 interpretation of the template is oriented more towards free-format
2210 input and simple pattern matching, rather than fixed-field formatting.
2211 For example, most @code{scanf} conversions skip over any amount of
2212 ``white space'' (including spaces, tabs, and newlines) in the input
2213 file, and there is no concept of precision for the numeric input
2214 conversions as there is for the corresponding output conversions.
2215 Ordinarily, non-whitespace characters in the template are expected to
2216 match characters in the input stream exactly, but a matching failure is
2217 distinct from an input error on the stream.
2218 @cindex conversion specifications (@code{scanf})
2220 Another area of difference between @code{scanf} and @code{printf} is
2221 that you must remember to supply pointers rather than immediate values
2222 as the optional arguments to @code{scanf}; the values that are read are
2223 stored in the objects that the pointers point to. Even experienced
2224 programmers tend to forget this occasionally, so if your program is
2225 getting strange errors that seem to be related to @code{scanf}, you
2226 might want to double-check this.
2228 When a @dfn{matching failure} occurs, @code{scanf} returns immediately,
2229 leaving the first non-matching character as the next character to be
2230 read from the stream. The normal return value from @code{scanf} is the
2231 number of values that were assigned, so you can use this to determine if
2232 a matching error happened before all the expected values were read.
2233 @cindex matching failure, in @code{scanf}
2235 The @code{scanf} function is typically used for things like reading in
2236 the contents of tables. For example, here is a function that uses
2237 @code{scanf} to initialize an array of @code{double}:
2241 readarray (double *array, int n)
2245 if (scanf (" %lf", &(array[i])) != 1)
2246 invalid_input_error ();
2250 The formatted input functions are not used as frequently as the
2251 formatted output functions. Partly, this is because it takes some care
2252 to use them properly. Another reason is that it is difficult to recover
2253 from a matching error.
2255 If you are trying to read input that doesn't match a single, fixed
2256 pattern, you may be better off using a tool such as Flex to generate a
2257 lexical scanner, or Bison to generate a parser, rather than using
2258 @code{scanf}. For more information about these tools, see @ref{, , ,
2259 flex.info, Flex: The Lexical Scanner Generator}, and @ref{, , ,
2260 bison.info, The Bison Reference Manual}.
2262 @node Input Conversion Syntax
2263 @subsection Input Conversion Syntax
2265 A @code{scanf} template string is a string that contains ordinary
2266 multibyte characters interspersed with conversion specifications that
2267 start with @samp{%}.
2269 Any whitespace character (as defined by the @code{isspace} function;
2270 @pxref{Classification of Characters}) in the template causes any number
2271 of whitespace characters in the input stream to be read and discarded.
2272 The whitespace characters that are matched need not be exactly the same
2273 whitespace characters that appear in the template string. For example,
2274 write @samp{ , } in the template to recognize a comma with optional
2275 whitespace before and after.
2277 Other characters in the template string that are not part of conversion
2278 specifications must match characters in the input stream exactly; if
2279 this is not the case, a matching failure occurs.
2281 The conversion specifications in a @code{scanf} template string
2282 have the general form:
2285 % @var{flags} @var{width} @var{type} @var{conversion}
2288 In more detail, an input conversion specification consists of an initial
2289 @samp{%} character followed in sequence by:
2293 An optional @dfn{flag character} @samp{*}, which says to ignore the text
2294 read for this specification. When @code{scanf} finds a conversion
2295 specification that uses this flag, it reads input as directed by the
2296 rest of the conversion specification, but it discards this input, does
2297 not use a pointer argument, and does not increment the count of
2298 successful assignments.
2299 @cindex flag character (@code{scanf})
2302 An optional flag character @samp{a} (valid with string conversions only)
2303 which requests allocation of a buffer long enough to store the string in.
2304 (This is a GNU extension.)
2305 @xref{Dynamic String Input}.
2308 An optional decimal integer that specifies the @dfn{maximum field
2309 width}. Reading of characters from the input stream stops either when
2310 this maximum is reached or when a non-matching character is found,
2311 whichever happens first. Most conversions discard initial whitespace
2312 characters (those that don't are explicitly documented), and these
2313 discarded characters don't count towards the maximum field width.
2314 String input conversions store a null character to mark the end of the
2315 input; the maximum field width does not include this terminator.
2316 @cindex maximum field width (@code{scanf})
2319 An optional @dfn{type modifier character}. For example, you can
2320 specify a type modifier of @samp{l} with integer conversions such as
2321 @samp{%d} to specify that the argument is a pointer to a @code{long int}
2322 rather than a pointer to an @code{int}.
2323 @cindex type modifier character (@code{scanf})
2326 A character that specifies the conversion to be applied.
2329 The exact options that are permitted and how they are interpreted vary
2330 between the different conversion specifiers. See the descriptions of the
2331 individual conversions for information about the particular options that
2334 With the @samp{-Wformat} option, the GNU C compiler checks calls to
2335 @code{scanf} and related functions. It examines the format string and
2336 verifies that the correct number and types of arguments are supplied.
2337 There is also a GNU C syntax to tell the compiler that a function you
2338 write uses a @code{scanf}-style format string.
2339 @xref{Function Attributes, , Declaring Attributes of Functions,
2340 gcc.info, Using GNU CC}, for more information.
2342 @node Table of Input Conversions
2343 @subsection Table of Input Conversions
2344 @cindex input conversions, for @code{scanf}
2346 Here is a table that summarizes the various conversion specifications:
2350 Matches an optionally signed integer written in decimal. @xref{Numeric
2354 Matches an optionally signed integer in any of the formats that the C
2355 language defines for specifying an integer constant. @xref{Numeric
2359 Matches an unsigned integer written in octal radix.
2360 @xref{Numeric Input Conversions}.
2363 Matches an unsigned integer written in decimal radix.
2364 @xref{Numeric Input Conversions}.
2366 @item @samp{%x}, @samp{%X}
2367 Matches an unsigned integer written in hexadecimal radix.
2368 @xref{Numeric Input Conversions}.
2370 @item @samp{%e}, @samp{%f}, @samp{%g}, @samp{%E}, @samp{%G}
2371 Matches an optionally signed floating-point number. @xref{Numeric Input
2375 Matches a string containing only non-whitespace characters.
2376 @xref{String Input Conversions}.
2379 Matches a string of characters that belong to a specified set.
2380 @xref{String Input Conversions}.
2383 Matches a string of one or more characters; the number of characters
2384 read is controlled by the maximum field width given for the conversion.
2385 @xref{String Input Conversions}.
2388 Matches a pointer value in the same implementation-defined format used
2389 by the @samp{%p} output conversion for @code{printf}. @xref{Other Input
2393 This conversion doesn't read any characters; it records the number of
2394 characters read so far by this call. @xref{Other Input Conversions}.
2397 This matches a literal @samp{%} character in the input stream. No
2398 corresponding argument is used. @xref{Other Input Conversions}.
2401 If the syntax of a conversion specification is invalid, the behavior is
2402 undefined. If there aren't enough function arguments provided to supply
2403 addresses for all the conversion specifications in the template strings
2404 that perform assignments, or if the arguments are not of the correct
2405 types, the behavior is also undefined. On the other hand, extra
2406 arguments are simply ignored.
2408 @node Numeric Input Conversions
2409 @subsection Numeric Input Conversions
2411 This section describes the @code{scanf} conversions for reading numeric
2414 The @samp{%d} conversion matches an optionally signed integer in decimal
2415 radix. The syntax that is recognized is the same as that for the
2416 @code{strtol} function (@pxref{Parsing of Integers}) with the value
2417 @code{10} for the @var{base} argument.
2419 The @samp{%i} conversion matches an optionally signed integer in any of
2420 the formats that the C language defines for specifying an integer
2421 constant. The syntax that is recognized is the same as that for the
2422 @code{strtol} function (@pxref{Parsing of Integers}) with the value
2423 @code{0} for the @var{base} argument. (You can print integers in this
2424 syntax with @code{printf} by using the @samp{#} flag character with the
2425 @samp{%x}, @samp{%o}, or @samp{%d} conversion. @xref{Integer Conversions}.)
2427 For example, any of the strings @samp{10}, @samp{0xa}, or @samp{012}
2428 could be read in as integers under the @samp{%i} conversion. Each of
2429 these specifies a number with decimal value @code{10}.
2431 The @samp{%o}, @samp{%u}, and @samp{%x} conversions match unsigned
2432 integers in octal, decimal, and hexadecimal radices, respectively. The
2433 syntax that is recognized is the same as that for the @code{strtoul}
2434 function (@pxref{Parsing of Integers}) with the appropriate value
2435 (@code{8}, @code{10}, or @code{16}) for the @var{base} argument.
2437 The @samp{%X} conversion is identical to the @samp{%x} conversion. They
2438 both permit either uppercase or lowercase letters to be used as digits.
2440 The default type of the corresponding argument for the @code{%d} and
2441 @code{%i} conversions is @code{int *}, and @code{unsigned int *} for the
2442 other integer conversions. You can use the following type modifiers to
2443 specify other sizes of integer:
2447 Specifies that the argument is a @code{short int *} or @code{unsigned
2451 Specifies that the argument is a @code{long int *} or @code{unsigned
2452 long int *}. Two @samp{l} characters is like the @samp{L} modifier, below.
2458 Specifies that the argument is a @code{long long int *} or @code{unsigned long long int *}. (The @code{long long} type is an extension supported by the
2459 GNU C compiler. For systems that don't provide extra-long integers, this
2460 is the same as @code{long int}.)
2462 The @samp{q} modifier is another name for the same thing, which comes
2463 from 4.4 BSD; a @w{@code{long long int}} is sometimes called a ``quad''
2467 All of the @samp{%e}, @samp{%f}, @samp{%g}, @samp{%E}, and @samp{%G}
2468 input conversions are interchangeable. They all match an optionally
2469 signed floating point number, in the same syntax as for the
2470 @code{strtod} function (@pxref{Parsing of Floats}).
2472 For the floating-point input conversions, the default argument type is
2473 @code{float *}. (This is different from the corresponding output
2474 conversions, where the default type is @code{double}; remember that
2475 @code{float} arguments to @code{printf} are converted to @code{double}
2476 by the default argument promotions, but @code{float *} arguments are
2477 not promoted to @code{double *}.) You can specify other sizes of float
2478 using these type modifiers:
2482 Specifies that the argument is of type @code{double *}.
2485 Specifies that the argument is of type @code{long double *}.
2488 For all the above number parsing formats there is an additional optional
2489 flag @samp{'}. When this flag is given the @code{scanf} function
2490 expects the number represented in the input string to be formatted
2491 according to the grouping rules of the currently selected locale
2492 (@pxref{General Numeric}).
2494 If the @code{"C"} or @code{"POSIX"} locale is selected there is no
2495 difference. But for a locale which specifies values for the appropriate
2496 fields in the locale the input must have the correct form in the input.
2497 Otherwise the longest prefix with a correct form is processed.
2499 @node String Input Conversions
2500 @subsection String Input Conversions
2502 This section describes the @code{scanf} input conversions for reading
2503 string and character values: @samp{%s}, @samp{%[}, and @samp{%c}.
2505 You have two options for how to receive the input from these
2510 Provide a buffer to store it in. This is the default. You
2511 should provide an argument of type @code{char *}.
2513 @strong{Warning:} To make a robust program, you must make sure that the
2514 input (plus its terminating null) cannot possibly exceed the size of the
2515 buffer you provide. In general, the only way to do this is to specify a
2516 maximum field width one less than the buffer size. @strong{If you
2517 provide the buffer, always specify a maximum field width to prevent
2521 Ask @code{scanf} to allocate a big enough buffer, by specifying the
2522 @samp{a} flag character. This is a GNU extension. You should provide
2523 an argument of type @code{char **} for the buffer address to be stored
2524 in. @xref{Dynamic String Input}.
2527 The @samp{%c} conversion is the simplest: it matches a fixed number of
2528 characters, always. The maximum field with says how many characters to
2529 read; if you don't specify the maximum, the default is 1. This
2530 conversion doesn't append a null character to the end of the text it
2531 reads. It also does not skip over initial whitespace characters. It
2532 reads precisely the next @var{n} characters, and fails if it cannot get
2533 that many. Since there is always a maximum field width with @samp{%c}
2534 (whether specified, or 1 by default), you can always prevent overflow by
2535 making the buffer long enough.
2537 The @samp{%s} conversion matches a string of non-whitespace characters.
2538 It skips and discards initial whitespace, but stops when it encounters
2539 more whitespace after having read something. It stores a null character
2540 at the end of the text that it reads.
2542 For example, reading the input:
2549 with the conversion @samp{%10c} produces @code{" hello, wo"}, but
2550 reading the same input with the conversion @samp{%10s} produces
2553 @strong{Warning:} If you do not specify a field width for @samp{%s},
2554 then the number of characters read is limited only by where the next
2555 whitespace character appears. This almost certainly means that invalid
2556 input can make your program crash---which is a bug.
2558 To read in characters that belong to an arbitrary set of your choice,
2559 use the @samp{%[} conversion. You specify the set between the @samp{[}
2560 character and a following @samp{]} character, using the same syntax used
2561 in regular expressions. As special cases:
2565 A literal @samp{]} character can be specified as the first character
2569 An embedded @samp{-} character (that is, one that is not the first or
2570 last character of the set) is used to specify a range of characters.
2573 If a caret character @samp{^} immediately follows the initial @samp{[},
2574 then the set of allowed input characters is the everything @emph{except}
2575 the characters listed.
2578 The @samp{%[} conversion does not skip over initial whitespace
2581 Here are some examples of @samp{%[} conversions and what they mean:
2584 @item %25[1234567890]
2585 Matches a string of up to 25 digits.
2588 Matches a string of up to 25 square brackets.
2590 @item %25[^ \f\n\r\t\v]
2591 Matches a string up to 25 characters long that doesn't contain any of
2592 the standard whitespace characters. This is slightly different from
2593 @samp{%s}, because if the input begins with a whitespace character,
2594 @samp{%[} reports a matching failure while @samp{%s} simply discards the
2598 Matches up to 25 lowercase characters.
2601 One more reminder: the @samp{%s} and @samp{%[} conversions are
2602 @strong{dangerous} if you don't specify a maximum width or use the
2603 @samp{a} flag, because input too long would overflow whatever buffer you
2604 have provided for it. No matter how long your buffer is, a user could
2605 supply input that is longer. A well-written program reports invalid
2606 input with a comprehensible error message, not with a crash.
2608 @node Dynamic String Input
2609 @subsection Dynamically Allocating String Conversions
2611 A GNU extension to formatted input lets you safely read a string with no
2612 maximum size. Using this feature, you don't supply a buffer; instead,
2613 @code{scanf} allocates a buffer big enough to hold the data and gives
2614 you its address. To use this feature, write @samp{a} as a flag
2615 character, as in @samp{%as} or @samp{%a[0-9a-z]}.
2617 The pointer argument you supply for where to store the input should have
2618 type @code{char **}. The @code{scanf} function allocates a buffer and
2619 stores its address in the word that the argument points to. You should
2620 free the buffer with @code{free} when you no longer need it.
2622 Here is an example of using the @samp{a} flag with the @samp{%[@dots{}]}
2623 conversion specification to read a ``variable assignment'' of the form
2624 @samp{@var{variable} = @var{value}}.
2628 char *variable, *value;
2630 if (2 > scanf ("%a[a-zA-Z0-9] = %a[^\n]\n",
2633 invalid_input_error ();
2641 @node Other Input Conversions
2642 @subsection Other Input Conversions
2644 This section describes the miscellaneous input conversions.
2646 The @samp{%p} conversion is used to read a pointer value. It recognizes
2647 the same syntax as is used by the @samp{%p} output conversion for
2648 @code{printf} (@pxref{Other Output Conversions}); that is, a hexadecimal
2649 number just as the @samp{%x} conversion accepts. The corresponding
2650 argument should be of type @code{void **}; that is, the address of a
2651 place to store a pointer.
2653 The resulting pointer value is not guaranteed to be valid if it was not
2654 originally written during the same program execution that reads it in.
2656 The @samp{%n} conversion produces the number of characters read so far
2657 by this call. The corresponding argument should be of type @code{int *}.
2658 This conversion works in the same way as the @samp{%n} conversion for
2659 @code{printf}; see @ref{Other Output Conversions}, for an example.
2661 The @samp{%n} conversion is the only mechanism for determining the
2662 success of literal matches or conversions with suppressed assignments.
2663 If the @samp{%n} follows the locus of a matching failure, then no value
2664 is stored for it since @code{scanf} returns before processing the
2665 @samp{%n}. If you store @code{-1} in that argument slot before calling
2666 @code{scanf}, the presence of @code{-1} after @code{scanf} indicates an
2667 error occurred before the @samp{%n} was reached.
2669 Finally, the @samp{%%} conversion matches a literal @samp{%} character
2670 in the input stream, without using an argument. This conversion does
2671 not permit any flags, field width, or type modifier to be specified.
2673 @node Formatted Input Functions
2674 @subsection Formatted Input Functions
2676 Here are the descriptions of the functions for performing formatted
2678 Prototypes for these functions are in the header file @file{stdio.h}.
2683 @deftypefun int scanf (const char *@var{template}, @dots{})
2684 The @code{scanf} function reads formatted input from the stream
2685 @code{stdin} under the control of the template string @var{template}.
2686 The optional arguments are pointers to the places which receive the
2689 The return value is normally the number of successful assignments. If
2690 an end-of-file condition is detected before any matches are performed
2691 (including matches against whitespace and literal characters in the
2692 template), then @code{EOF} is returned.
2697 @deftypefun int fscanf (FILE *@var{stream}, const char *@var{template}, @dots{})
2698 This function is just like @code{scanf}, except that the input is read
2699 from the stream @var{stream} instead of @code{stdin}.
2704 @deftypefun int sscanf (const char *@var{s}, const char *@var{template}, @dots{})
2705 This is like @code{scanf}, except that the characters are taken from the
2706 null-terminated string @var{s} instead of from a stream. Reaching the
2707 end of the string is treated as an end-of-file condition.
2709 The behavior of this function is undefined if copying takes place
2710 between objects that overlap---for example, if @var{s} is also given
2711 as an argument to receive a string read under control of the @samp{%s}
2715 @node Variable Arguments Input
2716 @subsection Variable Arguments Input Functions
2718 The functions @code{vscanf} and friends are provided so that you can
2719 define your own variadic @code{scanf}-like functions that make use of
2720 the same internals as the built-in formatted output functions.
2721 These functions are analogous to the @code{vprintf} series of output
2722 functions. @xref{Variable Arguments Output}, for important
2723 information on how to use them.
2725 @strong{Portability Note:} The functions listed in this section are GNU
2730 @deftypefun int vscanf (const char *@var{template}, va_list @var{ap})
2731 This function is similar to @code{scanf} except that, instead of taking
2732 a variable number of arguments directly, it takes an argument list
2733 pointer @var{ap} of type @code{va_list} (@pxref{Variadic Functions}).
2738 @deftypefun int vfscanf (FILE *@var{stream}, const char *@var{template}, va_list @var{ap})
2739 This is the equivalent of @code{fscanf} with the variable argument list
2740 specified directly as for @code{vscanf}.
2745 @deftypefun int vsscanf (const char *@var{s}, const char *@var{template}, va_list @var{ap})
2746 This is the equivalent of @code{sscanf} with the variable argument list
2747 specified directly as for @code{vscanf}.
2750 In GNU C, there is a special construct you can use to let the compiler
2751 know that a function uses a @code{scanf}-style format string. Then it
2752 can check the number and types of arguments in each call to the
2753 function, and warn you when they do not match the format string.
2754 @xref{Function Attributes, , Declaring Attributes of Functions,
2755 gcc.info, Using GNU CC}, for details.
2757 @node EOF and Errors
2758 @section End-Of-File and Errors
2760 @cindex end of file, on a stream
2761 Many of the functions described in this chapter return the value of the
2762 macro @code{EOF} to indicate unsuccessful completion of the operation.
2763 Since @code{EOF} is used to report both end of file and random errors,
2764 it's often better to use the @code{feof} function to check explicitly
2765 for end of file and @code{ferror} to check for errors. These functions
2766 check indicators that are part of the internal state of the stream
2767 object, indicators set if the appropriate condition was detected by a
2768 previous I/O operation on that stream.
2770 These symbols are declared in the header file @file{stdio.h}.
2775 @deftypevr Macro int EOF
2776 This macro is an integer value that is returned by a number of functions
2777 to indicate an end-of-file condition, or some other error situation.
2778 With the GNU library, @code{EOF} is @code{-1}. In other libraries, its
2779 value may be some other negative number.
2784 @deftypefun void clearerr (FILE *@var{stream})
2785 This function clears the end-of-file and error indicators for the
2786 stream @var{stream}.
2788 The file positioning functions (@pxref{File Positioning}) also clear the
2789 end-of-file indicator for the stream.
2794 @deftypefun int feof (FILE *@var{stream})
2795 The @code{feof} function returns nonzero if and only if the end-of-file
2796 indicator for the stream @var{stream} is set.
2801 @deftypefun int ferror (FILE *@var{stream})
2802 The @code{ferror} function returns nonzero if and only if the error
2803 indicator for the stream @var{stream} is set, indicating that an error
2804 has occurred on a previous operation on the stream.
2807 In addition to setting the error indicator associated with the stream,
2808 the functions that operate on streams also set @code{errno} in the same
2809 way as the corresponding low-level functions that operate on file
2810 descriptors. For example, all of the functions that perform output to a
2811 stream---such as @code{fputc}, @code{printf}, and @code{fflush}---are
2812 implemented in terms of @code{write}, and all of the @code{errno} error
2813 conditions defined for @code{write} are meaningful for these functions.
2814 For more information about the descriptor-level I/O functions, see
2815 @ref{Low-Level I/O}.
2817 @node Binary Streams
2818 @section Text and Binary Streams
2820 The GNU system and other POSIX-compatible operating systems organize all
2821 files as uniform sequences of characters. However, some other systems
2822 make a distinction between files containing text and files containing
2823 binary data, and the input and output facilities of @w{ISO C} provide for
2824 this distinction. This section tells you how to write programs portable
2828 @cindex binary stream
2829 When you open a stream, you can specify either a @dfn{text stream} or a
2830 @dfn{binary stream}. You indicate that you want a binary stream by
2831 specifying the @samp{b} modifier in the @var{opentype} argument to
2832 @code{fopen}; see @ref{Opening Streams}. Without this
2833 option, @code{fopen} opens the file as a text stream.
2835 Text and binary streams differ in several ways:
2839 The data read from a text stream is divided into @dfn{lines} which are
2840 terminated by newline (@code{'\n'}) characters, while a binary stream is
2841 simply a long series of characters. A text stream might on some systems
2842 fail to handle lines more than 254 characters long (including the
2843 terminating newline character).
2844 @cindex lines (in a text file)
2847 On some systems, text files can contain only printing characters,
2848 horizontal tab characters, and newlines, and so text streams may not
2849 support other characters. However, binary streams can handle any
2853 Space characters that are written immediately preceding a newline
2854 character in a text stream may disappear when the file is read in again.
2857 More generally, there need not be a one-to-one mapping between
2858 characters that are read from or written to a text stream, and the
2859 characters in the actual file.
2862 Since a binary stream is always more capable and more predictable than a
2863 text stream, you might wonder what purpose text streams serve. Why not
2864 simply always use binary streams? The answer is that on these operating
2865 systems, text and binary streams use different file formats, and the
2866 only way to read or write ``an ordinary file of text'' that can work
2867 with other text-oriented programs is through a text stream.
2869 In the GNU library, and on all POSIX systems, there is no difference
2870 between text streams and binary streams. When you open a stream, you
2871 get the same kind of stream regardless of whether you ask for binary.
2872 This stream can handle any file content, and has none of the
2873 restrictions that text streams sometimes have.
2875 @node File Positioning
2876 @section File Positioning
2877 @cindex file positioning on a stream
2878 @cindex positioning a stream
2879 @cindex seeking on a stream
2881 The @dfn{file position} of a stream describes where in the file the
2882 stream is currently reading or writing. I/O on the stream advances the
2883 file position through the file. In the GNU system, the file position is
2884 represented as an integer, which counts the number of bytes from the
2885 beginning of the file. @xref{File Position}.
2887 During I/O to an ordinary disk file, you can change the file position
2888 whenever you wish, so as to read or write any portion of the file. Some
2889 other kinds of files may also permit this. Files which support changing
2890 the file position are sometimes referred to as @dfn{random-access}
2893 You can use the functions in this section to examine or modify the file
2894 position indicator associated with a stream. The symbols listed below
2895 are declared in the header file @file{stdio.h}.
2900 @deftypefun {long int} ftell (FILE *@var{stream})
2901 This function returns the current file position of the stream
2904 This function can fail if the stream doesn't support file positioning,
2905 or if the file position can't be represented in a @code{long int}, and
2906 possibly for other reasons as well. If a failure occurs, a value of
2907 @code{-1} is returned.
2912 @deftypefun int fseek (FILE *@var{stream}, long int @var{offset}, int @var{whence})
2913 The @code{fseek} function is used to change the file position of the
2914 stream @var{stream}. The value of @var{whence} must be one of the
2915 constants @code{SEEK_SET}, @code{SEEK_CUR}, or @code{SEEK_END}, to
2916 indicate whether the @var{offset} is relative to the beginning of the
2917 file, the current file position, or the end of the file, respectively.
2919 This function returns a value of zero if the operation was successful,
2920 and a nonzero value to indicate failure. A successful call also clears
2921 the end-of-file indicator of @var{stream} and discards any characters
2922 that were ``pushed back'' by the use of @code{ungetc}.
2924 @code{fseek} either flushes any buffered output before setting the file
2925 position or else remembers it so it will be written later in its proper
2929 @strong{Portability Note:} In non-POSIX systems, @code{ftell} and
2930 @code{fseek} might work reliably only on binary streams. @xref{Binary
2933 The following symbolic constants are defined for use as the @var{whence}
2934 argument to @code{fseek}. They are also used with the @code{lseek}
2935 function (@pxref{I/O Primitives}) and to specify offsets for file locks
2936 (@pxref{Control Operations}).
2940 @deftypevr Macro int SEEK_SET
2941 This is an integer constant which, when used as the @var{whence}
2942 argument to the @code{fseek} function, specifies that the offset
2943 provided is relative to the beginning of the file.
2948 @deftypevr Macro int SEEK_CUR
2949 This is an integer constant which, when used as the @var{whence}
2950 argument to the @code{fseek} function, specifies that the offset
2951 provided is relative to the current file position.
2956 @deftypevr Macro int SEEK_END
2957 This is an integer constant which, when used as the @var{whence}
2958 argument to the @code{fseek} function, specifies that the offset
2959 provided is relative to the end of the file.
2964 @deftypefun void rewind (FILE *@var{stream})
2965 The @code{rewind} function positions the stream @var{stream} at the
2966 begining of the file. It is equivalent to calling @code{fseek} on the
2967 @var{stream} with an @var{offset} argument of @code{0L} and a
2968 @var{whence} argument of @code{SEEK_SET}, except that the return
2969 value is discarded and the error indicator for the stream is reset.
2972 These three aliases for the @samp{SEEK_@dots{}} constants exist for the
2973 sake of compatibility with older BSD systems. They are defined in two
2974 different header files: @file{fcntl.h} and @file{sys/file.h}.
2981 An alias for @code{SEEK_SET}.
2987 An alias for @code{SEEK_CUR}.
2993 An alias for @code{SEEK_END}.
2996 @node Portable Positioning
2997 @section Portable File-Position Functions
2999 On the GNU system, the file position is truly a character count. You
3000 can specify any character count value as an argument to @code{fseek} and
3001 get reliable results for any random access file. However, some @w{ISO C}
3002 systems do not represent file positions in this way.
3004 On some systems where text streams truly differ from binary streams, it
3005 is impossible to represent the file position of a text stream as a count
3006 of characters from the beginning of the file. For example, the file
3007 position on some systems must encode both a record offset within the
3008 file, and a character offset within the record.
3010 As a consequence, if you want your programs to be portable to these
3011 systems, you must observe certain rules:
3015 The value returned from @code{ftell} on a text stream has no predictable
3016 relationship to the number of characters you have read so far. The only
3017 thing you can rely on is that you can use it subsequently as the
3018 @var{offset} argument to @code{fseek} to move back to the same file
3022 In a call to @code{fseek} on a text stream, either the @var{offset} must
3023 either be zero; or @var{whence} must be @code{SEEK_SET} and the
3024 @var{offset} must be the result of an earlier call to @code{ftell} on
3028 The value of the file position indicator of a text stream is undefined
3029 while there are characters that have been pushed back with @code{ungetc}
3030 that haven't been read or discarded. @xref{Unreading}.
3033 But even if you observe these rules, you may still have trouble for long
3034 files, because @code{ftell} and @code{fseek} use a @code{long int} value
3035 to represent the file position. This type may not have room to encode
3036 all the file positions in a large file.
3038 So if you do want to support systems with peculiar encodings for the
3039 file positions, it is better to use the functions @code{fgetpos} and
3040 @code{fsetpos} instead. These functions represent the file position
3041 using the data type @code{fpos_t}, whose internal representation varies
3042 from system to system.
3044 These symbols are declared in the header file @file{stdio.h}.
3049 @deftp {Data Type} fpos_t
3050 This is the type of an object that can encode information about the
3051 file position of a stream, for use by the functions @code{fgetpos} and
3054 In the GNU system, @code{fpos_t} is equivalent to @code{off_t} or
3055 @code{long int}. In other systems, it might have a different internal
3061 @deftypefun int fgetpos (FILE *@var{stream}, fpos_t *@var{position})
3062 This function stores the value of the file position indicator for the
3063 stream @var{stream} in the @code{fpos_t} object pointed to by
3064 @var{position}. If successful, @code{fgetpos} returns zero; otherwise
3065 it returns a nonzero value and stores an implementation-defined positive
3066 value in @code{errno}.
3071 @deftypefun int fsetpos (FILE *@var{stream}, const fpos_t @var{position})
3072 This function sets the file position indicator for the stream @var{stream}
3073 to the position @var{position}, which must have been set by a previous
3074 call to @code{fgetpos} on the same stream. If successful, @code{fsetpos}
3075 clears the end-of-file indicator on the stream, discards any characters
3076 that were ``pushed back'' by the use of @code{ungetc}, and returns a value
3077 of zero. Otherwise, @code{fsetpos} returns a nonzero value and stores
3078 an implementation-defined positive value in @code{errno}.
3081 @node Stream Buffering
3082 @section Stream Buffering
3084 @cindex buffering of streams
3085 Characters that are written to a stream are normally accumulated and
3086 transmitted asynchronously to the file in a block, instead of appearing
3087 as soon as they are output by the application program. Similarly,
3088 streams often retrieve input from the host environment in blocks rather
3089 than on a character-by-character basis. This is called @dfn{buffering}.
3091 If you are writing programs that do interactive input and output using
3092 streams, you need to understand how buffering works when you design the
3093 user interface to your program. Otherwise, you might find that output
3094 (such as progress or prompt messages) doesn't appear when you intended
3095 it to, or other unexpected behavior.
3097 This section deals only with controlling when characters are transmitted
3098 between the stream and the file or device, and @emph{not} with how
3099 things like echoing, flow control, and the like are handled on specific
3100 classes of devices. For information on common control operations on
3101 terminal devices, see @ref{Low-Level Terminal Interface}.
3103 You can bypass the stream buffering facilities altogether by using the
3104 low-level input and output functions that operate on file descriptors
3105 instead. @xref{Low-Level I/O}.
3108 * Buffering Concepts:: Terminology is defined here.
3109 * Flushing Buffers:: How to ensure that output buffers are flushed.
3110 * Controlling Buffering:: How to specify what kind of buffering to use.
3113 @node Buffering Concepts
3114 @subsection Buffering Concepts
3116 There are three different kinds of buffering strategies:
3120 Characters written to or read from an @dfn{unbuffered} stream are
3121 transmitted individually to or from the file as soon as possible.
3122 @cindex unbuffered stream
3125 Characters written to a @dfn{line buffered} stream are transmitted to
3126 the file in blocks when a newline character is encountered.
3127 @cindex line buffered stream
3130 Characters written to or read from a @dfn{fully buffered} stream are
3131 transmitted to or from the file in blocks of arbitrary size.
3132 @cindex fully buffered stream
3135 Newly opened streams are normally fully buffered, with one exception: a
3136 stream connected to an interactive device such as a terminal is
3137 initially line buffered. @xref{Controlling Buffering}, for information
3138 on how to select a different kind of buffering. Usually the automatic
3139 selection gives you the most convenient kind of buffering for the file
3142 The use of line buffering for interactive devices implies that output
3143 messages ending in a newline will appear immediately---which is usually
3144 what you want. Output that doesn't end in a newline might or might not
3145 show up immediately, so if you want them to appear immediately, you
3146 should flush buffered output explicitly with @code{fflush}, as described
3147 in @ref{Flushing Buffers}.
3149 @node Flushing Buffers
3150 @subsection Flushing Buffers
3152 @cindex flushing a stream
3153 @dfn{Flushing} output on a buffered stream means transmitting all
3154 accumulated characters to the file. There are many circumstances when
3155 buffered output on a stream is flushed automatically:
3159 When you try to do output and the output buffer is full.
3162 When the stream is closed. @xref{Closing Streams}.
3165 When the program terminates by calling @code{exit}.
3166 @xref{Normal Termination}.
3169 When a newline is written, if the stream is line buffered.
3172 Whenever an input operation on @emph{any} stream actually reads data
3176 If you want to flush the buffered output at another time, call
3177 @code{fflush}, which is declared in the header file @file{stdio.h}.
3182 @deftypefun int fflush (FILE *@var{stream})
3183 This function causes any buffered output on @var{stream} to be delivered
3184 to the file. If @var{stream} is a null pointer, then
3185 @code{fflush} causes buffered output on @emph{all} open output streams
3188 This function returns @code{EOF} if a write error occurs, or zero
3192 @strong{Compatibility Note:} Some brain-damaged operating systems have
3193 been known to be so thoroughly fixated on line-oriented input and output
3194 that flushing a line buffered stream causes a newline to be written!
3195 Fortunately, this ``feature'' seems to be becoming less common. You do
3196 not need to worry about this in the GNU system.
3199 @node Controlling Buffering
3200 @subsection Controlling Which Kind of Buffering
3202 After opening a stream (but before any other operations have been
3203 performed on it), you can explicitly specify what kind of buffering you
3204 want it to have using the @code{setvbuf} function.
3205 @cindex buffering, controlling
3207 The facilities listed in this section are declared in the header
3208 file @file{stdio.h}.
3213 @deftypefun int setvbuf (FILE *@var{stream}, char *@var{buf}, int @var{mode}, size_t @var{size})
3214 This function is used to specify that the stream @var{stream} should
3215 have the buffering mode @var{mode}, which can be either @code{_IOFBF}
3216 (for full buffering), @code{_IOLBF} (for line buffering), or
3217 @code{_IONBF} (for unbuffered input/output).
3219 If you specify a null pointer as the @var{buf} argument, then @code{setvbuf}
3220 allocates a buffer itself using @code{malloc}. This buffer will be freed
3221 when you close the stream.
3223 Otherwise, @var{buf} should be a character array that can hold at least
3224 @var{size} characters. You should not free the space for this array as
3225 long as the stream remains open and this array remains its buffer. You
3226 should usually either allocate it statically, or @code{malloc}
3227 (@pxref{Unconstrained Allocation}) the buffer. Using an automatic array
3228 is not a good idea unless you close the file before exiting the block
3229 that declares the array.
3231 While the array remains a stream buffer, the stream I/O functions will
3232 use the buffer for their internal purposes. You shouldn't try to access
3233 the values in the array directly while the stream is using it for
3236 The @code{setvbuf} function returns zero on success, or a nonzero value
3237 if the value of @var{mode} is not valid or if the request could not
3243 @deftypevr Macro int _IOFBF
3244 The value of this macro is an integer constant expression that can be
3245 used as the @var{mode} argument to the @code{setvbuf} function to
3246 specify that the stream should be fully buffered.
3251 @deftypevr Macro int _IOLBF
3252 The value of this macro is an integer constant expression that can be
3253 used as the @var{mode} argument to the @code{setvbuf} function to
3254 specify that the stream should be line buffered.
3259 @deftypevr Macro int _IONBF
3260 The value of this macro is an integer constant expression that can be
3261 used as the @var{mode} argument to the @code{setvbuf} function to
3262 specify that the stream should be unbuffered.
3267 @deftypevr Macro int BUFSIZ
3268 The value of this macro is an integer constant expression that is good
3269 to use for the @var{size} argument to @code{setvbuf}. This value is
3270 guaranteed to be at least @code{256}.
3272 The value of @code{BUFSIZ} is chosen on each system so as to make stream
3273 I/O efficient. So it is a good idea to use @code{BUFSIZ} as the size
3274 for the buffer when you call @code{setvbuf}.
3276 Actually, you can get an even better value to use for the buffer size
3277 by means of the @code{fstat} system call: it is found in the
3278 @code{st_blksize} field of the file attributes. @xref{Attribute Meanings}.
3280 Sometimes people also use @code{BUFSIZ} as the allocation size of
3281 buffers used for related purposes, such as strings used to receive a
3282 line of input with @code{fgets} (@pxref{Character Input}). There is no
3283 particular reason to use @code{BUFSIZ} for this instead of any other
3284 integer, except that it might lead to doing I/O in chunks of an
3290 @deftypefun void setbuf (FILE *@var{stream}, char *@var{buf})
3291 If @var{buf} is a null pointer, the effect of this function is
3292 equivalent to calling @code{setvbuf} with a @var{mode} argument of
3293 @code{_IONBF}. Otherwise, it is equivalent to calling @code{setvbuf}
3294 with @var{buf}, and a @var{mode} of @code{_IOFBF} and a @var{size}
3295 argument of @code{BUFSIZ}.
3297 The @code{setbuf} function is provided for compatibility with old code;
3298 use @code{setvbuf} in all new programs.
3303 @deftypefun void setbuffer (FILE *@var{stream}, char *@var{buf}, size_t @var{size})
3304 If @var{buf} is a null pointer, this function makes @var{stream} unbuffered.
3305 Otherwise, it makes @var{stream} fully buffered using @var{buf} as the
3306 buffer. The @var{size} argument specifies the length of @var{buf}.
3308 This function is provided for compatibility with old BSD code. Use
3309 @code{setvbuf} instead.
3314 @deftypefun void setlinebuf (FILE *@var{stream})
3315 This function makes @var{stream} be line buffered, and allocates the
3318 This function is provided for compatibility with old BSD code. Use
3319 @code{setvbuf} instead.
3322 @node Other Kinds of Streams
3323 @section Other Kinds of Streams
3325 The GNU library provides ways for you to define additional kinds of
3326 streams that do not necessarily correspond to an open file.
3328 One such type of stream takes input from or writes output to a string.
3329 These kinds of streams are used internally to implement the
3330 @code{sprintf} and @code{sscanf} functions. You can also create such a
3331 stream explicitly, using the functions described in @ref{String Streams}.
3333 More generally, you can define streams that do input/output to arbitrary
3334 objects using functions supplied by your program. This protocol is
3335 discussed in @ref{Custom Streams}.
3337 @strong{Portability Note:} The facilities described in this section are
3338 specific to GNU. Other systems or C implementations might or might not
3339 provide equivalent functionality.
3342 * String Streams:: Streams that get data from or put data in
3343 a string or memory buffer.
3344 * Obstack Streams:: Streams that store data in an obstack.
3345 * Custom Streams:: Defining your own streams with an arbitrary
3346 input data source and/or output data sink.
3349 @node String Streams
3350 @subsection String Streams
3352 @cindex stream, for I/O to a string
3353 @cindex string stream
3354 The @code{fmemopen} and @code{open_memstream} functions allow you to do
3355 I/O to a string or memory buffer. These facilities are declared in
3361 @deftypefun {FILE *} fmemopen (void *@var{buf}, size_t @var{size}, const char *@var{opentype})
3362 This function opens a stream that allows the access specified by the
3363 @var{opentype} argument, that reads from or writes to the buffer specified
3364 by the argument @var{buf}. This array must be at least @var{size} bytes long.
3366 If you specify a null pointer as the @var{buf} argument, @code{fmemopen}
3367 dynamically allocates (as with @code{malloc}; @pxref{Unconstrained
3368 Allocation}) an array @var{size} bytes long. This is really only useful
3369 if you are going to write things to the buffer and then read them back
3370 in again, because you have no way of actually getting a pointer to the
3371 buffer (for this, try @code{open_memstream}, below). The buffer is
3372 freed when the stream is open.
3374 The argument @var{opentype} is the same as in @code{fopen}
3375 (@xref{Opening Streams}). If the @var{opentype} specifies
3376 append mode, then the initial file position is set to the first null
3377 character in the buffer. Otherwise the initial file position is at the
3378 beginning of the buffer.
3380 When a stream open for writing is flushed or closed, a null character
3381 (zero byte) is written at the end of the buffer if it fits. You
3382 should add an extra byte to the @var{size} argument to account for this.
3383 Attempts to write more than @var{size} bytes to the buffer result
3386 For a stream open for reading, null characters (zero bytes) in the
3387 buffer do not count as ``end of file''. Read operations indicate end of
3388 file only when the file position advances past @var{size} bytes. So, if
3389 you want to read characters from a null-terminated string, you should
3390 supply the length of the string as the @var{size} argument.
3393 Here is an example of using @code{fmemopen} to create a stream for
3394 reading from a string:
3397 @include memopen.c.texi
3400 This program produces the following output:
3413 @deftypefun {FILE *} open_memstream (char **@var{ptr}, size_t *@var{sizeloc})
3414 This function opens a stream for writing to a buffer. The buffer is
3415 allocated dynamically (as with @code{malloc}; @pxref{Unconstrained
3416 Allocation}) and grown as necessary.
3418 When the stream is closed with @code{fclose} or flushed with
3419 @code{fflush}, the locations @var{ptr} and @var{sizeloc} are updated to
3420 contain the pointer to the buffer and its size. The values thus stored
3421 remain valid only as long as no further output on the stream takes
3422 place. If you do more output, you must flush the stream again to store
3423 new values before you use them again.
3425 A null character is written at the end of the buffer. This null character
3426 is @emph{not} included in the size value stored at @var{sizeloc}.
3428 You can move the stream's file position with @code{fseek} (@pxref{File
3429 Positioning}). Moving the file position past the end of the data
3430 already written fills the intervening space with zeroes.
3433 Here is an example of using @code{open_memstream}:
3436 @include memstrm.c.texi
3439 This program produces the following output:
3442 buf = `hello', size = 5
3443 buf = `hello, world', size = 12
3446 @c @group Invalid outside @example.
3447 @node Obstack Streams
3448 @subsection Obstack Streams
3450 You can open an output stream that puts it data in an obstack.
3455 @deftypefun {FILE *} open_obstack_stream (struct obstack *@var{obstack})
3456 This function opens a stream for writing data into the obstack @var{obstack}.
3457 This starts an object in the obstack and makes it grow as data is
3458 written (@pxref{Growing Objects}).
3459 @c @end group Doubly invalid because not nested right.
3461 Calling @code{fflush} on this stream updates the current size of the
3462 object to match the amount of data that has been written. After a call
3463 to @code{fflush}, you can examine the object temporarily.
3465 You can move the file position of an obstack stream with @code{fseek}
3466 (@pxref{File Positioning}). Moving the file position past the end of
3467 the data written fills the intervening space with zeros.
3469 To make the object permanent, update the obstack with @code{fflush}, and
3470 then use @code{obstack_finish} to finalize the object and get its address.
3471 The following write to the stream starts a new object in the obstack,
3472 and later writes add to that object until you do another @code{fflush}
3473 and @code{obstack_finish}.
3475 But how do you find out how long the object is? You can get the length
3476 in bytes by calling @code{obstack_object_size} (@pxref{Status of an
3477 Obstack}), or you can null-terminate the object like this:
3480 obstack_1grow (@var{obstack}, 0);
3483 Whichever one you do, you must do it @emph{before} calling
3484 @code{obstack_finish}. (You can do both if you wish.)
3487 Here is a sample function that uses @code{open_obstack_stream}:
3491 make_message_string (const char *a, int b)
3493 FILE *stream = open_obstack_stream (&message_obstack);
3494 output_task (stream);
3495 fprintf (stream, ": ");
3496 fprintf (stream, a, b);
3497 fprintf (stream, "\n");
3499 obstack_1grow (&message_obstack, 0);
3500 return obstack_finish (&message_obstack);
3504 @node Custom Streams
3505 @subsection Programming Your Own Custom Streams
3506 @cindex custom streams
3507 @cindex programming your own streams
3509 This section describes how you can make a stream that gets input from an
3510 arbitrary data source or writes output to an arbitrary data sink
3511 programmed by you. We call these @dfn{custom streams}.
3513 @c !!! this does not talk at all about the higher-level hooks
3516 * Streams and Cookies:: The @dfn{cookie} records where to fetch or
3517 store data that is read or written.
3518 * Hook Functions:: How you should define the four @dfn{hook
3519 functions} that a custom stream needs.
3522 @node Streams and Cookies
3523 @subsubsection Custom Streams and Cookies
3524 @cindex cookie, for custom stream
3526 Inside every custom stream is a special object called the @dfn{cookie}.
3527 This is an object supplied by you which records where to fetch or store
3528 the data read or written. It is up to you to define a data type to use
3529 for the cookie. The stream functions in the library never refer
3530 directly to its contents, and they don't even know what the type is;
3531 they record its address with type @code{void *}.
3533 To implement a custom stream, you must specify @emph{how} to fetch or
3534 store the data in the specified place. You do this by defining
3535 @dfn{hook functions} to read, write, change ``file position'', and close
3536 the stream. All four of these functions will be passed the stream's
3537 cookie so they can tell where to fetch or store the data. The library
3538 functions don't know what's inside the cookie, but your functions will
3541 When you create a custom stream, you must specify the cookie pointer,
3542 and also the four hook functions stored in a structure of type
3543 @code{cookie_io_functions_t}.
3545 These facilities are declared in @file{stdio.h}.
3550 @deftp {Data Type} {cookie_io_functions_t}
3551 This is a structure type that holds the functions that define the
3552 communications protocol between the stream and its cookie. It has
3553 the following members:
3556 @item cookie_read_function_t *read
3557 This is the function that reads data from the cookie. If the value is a
3558 null pointer instead of a function, then read operations on ths stream
3559 always return @code{EOF}.
3561 @item cookie_write_function_t *write
3562 This is the function that writes data to the cookie. If the value is a
3563 null pointer instead of a function, then data written to the stream is
3566 @item cookie_seek_function_t *seek
3567 This is the function that performs the equivalent of file positioning on
3568 the cookie. If the value is a null pointer instead of a function, calls
3569 to @code{fseek} on this stream can only seek to locations within the
3570 buffer; any attempt to seek outside the buffer will return an
3571 @code{ESPIPE} error.
3573 @item cookie_close_function_t *close
3574 This function performs any appropriate cleanup on the cookie when
3575 closing the stream. If the value is a null pointer instead of a
3576 function, nothing special is done to close the cookie when the stream is
3583 @deftypefun {FILE *} fopencookie (void *@var{cookie}, const char *@var{opentype}, cookie_io_functions_t @var{io-functions})
3584 This function actually creates the stream for communicating with the
3585 @var{cookie} using the functions in the @var{io-functions} argument.
3586 The @var{opentype} argument is interpreted as for @code{fopen};
3587 see @ref{Opening Streams}. (But note that the ``truncate on
3588 open'' option is ignored.) The new stream is fully buffered.
3590 The @code{fopencookie} function returns the newly created stream, or a null
3591 pointer in case of an error.
3594 @node Hook Functions
3595 @subsubsection Custom Stream Hook Functions
3596 @cindex hook functions (of custom streams)
3598 Here are more details on how you should define the four hook functions
3599 that a custom stream needs.
3601 You should define the function to read data from the cookie as:
3604 ssize_t @var{reader} (void *@var{cookie}, void *@var{buffer}, size_t @var{size})
3607 This is very similar to the @code{read} function; see @ref{I/O
3608 Primitives}. Your function should transfer up to @var{size} bytes into
3609 the @var{buffer}, and return the number of bytes read, or zero to
3610 indicate end-of-file. You can return a value of @code{-1} to indicate
3613 You should define the function to write data to the cookie as:
3616 ssize_t @var{writer} (void *@var{cookie}, const void *@var{buffer}, size_t @var{size})
3619 This is very similar to the @code{write} function; see @ref{I/O
3620 Primitives}. Your function should transfer up to @var{size} bytes from
3621 the buffer, and return the number of bytes written. You can return a
3622 value of @code{-1} to indicate an error.
3624 You should define the function to perform seek operations on the cookie
3628 int @var{seeker} (void *@var{cookie}, fpos_t *@var{position}, int @var{whence})
3631 For this function, the @var{position} and @var{whence} arguments are
3632 interpreted as for @code{fgetpos}; see @ref{Portable Positioning}. In
3633 the GNU library, @code{fpos_t} is equivalent to @code{off_t} or
3634 @code{long int}, and simply represents the number of bytes from the
3635 beginning of the file.
3637 After doing the seek operation, your function should store the resulting
3638 file position relative to the beginning of the file in @var{position}.
3639 Your function should return a value of @code{0} on success and @code{-1}
3640 to indicate an error.
3642 You should define the function to do cleanup operations on the cookie
3643 appropriate for closing the stream as:
3646 int @var{cleaner} (void *@var{cookie})
3649 Your function should return @code{-1} to indicate an error, and @code{0}
3654 @deftp {Data Type} cookie_read_function
3655 This is the data type that the read function for a custom stream should have.
3656 If you declare the function as shown above, this is the type it will have.
3661 @deftp {Data Type} cookie_write_function
3662 The data type of the write function for a custom stream.
3667 @deftp {Data Type} cookie_seek_function
3668 The data type of the seek function for a custom stream.
3673 @deftp {Data Type} cookie_close_function
3674 The data type of the close function for a custom stream.
3681 There is another set of functions one can give a stream, the
3682 input-room and output-room functions. These functions must
3683 understand stdio internals. To describe how to use these
3684 functions, you also need to document lots of how stdio works
3685 internally (which isn't relevant for other uses of stdio).
3686 Perhaps I can write an interface spec from which you can write
3687 good documentation. But it's pretty complex and deals with lots
3688 of nitty-gritty details. I think it might be better to let this
3689 wait until the rest of the manual is more done and polished.
3693 @c ??? This section could use an example.