1 @node File System Interface, Pipes and FIFOs, Low-Level I/O, Top
2 @c %MENU% Functions for manipulating files
3 @chapter File System Interface
5 This chapter describes @theglibc{}'s functions for manipulating
6 files. Unlike the input and output functions (@pxref{I/O on Streams};
7 @pxref{Low-Level I/O}), these functions are concerned with operating
8 on the files themselves rather than on their contents.
10 Among the facilities described in this chapter are functions for
11 examining or modifying directories, functions for renaming and deleting
12 files, and functions for examining and setting file attributes such as
13 access permissions and modification times.
16 * Working Directory:: This is used to resolve relative
18 * Accessing Directories:: Finding out what files a directory
20 * Working with Directory Trees:: Apply actions to all files or a selectable
21 subset of a directory hierarchy.
22 * Hard Links:: Adding alternate names to a file.
23 * Symbolic Links:: A file that ``points to'' a file name.
24 * Deleting Files:: How to delete a file, and what that means.
25 * Renaming Files:: Changing a file's name.
26 * Creating Directories:: A system call just for creating a directory.
27 * File Attributes:: Attributes of individual files.
28 * Making Special Files:: How to create special files.
29 * Temporary Files:: Naming and creating temporary files.
32 @node Working Directory
33 @section Working Directory
35 @cindex current working directory
36 @cindex working directory
37 @cindex change working directory
38 Each process has associated with it a directory, called its @dfn{current
39 working directory} or simply @dfn{working directory}, that is used in
40 the resolution of relative file names (@pxref{File Name Resolution}).
42 When you log in and begin a new session, your working directory is
43 initially set to the home directory associated with your login account
44 in the system user database. You can find any user's home directory
45 using the @code{getpwuid} or @code{getpwnam} functions; see @ref{User
48 Users can change the working directory using shell commands like
49 @code{cd}. The functions described in this section are the primitives
50 used by those commands and by other programs for examining and changing
51 the working directory.
54 Prototypes for these functions are declared in the header file
60 @deftypefun {char *} getcwd (char *@var{buffer}, size_t @var{size})
61 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
62 @c If buffer is NULL, this function calls malloc and realloc, and, in
63 @c case of error, free. Linux offers a getcwd syscall that we use on
64 @c GNU/Linux systems, but it may fail if the pathname is too long. As a
65 @c fallback, and on other systems, the generic implementation opens each
66 @c parent directory with opendir, which allocates memory for the
67 @c directory stream with malloc. If a fstatat64 syscall is not
68 @c available, very deep directory trees may also have to malloc to build
69 @c longer sequences of ../../../... than those supported by a global
70 @c const read-only string.
74 @c malloc/realloc/free if buffer is NULL, or if dir is too deep
75 @c lstat64 -> see its own entry
77 @c direct syscall if possible, alloca+snprintf+*stat64 otherwise
78 @c openat64_not_cancel_3, close_not_cancel_no_status
79 @c __fdopendir, __opendir, __readdir, rewinddir
80 The @code{getcwd} function returns an absolute file name representing
81 the current working directory, storing it in the character array
82 @var{buffer} that you provide. The @var{size} argument is how you tell
83 the system the allocation size of @var{buffer}.
85 The @glibcadj{} version of this function also permits you to specify a
86 null pointer for the @var{buffer} argument. Then @code{getcwd}
87 allocates a buffer automatically, as with @code{malloc}
88 (@pxref{Unconstrained Allocation}). If the @var{size} is greater than
89 zero, then the buffer is that large; otherwise, the buffer is as large
90 as necessary to hold the result.
92 The return value is @var{buffer} on success and a null pointer on failure.
93 The following @code{errno} error conditions are defined for this function:
97 The @var{size} argument is zero and @var{buffer} is not a null pointer.
100 The @var{size} argument is less than the length of the working directory
101 name. You need to allocate a bigger array and try again.
104 Permission to read or search a component of the file name was denied.
108 You could implement the behavior of GNU's @w{@code{getcwd (NULL, 0)}}
109 using only the standard behavior of @code{getcwd}:
119 char *buffer = (char *) xmalloc (size);
120 if (getcwd (buffer, size) == buffer)
131 @xref{Malloc Examples}, for information about @code{xmalloc}, which is
132 not a library function but is a customary name used in most GNU
137 @deftypefn {Deprecated Function} {char *} getwd (char *@var{buffer})
138 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @ascuintl{}}@acunsafe{@acsmem{} @acsfd{}}}
139 @c Besides the getcwd safety issues, it calls strerror_r on error, which
140 @c brings in all of the i18n issues.
141 This is similar to @code{getcwd}, but has no way to specify the size of
142 the buffer. @Theglibc{} provides @code{getwd} only
143 for backwards compatibility with BSD.
145 The @var{buffer} argument should be a pointer to an array at least
146 @code{PATH_MAX} bytes long (@pxref{Limits for Files}). On @gnuhurdsystems{}
147 there is no limit to the size of a file name, so this is not
148 necessarily enough space to contain the directory name. That is why
149 this function is deprecated.
154 @deftypefun {char *} get_current_dir_name (void)
155 @safety{@prelim{}@mtsafe{@mtsenv{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
156 @c Besides getcwd, which this function calls as a fallback, it calls
157 @c getenv, with the potential thread-safety issues that brings about.
159 This @code{get_current_dir_name} function is basically equivalent to
160 @w{@code{getcwd (NULL, 0)}}. The only difference is that the value of
161 the @code{PWD} variable is returned if this value is correct. This is a
162 subtle difference which is visible if the path described by the
163 @code{PWD} value is using one or more symbol links in which case the
164 value returned by @code{getcwd} can resolve the symbol links and
165 therefore yield a different result.
167 This function is a GNU extension.
172 @deftypefun int chdir (const char *@var{filename})
173 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
174 This function is used to set the process's working directory to
177 The normal, successful return value from @code{chdir} is @code{0}. A
178 value of @code{-1} is returned to indicate an error. The @code{errno}
179 error conditions defined for this function are the usual file name
180 syntax errors (@pxref{File Name Errors}), plus @code{ENOTDIR} if the
181 file @var{filename} is not a directory.
186 @deftypefun int fchdir (int @var{filedes})
187 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
188 This function is used to set the process's working directory to
189 directory associated with the file descriptor @var{filedes}.
191 The normal, successful return value from @code{fchdir} is @code{0}. A
192 value of @code{-1} is returned to indicate an error. The following
193 @code{errno} error conditions are defined for this function:
197 Read permission is denied for the directory named by @code{dirname}.
200 The @var{filedes} argument is not a valid file descriptor.
203 The file descriptor @var{filedes} is not associated with a directory.
206 The function call was interrupt by a signal.
209 An I/O error occurred.
214 @node Accessing Directories
215 @section Accessing Directories
216 @cindex accessing directories
217 @cindex reading from a directory
218 @cindex directories, accessing
220 The facilities described in this section let you read the contents of a
221 directory file. This is useful if you want your program to list all the
222 files in a directory, perhaps as part of a menu.
224 @cindex directory stream
225 The @code{opendir} function opens a @dfn{directory stream} whose
226 elements are directory entries. Alternatively @code{fdopendir} can be
227 used which can have advantages if the program needs to have more
228 control over the way the directory is opened for reading. This
229 allows, for instance, to pass the @code{O_NOATIME} flag to
232 You use the @code{readdir} function on the directory stream to
233 retrieve these entries, represented as @w{@code{struct dirent}}
234 objects. The name of the file for each entry is stored in the
235 @code{d_name} member of this structure. There are obvious parallels
236 here to the stream facilities for ordinary files, described in
237 @ref{I/O on Streams}.
240 * Directory Entries:: Format of one directory entry.
241 * Opening a Directory:: How to open a directory stream.
242 * Reading/Closing Directory:: How to read directory entries from the stream.
243 * Simple Directory Lister:: A very simple directory listing program.
244 * Random Access Directory:: Rereading part of the directory
245 already read with the same stream.
246 * Scanning Directory Content:: Get entries for user selected subset of
247 contents in given directory.
248 * Simple Directory Lister Mark II:: Revised version of the program.
251 @node Directory Entries
252 @subsection Format of a Directory Entry
255 This section describes what you find in a single directory entry, as you
256 might obtain it from a directory stream. All the symbols are declared
257 in the header file @file{dirent.h}.
261 @deftp {Data Type} {struct dirent}
262 This is a structure type used to return information about directory
263 entries. It contains the following fields:
267 This is the null-terminated file name component. This is the only
268 field you can count on in all POSIX systems.
271 This is the file serial number. For BSD compatibility, you can also
272 refer to this member as @code{d_ino}. On @gnulinuxhurdsystems{} and most POSIX
273 systems, for most files this the same as the @code{st_ino} member that
274 @code{stat} will return for the file. @xref{File Attributes}.
276 @item unsigned char d_namlen
277 This is the length of the file name, not including the terminating
278 null character. Its type is @code{unsigned char} because that is the
279 integer type of the appropriate size. This member is a BSD extension.
280 The symbol @code{_DIRENT_HAVE_D_NAMLEN} is defined if this member is
283 @item unsigned char d_type
284 This is the type of the file, possibly unknown. The following constants
285 are defined for its value:
289 The type is unknown. Only some filesystems have full support to
290 return the type of the file, others might always return this value.
299 A named pipe, or FIFO. @xref{FIFO Special Files}.
302 A local-domain socket. @c !!! @xref{Local Domain}.
314 This member is a BSD extension. The symbol @code{_DIRENT_HAVE_D_TYPE}
315 is defined if this member is available. On systems where it is used, it
316 corresponds to the file type bits in the @code{st_mode} member of
317 @code{struct stat}. If the value cannot be determine the member
318 value is DT_UNKNOWN. These two macros convert between @code{d_type}
319 values and @code{st_mode} values:
323 @deftypefun int IFTODT (mode_t @var{mode})
324 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
325 This returns the @code{d_type} value corresponding to @var{mode}.
330 @deftypefun mode_t DTTOIF (int @var{dtype})
331 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
332 This returns the @code{st_mode} value corresponding to @var{dtype}.
336 This structure may contain additional members in the future. Their
337 availability is always announced in the compilation environment by a
338 macro names @code{_DIRENT_HAVE_D_@var{xxx}} where @var{xxx} is replaced
339 by the name of the new member. For instance, the member @code{d_reclen}
340 available on some systems is announced through the macro
341 @code{_DIRENT_HAVE_D_RECLEN}.
343 When a file has multiple names, each name has its own directory entry.
344 The only way you can tell that the directory entries belong to a
345 single file is that they have the same value for the @code{d_fileno}
348 File attributes such as size, modification times etc., are part of the
349 file itself, not of any particular directory entry. @xref{File
353 @node Opening a Directory
354 @subsection Opening a Directory Stream
357 This section describes how to open a directory stream. All the symbols
358 are declared in the header file @file{dirent.h}.
362 @deftp {Data Type} DIR
363 The @code{DIR} data type represents a directory stream.
366 You shouldn't ever allocate objects of the @code{struct dirent} or
367 @code{DIR} data types, since the directory access functions do that for
368 you. Instead, you refer to these objects using the pointers returned by
369 the following functions.
373 @deftypefun {DIR *} opendir (const char *@var{dirname})
374 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
375 @c Besides the safe syscall, we have to allocate the DIR object with
376 @c __alloc_dir, that calls malloc.
377 The @code{opendir} function opens and returns a directory stream for
378 reading the directory whose file name is @var{dirname}. The stream has
381 If unsuccessful, @code{opendir} returns a null pointer. In addition to
382 the usual file name errors (@pxref{File Name Errors}), the
383 following @code{errno} error conditions are defined for this function:
387 Read permission is denied for the directory named by @code{dirname}.
390 The process has too many files open.
393 The entire system, or perhaps the file system which contains the
394 directory, cannot support any additional open files at the moment.
395 (This problem cannot happen on @gnuhurdsystems{}.)
398 Not enough memory available.
401 The @code{DIR} type is typically implemented using a file descriptor,
402 and the @code{opendir} function in terms of the @code{open} function.
403 @xref{Low-Level I/O}. Directory streams and the underlying
404 file descriptors are closed on @code{exec} (@pxref{Executing a File}).
407 The directory which is opened for reading by @code{opendir} is
408 identified by the name. In some situations this is not sufficient.
409 Or the way @code{opendir} implicitly creates a file descriptor for the
410 directory is not the way a program might want it. In these cases an
411 alternative interface can be used.
415 @deftypefun {DIR *} fdopendir (int @var{fd})
416 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
417 @c The DIR object is allocated with __alloc_dir, that calls malloc.
418 The @code{fdopendir} function works just like @code{opendir} but
419 instead of taking a file name and opening a file descriptor for the
420 directory the caller is required to provide a file descriptor. This
421 file descriptor is then used in subsequent uses of the returned
422 directory stream object.
424 The caller must make sure the file descriptor is associated with a
425 directory and it allows reading.
427 If the @code{fdopendir} call returns successfully the file descriptor
428 is now under the control of the system. It can be used in the same
429 way the descriptor implicitly created by @code{opendir} can be used
430 but the program must not close the descriptor.
432 In case the function is unsuccessful it returns a null pointer and the
433 file descriptor remains to be usable by the program. The following
434 @code{errno} error conditions are defined for this function:
438 The file descriptor is not valid.
441 The file descriptor is not associated with a directory.
444 The descriptor does not allow reading the directory content.
447 Not enough memory available.
451 In some situations it can be desirable to get hold of the file
452 descriptor which is created by the @code{opendir} call. For instance,
453 to switch the current working directory to the directory just read the
454 @code{fchdir} function could be used. Historically the @code{DIR} type
455 was exposed and programs could access the fields. This does not happen
456 in @theglibc{}. Instead a separate function is provided to allow
461 @deftypefun int dirfd (DIR *@var{dirstream})
462 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
463 The function @code{dirfd} returns the file descriptor associated with
464 the directory stream @var{dirstream}. This descriptor can be used until
465 the directory is closed with @code{closedir}. If the directory stream
466 implementation is not using file descriptors the return value is
470 @node Reading/Closing Directory
471 @subsection Reading and Closing a Directory Stream
474 This section describes how to read directory entries from a directory
475 stream, and how to close the stream when you are done with it. All the
476 symbols are declared in the header file @file{dirent.h}.
480 @deftypefun {struct dirent *} readdir (DIR *@var{dirstream})
481 @safety{@prelim{}@mtunsafe{@mtasurace{:dirstream}}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
482 @c This function holds dirstream's non-recursive lock, which brings
483 @c about the usual issues with locks and async signals and cancellation,
484 @c but the lock taking is not enough to make the returned value safe to
485 @c use, since it points to a stream's internal buffer that can be
486 @c overwritten by subsequent calls or even released by closedir.
487 This function reads the next entry from the directory. It normally
488 returns a pointer to a structure containing information about the
489 file. This structure is associated with the @var{dirstream} handle
490 and can be rewritten by a subsequent call.
492 @strong{Portability Note:} On some systems @code{readdir} may not
493 return entries for @file{.} and @file{..}, even though these are always
494 valid file names in any directory. @xref{File Name Resolution}.
496 If there are no more entries in the directory or an error is detected,
497 @code{readdir} returns a null pointer. The following @code{errno} error
498 conditions are defined for this function:
502 The @var{dirstream} argument is not valid.
505 To distinguish between an end-of-directory condition or an error, you
506 must set @code{errno} to zero before calling @code{readdir}. To avoid
507 entering an infinite loop, you should stop reading from the directory
508 after the first error.
510 In POSIX.1-2008, @code{readdir} is not thread-safe. In @theglibc{}
511 implementation, it is safe to call @code{readdir} concurrently on
512 different @var{dirstream}s, but multiple threads accessing the same
513 @var{dirstream} result in undefined behavior. @code{readdir_r} is a
514 fully thread-safe alternative, but suffers from poor portability (see
515 below). It is recommended that you use @code{readdir}, with external
516 locking if multiple threads access the same @var{dirstream}.
521 @deftypefun int readdir_r (DIR *@var{dirstream}, struct dirent *@var{entry}, struct dirent **@var{result})
522 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
523 This function is a version of @code{readdir} which performs internal
524 locking. Like @code{readdir} it returns the next entry from the
525 directory. To prevent conflicts between simultaneously running
526 threads the result is stored inside the @var{entry} object.
528 @strong{Portability Note:} @code{readdir_r} is deprecated. It is
529 recommended to use @code{readdir} instead of @code{readdir_r} for the
534 On systems which do not define @code{NAME_MAX}, it may not be possible
535 to use @code{readdir_r} safely because the caller does not specify the
536 length of the buffer for the directory entry.
539 On some systems, @code{readdir_r} cannot read directory entries with
540 very long names. If such a name is encountered, @theglibc{}
541 implementation of @code{readdir_r} returns with an error code of
542 @code{ENAMETOOLONG} after the final directory entry has been read. On
543 other systems, @code{readdir_r} may return successfully, but the
544 @code{d_name} member may not be NUL-terminated or may be truncated.
547 POSIX-1.2008 does not guarantee that @code{readdir} is thread-safe,
548 even when access to the same @var{dirstream} is serialized. But in
549 current implementations (including @theglibc{}), it is safe to call
550 @code{readdir} concurrently on different @var{dirstream}s, so there is
551 no need to use @code{readdir_r} in most multi-threaded programs. In
552 the rare case that multiple threads need to read from the same
553 @var{dirstream}, it is still better to use @code{readdir} and external
557 It is expected that future versions of POSIX will obsolete
558 @code{readdir_r} and mandate the level of thread safety for
559 @code{readdir} which is provided by @theglibc{} and other
560 implementations today.
563 Normally @code{readdir_r} returns zero and sets @code{*@var{result}}
564 to @var{entry}. If there are no more entries in the directory or an
565 error is detected, @code{readdir_r} sets @code{*@var{result}} to a
566 null pointer and returns a nonzero error code, also stored in
567 @code{errno}, as described for @code{readdir}.
569 It is also important to look at the definition of the @code{struct
570 dirent} type. Simply passing a pointer to an object of this type for
571 the second parameter of @code{readdir_r} might not be enough. Some
572 systems don't define the @code{d_name} element sufficiently long. In
573 this case the user has to provide additional space. There must be room
574 for at least @code{NAME_MAX + 1} characters in the @code{d_name} array.
575 Code to call @code{readdir_r} could look like this:
581 char b[offsetof (struct dirent, d_name) + NAME_MAX + 1];
584 if (readdir_r (dir, &u.d, &res) == 0)
589 To support large filesystems on 32-bit machines there are LFS variants
590 of the last two functions.
594 @deftypefun {struct dirent64 *} readdir64 (DIR *@var{dirstream})
595 @safety{@prelim{}@mtunsafe{@mtasurace{:dirstream}}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
596 The @code{readdir64} function is just like the @code{readdir} function
597 except that it returns a pointer to a record of type @code{struct
598 dirent64}. Some of the members of this data type (notably @code{d_ino})
599 might have a different size to allow large filesystems.
601 In all other aspects this function is equivalent to @code{readdir}.
606 @deftypefun int readdir64_r (DIR *@var{dirstream}, struct dirent64 *@var{entry}, struct dirent64 **@var{result})
607 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
608 The deprecated @code{readdir64_r} function is equivalent to the
609 @code{readdir_r} function except that it takes parameters of base type
610 @code{struct dirent64} instead of @code{struct dirent} in the second and
611 third position. The same precautions mentioned in the documentation of
612 @code{readdir_r} also apply here.
617 @deftypefun int closedir (DIR *@var{dirstream})
618 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{/hurd}}@acunsafe{@acsmem{} @acsfd{} @aculock{/hurd}}}
619 @c No synchronization in the posix implementation, only in the hurd
620 @c one. This is regarded as safe because it is undefined behavior if
621 @c other threads could still be using the dir stream while it's closed.
622 This function closes the directory stream @var{dirstream}. It returns
623 @code{0} on success and @code{-1} on failure.
625 The following @code{errno} error conditions are defined for this
630 The @var{dirstream} argument is not valid.
634 @node Simple Directory Lister
635 @subsection Simple Program to List a Directory
637 Here's a simple program that prints the names of the files in
638 the current working directory:
644 The order in which files appear in a directory tends to be fairly
645 random. A more useful program would sort the entries (perhaps by
646 alphabetizing them) before printing them; see
647 @ref{Scanning Directory Content}, and @ref{Array Sort Function}.
650 @node Random Access Directory
651 @subsection Random Access in a Directory Stream
654 This section describes how to reread parts of a directory that you have
655 already read from an open directory stream. All the symbols are
656 declared in the header file @file{dirent.h}.
660 @deftypefun void rewinddir (DIR *@var{dirstream})
661 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
662 The @code{rewinddir} function is used to reinitialize the directory
663 stream @var{dirstream}, so that if you call @code{readdir} it
664 returns information about the first entry in the directory again. This
665 function also notices if files have been added or removed to the
666 directory since it was opened with @code{opendir}. (Entries for these
667 files might or might not be returned by @code{readdir} if they were
668 added or removed since you last called @code{opendir} or
674 @deftypefun {long int} telldir (DIR *@var{dirstream})
675 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{/bsd} @asulock{/bsd}}@acunsafe{@acsmem{/bsd} @aculock{/bsd}}}
676 @c The implementation is safe on most platforms, but on BSD it uses
677 @c cookies, buckets and records, and the global array of pointers to
678 @c dynamically allocated records is guarded by a non-recursive lock.
679 The @code{telldir} function returns the file position of the directory
680 stream @var{dirstream}. You can use this value with @code{seekdir} to
681 restore the directory stream to that position.
686 @deftypefun void seekdir (DIR *@var{dirstream}, long int @var{pos})
687 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{/bsd} @asulock{/bsd}}@acunsafe{@acsmem{/bsd} @aculock{/bsd}}}
688 @c The implementation is safe on most platforms, but on BSD it uses
689 @c cookies, buckets and records, and the global array of pointers to
690 @c dynamically allocated records is guarded by a non-recursive lock.
691 The @code{seekdir} function sets the file position of the directory
692 stream @var{dirstream} to @var{pos}. The value @var{pos} must be the
693 result of a previous call to @code{telldir} on this particular stream;
694 closing and reopening the directory can invalidate values returned by
699 @node Scanning Directory Content
700 @subsection Scanning the Content of a Directory
702 A higher-level interface to the directory handling functions is the
703 @code{scandir} function. With its help one can select a subset of the
704 entries in a directory, possibly sort them and get a list of names as
709 @deftypefun int scandir (const char *@var{dir}, struct dirent ***@var{namelist}, int (*@var{selector}) (const struct dirent *), int (*@var{cmp}) (const struct dirent **, const struct dirent **))
710 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
711 @c The scandir function calls __opendirat, __readdir, and __closedir to
712 @c go over the named dir; malloc and realloc to allocate the namelist
713 @c and copies of each selected dirent, besides the selector, if given,
714 @c and qsort and the cmp functions if the latter is given. In spite of
715 @c the cleanup handler that releases memory and the file descriptor in
716 @c case of synchronous cancellation, an asynchronous cancellation may
717 @c still leak memory and a file descriptor. Although readdir is unsafe
718 @c in general, the use of an internal dir stream for sequential scanning
719 @c of the directory with copying of dirents before subsequent calls
720 @c makes the use safe, and the fact that the dir stream is private to
721 @c each scandir call does away with the lock issues in readdir and
724 The @code{scandir} function scans the contents of the directory selected
725 by @var{dir}. The result in *@var{namelist} is an array of pointers to
726 structure of type @code{struct dirent} which describe all selected
727 directory entries and which is allocated using @code{malloc}. Instead
728 of always getting all directory entries returned, the user supplied
729 function @var{selector} can be used to decide which entries are in the
730 result. Only the entries for which @var{selector} returns a non-zero
733 Finally the entries in *@var{namelist} are sorted using the
734 user-supplied function @var{cmp}. The arguments passed to the @var{cmp}
735 function are of type @code{struct dirent **}, therefore one cannot
736 directly use the @code{strcmp} or @code{strcoll} functions; instead see
737 the functions @code{alphasort} and @code{versionsort} below.
739 The return value of the function is the number of entries placed in
740 *@var{namelist}. If it is @code{-1} an error occurred (either the
741 directory could not be opened for reading or the malloc call failed) and
742 the global variable @code{errno} contains more information on the error.
745 As described above the fourth argument to the @code{scandir} function
746 must be a pointer to a sorting function. For the convenience of the
747 programmer @theglibc{} contains implementations of functions which
748 are very helpful for this purpose.
752 @deftypefun int alphasort (const struct dirent **@var{a}, const struct dirent **@var{b})
753 @safety{@prelim{}@mtsafe{@mtslocale{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
755 The @code{alphasort} function behaves like the @code{strcoll} function
756 (@pxref{String/Array Comparison}). The difference is that the arguments
757 are not string pointers but instead they are of type
758 @code{struct dirent **}.
760 The return value of @code{alphasort} is less than, equal to, or greater
761 than zero depending on the order of the two entries @var{a} and @var{b}.
766 @deftypefun int versionsort (const struct dirent **@var{a}, const struct dirent **@var{b})
767 @safety{@prelim{}@mtsafe{@mtslocale{}}@assafe{}@acsafe{}}
768 @c Calls strverscmp, which will accesses the locale object multiple
770 The @code{versionsort} function is like @code{alphasort} except that it
771 uses the @code{strverscmp} function internally.
774 If the filesystem supports large files we cannot use the @code{scandir}
775 anymore since the @code{dirent} structure might not able to contain all
776 the information. The LFS provides the new type @w{@code{struct
777 dirent64}}. To use this we need a new function.
781 @deftypefun int scandir64 (const char *@var{dir}, struct dirent64 ***@var{namelist}, int (*@var{selector}) (const struct dirent64 *), int (*@var{cmp}) (const struct dirent64 **, const struct dirent64 **))
782 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
784 The @code{scandir64} function works like the @code{scandir} function
785 except that the directory entries it returns are described by elements
786 of type @w{@code{struct dirent64}}. The function pointed to by
787 @var{selector} is again used to select the desired entries, except that
788 @var{selector} now must point to a function which takes a
789 @w{@code{struct dirent64 *}} parameter.
791 Similarly the @var{cmp} function should expect its two arguments to be
792 of type @code{struct dirent64 **}.
795 As @var{cmp} is now a function of a different type, the functions
796 @code{alphasort} and @code{versionsort} cannot be supplied for that
797 argument. Instead we provide the two replacement functions below.
801 @deftypefun int alphasort64 (const struct dirent64 **@var{a}, const struct dirent **@var{b})
802 @safety{@prelim{}@mtsafe{@mtslocale{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
804 The @code{alphasort64} function behaves like the @code{strcoll} function
805 (@pxref{String/Array Comparison}). The difference is that the arguments
806 are not string pointers but instead they are of type
807 @code{struct dirent64 **}.
809 Return value of @code{alphasort64} is less than, equal to, or greater
810 than zero depending on the order of the two entries @var{a} and @var{b}.
815 @deftypefun int versionsort64 (const struct dirent64 **@var{a}, const struct dirent64 **@var{b})
816 @safety{@prelim{}@mtsafe{@mtslocale{}}@assafe{}@acsafe{}}
818 The @code{versionsort64} function is like @code{alphasort64}, excepted that it
819 uses the @code{strverscmp} function internally.
822 It is important not to mix the use of @code{scandir} and the 64-bit
823 comparison functions or vice versa. There are systems on which this
824 works but on others it will fail miserably.
826 @node Simple Directory Lister Mark II
827 @subsection Simple Program to List a Directory, Mark II
829 Here is a revised version of the directory lister found above
830 (@pxref{Simple Directory Lister}). Using the @code{scandir} function we
831 can avoid the functions which work directly with the directory contents.
832 After the call the returned entries are available for direct use.
838 Note the simple selector function in this example. Since we want to see
839 all directory entries we always return @code{1}.
842 @node Working with Directory Trees
843 @section Working with Directory Trees
844 @cindex directory hierarchy
845 @cindex hierarchy, directory
846 @cindex tree, directory
848 The functions described so far for handling the files in a directory
849 have allowed you to either retrieve the information bit by bit, or to
850 process all the files as a group (see @code{scandir}). Sometimes it is
851 useful to process whole hierarchies of directories and their contained
852 files. The X/Open specification defines two functions to do this. The
853 simpler form is derived from an early definition in @w{System V} systems
854 and therefore this function is available on SVID-derived systems. The
855 prototypes and required definitions can be found in the @file{ftw.h}
858 There are four functions in this family: @code{ftw}, @code{nftw} and
859 their 64-bit counterparts @code{ftw64} and @code{nftw64}. These
860 functions take as one of their arguments a pointer to a callback
861 function of the appropriate type.
865 @deftp {Data Type} __ftw_func_t
868 int (*) (const char *, const struct stat *, int)
871 The type of callback functions given to the @code{ftw} function. The
872 first parameter points to the file name, the second parameter to an
873 object of type @code{struct stat} which is filled in for the file named
874 in the first parameter.
877 The last parameter is a flag giving more information about the current
878 file. It can have the following values:
882 The item is either a normal file or a file which does not fit into one
883 of the following categories. This could be special files, sockets etc.
885 The item is a directory.
887 The @code{stat} call failed and so the information pointed to by the
888 second parameter is invalid.
890 The item is a directory which cannot be read.
892 The item is a symbolic link. Since symbolic links are normally followed
893 seeing this value in a @code{ftw} callback function means the referenced
894 file does not exist. The situation for @code{nftw} is different.
896 This value is only available if the program is compiled with
897 @code{_XOPEN_EXTENDED} defined before including
898 the first header. The original SVID systems do not have symbolic links.
901 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
902 type is in fact @code{__ftw64_func_t} since this mode changes
903 @code{struct stat} to be @code{struct stat64}.
906 For the LFS interface and for use in the function @code{ftw64}, the
907 header @file{ftw.h} defines another function type.
911 @deftp {Data Type} __ftw64_func_t
914 int (*) (const char *, const struct stat64 *, int)
917 This type is used just like @code{__ftw_func_t} for the callback
918 function, but this time is called from @code{ftw64}. The second
919 parameter to the function is a pointer to a variable of type
920 @code{struct stat64} which is able to represent the larger values.
925 @deftp {Data Type} __nftw_func_t
928 int (*) (const char *, const struct stat *, int, struct FTW *)
933 The first three arguments are the same as for the @code{__ftw_func_t}
934 type. However for the third argument some additional values are defined
935 to allow finer differentiation:
938 The current item is a directory and all subdirectories have already been
939 visited and reported. This flag is returned instead of @code{FTW_D} if
940 the @code{FTW_DEPTH} flag is passed to @code{nftw} (see below).
942 The current item is a stale symbolic link. The file it points to does
946 The last parameter of the callback function is a pointer to a structure
947 with some extra information as described below.
949 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
950 type is in fact @code{__nftw64_func_t} since this mode changes
951 @code{struct stat} to be @code{struct stat64}.
954 For the LFS interface there is also a variant of this data type
955 available which has to be used with the @code{nftw64} function.
959 @deftp {Data Type} __nftw64_func_t
962 int (*) (const char *, const struct stat64 *, int, struct FTW *)
965 This type is used just like @code{__nftw_func_t} for the callback
966 function, but this time is called from @code{nftw64}. The second
967 parameter to the function is this time a pointer to a variable of type
968 @code{struct stat64} which is able to represent the larger values.
973 @deftp {Data Type} {struct FTW}
974 The information contained in this structure helps in interpreting the
975 name parameter and gives some information about the current state of the
976 traversal of the directory hierarchy.
980 The value is the offset into the string passed in the first parameter to
981 the callback function of the beginning of the file name. The rest of
982 the string is the path of the file. This information is especially
983 important if the @code{FTW_CHDIR} flag was set in calling @code{nftw}
984 since then the current directory is the one the current item is found
987 Whilst processing, the code tracks how many directories down it has gone
988 to find the current file. This nesting level starts at @math{0} for
989 files in the initial directory (or is zero for the initial file if a
997 @deftypefun int ftw (const char *@var{filename}, __ftw_func_t @var{func}, int @var{descriptors})
998 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
999 @c see nftw for safety details
1000 The @code{ftw} function calls the callback function given in the
1001 parameter @var{func} for every item which is found in the directory
1002 specified by @var{filename} and all directories below. The function
1003 follows symbolic links if necessary but does not process an item twice.
1004 If @var{filename} is not a directory then it itself is the only object
1005 returned to the callback function.
1007 The file name passed to the callback function is constructed by taking
1008 the @var{filename} parameter and appending the names of all passed
1009 directories and then the local file name. So the callback function can
1010 use this parameter to access the file. @code{ftw} also calls
1011 @code{stat} for the file and passes that information on to the callback
1012 function. If this @code{stat} call was not successful the failure is
1013 indicated by setting the third argument of the callback function to
1014 @code{FTW_NS}. Otherwise it is set according to the description given
1015 in the account of @code{__ftw_func_t} above.
1017 The callback function is expected to return @math{0} to indicate that no
1018 error occurred and that processing should continue. If an error
1019 occurred in the callback function or it wants @code{ftw} to return
1020 immediately, the callback function can return a value other than
1021 @math{0}. This is the only correct way to stop the function. The
1022 program must not use @code{setjmp} or similar techniques to continue
1023 from another place. This would leave resources allocated by the
1024 @code{ftw} function unfreed.
1026 The @var{descriptors} parameter to @code{ftw} specifies how many file
1027 descriptors it is allowed to consume. The function runs faster the more
1028 descriptors it can use. For each level in the directory hierarchy at
1029 most one descriptor is used, but for very deep ones any limit on open
1030 file descriptors for the process or the system may be exceeded.
1031 Moreover, file descriptor limits in a multi-threaded program apply to
1032 all the threads as a group, and therefore it is a good idea to supply a
1033 reasonable limit to the number of open descriptors.
1035 The return value of the @code{ftw} function is @math{0} if all callback
1036 function calls returned @math{0} and all actions performed by the
1037 @code{ftw} succeeded. If a function call failed (other than calling
1038 @code{stat} on an item) the function returns @math{-1}. If a callback
1039 function returns a value other than @math{0} this value is returned as
1040 the return value of @code{ftw}.
1042 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1043 32-bit system this function is in fact @code{ftw64}, i.e., the LFS
1044 interface transparently replaces the old interface.
1049 @deftypefun int ftw64 (const char *@var{filename}, __ftw64_func_t @var{func}, int @var{descriptors})
1050 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1051 This function is similar to @code{ftw} but it can work on filesystems
1052 with large files. File information is reported using a variable of type
1053 @code{struct stat64} which is passed by reference to the callback
1056 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1057 32-bit system this function is available under the name @code{ftw} and
1058 transparently replaces the old implementation.
1063 @deftypefun int nftw (const char *@var{filename}, __nftw_func_t @var{func}, int @var{descriptors}, int @var{flag})
1064 @safety{@prelim{}@mtsafe{@mtasscwd{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{} @acscwd{}}}
1065 @c ftw_startup calls alloca, malloc, free, xstat/lxstat, tdestroy, and ftw_dir
1066 @c if FTW_CHDIR, call open, and fchdir, or chdir and getcwd
1067 @c ftw_dir calls open_dir_stream, readdir64, process_entry, closedir
1068 @c if FTW_CHDIR, also calls fchdir
1069 @c open_dir_stream calls malloc, realloc, readdir64, free, closedir,
1070 @c then openat64_not_cancel_3 and fdopendir or opendir, then dirfd.
1071 @c process_entry may cal realloc, fxstatat/lxstat/xstat, ftw_dir, and
1072 @c find_object (tsearch) and add_object (tfind).
1073 @c Since each invocation of *ftw uses its own private search tree, none
1074 @c of the search tree concurrency issues apply.
1075 The @code{nftw} function works like the @code{ftw} functions. They call
1076 the callback function @var{func} for all items found in the directory
1077 @var{filename} and below. At most @var{descriptors} file descriptors
1078 are consumed during the @code{nftw} call.
1080 One difference is that the callback function is of a different type. It
1081 is of type @w{@code{struct FTW *}} and provides the callback function
1082 with the extra information described above.
1084 A second difference is that @code{nftw} takes a fourth argument, which
1085 is @math{0} or a bitwise-OR combination of any of the following values.
1089 While traversing the directory symbolic links are not followed. Instead
1090 symbolic links are reported using the @code{FTW_SL} value for the type
1091 parameter to the callback function. If the file referenced by a
1092 symbolic link does not exist @code{FTW_SLN} is returned instead.
1094 The callback function is only called for items which are on the same
1095 mounted filesystem as the directory given by the @var{filename}
1096 parameter to @code{nftw}.
1098 If this flag is given the current working directory is changed to the
1099 directory of the reported object before the callback function is called.
1100 When @code{ntfw} finally returns the current directory is restored to
1103 If this option is specified then all subdirectories and files within
1104 them are processed before processing the top directory itself
1105 (depth-first processing). This also means the type flag given to the
1106 callback function is @code{FTW_DP} and not @code{FTW_D}.
1107 @item FTW_ACTIONRETVAL
1108 If this option is specified then return values from callbacks
1109 are handled differently. If the callback returns @code{FTW_CONTINUE},
1110 walking continues normally. @code{FTW_STOP} means walking stops
1111 and @code{FTW_STOP} is returned to the caller. If @code{FTW_SKIP_SUBTREE}
1112 is returned by the callback with @code{FTW_D} argument, the subtree
1113 is skipped and walking continues with next sibling of the directory.
1114 If @code{FTW_SKIP_SIBLINGS} is returned by the callback, all siblings
1115 of the current entry are skipped and walking continues in its parent.
1116 No other return values should be returned from the callbacks if
1117 this option is set. This option is a GNU extension.
1120 The return value is computed in the same way as for @code{ftw}.
1121 @code{nftw} returns @math{0} if no failures occurred and all callback
1122 functions returned @math{0}. In case of internal errors, such as memory
1123 problems, the return value is @math{-1} and @var{errno} is set
1124 accordingly. If the return value of a callback invocation was non-zero
1125 then that value is returned.
1127 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1128 32-bit system this function is in fact @code{nftw64}, i.e., the LFS
1129 interface transparently replaces the old interface.
1134 @deftypefun int nftw64 (const char *@var{filename}, __nftw64_func_t @var{func}, int @var{descriptors}, int @var{flag})
1135 @safety{@prelim{}@mtsafe{@mtasscwd{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{} @acscwd{}}}
1136 This function is similar to @code{nftw} but it can work on filesystems
1137 with large files. File information is reported using a variable of type
1138 @code{struct stat64} which is passed by reference to the callback
1141 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1142 32-bit system this function is available under the name @code{nftw} and
1143 transparently replaces the old implementation.
1151 @cindex multiple names for one file
1152 @cindex file names, multiple
1154 In POSIX systems, one file can have many names at the same time. All of
1155 the names are equally real, and no one of them is preferred to the
1158 To add a name to a file, use the @code{link} function. (The new name is
1159 also called a @dfn{hard link} to the file.) Creating a new link to a
1160 file does not copy the contents of the file; it simply makes a new name
1161 by which the file can be known, in addition to the file's existing name
1164 One file can have names in several directories, so the organization
1165 of the file system is not a strict hierarchy or tree.
1167 In most implementations, it is not possible to have hard links to the
1168 same file in multiple file systems. @code{link} reports an error if you
1169 try to make a hard link to the file from another file system when this
1172 The prototype for the @code{link} function is declared in the header
1173 file @file{unistd.h}.
1178 @deftypefun int link (const char *@var{oldname}, const char *@var{newname})
1179 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1180 The @code{link} function makes a new link to the existing file named by
1181 @var{oldname}, under the new name @var{newname}.
1183 This function returns a value of @code{0} if it is successful and
1184 @code{-1} on failure. In addition to the usual file name errors
1185 (@pxref{File Name Errors}) for both @var{oldname} and @var{newname}, the
1186 following @code{errno} error conditions are defined for this function:
1190 You are not allowed to write to the directory in which the new link is
1193 Some implementations also require that the existing file be accessible
1194 by the caller, and use this error to report failure for that reason.
1198 There is already a file named @var{newname}. If you want to replace
1199 this link with a new link, you must remove the old link explicitly first.
1202 There are already too many links to the file named by @var{oldname}.
1203 (The maximum number of links to a file is @w{@code{LINK_MAX}}; see
1204 @ref{Limits for Files}.)
1207 The file named by @var{oldname} doesn't exist. You can't make a link to
1208 a file that doesn't exist.
1211 The directory or file system that would contain the new link is full
1212 and cannot be extended.
1215 On @gnulinuxhurdsystems{} and some others, you cannot make links to
1217 Many systems allow only privileged users to do so. This error
1218 is used to report the problem.
1221 The directory containing the new link can't be modified because it's on
1222 a read-only file system.
1225 The directory specified in @var{newname} is on a different file system
1226 than the existing file.
1229 A hardware error occurred while trying to read or write the to filesystem.
1233 @node Symbolic Links
1234 @section Symbolic Links
1237 @cindex symbolic link
1238 @cindex link, symbolic
1240 @gnusystems{} support @dfn{soft links} or @dfn{symbolic links}. This
1241 is a kind of ``file'' that is essentially a pointer to another file
1242 name. Unlike hard links, symbolic links can be made to directories or
1243 across file systems with no restrictions. You can also make a symbolic
1244 link to a name which is not the name of any file. (Opening this link
1245 will fail until a file by that name is created.) Likewise, if the
1246 symbolic link points to an existing file which is later deleted, the
1247 symbolic link continues to point to the same file name even though the
1248 name no longer names any file.
1250 The reason symbolic links work the way they do is that special things
1251 happen when you try to open the link. The @code{open} function realizes
1252 you have specified the name of a link, reads the file name contained in
1253 the link, and opens that file name instead. The @code{stat} function
1254 likewise operates on the file that the symbolic link points to, instead
1255 of on the link itself.
1257 By contrast, other operations such as deleting or renaming the file
1258 operate on the link itself. The functions @code{readlink} and
1259 @code{lstat} also refrain from following symbolic links, because their
1260 purpose is to obtain information about the link. @code{link}, the
1261 function that makes a hard link, does too. It makes a hard link to the
1262 symbolic link, which one rarely wants.
1264 Some systems have for some functions operating on files have a limit on
1265 how many symbolic links are followed when resolving a path name. The
1266 limit if it exists is published in the @file{sys/param.h} header file.
1268 @comment sys/param.h
1270 @deftypevr Macro int MAXSYMLINKS
1272 The macro @code{MAXSYMLINKS} specifies how many symlinks some function
1273 will follow before returning @code{ELOOP}. Not all functions behave the
1274 same and this value is not the same a that returned for
1275 @code{_SC_SYMLOOP} by @code{sysconf}. In fact, the @code{sysconf}
1276 result can indicate that there is no fixed limit although
1277 @code{MAXSYMLINKS} exists and has a finite value.
1280 Prototypes for most of the functions listed in this section are in
1286 @deftypefun int symlink (const char *@var{oldname}, const char *@var{newname})
1287 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1288 The @code{symlink} function makes a symbolic link to @var{oldname} named
1291 The normal return value from @code{symlink} is @code{0}. A return value
1292 of @code{-1} indicates an error. In addition to the usual file name
1293 syntax errors (@pxref{File Name Errors}), the following @code{errno}
1294 error conditions are defined for this function:
1298 There is already an existing file named @var{newname}.
1301 The file @var{newname} would exist on a read-only file system.
1304 The directory or file system cannot be extended to make the new link.
1307 A hardware error occurred while reading or writing data on the disk.
1310 @comment not sure about these
1312 There are too many levels of indirection. This can be the result of
1313 circular symbolic links to directories.
1316 The new link can't be created because the user's disk quota has been
1324 @deftypefun ssize_t readlink (const char *@var{filename}, char *@var{buffer}, size_t @var{size})
1325 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1326 The @code{readlink} function gets the value of the symbolic link
1327 @var{filename}. The file name that the link points to is copied into
1328 @var{buffer}. This file name string is @emph{not} null-terminated;
1329 @code{readlink} normally returns the number of characters copied. The
1330 @var{size} argument specifies the maximum number of characters to copy,
1331 usually the allocation size of @var{buffer}.
1333 If the return value equals @var{size}, you cannot tell whether or not
1334 there was room to return the entire name. So make a bigger buffer and
1335 call @code{readlink} again. Here is an example:
1339 readlink_malloc (const char *filename)
1342 char *buffer = NULL;
1346 buffer = (char *) xrealloc (buffer, size);
1347 int nchars = readlink (filename, buffer, size);
1360 @c @group Invalid outside example.
1361 A value of @code{-1} is returned in case of error. In addition to the
1362 usual file name errors (@pxref{File Name Errors}), the following
1363 @code{errno} error conditions are defined for this function:
1367 The named file is not a symbolic link.
1370 A hardware error occurred while reading or writing data on the disk.
1375 In some situations it is desirable to resolve all the
1376 symbolic links to get the real
1377 name of a file where no prefix names a symbolic link which is followed
1378 and no filename in the path is @code{.} or @code{..}. This is for
1379 instance desirable if files have to be compare in which case different
1380 names can refer to the same inode.
1384 @deftypefun {char *} canonicalize_file_name (const char *@var{name})
1385 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1388 The @code{canonicalize_file_name} function returns the absolute name of
1389 the file named by @var{name} which contains no @code{.}, @code{..}
1390 components nor any repeated path separators (@code{/}) or symlinks. The
1391 result is passed back as the return value of the function in a block of
1392 memory allocated with @code{malloc}. If the result is not used anymore
1393 the memory should be freed with a call to @code{free}.
1395 If any of the path components is missing the function returns a NULL
1396 pointer. This is also what is returned if the length of the path
1397 reaches or exceeds @code{PATH_MAX} characters. In any case
1398 @code{errno} is set accordingly.
1402 The resulting path is too long. This error only occurs on systems which
1403 have a limit on the file name length.
1406 At least one of the path components is not readable.
1409 The input file name is empty.
1412 At least one of the path components does not exist.
1415 More than @code{MAXSYMLINKS} many symlinks have been followed.
1418 This function is a GNU extension and is declared in @file{stdlib.h}.
1421 The Unix standard includes a similar function which differs from
1422 @code{canonicalize_file_name} in that the user has to provide the buffer
1423 where the result is placed in.
1427 @deftypefun {char *} realpath (const char *restrict @var{name}, char *restrict @var{resolved})
1428 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1429 @c Calls malloc, realloc, getcwd, lxstat64, readlink, alloca.
1431 A call to @code{realpath} where the @var{resolved} parameter is
1432 @code{NULL} behaves exactly like @code{canonicalize_file_name}. The
1433 function allocates a buffer for the file name and returns a pointer to
1434 it. If @var{resolved} is not @code{NULL} it points to a buffer into
1435 which the result is copied. It is the callers responsibility to
1436 allocate a buffer which is large enough. On systems which define
1437 @code{PATH_MAX} this means the buffer must be large enough for a
1438 pathname of this size. For systems without limitations on the pathname
1439 length the requirement cannot be met and programs should not call
1440 @code{realpath} with anything but @code{NULL} for the second parameter.
1442 One other difference is that the buffer @var{resolved} (if nonzero) will
1443 contain the part of the path component which does not exist or is not
1444 readable if the function returns @code{NULL} and @code{errno} is set to
1445 @code{EACCES} or @code{ENOENT}.
1447 This function is declared in @file{stdlib.h}.
1450 The advantage of using this function is that it is more widely
1451 available. The drawback is that it reports failures for long path on
1452 systems which have no limits on the file name length.
1454 @node Deleting Files
1455 @section Deleting Files
1456 @cindex deleting a file
1457 @cindex removing a file
1458 @cindex unlinking a file
1460 You can delete a file with @code{unlink} or @code{remove}.
1462 Deletion actually deletes a file name. If this is the file's only name,
1463 then the file is deleted as well. If the file has other remaining names
1464 (@pxref{Hard Links}), it remains accessible under those names.
1468 @deftypefun int unlink (const char *@var{filename})
1469 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1470 The @code{unlink} function deletes the file name @var{filename}. If
1471 this is a file's sole name, the file itself is also deleted. (Actually,
1472 if any process has the file open when this happens, deletion is
1473 postponed until all processes have closed the file.)
1476 The function @code{unlink} is declared in the header file @file{unistd.h}.
1478 This function returns @code{0} on successful completion, and @code{-1}
1479 on error. In addition to the usual file name errors
1480 (@pxref{File Name Errors}), the following @code{errno} error conditions are
1481 defined for this function:
1485 Write permission is denied for the directory from which the file is to be
1486 removed, or the directory has the sticky bit set and you do not own the file.
1489 This error indicates that the file is being used by the system in such a
1490 way that it can't be unlinked. For example, you might see this error if
1491 the file name specifies the root directory or a mount point for a file
1495 The file name to be deleted doesn't exist.
1498 On some systems @code{unlink} cannot be used to delete the name of a
1499 directory, or at least can only be used this way by a privileged user.
1500 To avoid such problems, use @code{rmdir} to delete directories. (On
1501 @gnulinuxhurdsystems{} @code{unlink} can never delete the name of a directory.)
1504 The directory containing the file name to be deleted is on a read-only
1505 file system and can't be modified.
1511 @deftypefun int rmdir (const char *@var{filename})
1512 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1513 @cindex directories, deleting
1514 @cindex deleting a directory
1515 The @code{rmdir} function deletes a directory. The directory must be
1516 empty before it can be removed; in other words, it can only contain
1517 entries for @file{.} and @file{..}.
1519 In most other respects, @code{rmdir} behaves like @code{unlink}. There
1520 are two additional @code{errno} error conditions defined for
1526 The directory to be deleted is not empty.
1529 These two error codes are synonymous; some systems use one, and some use
1530 the other. @gnulinuxhurdsystems{} always use @code{ENOTEMPTY}.
1532 The prototype for this function is declared in the header file
1539 @deftypefun int remove (const char *@var{filename})
1540 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1541 @c Calls unlink and rmdir.
1542 This is the @w{ISO C} function to remove a file. It works like
1543 @code{unlink} for files and like @code{rmdir} for directories.
1544 @code{remove} is declared in @file{stdio.h}.
1548 @node Renaming Files
1549 @section Renaming Files
1551 The @code{rename} function is used to change a file's name.
1553 @cindex renaming a file
1556 @deftypefun int rename (const char *@var{oldname}, const char *@var{newname})
1557 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1558 @c In the absence of a rename syscall, there's an emulation with link
1559 @c and unlink, but it's racy, even more so if newname exists and is
1561 The @code{rename} function renames the file @var{oldname} to
1562 @var{newname}. The file formerly accessible under the name
1563 @var{oldname} is afterwards accessible as @var{newname} instead. (If
1564 the file had any other names aside from @var{oldname}, it continues to
1567 The directory containing the name @var{newname} must be on the same file
1568 system as the directory containing the name @var{oldname}.
1570 One special case for @code{rename} is when @var{oldname} and
1571 @var{newname} are two names for the same file. The consistent way to
1572 handle this case is to delete @var{oldname}. However, in this case
1573 POSIX requires that @code{rename} do nothing and report success---which
1574 is inconsistent. We don't know what your operating system will do.
1576 If @var{oldname} is not a directory, then any existing file named
1577 @var{newname} is removed during the renaming operation. However, if
1578 @var{newname} is the name of a directory, @code{rename} fails in this
1581 If @var{oldname} is a directory, then either @var{newname} must not
1582 exist or it must name a directory that is empty. In the latter case,
1583 the existing directory named @var{newname} is deleted first. The name
1584 @var{newname} must not specify a subdirectory of the directory
1585 @code{oldname} which is being renamed.
1587 One useful feature of @code{rename} is that the meaning of @var{newname}
1588 changes ``atomically'' from any previously existing file by that name to
1589 its new meaning (i.e., the file that was called @var{oldname}). There is
1590 no instant at which @var{newname} is non-existent ``in between'' the old
1591 meaning and the new meaning. If there is a system crash during the
1592 operation, it is possible for both names to still exist; but
1593 @var{newname} will always be intact if it exists at all.
1595 If @code{rename} fails, it returns @code{-1}. In addition to the usual
1596 file name errors (@pxref{File Name Errors}), the following
1597 @code{errno} error conditions are defined for this function:
1601 One of the directories containing @var{newname} or @var{oldname}
1602 refuses write permission; or @var{newname} and @var{oldname} are
1603 directories and write permission is refused for one of them.
1606 A directory named by @var{oldname} or @var{newname} is being used by
1607 the system in a way that prevents the renaming from working. This includes
1608 directories that are mount points for filesystems, and directories
1609 that are the current working directories of processes.
1613 The directory @var{newname} isn't empty. @gnulinuxhurdsystems{} always return
1614 @code{ENOTEMPTY} for this, but some other systems return @code{EEXIST}.
1617 @var{oldname} is a directory that contains @var{newname}.
1620 @var{newname} is a directory but the @var{oldname} isn't.
1623 The parent directory of @var{newname} would have too many links
1627 The file @var{oldname} doesn't exist.
1630 The directory that would contain @var{newname} has no room for another
1631 entry, and there is no space left in the file system to expand it.
1634 The operation would involve writing to a directory on a read-only file
1638 The two file names @var{newname} and @var{oldname} are on different
1643 @node Creating Directories
1644 @section Creating Directories
1645 @cindex creating a directory
1646 @cindex directories, creating
1649 Directories are created with the @code{mkdir} function. (There is also
1650 a shell command @code{mkdir} which does the same thing.)
1655 @deftypefun int mkdir (const char *@var{filename}, mode_t @var{mode})
1656 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1657 The @code{mkdir} function creates a new, empty directory with name
1660 The argument @var{mode} specifies the file permissions for the new
1661 directory file. @xref{Permission Bits}, for more information about
1664 A return value of @code{0} indicates successful completion, and
1665 @code{-1} indicates failure. In addition to the usual file name syntax
1666 errors (@pxref{File Name Errors}), the following @code{errno} error
1667 conditions are defined for this function:
1671 Write permission is denied for the parent directory in which the new
1672 directory is to be added.
1675 A file named @var{filename} already exists.
1678 The parent directory has too many links (entries).
1680 Well-designed file systems never report this error, because they permit
1681 more links than your disk could possibly hold. However, you must still
1682 take account of the possibility of this error, as it could result from
1683 network access to a file system on another machine.
1686 The file system doesn't have enough room to create the new directory.
1689 The parent directory of the directory being created is on a read-only
1690 file system and cannot be modified.
1693 To use this function, your program should include the header file
1698 @node File Attributes
1699 @section File Attributes
1702 When you issue an @samp{ls -l} shell command on a file, it gives you
1703 information about the size of the file, who owns it, when it was last
1704 modified, etc. These are called the @dfn{file attributes}, and are
1705 associated with the file itself and not a particular one of its names.
1707 This section contains information about how you can inquire about and
1708 modify the attributes of a file.
1711 * Attribute Meanings:: The names of the file attributes,
1712 and what their values mean.
1713 * Reading Attributes:: How to read the attributes of a file.
1714 * Testing File Type:: Distinguishing ordinary files,
1715 directories, links@dots{}
1716 * File Owner:: How ownership for new files is determined,
1717 and how to change it.
1718 * Permission Bits:: How information about a file's access
1720 * Access Permission:: How the system decides who can access a file.
1721 * Setting Permissions:: How permissions for new files are assigned,
1722 and how to change them.
1723 * Testing File Access:: How to find out if your process can
1725 * File Times:: About the time attributes of a file.
1726 * File Size:: Manually changing the size of a file.
1727 * Storage Allocation:: Allocate backing storage for files.
1730 @node Attribute Meanings
1731 @subsection The meaning of the File Attributes
1732 @cindex status of a file
1733 @cindex attributes of a file
1734 @cindex file attributes
1736 When you read the attributes of a file, they come back in a structure
1737 called @code{struct stat}. This section describes the names of the
1738 attributes, their data types, and what they mean. For the functions
1739 to read the attributes of a file, see @ref{Reading Attributes}.
1741 The header file @file{sys/stat.h} declares all the symbols defined
1747 @deftp {Data Type} {struct stat}
1748 The @code{stat} structure type is used to return information about the
1749 attributes of a file. It contains at least the following members:
1752 @item mode_t st_mode
1753 Specifies the mode of the file. This includes file type information
1754 (@pxref{Testing File Type}) and the file permission bits
1755 (@pxref{Permission Bits}).
1758 The file serial number, which distinguishes this file from all other
1759 files on the same device.
1762 Identifies the device containing the file. The @code{st_ino} and
1763 @code{st_dev}, taken together, uniquely identify the file. The
1764 @code{st_dev} value is not necessarily consistent across reboots or
1765 system crashes, however.
1767 @item nlink_t st_nlink
1768 The number of hard links to the file. This count keeps track of how
1769 many directories have entries for this file. If the count is ever
1770 decremented to zero, then the file itself is discarded as soon as no
1771 process still holds it open. Symbolic links are not counted in the
1775 The user ID of the file's owner. @xref{File Owner}.
1778 The group ID of the file. @xref{File Owner}.
1781 This specifies the size of a regular file in bytes. For files that are
1782 really devices this field isn't usually meaningful. For symbolic links
1783 this specifies the length of the file name the link refers to.
1785 @item time_t st_atime
1786 This is the last access time for the file. @xref{File Times}.
1788 @item unsigned long int st_atime_usec
1789 This is the fractional part of the last access time for the file.
1792 @item time_t st_mtime
1793 This is the time of the last modification to the contents of the file.
1796 @item unsigned long int st_mtime_usec
1797 This is the fractional part of the time of the last modification to the
1798 contents of the file. @xref{File Times}.
1800 @item time_t st_ctime
1801 This is the time of the last modification to the attributes of the file.
1804 @item unsigned long int st_ctime_usec
1805 This is the fractional part of the time of the last modification to the
1806 attributes of the file. @xref{File Times}.
1809 @item blkcnt_t st_blocks
1810 This is the amount of disk space that the file occupies, measured in
1811 units of 512-byte blocks.
1813 The number of disk blocks is not strictly proportional to the size of
1814 the file, for two reasons: the file system may use some blocks for
1815 internal record keeping; and the file may be sparse---it may have
1816 ``holes'' which contain zeros but do not actually take up space on the
1819 You can tell (approximately) whether a file is sparse by comparing this
1820 value with @code{st_size}, like this:
1823 (st.st_blocks * 512 < st.st_size)
1826 This test is not perfect because a file that is just slightly sparse
1827 might not be detected as sparse at all. For practical applications,
1828 this is not a problem.
1830 @item unsigned int st_blksize
1831 The optimal block size for reading of writing this file, in bytes. You
1832 might use this size for allocating the buffer space for reading of
1833 writing the file. (This is unrelated to @code{st_blocks}.)
1837 The extensions for the Large File Support (LFS) require, even on 32-bit
1838 machines, types which can handle file sizes up to @twoexp{63}.
1839 Therefore a new definition of @code{struct stat} is necessary.
1843 @deftp {Data Type} {struct stat64}
1844 The members of this type are the same and have the same names as those
1845 in @code{struct stat}. The only difference is that the members
1846 @code{st_ino}, @code{st_size}, and @code{st_blocks} have a different
1847 type to support larger values.
1850 @item mode_t st_mode
1851 Specifies the mode of the file. This includes file type information
1852 (@pxref{Testing File Type}) and the file permission bits
1853 (@pxref{Permission Bits}).
1855 @item ino64_t st_ino
1856 The file serial number, which distinguishes this file from all other
1857 files on the same device.
1860 Identifies the device containing the file. The @code{st_ino} and
1861 @code{st_dev}, taken together, uniquely identify the file. The
1862 @code{st_dev} value is not necessarily consistent across reboots or
1863 system crashes, however.
1865 @item nlink_t st_nlink
1866 The number of hard links to the file. This count keeps track of how
1867 many directories have entries for this file. If the count is ever
1868 decremented to zero, then the file itself is discarded as soon as no
1869 process still holds it open. Symbolic links are not counted in the
1873 The user ID of the file's owner. @xref{File Owner}.
1876 The group ID of the file. @xref{File Owner}.
1878 @item off64_t st_size
1879 This specifies the size of a regular file in bytes. For files that are
1880 really devices this field isn't usually meaningful. For symbolic links
1881 this specifies the length of the file name the link refers to.
1883 @item time_t st_atime
1884 This is the last access time for the file. @xref{File Times}.
1886 @item unsigned long int st_atime_usec
1887 This is the fractional part of the last access time for the file.
1890 @item time_t st_mtime
1891 This is the time of the last modification to the contents of the file.
1894 @item unsigned long int st_mtime_usec
1895 This is the fractional part of the time of the last modification to the
1896 contents of the file. @xref{File Times}.
1898 @item time_t st_ctime
1899 This is the time of the last modification to the attributes of the file.
1902 @item unsigned long int st_ctime_usec
1903 This is the fractional part of the time of the last modification to the
1904 attributes of the file. @xref{File Times}.
1907 @item blkcnt64_t st_blocks
1908 This is the amount of disk space that the file occupies, measured in
1909 units of 512-byte blocks.
1911 @item unsigned int st_blksize
1912 The optimal block size for reading of writing this file, in bytes. You
1913 might use this size for allocating the buffer space for reading of
1914 writing the file. (This is unrelated to @code{st_blocks}.)
1918 Some of the file attributes have special data type names which exist
1919 specifically for those attributes. (They are all aliases for well-known
1920 integer types that you know and love.) These typedef names are defined
1921 in the header file @file{sys/types.h} as well as in @file{sys/stat.h}.
1922 Here is a list of them.
1924 @comment sys/types.h
1926 @deftp {Data Type} mode_t
1927 This is an integer data type used to represent file modes. In
1928 @theglibc{}, this is an unsigned type no narrower than @code{unsigned
1932 @cindex inode number
1933 @comment sys/types.h
1935 @deftp {Data Type} ino_t
1936 This is an unsigned integer type used to represent file serial numbers.
1937 (In Unix jargon, these are sometimes called @dfn{inode numbers}.)
1938 In @theglibc{}, this type is no narrower than @code{unsigned int}.
1940 If the source is compiled with @code{_FILE_OFFSET_BITS == 64} this type
1941 is transparently replaced by @code{ino64_t}.
1944 @comment sys/types.h
1946 @deftp {Data Type} ino64_t
1947 This is an unsigned integer type used to represent file serial numbers
1948 for the use in LFS. In @theglibc{}, this type is no narrower than
1949 @code{unsigned int}.
1951 When compiling with @code{_FILE_OFFSET_BITS == 64} this type is
1952 available under the name @code{ino_t}.
1955 @comment sys/types.h
1957 @deftp {Data Type} dev_t
1958 This is an arithmetic data type used to represent file device numbers.
1959 In @theglibc{}, this is an integer type no narrower than @code{int}.
1962 @comment sys/types.h
1964 @deftp {Data Type} nlink_t
1965 This is an integer type used to represent file link counts.
1968 @comment sys/types.h
1970 @deftp {Data Type} blkcnt_t
1971 This is a signed integer type used to represent block counts.
1972 In @theglibc{}, this type is no narrower than @code{int}.
1974 If the source is compiled with @code{_FILE_OFFSET_BITS == 64} this type
1975 is transparently replaced by @code{blkcnt64_t}.
1978 @comment sys/types.h
1980 @deftp {Data Type} blkcnt64_t
1981 This is a signed integer type used to represent block counts for the
1982 use in LFS. In @theglibc{}, this type is no narrower than @code{int}.
1984 When compiling with @code{_FILE_OFFSET_BITS == 64} this type is
1985 available under the name @code{blkcnt_t}.
1988 @node Reading Attributes
1989 @subsection Reading the Attributes of a File
1991 To examine the attributes of files, use the functions @code{stat},
1992 @code{fstat} and @code{lstat}. They return the attribute information in
1993 a @code{struct stat} object. All three functions are declared in the
1994 header file @file{sys/stat.h}.
1998 @deftypefun int stat (const char *@var{filename}, struct stat *@var{buf})
1999 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2000 The @code{stat} function returns information about the attributes of the
2001 file named by @w{@var{filename}} in the structure pointed to by @var{buf}.
2003 If @var{filename} is the name of a symbolic link, the attributes you get
2004 describe the file that the link points to. If the link points to a
2005 nonexistent file name, then @code{stat} fails reporting a nonexistent
2008 The return value is @code{0} if the operation is successful, or
2009 @code{-1} on failure. In addition to the usual file name errors
2010 (@pxref{File Name Errors}, the following @code{errno} error conditions
2011 are defined for this function:
2015 The file named by @var{filename} doesn't exist.
2018 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2019 function is in fact @code{stat64} since the LFS interface transparently
2020 replaces the normal implementation.
2025 @deftypefun int stat64 (const char *@var{filename}, struct stat64 *@var{buf})
2026 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2027 This function is similar to @code{stat} but it is also able to work on
2028 files larger than @twoexp{31} bytes on 32-bit systems. To be able to do
2029 this the result is stored in a variable of type @code{struct stat64} to
2030 which @var{buf} must point.
2032 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2033 function is available under the name @code{stat} and so transparently
2034 replaces the interface for small files on 32-bit machines.
2039 @deftypefun int fstat (int @var{filedes}, struct stat *@var{buf})
2040 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2041 The @code{fstat} function is like @code{stat}, except that it takes an
2042 open file descriptor as an argument instead of a file name.
2043 @xref{Low-Level I/O}.
2045 Like @code{stat}, @code{fstat} returns @code{0} on success and @code{-1}
2046 on failure. The following @code{errno} error conditions are defined for
2051 The @var{filedes} argument is not a valid file descriptor.
2054 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2055 function is in fact @code{fstat64} since the LFS interface transparently
2056 replaces the normal implementation.
2061 @deftypefun int fstat64 (int @var{filedes}, struct stat64 *@var{buf})
2062 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2063 This function is similar to @code{fstat} but is able to work on large
2064 files on 32-bit platforms. For large files the file descriptor
2065 @var{filedes} should be obtained by @code{open64} or @code{creat64}.
2066 The @var{buf} pointer points to a variable of type @code{struct stat64}
2067 which is able to represent the larger values.
2069 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2070 function is available under the name @code{fstat} and so transparently
2071 replaces the interface for small files on 32-bit machines.
2074 @c fstatat will call alloca and snprintf if the syscall is not
2076 @c @safety{@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
2080 @deftypefun int lstat (const char *@var{filename}, struct stat *@var{buf})
2081 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2082 @c Direct system call through lxstat, sometimes with an xstat conv call
2084 The @code{lstat} function is like @code{stat}, except that it does not
2085 follow symbolic links. If @var{filename} is the name of a symbolic
2086 link, @code{lstat} returns information about the link itself; otherwise
2087 @code{lstat} works like @code{stat}. @xref{Symbolic Links}.
2089 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2090 function is in fact @code{lstat64} since the LFS interface transparently
2091 replaces the normal implementation.
2096 @deftypefun int lstat64 (const char *@var{filename}, struct stat64 *@var{buf})
2097 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2098 @c Direct system call through lxstat64, sometimes with an xstat conv
2100 This function is similar to @code{lstat} but it is also able to work on
2101 files larger than @twoexp{31} bytes on 32-bit systems. To be able to do
2102 this the result is stored in a variable of type @code{struct stat64} to
2103 which @var{buf} must point.
2105 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2106 function is available under the name @code{lstat} and so transparently
2107 replaces the interface for small files on 32-bit machines.
2110 @node Testing File Type
2111 @subsection Testing the Type of a File
2113 The @dfn{file mode}, stored in the @code{st_mode} field of the file
2114 attributes, contains two kinds of information: the file type code, and
2115 the access permission bits. This section discusses only the type code,
2116 which you can use to tell whether the file is a directory, socket,
2117 symbolic link, and so on. For details about access permissions see
2118 @ref{Permission Bits}.
2120 There are two ways you can access the file type information in a file
2121 mode. Firstly, for each file type there is a @dfn{predicate macro}
2122 which examines a given file mode and returns whether it is of that type
2123 or not. Secondly, you can mask out the rest of the file mode to leave
2124 just the file type code, and compare this against constants for each of
2125 the supported file types.
2127 All of the symbols listed in this section are defined in the header file
2131 The following predicate macros test the type of a file, given the value
2132 @var{m} which is the @code{st_mode} field returned by @code{stat} on
2137 @deftypefn Macro int S_ISDIR (mode_t @var{m})
2138 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2139 This macro returns non-zero if the file is a directory.
2144 @deftypefn Macro int S_ISCHR (mode_t @var{m})
2145 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2146 This macro returns non-zero if the file is a character special file (a
2147 device like a terminal).
2152 @deftypefn Macro int S_ISBLK (mode_t @var{m})
2153 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2154 This macro returns non-zero if the file is a block special file (a device
2160 @deftypefn Macro int S_ISREG (mode_t @var{m})
2161 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2162 This macro returns non-zero if the file is a regular file.
2167 @deftypefn Macro int S_ISFIFO (mode_t @var{m})
2168 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2169 This macro returns non-zero if the file is a FIFO special file, or a
2170 pipe. @xref{Pipes and FIFOs}.
2175 @deftypefn Macro int S_ISLNK (mode_t @var{m})
2176 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2177 This macro returns non-zero if the file is a symbolic link.
2178 @xref{Symbolic Links}.
2183 @deftypefn Macro int S_ISSOCK (mode_t @var{m})
2184 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2185 This macro returns non-zero if the file is a socket. @xref{Sockets}.
2188 An alternate non-POSIX method of testing the file type is supported for
2189 compatibility with BSD. The mode can be bitwise AND-ed with
2190 @code{S_IFMT} to extract the file type code, and compared to the
2191 appropriate constant. For example,
2194 S_ISCHR (@var{mode})
2201 ((@var{mode} & S_IFMT) == S_IFCHR)
2206 @deftypevr Macro int S_IFMT
2207 This is a bit mask used to extract the file type code from a mode value.
2210 These are the symbolic names for the different file type codes:
2217 This is the file type constant of a directory file.
2223 This is the file type constant of a character-oriented device file.
2229 This is the file type constant of a block-oriented device file.
2235 This is the file type constant of a regular file.
2241 This is the file type constant of a symbolic link.
2247 This is the file type constant of a socket.
2253 This is the file type constant of a FIFO or pipe.
2256 The POSIX.1b standard introduced a few more objects which possibly can
2257 be implemented as object in the filesystem. These are message queues,
2258 semaphores, and shared memory objects. To allow differentiating these
2259 objects from other files the POSIX standard introduces three new test
2260 macros. But unlike the other macros it does not take the value of the
2261 @code{st_mode} field as the parameter. Instead they expect a pointer to
2262 the whole @code{struct stat} structure.
2266 @deftypefn Macro int S_TYPEISMQ (struct stat *@var{s})
2267 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2268 If the system implement POSIX message queues as distinct objects and the
2269 file is a message queue object, this macro returns a non-zero value.
2270 In all other cases the result is zero.
2275 @deftypefn Macro int S_TYPEISSEM (struct stat *@var{s})
2276 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2277 If the system implement POSIX semaphores as distinct objects and the
2278 file is a semaphore object, this macro returns a non-zero value.
2279 In all other cases the result is zero.
2284 @deftypefn Macro int S_TYPEISSHM (struct stat *@var{s})
2285 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2286 If the system implement POSIX shared memory objects as distinct objects
2287 and the file is a shared memory object, this macro returns a non-zero
2288 value. In all other cases the result is zero.
2292 @subsection File Owner
2294 @cindex owner of a file
2295 @cindex group owner of a file
2297 Every file has an @dfn{owner} which is one of the registered user names
2298 defined on the system. Each file also has a @dfn{group} which is one of
2299 the defined groups. The file owner can often be useful for showing you
2300 who edited the file (especially when you edit with GNU Emacs), but its
2301 main purpose is for access control.
2303 The file owner and group play a role in determining access because the
2304 file has one set of access permission bits for the owner, another set
2305 that applies to users who belong to the file's group, and a third set of
2306 bits that applies to everyone else. @xref{Access Permission}, for the
2307 details of how access is decided based on this data.
2309 When a file is created, its owner is set to the effective user ID of the
2310 process that creates it (@pxref{Process Persona}). The file's group ID
2311 may be set to either the effective group ID of the process, or the group
2312 ID of the directory that contains the file, depending on the system
2313 where the file is stored. When you access a remote file system, it
2314 behaves according to its own rules, not according to the system your
2315 program is running on. Thus, your program must be prepared to encounter
2316 either kind of behavior no matter what kind of system you run it on.
2320 You can change the owner and/or group owner of an existing file using
2321 the @code{chown} function. This is the primitive for the @code{chown}
2322 and @code{chgrp} shell commands.
2325 The prototype for this function is declared in @file{unistd.h}.
2329 @deftypefun int chown (const char *@var{filename}, uid_t @var{owner}, gid_t @var{group})
2330 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2331 The @code{chown} function changes the owner of the file @var{filename} to
2332 @var{owner}, and its group owner to @var{group}.
2334 Changing the owner of the file on certain systems clears the set-user-ID
2335 and set-group-ID permission bits. (This is because those bits may not
2336 be appropriate for the new owner.) Other file permission bits are not
2339 The return value is @code{0} on success and @code{-1} on failure.
2340 In addition to the usual file name errors (@pxref{File Name Errors}),
2341 the following @code{errno} error conditions are defined for this function:
2345 This process lacks permission to make the requested change.
2347 Only privileged users or the file's owner can change the file's group.
2348 On most file systems, only privileged users can change the file owner;
2349 some file systems allow you to change the owner if you are currently the
2350 owner. When you access a remote file system, the behavior you encounter
2351 is determined by the system that actually holds the file, not by the
2352 system your program is running on.
2354 @xref{Options for Files}, for information about the
2355 @code{_POSIX_CHOWN_RESTRICTED} macro.
2358 The file is on a read-only file system.
2364 @deftypefun int fchown (int @var{filedes}, uid_t @var{owner}, gid_t @var{group})
2365 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2366 This is like @code{chown}, except that it changes the owner of the open
2367 file with descriptor @var{filedes}.
2369 The return value from @code{fchown} is @code{0} on success and @code{-1}
2370 on failure. The following @code{errno} error codes are defined for this
2375 The @var{filedes} argument is not a valid file descriptor.
2378 The @var{filedes} argument corresponds to a pipe or socket, not an ordinary
2382 This process lacks permission to make the requested change. For details
2383 see @code{chmod} above.
2386 The file resides on a read-only file system.
2390 @node Permission Bits
2391 @subsection The Mode Bits for Access Permission
2393 The @dfn{file mode}, stored in the @code{st_mode} field of the file
2394 attributes, contains two kinds of information: the file type code, and
2395 the access permission bits. This section discusses only the access
2396 permission bits, which control who can read or write the file.
2397 @xref{Testing File Type}, for information about the file type code.
2399 All of the symbols listed in this section are defined in the header file
2403 @cindex file permission bits
2404 These symbolic constants are defined for the file mode bits that control
2405 access permission for the file:
2416 Read permission bit for the owner of the file. On many systems this bit
2417 is 0400. @code{S_IREAD} is an obsolete synonym provided for BSD
2428 Write permission bit for the owner of the file. Usually 0200.
2429 @w{@code{S_IWRITE}} is an obsolete synonym provided for BSD compatibility.
2439 Execute (for ordinary files) or search (for directories) permission bit
2440 for the owner of the file. Usually 0100. @code{S_IEXEC} is an obsolete
2441 synonym provided for BSD compatibility.
2447 This is equivalent to @samp{(S_IRUSR | S_IWUSR | S_IXUSR)}.
2453 Read permission bit for the group owner of the file. Usually 040.
2459 Write permission bit for the group owner of the file. Usually 020.
2465 Execute or search permission bit for the group owner of the file.
2472 This is equivalent to @samp{(S_IRGRP | S_IWGRP | S_IXGRP)}.
2478 Read permission bit for other users. Usually 04.
2484 Write permission bit for other users. Usually 02.
2490 Execute or search permission bit for other users. Usually 01.
2496 This is equivalent to @samp{(S_IROTH | S_IWOTH | S_IXOTH)}.
2502 This is the set-user-ID on execute bit, usually 04000.
2503 @xref{How Change Persona}.
2509 This is the set-group-ID on execute bit, usually 02000.
2510 @xref{How Change Persona}.
2517 This is the @dfn{sticky} bit, usually 01000.
2519 For a directory it gives permission to delete a file in that directory
2520 only if you own that file. Ordinarily, a user can either delete all the
2521 files in a directory or cannot delete any of them (based on whether the
2522 user has write permission for the directory). The same restriction
2523 applies---you must have both write permission for the directory and own
2524 the file you want to delete. The one exception is that the owner of the
2525 directory can delete any file in the directory, no matter who owns it
2526 (provided the owner has given himself write permission for the
2527 directory). This is commonly used for the @file{/tmp} directory, where
2528 anyone may create files but not delete files created by other users.
2530 Originally the sticky bit on an executable file modified the swapping
2531 policies of the system. Normally, when a program terminated, its pages
2532 in core were immediately freed and reused. If the sticky bit was set on
2533 the executable file, the system kept the pages in core for a while as if
2534 the program were still running. This was advantageous for a program
2535 likely to be run many times in succession. This usage is obsolete in
2536 modern systems. When a program terminates, its pages always remain in
2537 core as long as there is no shortage of memory in the system. When the
2538 program is next run, its pages will still be in core if no shortage
2539 arose since the last run.
2541 On some modern systems where the sticky bit has no useful meaning for an
2542 executable file, you cannot set the bit at all for a non-directory.
2543 If you try, @code{chmod} fails with @code{EFTYPE};
2544 @pxref{Setting Permissions}.
2546 Some systems (particularly SunOS) have yet another use for the sticky
2547 bit. If the sticky bit is set on a file that is @emph{not} executable,
2548 it means the opposite: never cache the pages of this file at all. The
2549 main use of this is for the files on an NFS server machine which are
2550 used as the swap area of diskless client machines. The idea is that the
2551 pages of the file will be cached in the client's memory, so it is a
2552 waste of the server's memory to cache them a second time. With this
2553 usage the sticky bit also implies that the filesystem may fail to record
2554 the file's modification time onto disk reliably (the idea being that
2555 no-one cares for a swap file).
2557 This bit is only available on BSD systems (and those derived from
2558 them). Therefore one has to use the @code{_GNU_SOURCE} feature select
2559 macro, or not define any feature test macros, to get the definition
2560 (@pxref{Feature Test Macros}).
2563 The actual bit values of the symbols are listed in the table above
2564 so you can decode file mode values when debugging your programs.
2565 These bit values are correct for most systems, but they are not
2568 @strong{Warning:} Writing explicit numbers for file permissions is bad
2569 practice. Not only is it not portable, it also requires everyone who
2570 reads your program to remember what the bits mean. To make your program
2571 clean use the symbolic names.
2573 @node Access Permission
2574 @subsection How Your Access to a File is Decided
2575 @cindex permission to access a file
2576 @cindex access permission for a file
2577 @cindex file access permission
2579 Recall that the operating system normally decides access permission for
2580 a file based on the effective user and group IDs of the process and its
2581 supplementary group IDs, together with the file's owner, group and
2582 permission bits. These concepts are discussed in detail in @ref{Process
2585 If the effective user ID of the process matches the owner user ID of the
2586 file, then permissions for read, write, and execute/search are
2587 controlled by the corresponding ``user'' (or ``owner'') bits. Likewise,
2588 if any of the effective group ID or supplementary group IDs of the
2589 process matches the group owner ID of the file, then permissions are
2590 controlled by the ``group'' bits. Otherwise, permissions are controlled
2591 by the ``other'' bits.
2593 Privileged users, like @samp{root}, can access any file regardless of
2594 its permission bits. As a special case, for a file to be executable
2595 even by a privileged user, at least one of its execute bits must be set.
2597 @node Setting Permissions
2598 @subsection Assigning File Permissions
2600 @cindex file creation mask
2602 The primitive functions for creating files (for example, @code{open} or
2603 @code{mkdir}) take a @var{mode} argument, which specifies the file
2604 permissions to give the newly created file. This mode is modified by
2605 the process's @dfn{file creation mask}, or @dfn{umask}, before it is
2608 The bits that are set in the file creation mask identify permissions
2609 that are always to be disabled for newly created files. For example, if
2610 you set all the ``other'' access bits in the mask, then newly created
2611 files are not accessible at all to processes in the ``other'' category,
2612 even if the @var{mode} argument passed to the create function would
2613 permit such access. In other words, the file creation mask is the
2614 complement of the ordinary access permissions you want to grant.
2616 Programs that create files typically specify a @var{mode} argument that
2617 includes all the permissions that make sense for the particular file.
2618 For an ordinary file, this is typically read and write permission for
2619 all classes of users. These permissions are then restricted as
2620 specified by the individual user's own file creation mask.
2623 To change the permission of an existing file given its name, call
2624 @code{chmod}. This function uses the specified permission bits and
2625 ignores the file creation mask.
2628 In normal use, the file creation mask is initialized by the user's login
2629 shell (using the @code{umask} shell command), and inherited by all
2630 subprocesses. Application programs normally don't need to worry about
2631 the file creation mask. It will automatically do what it is supposed to
2634 When your program needs to create a file and bypass the umask for its
2635 access permissions, the easiest way to do this is to use @code{fchmod}
2636 after opening the file, rather than changing the umask. In fact,
2637 changing the umask is usually done only by shells. They use the
2638 @code{umask} function.
2640 The functions in this section are declared in @file{sys/stat.h}.
2645 @deftypefun mode_t umask (mode_t @var{mask})
2646 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2647 The @code{umask} function sets the file creation mask of the current
2648 process to @var{mask}, and returns the previous value of the file
2651 Here is an example showing how to read the mask with @code{umask}
2652 without changing it permanently:
2658 mode_t mask = umask (0);
2665 However, on @gnuhurdsystems{} it is better to use @code{getumask} if
2666 you just want to read the mask value, because it is reentrant.
2671 @deftypefun mode_t getumask (void)
2672 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2673 Return the current value of the file creation mask for the current
2674 process. This function is a GNU extension and is only available on
2680 @deftypefun int chmod (const char *@var{filename}, mode_t @var{mode})
2681 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2682 The @code{chmod} function sets the access permission bits for the file
2683 named by @var{filename} to @var{mode}.
2685 If @var{filename} is a symbolic link, @code{chmod} changes the
2686 permissions of the file pointed to by the link, not those of the link
2689 This function returns @code{0} if successful and @code{-1} if not. In
2690 addition to the usual file name errors (@pxref{File Name
2691 Errors}), the following @code{errno} error conditions are defined for
2696 The named file doesn't exist.
2699 This process does not have permission to change the access permissions
2700 of this file. Only the file's owner (as judged by the effective user ID
2701 of the process) or a privileged user can change them.
2704 The file resides on a read-only file system.
2707 @var{mode} has the @code{S_ISVTX} bit (the ``sticky bit'') set,
2708 and the named file is not a directory. Some systems do not allow setting the
2709 sticky bit on non-directory files, and some do (and only some of those
2710 assign a useful meaning to the bit for non-directory files).
2712 You only get @code{EFTYPE} on systems where the sticky bit has no useful
2713 meaning for non-directory files, so it is always safe to just clear the
2714 bit in @var{mode} and call @code{chmod} again. @xref{Permission Bits},
2715 for full details on the sticky bit.
2721 @deftypefun int fchmod (int @var{filedes}, mode_t @var{mode})
2722 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2723 This is like @code{chmod}, except that it changes the permissions of the
2724 currently open file given by @var{filedes}.
2726 The return value from @code{fchmod} is @code{0} on success and @code{-1}
2727 on failure. The following @code{errno} error codes are defined for this
2732 The @var{filedes} argument is not a valid file descriptor.
2735 The @var{filedes} argument corresponds to a pipe or socket, or something
2736 else that doesn't really have access permissions.
2739 This process does not have permission to change the access permissions
2740 of this file. Only the file's owner (as judged by the effective user ID
2741 of the process) or a privileged user can change them.
2744 The file resides on a read-only file system.
2748 @node Testing File Access
2749 @subsection Testing Permission to Access a File
2750 @cindex testing access permission
2751 @cindex access, testing for
2752 @cindex setuid programs and file access
2754 In some situations it is desirable to allow programs to access files or
2755 devices even if this is not possible with the permissions granted to the
2756 user. One possible solution is to set the setuid-bit of the program
2757 file. If such a program is started the @emph{effective} user ID of the
2758 process is changed to that of the owner of the program file. So to
2759 allow write access to files like @file{/etc/passwd}, which normally can
2760 be written only by the super-user, the modifying program will have to be
2761 owned by @code{root} and the setuid-bit must be set.
2763 But beside the files the program is intended to change the user should
2764 not be allowed to access any file to which s/he would not have access
2765 anyway. The program therefore must explicitly check whether @emph{the
2766 user} would have the necessary access to a file, before it reads or
2769 To do this, use the function @code{access}, which checks for access
2770 permission based on the process's @emph{real} user ID rather than the
2771 effective user ID. (The setuid feature does not alter the real user ID,
2772 so it reflects the user who actually ran the program.)
2774 There is another way you could check this access, which is easy to
2775 describe, but very hard to use. This is to examine the file mode bits
2776 and mimic the system's own access computation. This method is
2777 undesirable because many systems have additional access control
2778 features; your program cannot portably mimic them, and you would not
2779 want to try to keep track of the diverse features that different systems
2780 have. Using @code{access} is simple and automatically does whatever is
2781 appropriate for the system you are using.
2783 @code{access} is @emph{only} only appropriate to use in setuid programs.
2784 A non-setuid program will always use the effective ID rather than the
2788 The symbols in this section are declared in @file{unistd.h}.
2792 @deftypefun int access (const char *@var{filename}, int @var{how})
2793 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2794 The @code{access} function checks to see whether the file named by
2795 @var{filename} can be accessed in the way specified by the @var{how}
2796 argument. The @var{how} argument either can be the bitwise OR of the
2797 flags @code{R_OK}, @code{W_OK}, @code{X_OK}, or the existence test
2800 This function uses the @emph{real} user and group IDs of the calling
2801 process, rather than the @emph{effective} IDs, to check for access
2802 permission. As a result, if you use the function from a @code{setuid}
2803 or @code{setgid} program (@pxref{How Change Persona}), it gives
2804 information relative to the user who actually ran the program.
2806 The return value is @code{0} if the access is permitted, and @code{-1}
2807 otherwise. (In other words, treated as a predicate function,
2808 @code{access} returns true if the requested access is @emph{denied}.)
2810 In addition to the usual file name errors (@pxref{File Name
2811 Errors}), the following @code{errno} error conditions are defined for
2816 The access specified by @var{how} is denied.
2819 The file doesn't exist.
2822 Write permission was requested for a file on a read-only file system.
2826 These macros are defined in the header file @file{unistd.h} for use
2827 as the @var{how} argument to the @code{access} function. The values
2828 are integer constants.
2833 @deftypevr Macro int R_OK
2834 Flag meaning test for read permission.
2839 @deftypevr Macro int W_OK
2840 Flag meaning test for write permission.
2845 @deftypevr Macro int X_OK
2846 Flag meaning test for execute/search permission.
2851 @deftypevr Macro int F_OK
2852 Flag meaning test for existence of the file.
2856 @subsection File Times
2858 @cindex file access time
2859 @cindex file modification time
2860 @cindex file attribute modification time
2861 Each file has three time stamps associated with it: its access time,
2862 its modification time, and its attribute modification time. These
2863 correspond to the @code{st_atime}, @code{st_mtime}, and @code{st_ctime}
2864 members of the @code{stat} structure; see @ref{File Attributes}.
2866 All of these times are represented in calendar time format, as
2867 @code{time_t} objects. This data type is defined in @file{time.h}.
2868 For more information about representation and manipulation of time
2869 values, see @ref{Calendar Time}.
2872 Reading from a file updates its access time attribute, and writing
2873 updates its modification time. When a file is created, all three
2874 time stamps for that file are set to the current time. In addition, the
2875 attribute change time and modification time fields of the directory that
2876 contains the new entry are updated.
2878 Adding a new name for a file with the @code{link} function updates the
2879 attribute change time field of the file being linked, and both the
2880 attribute change time and modification time fields of the directory
2881 containing the new name. These same fields are affected if a file name
2882 is deleted with @code{unlink}, @code{remove} or @code{rmdir}. Renaming
2883 a file with @code{rename} affects only the attribute change time and
2884 modification time fields of the two parent directories involved, and not
2885 the times for the file being renamed.
2887 Changing the attributes of a file (for example, with @code{chmod})
2888 updates its attribute change time field.
2890 You can also change some of the time stamps of a file explicitly using
2891 the @code{utime} function---all except the attribute change time. You
2892 need to include the header file @file{utime.h} to use this facility.
2897 @deftp {Data Type} {struct utimbuf}
2898 The @code{utimbuf} structure is used with the @code{utime} function to
2899 specify new access and modification times for a file. It contains the
2904 This is the access time for the file.
2906 @item time_t modtime
2907 This is the modification time for the file.
2913 @deftypefun int utime (const char *@var{filename}, const struct utimbuf *@var{times})
2914 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2915 @c In the absence of a utime syscall, it non-atomically converts times
2916 @c to a struct timeval and calls utimes.
2917 This function is used to modify the file times associated with the file
2918 named @var{filename}.
2920 If @var{times} is a null pointer, then the access and modification times
2921 of the file are set to the current time. Otherwise, they are set to the
2922 values from the @code{actime} and @code{modtime} members (respectively)
2923 of the @code{utimbuf} structure pointed to by @var{times}.
2925 The attribute modification time for the file is set to the current time
2926 in either case (since changing the time stamps is itself a modification
2927 of the file attributes).
2929 The @code{utime} function returns @code{0} if successful and @code{-1}
2930 on failure. In addition to the usual file name errors
2931 (@pxref{File Name Errors}), the following @code{errno} error conditions
2932 are defined for this function:
2936 There is a permission problem in the case where a null pointer was
2937 passed as the @var{times} argument. In order to update the time stamp on
2938 the file, you must either be the owner of the file, have write
2939 permission for the file, or be a privileged user.
2942 The file doesn't exist.
2945 If the @var{times} argument is not a null pointer, you must either be
2946 the owner of the file or be a privileged user.
2949 The file lives on a read-only file system.
2953 Each of the three time stamps has a corresponding microsecond part,
2954 which extends its resolution. These fields are called
2955 @code{st_atime_usec}, @code{st_mtime_usec}, and @code{st_ctime_usec};
2956 each has a value between 0 and 999,999, which indicates the time in
2957 microseconds. They correspond to the @code{tv_usec} field of a
2958 @code{timeval} structure; see @ref{High-Resolution Calendar}.
2960 The @code{utimes} function is like @code{utime}, but also lets you specify
2961 the fractional part of the file times. The prototype for this function is
2962 in the header file @file{sys/time.h}.
2967 @deftypefun int utimes (const char *@var{filename}, const struct timeval @var{tvp}@t{[2]})
2968 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2969 @c In the absence of a utimes syscall, it non-atomically converts tvp
2970 @c to struct timespec array and issues a utimensat syscall, or to
2971 @c struct utimbuf and calls utime.
2972 This function sets the file access and modification times of the file
2973 @var{filename}. The new file access time is specified by
2974 @code{@var{tvp}[0]}, and the new modification time by
2975 @code{@var{tvp}[1]}. Similar to @code{utime}, if @var{tvp} is a null
2976 pointer then the access and modification times of the file are set to
2977 the current time. This function comes from BSD.
2979 The return values and error conditions are the same as for the @code{utime}
2985 @deftypefun int lutimes (const char *@var{filename}, const struct timeval @var{tvp}@t{[2]})
2986 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2987 @c Since there's no lutimes syscall, it non-atomically converts tvp
2988 @c to struct timespec array and issues a utimensat syscall.
2989 This function is like @code{utimes}, except that it does not follow
2990 symbolic links. If @var{filename} is the name of a symbolic link,
2991 @code{lutimes} sets the file access and modification times of the
2992 symbolic link special file itself (as seen by @code{lstat};
2993 @pxref{Symbolic Links}) while @code{utimes} sets the file access and
2994 modification times of the file the symbolic link refers to. This
2995 function comes from FreeBSD, and is not available on all platforms (if
2996 not available, it will fail with @code{ENOSYS}).
2998 The return values and error conditions are the same as for the @code{utime}
3004 @deftypefun int futimes (int @var{fd}, const struct timeval @var{tvp}@t{[2]})
3005 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3006 @c Since there's no futimes syscall, it non-atomically converts tvp
3007 @c to struct timespec array and issues a utimensat syscall, falling back
3008 @c to utimes on a /proc/self/fd symlink.
3009 This function is like @code{utimes}, except that it takes an open file
3010 descriptor as an argument instead of a file name. @xref{Low-Level
3011 I/O}. This function comes from FreeBSD, and is not available on all
3012 platforms (if not available, it will fail with @code{ENOSYS}).
3014 Like @code{utimes}, @code{futimes} returns @code{0} on success and @code{-1}
3015 on failure. The following @code{errno} error conditions are defined for
3020 There is a permission problem in the case where a null pointer was
3021 passed as the @var{times} argument. In order to update the time stamp on
3022 the file, you must either be the owner of the file, have write
3023 permission for the file, or be a privileged user.
3026 The @var{filedes} argument is not a valid file descriptor.
3029 If the @var{times} argument is not a null pointer, you must either be
3030 the owner of the file or be a privileged user.
3033 The file lives on a read-only file system.
3038 @subsection File Size
3040 Normally file sizes are maintained automatically. A file begins with a
3041 size of @math{0} and is automatically extended when data is written past
3042 its end. It is also possible to empty a file completely by an
3043 @code{open} or @code{fopen} call.
3045 However, sometimes it is necessary to @emph{reduce} the size of a file.
3046 This can be done with the @code{truncate} and @code{ftruncate} functions.
3047 They were introduced in BSD Unix. @code{ftruncate} was later added to
3050 Some systems allow you to extend a file (creating holes) with these
3051 functions. This is useful when using memory-mapped I/O
3052 (@pxref{Memory-mapped I/O}), where files are not automatically extended.
3053 However, it is not portable but must be implemented if @code{mmap}
3054 allows mapping of files (i.e., @code{_POSIX_MAPPED_FILES} is defined).
3056 Using these functions on anything other than a regular file gives
3057 @emph{undefined} results. On many systems, such a call will appear to
3058 succeed, without actually accomplishing anything.
3062 @deftypefun int truncate (const char *@var{filename}, off_t @var{length})
3063 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3064 @c In the absence of a truncate syscall, we use open and ftruncate.
3066 The @code{truncate} function changes the size of @var{filename} to
3067 @var{length}. If @var{length} is shorter than the previous length, data
3068 at the end will be lost. The file must be writable by the user to
3069 perform this operation.
3071 If @var{length} is longer, holes will be added to the end. However, some
3072 systems do not support this feature and will leave the file unchanged.
3074 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} the
3075 @code{truncate} function is in fact @code{truncate64} and the type
3076 @code{off_t} has 64 bits which makes it possible to handle files up to
3077 @twoexp{63} bytes in length.
3079 The return value is @math{0} for success, or @math{-1} for an error. In
3080 addition to the usual file name errors, the following errors may occur:
3085 The file is a directory or not writable.
3088 @var{length} is negative.
3091 The operation would extend the file beyond the limits of the operating system.
3094 A hardware I/O error occurred.
3097 The file is "append-only" or "immutable".
3100 The operation was interrupted by a signal.
3108 @deftypefun int truncate64 (const char *@var{name}, off64_t @var{length})
3109 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3110 @c In the absence of a syscall, try truncate if length fits.
3111 This function is similar to the @code{truncate} function. The
3112 difference is that the @var{length} argument is 64 bits wide even on 32
3113 bits machines, which allows the handling of files with sizes up to
3116 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} on a
3117 32 bits machine this function is actually available under the name
3118 @code{truncate} and so transparently replaces the 32 bits interface.
3123 @deftypefun int ftruncate (int @var{fd}, off_t @var{length})
3124 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3126 This is like @code{truncate}, but it works on a file descriptor @var{fd}
3127 for an opened file instead of a file name to identify the object. The
3128 file must be opened for writing to successfully carry out the operation.
3130 The POSIX standard leaves it implementation defined what happens if the
3131 specified new @var{length} of the file is bigger than the original size.
3132 The @code{ftruncate} function might simply leave the file alone and do
3133 nothing or it can increase the size to the desired size. In this later
3134 case the extended area should be zero-filled. So using @code{ftruncate}
3135 is no reliable way to increase the file size but if it is possible it is
3136 probably the fastest way. The function also operates on POSIX shared
3137 memory segments if these are implemented by the system.
3139 @code{ftruncate} is especially useful in combination with @code{mmap}.
3140 Since the mapped region must have a fixed size one cannot enlarge the
3141 file by writing something beyond the last mapped page. Instead one has
3142 to enlarge the file itself and then remap the file with the new size.
3143 The example below shows how this works.
3145 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} the
3146 @code{ftruncate} function is in fact @code{ftruncate64} and the type
3147 @code{off_t} has 64 bits which makes it possible to handle files up to
3148 @twoexp{63} bytes in length.
3150 The return value is @math{0} for success, or @math{-1} for an error. The
3151 following errors may occur:
3156 @var{fd} does not correspond to an open file.
3159 @var{fd} is a directory or not open for writing.
3162 @var{length} is negative.
3165 The operation would extend the file beyond the limits of the operating system.
3166 @c or the open() call -- with the not-yet-discussed feature of opening
3167 @c files with extra-large offsets.
3170 A hardware I/O error occurred.
3173 The file is "append-only" or "immutable".
3176 The operation was interrupted by a signal.
3178 @c ENOENT is also possible on Linux --- however it only occurs if the file
3179 @c descriptor has a `file' structure but no `inode' structure. I'm not
3180 @c sure how such an fd could be created. Perhaps it's a bug.
3188 @deftypefun int ftruncate64 (int @var{id}, off64_t @var{length})
3189 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3190 @c In the absence of a syscall, try ftruncate if length fits.
3191 This function is similar to the @code{ftruncate} function. The
3192 difference is that the @var{length} argument is 64 bits wide even on 32
3193 bits machines which allows the handling of files with sizes up to
3196 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} on a
3197 32 bits machine this function is actually available under the name
3198 @code{ftruncate} and so transparently replaces the 32 bits interface.
3201 As announced here is a little example of how to use @code{ftruncate} in
3202 combination with @code{mmap}:
3210 add (off_t at, void *block, size_t size)
3212 if (at + size > len)
3214 /* Resize the file and remap. */
3215 size_t ps = sysconf (_SC_PAGESIZE);
3216 size_t ns = (at + size + ps - 1) & ~(ps - 1);
3218 if (ftruncate (fd, ns) < 0)
3220 np = mmap (NULL, ns, PROT_READ|PROT_WRITE, MAP_SHARED, fd, 0);
3221 if (np == MAP_FAILED)
3226 memcpy ((char *) start + at, block, size);
3231 The function @code{add} writes a block of memory at an arbitrary
3232 position in the file. If the current size of the file is too small it
3233 is extended. Note the it is extended by a round number of pages. This
3234 is a requirement of @code{mmap}. The program has to keep track of the
3235 real size, and when it has finished a final @code{ftruncate} call should
3236 set the real size of the file.
3238 @node Storage Allocation
3239 @subsection Storage Allocation
3240 @cindex allocating file storage
3241 @cindex file allocation
3242 @cindex storage allocating
3244 @cindex file fragmentation
3245 @cindex fragmentation of files
3246 @cindex sparse files
3247 @cindex files, sparse
3248 Most file systems support allocating large files in a non-contiguous
3249 fashion: the file is split into @emph{fragments} which are allocated
3250 sequentially, but the fragments themselves can be scattered across the
3251 disk. File systems generally try to avoid such fragmentation because it
3252 decreases performance, but if a file gradually increases in size, there
3253 might be no other option than to fragment it. In addition, many file
3254 systems support @emph{sparse files} with @emph{holes}: regions of null
3255 bytes for which no backing storage has been allocated by the file
3256 system. When the holes are finally overwritten with data, fragmentation
3259 Explicit allocation of storage for yet-unwritten parts of the file can
3260 help the system to avoid fragmentation. Additionally, if storage
3261 pre-allocation fails, it is possible to report the out-of-disk error
3262 early, often without filling up the entire disk. However, due to
3263 deduplication, copy-on-write semantics, and file compression, such
3264 pre-allocation may not reliably prevent the out-of-disk-space error from
3265 occurring later. Checking for write errors is still required, and
3266 writes to memory-mapped regions created with @code{mmap} can still
3267 result in @code{SIGBUS}.
3269 @deftypefun int posix_fallocate (int @var{fd}, off_t @var{offset}, off_t @var{length})
3270 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3271 @c If the file system does not support allocation,
3272 @c @code{posix_fallocate} has a race with file extension (if
3273 @c @var{length} is zero) or with concurrent writes of non-NUL bytes (if
3274 @c @var{length} is positive).
3276 Allocate backing store for the region of @var{length} bytes starting at
3277 byte @var{offset} in the file for the descriptor @var{fd}. The file
3278 length is increased to @samp{@var{length} + @var{offset}} if necessary.
3280 @var{fd} must be a regular file opened for writing, or @code{EBADF} is
3281 returned. If there is insufficient disk space to fulfill the allocation
3282 request, @code{ENOSPC} is returned.
3284 @strong{Note:} If @code{fallocate} is not available (because the file
3285 system does not support it), @code{posix_fallocate} is emulated, which
3286 has the following drawbacks:
3290 It is very inefficient because all file system blocks in the requested
3291 range need to be examined (even if they have been allocated before) and
3292 potentially rewritten. In contrast, with proper @code{fallocate}
3293 support (see below), the file system can examine the internal file
3294 allocation data structures and eliminate holes directly, maybe even
3295 using unwritten extents (which are pre-allocated but uninitialized on
3299 There is a race condition if another thread or process modifies the
3300 underlying file in the to-be-allocated area. Non-null bytes could be
3301 overwritten with null bytes.
3304 If @var{fd} has been opened with the @code{O_WRONLY} flag, the function
3305 will fail with an @code{errno} value of @code{EBADF}.
3308 If @var{fd} has been opened with the @code{O_APPEND} flag, the function
3309 will fail with an @code{errno} value of @code{EBADF}.
3312 If @var{length} is zero, @code{ftruncate} is used to increase the file
3313 size as requested, without allocating file system blocks. There is a
3314 race condition which means that @code{ftruncate} can accidentally
3315 truncate the file if it has been extended concurrently.
3318 On Linux, if an application does not benefit from emulation or if the
3319 emulation is harmful due to its inherent race conditions, the
3320 application can use the Linux-specific @code{fallocate} function, with a
3321 zero flag argument. For the @code{fallocate} function, @theglibc{} does
3322 not perform allocation emulation if the file system does not support
3323 allocation. Instead, an @code{EOPNOTSUPP} is returned to the caller.
3327 @deftypefun int posix_fallocate64 (int @var{fd}, off64_t @var{offset}, off64_t @var{length})
3328 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3330 This function is a variant of @code{posix_fallocate64} which accepts
3331 64-bit file offsets on all platforms.
3335 @node Making Special Files
3336 @section Making Special Files
3337 @cindex creating special files
3338 @cindex special files
3340 The @code{mknod} function is the primitive for making special files,
3341 such as files that correspond to devices. @Theglibc{} includes
3342 this function for compatibility with BSD.
3344 The prototype for @code{mknod} is declared in @file{sys/stat.h}.
3349 @deftypefun int mknod (const char *@var{filename}, mode_t @var{mode}, dev_t @var{dev})
3350 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3351 @c Instead of issuing the syscall directly, we go through xmknod.
3352 @c Although the internal xmknod takes a dev_t*, that could lead to
3353 @c @mtsrace races, it's passed a pointer to mknod's dev.
3354 The @code{mknod} function makes a special file with name @var{filename}.
3355 The @var{mode} specifies the mode of the file, and may include the various
3356 special file bits, such as @code{S_IFCHR} (for a character special file)
3357 or @code{S_IFBLK} (for a block special file). @xref{Testing File Type}.
3359 The @var{dev} argument specifies which device the special file refers to.
3360 Its exact interpretation depends on the kind of special file being created.
3362 The return value is @code{0} on success and @code{-1} on error. In addition
3363 to the usual file name errors (@pxref{File Name Errors}), the
3364 following @code{errno} error conditions are defined for this function:
3368 The calling process is not privileged. Only the superuser can create
3372 The directory or file system that would contain the new file is full
3373 and cannot be extended.
3376 The directory containing the new file can't be modified because it's on
3377 a read-only file system.
3380 There is already a file named @var{filename}. If you want to replace
3381 this file, you must remove the old file explicitly first.
3385 @node Temporary Files
3386 @section Temporary Files
3388 If you need to use a temporary file in your program, you can use the
3389 @code{tmpfile} function to open it. Or you can use the @code{tmpnam}
3390 (better: @code{tmpnam_r}) function to provide a name for a temporary
3391 file and then you can open it in the usual way with @code{fopen}.
3393 The @code{tempnam} function is like @code{tmpnam} but lets you choose
3394 what directory temporary files will go in, and something about what
3395 their file names will look like. Important for multi-threaded programs
3396 is that @code{tempnam} is reentrant, while @code{tmpnam} is not since it
3397 returns a pointer to a static buffer.
3399 These facilities are declared in the header file @file{stdio.h}.
3404 @deftypefun {FILE *} tmpfile (void)
3405 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acsmem{} @acsfd{} @aculock{}}}
3406 @c The unsafety issues are those of fdopen, plus @acsfd because of the
3408 @c __path_search (internal buf, !dir, const pfx, !try_tmpdir) ok
3409 @c libc_secure_genenv only if try_tmpdir
3410 @c xstat64, strlen, strcmp, sprintf
3411 @c __gen_tempname (internal tmpl, __GT_FILE) ok
3412 @c strlen, memcmp, getpid, open/mkdir/lxstat64 ok
3413 @c HP_TIMING_NOW if available ok
3414 @c gettimeofday (!tz) first time, or every time if no HP_TIMING_NOW ok
3415 @c static value is used and modified without synchronization ok
3416 @c but the use is as a source of non-cryptographic randomness
3417 @c with retries in case of collision, so it should be safe
3419 This function creates a temporary binary file for update mode, as if by
3420 calling @code{fopen} with mode @code{"wb+"}. The file is deleted
3421 automatically when it is closed or when the program terminates. (On
3422 some other @w{ISO C} systems the file may fail to be deleted if the program
3423 terminates abnormally).
3425 This function is reentrant.
3427 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
3428 32-bit system this function is in fact @code{tmpfile64}, i.e., the LFS
3429 interface transparently replaces the old interface.
3434 @deftypefun {FILE *} tmpfile64 (void)
3435 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acsmem{} @acsfd{} @aculock{}}}
3436 This function is similar to @code{tmpfile}, but the stream it returns a
3437 pointer to was opened using @code{tmpfile64}. Therefore this stream can
3438 be used for files larger than @twoexp{31} bytes on 32-bit machines.
3440 Please note that the return type is still @code{FILE *}. There is no
3441 special @code{FILE} type for the LFS interface.
3443 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a 32
3444 bits machine this function is available under the name @code{tmpfile}
3445 and so transparently replaces the old interface.
3450 @deftypefun {char *} tmpnam (char *@var{result})
3451 @safety{@prelim{}@mtunsafe{@mtasurace{:tmpnam/!result}}@asunsafe{}@acsafe{}}
3452 @c The passed-in buffer should not be modified concurrently with the
3454 @c __path_search (static or passed-in buf, !dir, !pfx, !try_tmpdir) ok
3455 @c __gen_tempname (internal tmpl, __GT_NOCREATE) ok
3456 This function constructs and returns a valid file name that does not
3457 refer to any existing file. If the @var{result} argument is a null
3458 pointer, the return value is a pointer to an internal static string,
3459 which might be modified by subsequent calls and therefore makes this
3460 function non-reentrant. Otherwise, the @var{result} argument should be
3461 a pointer to an array of at least @code{L_tmpnam} characters, and the
3462 result is written into that array.
3464 It is possible for @code{tmpnam} to fail if you call it too many times
3465 without removing previously-created files. This is because the limited
3466 length of the temporary file names gives room for only a finite number
3467 of different names. If @code{tmpnam} fails it returns a null pointer.
3469 @strong{Warning:} Between the time the pathname is constructed and the
3470 file is created another process might have created a file with the same
3471 name using @code{tmpnam}, leading to a possible security hole. The
3472 implementation generates names which can hardly be predicted, but when
3473 opening the file you should use the @code{O_EXCL} flag. Using
3474 @code{tmpfile} or @code{mkstemp} is a safe way to avoid this problem.
3479 @deftypefun {char *} tmpnam_r (char *@var{result})
3480 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3481 This function is nearly identical to the @code{tmpnam} function, except
3482 that if @var{result} is a null pointer it returns a null pointer.
3484 This guarantees reentrancy because the non-reentrant situation of
3485 @code{tmpnam} cannot happen here.
3487 @strong{Warning}: This function has the same security problems as
3493 @deftypevr Macro int L_tmpnam
3494 The value of this macro is an integer constant expression that
3495 represents the minimum size of a string large enough to hold a file name
3496 generated by the @code{tmpnam} function.
3501 @deftypevr Macro int TMP_MAX
3502 The macro @code{TMP_MAX} is a lower bound for how many temporary names
3503 you can create with @code{tmpnam}. You can rely on being able to call
3504 @code{tmpnam} at least this many times before it might fail saying you
3505 have made too many temporary file names.
3507 With @theglibc{}, you can create a very large number of temporary
3508 file names. If you actually created the files, you would probably run
3509 out of disk space before you ran out of names. Some other systems have
3510 a fixed, small limit on the number of temporary files. The limit is
3511 never less than @code{25}.
3516 @deftypefun {char *} tempnam (const char *@var{dir}, const char *@var{prefix})
3517 @safety{@prelim{}@mtsafe{@mtsenv{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
3518 @c There's no way (short of being setuid) to avoid getenv("TMPDIR"),
3519 @c even with a non-NULL dir.
3521 @c __path_search (internal buf, dir, pfx, try_tmpdir) unsafe getenv
3522 @c __gen_tempname (internal tmpl, __GT_NOCREATE) ok
3524 This function generates a unique temporary file name. If @var{prefix}
3525 is not a null pointer, up to five characters of this string are used as
3526 a prefix for the file name. The return value is a string newly
3527 allocated with @code{malloc}, so you should release its storage with
3528 @code{free} when it is no longer needed.
3530 Because the string is dynamically allocated this function is reentrant.
3532 The directory prefix for the temporary file name is determined by
3533 testing each of the following in sequence. The directory must exist and
3538 The environment variable @code{TMPDIR}, if it is defined. For security
3539 reasons this only happens if the program is not SUID or SGID enabled.
3542 The @var{dir} argument, if it is not a null pointer.
3545 The value of the @code{P_tmpdir} macro.
3548 The directory @file{/tmp}.
3551 This function is defined for SVID compatibility.
3553 @strong{Warning:} Between the time the pathname is constructed and the
3554 file is created another process might have created a file with the same
3555 name using @code{tempnam}, leading to a possible security hole. The
3556 implementation generates names which can hardly be predicted, but when
3557 opening the file you should use the @code{O_EXCL} flag. Using
3558 @code{tmpfile} or @code{mkstemp} is a safe way to avoid this problem.
3560 @cindex TMPDIR environment variable
3564 @c !!! are we putting SVID/GNU/POSIX.1/BSD in here or not??
3565 @deftypevr {SVID Macro} {char *} P_tmpdir
3566 This macro is the name of the default directory for temporary files.
3569 Older Unix systems did not have the functions just described. Instead
3570 they used @code{mktemp} and @code{mkstemp}. Both of these functions
3571 work by modifying a file name template string you pass. The last six
3572 characters of this string must be @samp{XXXXXX}. These six @samp{X}s
3573 are replaced with six characters which make the whole string a unique
3574 file name. Usually the template string is something like
3575 @samp{/tmp/@var{prefix}XXXXXX}, and each program uses a unique @var{prefix}.
3577 @strong{NB:} Because @code{mktemp} and @code{mkstemp} modify the
3578 template string, you @emph{must not} pass string constants to them.
3579 String constants are normally in read-only storage, so your program
3580 would crash when @code{mktemp} or @code{mkstemp} tried to modify the
3581 string. These functions are declared in the header file @file{stdlib.h}.
3586 @deftypefun {char *} mktemp (char *@var{template})
3587 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3588 @c __gen_tempname (caller tmpl, __GT_NOCREATE) ok
3589 The @code{mktemp} function generates a unique file name by modifying
3590 @var{template} as described above. If successful, it returns
3591 @var{template} as modified. If @code{mktemp} cannot find a unique file
3592 name, it makes @var{template} an empty string and returns that. If
3593 @var{template} does not end with @samp{XXXXXX}, @code{mktemp} returns a
3596 @strong{Warning:} Between the time the pathname is constructed and the
3597 file is created another process might have created a file with the same
3598 name using @code{mktemp}, leading to a possible security hole. The
3599 implementation generates names which can hardly be predicted, but when
3600 opening the file you should use the @code{O_EXCL} flag. Using
3601 @code{mkstemp} is a safe way to avoid this problem.
3606 @deftypefun int mkstemp (char *@var{template})
3607 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{@acsfd{}}}
3608 @c __gen_tempname (caller tmpl, __GT_FILE) ok
3609 The @code{mkstemp} function generates a unique file name just as
3610 @code{mktemp} does, but it also opens the file for you with @code{open}
3611 (@pxref{Opening and Closing Files}). If successful, it modifies
3612 @var{template} in place and returns a file descriptor for that file open
3613 for reading and writing. If @code{mkstemp} cannot create a
3614 uniquely-named file, it returns @code{-1}. If @var{template} does not
3615 end with @samp{XXXXXX}, @code{mkstemp} returns @code{-1} and does not
3616 modify @var{template}.
3618 The file is opened using mode @code{0600}. If the file is meant to be
3619 used by other users this mode must be changed explicitly.
3622 Unlike @code{mktemp}, @code{mkstemp} is actually guaranteed to create a
3623 unique file that cannot possibly clash with any other program trying to
3624 create a temporary file. This is because it works by calling
3625 @code{open} with the @code{O_EXCL} flag, which says you want to create a
3626 new file and get an error if the file already exists.
3630 @deftypefun {char *} mkdtemp (char *@var{template})
3631 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3632 @c __gen_tempname (caller tmpl, __GT_DIR) ok
3633 The @code{mkdtemp} function creates a directory with a unique name. If
3634 it succeeds, it overwrites @var{template} with the name of the
3635 directory, and returns @var{template}. As with @code{mktemp} and
3636 @code{mkstemp}, @var{template} should be a string ending with
3639 If @code{mkdtemp} cannot create an uniquely named directory, it returns
3640 @code{NULL} and sets @var{errno} appropriately. If @var{template} does
3641 not end with @samp{XXXXXX}, @code{mkdtemp} returns @code{NULL} and does
3642 not modify @var{template}. @var{errno} will be set to @code{EINVAL} in
3645 The directory is created using mode @code{0700}.
3648 The directory created by @code{mkdtemp} cannot clash with temporary
3649 files or directories created by other users. This is because directory
3650 creation always works like @code{open} with @code{O_EXCL}.
3651 @xref{Creating Directories}.
3653 The @code{mkdtemp} function comes from OpenBSD.
3655 @c FIXME these are undocumented:
3660 @c fstatat (there's a commented-out safety assessment for this one)
3664 @c name_to_handle_at
3666 @c open_by_handle_at