1 \input texinfo @c -*-texinfo-*-
4 @settitle Finding Files
5 @c For double-sided printing, uncomment:
6 @c @setchapternewpage odd
18 * Finding Files: (find). Listing and operating on files
19 that match certain criteria.
23 This file documents the GNU utilities for finding files that match
24 certain criteria and performing various operations on them.
26 Copyright (C) 1994 Free Software Foundation, Inc.
28 Permission is granted to make and distribute verbatim copies of
29 this manual provided the copyright notice and this permission notice
30 are preserved on all copies.
33 Permission is granted to process this file through TeX and print the
34 results, provided the printed document carries copying permission
35 notice identical to this one except for the removal of this paragraph
36 (this paragraph not being relevant to the printed manual).
39 Permission is granted to copy and distribute modified versions of this
40 manual under the conditions for verbatim copying, provided that the entire
41 resulting derived work is distributed under the terms of a permission
42 notice identical to this one.
44 Permission is granted to copy and distribute translations of this manual
45 into another language, under the above conditions for modified versions,
46 except that this permission notice may be stated in a translation approved
52 @subtitle Edition @value{EDITION}, for GNU @code{find} version @value{VERSION}
53 @subtitle @value{UPDATED}
54 @author by David MacKenzie
57 @vskip 0pt plus 1filll
58 Copyright @copyright{} 1994 Free Software Foundation, Inc.
60 Permission is granted to make and distribute verbatim copies of
61 this manual provided the copyright notice and this permission notice
62 are preserved on all copies.
64 Permission is granted to copy and distribute modified versions of this
65 manual under the conditions for verbatim copying, provided that the entire
66 resulting derived work is distributed under the terms of a permission
67 notice identical to this one.
69 Permission is granted to copy and distribute translations of this manual
70 into another language, under the above conditions for modified versions,
71 except that this permission notice may be stated in a translation approved
75 @node Top, Introduction, , (dir)
76 @comment node-name, next, previous, up
79 This file documents the GNU utilities for finding files that match
80 certain criteria and performing various actions on them.
81 This is edition @value{EDITION}, for @code{find} version @value{VERSION}.
84 @c The master menu, created with texinfo-master-menu, goes here.
87 * Introduction:: Summary of the tasks this manual describes.
88 * Finding Files:: Finding files that match certain criteria.
89 * Actions:: Doing things to files you have found.
90 * Common Tasks:: Solutions to common real-world problems.
91 * Databases:: Maintaining file name databases.
92 * File Permissions:: How to control access to files.
93 * Reference:: Summary of how to invoke the programs.
94 * Primary Index:: The components of @code{find} expressions.
97 @node Introduction, Finding Files, Top, Top
100 This manual shows how to find files that meet criteria you specify, and
101 how to perform various actions on the files that you find. The
102 principal programs that you use to perform these tasks are @code{find},
103 @code{locate}, and @code{xargs}. Some of the examples in this manual
104 use capabilities specific to the GNU versions of those programs.
106 GNU @code{find} was originally written by Eric Decker, with enhancements
107 by David MacKenzie, Jay Plett, and Tim Wood. GNU @code{xargs} was
108 originally written by Mike Rendell, with enhancements by David
109 MacKenzie. GNU @code{locate} and its associated utilities were
110 originally written by James Woods, with enhancements by David MacKenzie.
111 The idea for @samp{find -print0} and @samp{xargs -0} came from Dan
112 Bernstein. Many other people have contributed bug fixes, small
113 improvements, and helpful suggestions. Thanks!
115 Mail suggestions and bug reports for these programs to
116 @code{bug-findutils@@gnu.org}. Please include the version
117 number, which you can get by running @samp{find --version}.
128 For brevity, the word @dfn{file} in this manual means a regular file, a
129 directory, a symbolic link, or any other kind of node that has a
130 directory entry. A directory entry is also called a @dfn{file name}. A
131 file name may contain some, all, or none of the directories in a path
132 that leads to the file. These are all examples of what this manual
133 calls ``file names'':
140 /usr/local/include/termcap.h
143 A @dfn{directory tree} is a directory and the files it contains, all of
144 its subdirectories and the files they contain, etc. It can also be a
145 single non-directory file.
147 These programs enable you to find the files in one or more directory
152 have names that contain certain text or match a certain pattern;
154 are links to certain files;
156 were last used during a certain period of time;
158 are within a certain size range;
160 are of a certain type (regular file, directory, symbolic link, etc.);
162 are owned by a certain user or group;
164 have certain access permissions;
166 contain text that matches a certain pattern;
168 are within a certain depth in the directory tree;
170 or some combination of the above.
173 Once you have found the files you're looking for (or files that are
174 potentially the ones you're looking for), you can do more to them than
175 simply list their names. You can get any combination of the files'
176 attributes, or process the files in many ways, either individually or in
177 groups of various sizes. Actions that you might want to perform on the
178 files you have found include, but are not limited to:
188 change access permissions
193 This manual describes how to perform each of those tasks, and more.
198 The principal programs used for making lists of files that match given
199 criteria and running commands on them are @code{find}, @code{locate},
200 and @code{xargs}. An additional command, @code{updatedb}, is used by
201 system administrators to create databases for @code{locate} to use.
203 @code{find} searches for files in a directory hierarchy and prints
204 information about the files it found. It is run like this:
207 find @r{[}@var{file}@dots{}@r{]} @r{[}@var{expression}@r{]}
211 Here is a typical use of @code{find}. This example prints the names of
212 all files in the directory tree rooted in @file{/usr/src} whose name
213 ends with @samp{.c} and that are larger than 100 Kilobytes.
215 find /usr/src -name '*.c' -size +100k -print
218 @code{locate} searches special file name databases for file names that
219 match patterns. The system administrator runs the @code{updatedb}
220 program to create the databases. @code{locate} is run like this:
223 locate @r{[}@var{option}@dots{}@r{]} @var{pattern}@dots{}
227 This example prints the names of all files in the default file name
228 database whose name ends with @samp{Makefile} or @samp{makefile}. Which
229 file names are stored in the database depends on how the system
230 administrator ran @code{updatedb}.
232 locate '*[Mm]akefile'
235 The name @code{xargs}, pronounced EX-args, means ``combine arguments.''
236 @code{xargs} builds and executes command lines by gathering together
237 arguments it reads on the standard input. Most often, these arguments
238 are lists of file names generated by @code{find}. @code{xargs} is run
242 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
246 The following command searches the files listed in the file
247 @file{file-list} and prints all of the lines in them that contain the
250 xargs grep typedef < file-list
253 @node find Expressions
254 @section @code{find} Expressions
256 The expression that @code{find} uses to select files consists of one or
257 more @dfn{primaries}, each of which is a separate command line argument
258 to @code{find}. @code{find} evaluates the expression each time it
259 processes a file. An expression can contain any of the following types
264 affect overall operation rather than the processing of a specific file;
266 return a true or false value, depending on the file's attributes;
268 have side effects and return a true or false value; and
270 connect the other arguments and affect when and whether they are
274 You can omit the operator between two primaries; it defaults to
275 @samp{-and}. @xref{Combining Primaries With Operators}, for ways to
276 connect primaries into more complex expressions. If the expression
277 contains no actions other than @samp{-prune}, @samp{-print} is performed
278 on all files for which the entire expression is true (@pxref{Print File
281 Options take effect immediately, rather than being evaluated for each
282 file when their place in the expression is reached. Therefore, for
283 clarity, it is best to place them at the beginning of the expression.
285 Many of the primaries take arguments, which immediately follow them in
286 the next command line argument to @code{find}. Some arguments are file
287 names, patterns, or other strings; others are numbers. Numeric
288 arguments can be specified as
292 for greater than @var{n},
294 for less than @var{n},
299 @node Finding Files, Actions, Introduction, Top
300 @chapter Finding Files
302 By default, @code{find} prints to the standard output the names of the
303 files that match the given criteria. @xref{Actions}, for how to get more
304 information about the matching files.
317 * Combining Primaries With Operators::
323 Here are ways to search for files whose name matches a certain pattern.
324 @xref{Shell Pattern Matching}, for a description of the @var{pattern}
325 arguments to these tests.
327 Each of these tests has a case-sensitive version and a case-insensitive
328 version, whose name begins with @samp{i}. In a case-insensitive
329 comparison, the patterns @samp{fo*} and @samp{F??} match the file names
330 @file{Foo}, @samp{FOO}, @samp{foo}, @samp{fOo}, etc.
333 * Base Name Patterns::
334 * Full Name Patterns::
335 * Fast Full Name Search::
336 * Shell Pattern Matching:: Wildcards used by these programs.
339 @node Base Name Patterns
340 @subsection Base Name Patterns
342 @deffn Test -name pattern
343 @deffnx Test -iname pattern
344 True if the base of the file name (the path with the leading directories
345 removed) matches shell pattern @var{pattern}. For @samp{-iname}, the
346 match is case-insensitive. To ignore a whole directory tree, use
347 @samp{-prune} (@pxref{Directories}). As an example, to find Texinfo
348 source files in @file{/usr/local/doc}:
351 find /usr/local/doc -name '*.texi'
355 Patterns for @samp{-name} and @samp{-iname} will match a filename with
356 a leading @samp{.}. For example the command @samp{find /tmp -name
357 \*bar} will match the file @file{/tmp/.foobar}.
360 @node Full Name Patterns
361 @subsection Full Name Patterns
363 @deffn Test -wholename pattern
364 @deffnx Test -iwholename pattern
365 True if the entire file name, starting with the command line argument
366 under which the file was found, matches shell pattern @var{pattern}.
367 For @samp{-iwholename}, the match is case-insensitive. To ignore a whole
368 directory tree, use @samp{-prune} rather than checking every file in the
369 tree (@pxref{Directories}).
372 @deffn Test -path pattern
373 @deffnx Test -ipath pattern
374 These tests are deprecated, but work as for @samp{-wholename} and @samp{-iwholename},
375 respectively. The @samp{-ipath} test is a GNU extension, but @samp{-path} is also
376 provided by HP-UX @code{find}.
379 @deffn Test -regex expr
380 @deffnx Test -iregex expr
381 True if the entire file name matches regular expression @var{expr}.
382 This is a match on the whole path, not a search. For example, to match
383 a file named @file{./fubar3}, you can use the regular expression
384 @samp{.*bar.} or @samp{.*b.*3}, but not @samp{b.*r3}. @xref{Regexps, ,
385 Syntax of Regular Expressions, emacs, The GNU Emacs Manual}, for a
386 description of the syntax of regular expressions. For @samp{-iregex},
387 the match is case-insensitive.
390 @node Fast Full Name Search
391 @subsection Fast Full Name Search
393 To search for files by name without having to actually scan the
394 directories on the disk (which can be slow), you can use the
395 @code{locate} program. For each shell pattern you give it,
396 @code{locate} searches one or more databases of file names and displays
397 the file names that contain the pattern. @xref{Shell Pattern Matching},
398 for details about shell patterns.
400 If a pattern is a plain string---it contains no
401 metacharacters---@code{locate} displays all file names in the database
402 that contain that string. If a pattern contains
403 metacharacters, @code{locate} only displays file names that match the
404 pattern exactly. As a result, patterns that contain metacharacters
405 should usually begin with a @samp{*}, and will most often end with one
406 as well. The exceptions are patterns that are intended to explicitly
407 match the beginning or end of a file name.
414 is almost equivalent to
416 find @var{directories} -name @var{pattern}
419 where @var{directories} are the directories for which the file name
420 databases contain information. The differences are that the
421 @code{locate} information might be out of date, and that @code{locate}
422 handles wildcards in the pattern slightly differently than @code{find}
423 (@pxref{Shell Pattern Matching}).
425 The file name databases contain lists of files that were on the system
426 when the databases were last updated. The system administrator can
427 choose the file name of the default database, the frequency with which
428 the databases are updated, and the directories for which they contain
431 Here is how to select which file name databases @code{locate} searches.
432 The default is system-dependent.
435 @item --database=@var{path}
437 Instead of searching the default file name database, search the file
438 name databases in @var{path}, which is a colon-separated list of
439 database file names. You can also use the environment variable
440 @code{LOCATE_PATH} to set the list of database files to search. The
441 option overrides the environment variable if both are used.
444 @node Shell Pattern Matching
445 @subsection Shell Pattern Matching
447 @code{find} and @code{locate} can compare file names, or parts of file
448 names, to shell patterns. A @dfn{shell pattern} is a string that may
449 contain the following special characters, which are known as
450 @dfn{wildcards} or @dfn{metacharacters}.
452 You must quote patterns that contain metacharacters to prevent the shell
453 from expanding them itself. Double and single quotes both work; so does
454 escaping with a backslash.
458 Matches any zero or more characters.
461 Matches any one character.
464 Matches exactly one character that is a member of the string
465 @var{string}. This is called a @dfn{character class}. As a shorthand,
466 @var{string} may contain ranges, which consist of two characters with a
467 dash between them. For example, the class @samp{[a-z0-9_]} matches a
468 lowercase letter, a number, or an underscore. You can negate a class by
469 placing a @samp{!} or @samp{^} immediately after the opening bracket.
470 Thus, @samp{[^A-Z@@]} matches any character except an uppercase letter
474 Removes the special meaning of the character that follows it. This
475 works even in character classes.
478 In the @code{find} tests that do shell pattern matching (@samp{-name},
479 @samp{-wholename}, etc.), wildcards in the pattern do not match a @samp{.}
480 at the beginning of a file name. This is not the case for
481 @code{locate}. Thus, @samp{find -name '*macs'} does not match a file
482 named @file{.emacs}, but @samp{locate '*macs'} does.
484 Slash characters have no special significance in the shell pattern
485 matching that @code{find} and @code{locate} do, unlike in the shell, in
486 which wildcards do not match them. Therefore, a pattern @samp{foo*bar}
487 can match a file name @samp{foo3/bar}, and a pattern @samp{./sr*sc} can
488 match a file name @samp{./src/misc}.
493 There are two ways that files can be linked together. @dfn{Symbolic
494 links} are a special type of file whose contents are a portion of the
495 name of another file. @dfn{Hard links} are multiple directory entries
496 for one file; the file names all have the same index node (@dfn{inode})
505 @subsection Symbolic Links
507 @deffn Test -lname pattern
508 @deffnx Test -ilname pattern
509 True if the file is a symbolic link whose contents match shell pattern
510 @var{pattern}. For @samp{-ilname}, the match is case-insensitive.
511 @xref{Shell Pattern Matching}, for details about the @var{pattern}
512 argument. So, to list any symbolic links to @file{sysdep.c} in the
513 current directory and its subdirectories, you can do:
516 find . -lname '*sysdep.c'
520 @deffn Option -follow
521 Dereference symbolic links. The following differences in behavior occur
522 when this option is given:
526 @code{find} follows symbolic links to directories when searching
529 @samp{-lname} and @samp{-ilname} always return false.
531 @samp{-type} reports the types of the files that symbolic links point
534 Implies @samp{-noleaf} (@pxref{Directories}).
539 @subsection Hard Links
541 To find hard links, first get the inode number of the file whose links
542 you want to find. You can learn a file's inode number and the number of
543 links to it by running @samp{ls -i} or @samp{find -ls}. If the file has
544 more than one link, you can search for the other links by passing that
545 inode number to @samp{-inum}. Add the @samp{-xdev} option if you are
546 starting the search at a directory that has other filesystems mounted on
547 it, such as @file{/usr} on many systems. Doing this saves needless
548 searching, since hard links to a file must be on the same filesystem.
552 File has inode number @var{n}.
555 You can also search for files that have a certain number of links, with
556 @samp{-links}. Directories normally have at least two hard links; their
557 @file{.} entry is the second one. If they have subdirectories, each of
558 those also has a hard link called @file{..} to its parent directory.
561 File has @var{n} hard links.
567 Each file has three time stamps, which record the last time that certain
568 operations were performed on the file:
572 access (read the file's contents)
574 change the status (modify the file or its attributes)
576 modify (change the file's contents)
579 You can search for files whose time stamps are within a certain age
580 range, or compare them to other time stamps.
584 * Comparing Timestamps::
588 @subsection Age Ranges
590 These tests are mainly useful with ranges (@samp{+@var{n}} and
594 @deffnx Test -ctime n
595 @deffnx Test -mtime n
596 True if the file was last accessed (or its status changed, or it was
597 modified) @var{n}*24 hours ago.
603 True if the file was last accessed (or its status changed, or it was
604 modified) @var{n} minutes ago. These tests provide finer granularity of
605 measurement than @samp{-atime} et al. For example, to list files in
606 @file{/u/bill} that were last read from 2 to 6 minutes ago:
609 find /u/bill -amin +2 -amin -6
613 @deffn Option -daystart
614 Measure times from the beginning of today rather than from 24 hours ago.
615 So, to list the regular files in your home directory that were modified
619 find ~ -daystart -type f -mtime 1
623 @node Comparing Timestamps
624 @subsection Comparing Timestamps
626 As an alternative to comparing timestamps to the current time, you can
627 compare them to another file's timestamp. That file's timestamp could
628 be updated by another program when some event occurs. Or you could set
629 it to a particular fixed date using the @code{touch} command. For
630 example, to list files in @file{/usr} modified after February 1 of the
633 @c Idea from Rick Sladkey.
635 touch -t 02010000 /tmp/stamp$$
636 find /usr -newer /tmp/stamp$$
640 @deffn Test -anewer file
641 @deffnx Test -cnewer file
642 @deffnx Test -newer file
643 True if the file was last accessed (or its status changed, or it was
644 modified) more recently than @var{file} was modified. These tests are
645 affected by @samp{-follow} only if @samp{-follow} comes before them on
646 the command line. @xref{Symbolic Links}, for more information on
647 @samp{-follow}. As an example, to list any files modified since
648 @file{/bin/sh} was last modified:
651 find . -newer /bin/sh
656 True if the file was last accessed @var{n} days after its status was
657 last changed. Useful for finding files that are not being used, and
658 could perhaps be archived or removed to save disk space.
664 @deffn Test -size n@r{[}bckwMG@r{]}
665 True if the file uses @var{n} units of space, rounding up. The units
666 are 512-byte blocks by default, but they can be changed by adding a
667 one-character suffix to @var{n}:
671 512-byte blocks (never 1024)
675 kilobytes (1024 bytes)
684 The `b' suffix always considers blocks to be 512 bytes. This is not
685 affected by the setting (or non-setting) of the POSIXLY_CORRECT
686 environment variable. This behaviour is different to the behaviour of
687 the @samp{-ls} action). If you want to use 1024-byte units, use the
690 The number can be prefixed with a `+' or a `-'. A plus sign indicates
691 that the test should succeed if the file uses at least @var{n} units
692 of storage (this is the way I normally use this test) and a minus sign
693 indicates that the test should succeed if the file uses less than
694 @var{n} units of storage. There is no `=' prefix, because that's the
697 The size does not count indirect blocks, but it does count blocks in
698 sparse files that are not actually allocated. This handling of sparse
699 files differs from the output of the @samp{%k} and @samp{%b} format
700 specifiers for the @samp{-printf} predicate.
705 True if the file is empty and is either a regular file or a directory.
706 This might make it a good candidate for deletion. This test is useful
707 with @samp{-depth} (@pxref{Directories}) and @samp{-exec rm -rf '@{@}' ';'}
708 (@pxref{Single File}).
715 True if the file is of type @var{c}:
719 block (buffered) special
721 character (unbuffered) special
738 The same as @samp{-type} unless the file is a symbolic link. For
739 symbolic links: if @samp{-follow} has not been given, true if the file
740 is a link to a file of type @var{c}; if @samp{-follow} has been given,
741 true if @var{c} is @samp{l}. In other words, for symbolic links,
742 @samp{-xtype} checks the type of the file that @samp{-type} does not
743 check. @xref{Symbolic Links}, for more information on @samp{-follow}.
749 @deffn Test -user uname
750 @deffnx Test -group gname
751 True if the file is owned by user @var{uname} (belongs to group @var{gname}).
752 A numeric ID is allowed.
757 True if the file's numeric user ID (group ID) is @var{n}. These tests
758 support ranges (@samp{+@var{n}} and @samp{-@var{n}}), unlike
759 @samp{-user} and @samp{-group}.
763 @deffnx Test -nogroup
764 True if no user corresponds to the file's numeric user ID (no group
765 corresponds to the numeric group ID). These cases usually mean that the
766 files belonged to users who have since been removed from the system.
767 You probably should change the ownership of such files to an existing
768 user or group, using the @code{chown} or @code{chgrp} program.
774 @xref{File Permissions}, for information on how file permissions are
775 structured and how to specify them.
777 @deffn Test -perm mode
779 file's permissions are exactly @var{mode} (which can be numeric or symbolic).
780 Symbolic modes use mode 0 as a point of departure.
781 If @var{mode} starts with @samp{-}, true if
782 @emph{all} of the permissions set in @var{mode} are set for the file;
783 permissions not set in @var{mode} are ignored.
784 If @var{mode} starts with @samp{+}, true if
785 @emph{any} of the permissions set in @var{mode} are set for the file;
786 permissions not set in @var{mode} are ignored.
792 To search for files based on their contents, you can use the @code{grep}
793 program. For example, to find out which C source files in the current
794 directory contain the string @samp{thing}, you can do:
800 If you also want to search for the string in files in subdirectories,
801 you can combine @code{grep} with @code{find} and @code{xargs}, like
805 find . -name '*.[ch]' | xargs grep -l thing
808 The @samp{-l} option causes @code{grep} to print only the names of files
809 that contain the string, rather than the lines that contain it. The
810 string argument (@samp{thing}) is actually a regular expression, so it
811 can contain metacharacters. This method can be refined a little by
812 using the @samp{-r} option to make @code{xargs} not run @code{grep} if
813 @code{find} produces no output, and using the @code{find} action
814 @samp{-print0} and the @code{xargs} option @samp{-0} to avoid
815 misinterpreting files whose names contain spaces:
818 find . -name '*.[ch]' -print0 | xargs -r -0 grep -l thing
821 For a fuller treatment of finding files whose contents match a pattern,
822 see the manual page for @code{grep}.
827 Here is how to control which directories @code{find} searches, and how
828 it searches them. These two options allow you to process a horizontal
829 slice of a directory tree.
831 @deffn Option -maxdepth levels
832 Descend at most @var{levels} (a non-negative integer) levels of
833 directories below the command line arguments. @samp{-maxdepth 0} means
834 only apply the tests and actions to the command line arguments.
837 @deffn Option -mindepth levels
838 Do not apply any tests or actions at levels less than @var{levels} (a
839 non-negative integer). @samp{-mindepth 1} means process all files
840 except the command line arguments.
844 Process each directory's contents before the directory itself. Doing
845 this is a good idea when producing lists of files to archive with
846 @code{cpio} or @code{tar}. If a directory does not have write
847 permission for its owner, its contents can still be restored from the
848 archive since the directory's permissions are restored after its contents.
852 If @samp{-depth} is not given, true; do not descend into the current
853 directory. If @samp{-depth} is given, false; no effect. @samp{-prune}
854 only affects tests and actions that come after it in the expression, not
855 those that come before.
857 For example, to skip the directory @file{src/emacs} and all files and
858 directories under it, and print the names of the other files found:
861 find . -wholename './src/emacs' -prune -o -print
865 @deffn Option -noleaf
866 Do not optimize by assuming that directories contain 2 fewer
867 subdirectories than their hard link count. This option is needed when
868 searching filesystems that do not follow the Unix directory-link
869 convention, such as CD-ROM or MS-DOS filesystems or AFS volume mount
870 points. Each directory on a normal Unix filesystem has at least 2 hard
871 links: its name and its @file{.} entry. Additionally, its
872 subdirectories (if any) each have a @file{..} entry linked to that
873 directory. When @code{find} is examining a directory, after it has
874 statted 2 fewer subdirectories than the directory's link count, it knows
875 that the rest of the entries in the directory are non-directories
876 (@dfn{leaf} files in the directory tree). If only the files' names need
877 to be examined, there is no need to stat them; this gives a significant
878 increase in search speed.
881 @deffn Option -ignore_readdir_race
882 If a file disappears after its name has been read from a directory but
883 before @code{find} gets around to examining the file with @code{stat},
884 don't issue an error message. If you don't specify this option, an
885 error message will be issued. This option can be useful in system
886 scripts (cron scripts, for example) that examine areas of the
887 filesystem that change frequently (mail queues, temporary directories,
888 and so forth), because this scenario is common for those sorts of
889 directories. Completely silencing error messages from @code{find} is
890 undesirable, so this option neatly solves the problem. There is no
891 way to search one part of the filesystem with this option on and part
892 of it with this option off, though.
895 @deffn Option -noignore_readdir_race
896 This option reverses the effect of the @samp{-ignore_readdir_race} option.
903 A @dfn{filesystem} is a section of a disk, either on the local host or
904 mounted from a remote host over a network. Searching network
905 filesystems can be slow, so it is common to make @code{find} avoid them.
907 There are two ways to avoid searching certain filesystems. One way is
908 to tell @code{find} to only search one filesystem:
911 @deffnx Option -mount
912 Don't descend directories on other filesystems. These options are synonyms.
915 The other way is to check the type of filesystem each file is on, and
916 not descend directories that are on undesirable filesystem types:
918 @deffn Test -fstype type
919 True if the file is on a filesystem of type @var{type}. The valid
920 filesystem types vary among different versions of Unix; an incomplete
921 list of filesystem types that are accepted on some version of Unix or
924 ufs 4.2 4.3 nfs tmp mfs S51K S52K
926 You can use @samp{-printf} with the @samp{%F} directive to see the types
927 of your filesystems. @xref{Print File Information}. @samp{-fstype} is
928 usually used with @samp{-prune} to avoid searching remote filesystems
929 (@pxref{Directories}).
932 @node Combining Primaries With Operators
933 @section Combining Primaries With Operators
935 Operators build a complex expression from tests and actions.
936 The operators are, in order of decreasing precedence:
939 @item @asis{( @var{expr} )}
941 Force precedence. True if @var{expr} is true.
943 @item @asis{! @var{expr}}
944 @itemx @asis{-not @var{expr}}
947 True if @var{expr} is false.
949 @item @asis{@var{expr1 expr2}}
950 @itemx @asis{@var{expr1} -a @var{expr2}}
951 @itemx @asis{@var{expr1} -and @var{expr2}}
954 And; @var{expr2} is not evaluated if @var{expr1} is false.
956 @item @asis{@var{expr1} -o @var{expr2}}
957 @itemx @asis{@var{expr1} -or @var{expr2}}
960 Or; @var{expr2} is not evaluated if @var{expr1} is true.
962 @item @asis{@var{expr1} , @var{expr2}}
964 List; both @var{expr1} and @var{expr2} are always evaluated. True if
965 @var{expr2} is true. The value of @var{expr1} is discarded. This
966 operator lets you do multiple independent operations on one traversal,
967 without depending on whether other operations succeeded.
970 @code{find} searches the directory tree rooted at each file name by
971 evaluating the expression from left to right, according to the rules of
972 precedence, until the outcome is known (the left hand side is false for
973 @samp{-and}, true for @samp{-or}), at which point @code{find} moves on
974 to the next file name.
976 There are two other tests that can be useful in complex expressions:
986 @node Actions, Common Tasks, Finding Files, Top
989 There are several ways you can print information about the files that
990 match the criteria you gave in the @code{find} expression. You can
991 print the information either to the standard output or to a file that
992 you name. You can also execute commands that have the file names as
993 arguments. You can use those commands as further filters to select files.
997 * Print File Information::
1002 @node Print File Name
1003 @section Print File Name
1005 @deffn Action -print
1006 True; print the full file name on the standard output, followed by a
1010 @deffn Action -fprint file
1011 True; print the full file name into file @var{file}, followed by a
1012 newline. If @var{file} does not exist when @code{find} is run, it is
1013 created; if it does exist, it is truncated to 0 bytes. The file names
1014 @file{/dev/stdout} and @file{/dev/stderr} are handled specially; they
1015 refer to the standard output and standard error output, respectively.
1018 @node Print File Information
1019 @section Print File Information
1022 True; list the current file in @samp{ls -dils} format on the standard
1023 output. The output looks like this:
1026 204744 17 -rw-r--r-- 1 djm staff 17337 Nov 2 1992 ./lwall-quotes
1033 The inode number of the file. @xref{Hard Links}, for how to find files
1034 based on their inode number.
1037 the number of blocks in the file. The block counts are of 1K blocks,
1038 unless the environment variable @code{POSIXLY_CORRECT} is set, in which
1039 case 512-byte blocks are used. @xref{Size}, for how to find files based
1043 The file's type and permissions. The type is shown as a dash for a
1044 regular file; for other file types, a letter like for @samp{-type} is
1045 used (@pxref{Type}). The permissions are read, write, and execute for
1046 the file's owner, its group, and other users, respectively; a dash means
1047 the permission is not granted. @xref{File Permissions}, for more details
1048 about file permissions. @xref{Permissions}, for how to find files based
1049 on their permissions.
1052 The number of hard links to the file.
1055 The user who owns the file.
1061 The file's size in bytes.
1064 The date the file was last modified.
1067 The file's name. @samp{-ls} quotes non-printable characters in the file
1068 names using C-like backslash escapes.
1072 @deffn Action -fls file
1073 True; like @samp{-ls} but write to @var{file} like @samp{-fprint}
1074 (@pxref{Print File Name}).
1077 @deffn Action -printf format
1078 True; print @var{format} on the standard output, interpreting @samp{\}
1079 escapes and @samp{%} directives. Field widths and precisions can be
1080 specified as with the @code{printf} C function. Unlike @samp{-print},
1081 @samp{-printf} does not add a newline at the end of the string.
1084 @deffn Action -fprintf file format
1085 True; like @samp{-printf} but write to @var{file} like @samp{-fprint}
1086 (@pxref{Print File Name}).
1091 * Format Directives::
1098 The escapes that @samp{-printf} and @samp{-fprintf} recognize are:
1106 Stop printing from this format immediately and flush the output.
1118 A literal backslash (@samp{\}).
1120 The character whose ASCII code is NNN (octal).
1123 A @samp{\} character followed by any other character is treated as an
1124 ordinary character, so they both are printed, and a warning message is
1125 printed to the standard error output (because it was probably a typo).
1127 @node Format Directives
1128 @subsection Format Directives
1130 @samp{-printf} and @samp{-fprintf} support the following format
1131 directives to print information about the file being processed.
1132 The C @code{printf} function, field width and precision specifiers
1133 are supported, as applied to string (%s) types. I.E. you can specify
1134 "minimum field width"."maximum field width" for each directive.
1136 @samp{%%} is a literal percent sign. A @samp{%} character followed by
1137 an unrecognised character (i.e. not a known directive or printf field
1138 width and precision specifier), is discarded (but the unrecognised character
1139 is printed), and a warning message is printed to the standard error output
1140 (because it was probably a typo).
1144 * Ownership Directives::
1146 * Location Directives::
1150 @node Name Directives
1151 @subsubsection Name Directives
1157 File's name with any leading directories removed (only the last element).
1159 Leading directories of file's name (all but the last element and the
1162 File's name with the name of the command line argument under which
1163 it was found removed from the beginning.
1165 Command line argument under which file was found.
1168 @node Ownership Directives
1169 @subsubsection Ownership Directives
1173 File's group name, or numeric group ID if the group has no name.
1175 File's numeric group ID.
1177 File's user name, or numeric user ID if the user has no name.
1179 File's numeric user ID.
1181 File's permissions (in octal).
1184 @node Size Directives
1185 @subsubsection Size Directives
1189 Amount of disk space occupied by the file, measured in 1K blocks
1190 (rounded up). This can be less than the length of the file if
1191 it is a sparse file (that is, it has ``holes'').
1193 File's size in 512-byte blocks (rounded up). This also can be less
1194 than the length of the file, if the file is sparse.
1196 File's size in bytes.
1199 @node Location Directives
1200 @subsubsection Location Directives
1204 File's depth in the directory tree; files named on the command line
1207 Type of the filesystem the file is on; this value can be used for
1208 @samp{-fstype} (@pxref{Directories}).
1210 Object of symbolic link (empty string if file is not a symbolic link).
1212 File's inode number (in decimal).
1214 Number of hard links to file.
1217 @node Time Directives
1218 @subsubsection Time Directives
1220 Some of these directives use the C @code{ctime} function. Its output
1221 depends on the current locale, but it typically looks like
1224 Wed Nov 2 00:42:36 1994
1229 File's last access time in the format returned by the C @code{ctime} function.
1231 File's last access time in the format specified by @var{k}
1232 (@pxref{Time Formats}).
1234 File's last status change time in the format returned by the C @code{ctime}
1237 File's last status change time in the format specified by @var{k}
1238 (@pxref{Time Formats}).
1240 File's last modification time in the format returned by the C @code{ctime}
1243 File's last modification time in the format specified by @var{k}
1244 (@pxref{Time Formats}).
1248 @subsection Time Formats
1250 Below are the formats for the directives @samp{%A}, @samp{%C}, and
1251 @samp{%T}, which print the file's timestamps. Some of these formats
1252 might not be available on all systems, due to differences in the C
1253 @code{strftime} function between systems.
1258 * Combined Time Formats::
1261 @node Time Components
1262 @subsubsection Time Components
1264 The following format directives print single components of the time.
1278 time zone (e.g., EDT), or nothing if no time zone is determinable
1284 seconds since Jan. 1, 1970, 00:00 GMT.
1287 @node Date Components
1288 @subsubsection Date Components
1290 The following format directives print single components of the date.
1294 locale's abbreviated weekday name (Sun..Sat)
1296 locale's full weekday name, variable length (Sunday..Saturday)
1299 locale's abbreviated month name (Jan..Dec)
1301 locale's full month name, variable length (January..December)
1305 day of month (01..31)
1309 day of year (001..366)
1311 week number of year with Sunday as first day of week (00..53)
1313 week number of year with Monday as first day of week (00..53)
1317 last two digits of year (00..99)
1320 @node Combined Time Formats
1321 @subsubsection Combined Time Formats
1323 The following format directives print combinations of time and date
1328 time, 12-hour (hh:mm:ss [AP]M)
1330 time, 24-hour (hh:mm:ss)
1332 locale's time representation (H:M:S)
1334 locale's date and time (Sat Nov 04 12:02:33 EST 1989)
1338 locale's date representation (mm/dd/yy)
1342 @section Run Commands
1344 You can use the list of file names created by @code{find} or
1345 @code{locate} as arguments to other commands. In this way you can
1346 perform arbitrary actions on the files.
1355 @subsection Single File
1357 Here is how to run a command on one file at a time.
1359 @deffn Action -exec command ;
1360 Execute @var{command}; true if 0 status is returned. @code{find} takes
1361 all arguments after @samp{-exec} to be part of the command until an
1362 argument consisting of @samp{;} is reached. It replaces the string
1363 @samp{@{@}} by the current file name being processed everywhere it
1364 occurs in the command. Both of these constructions need to be escaped
1365 (with a @samp{\}) or quoted to protect them from expansion by the shell.
1366 The command is executed in the directory in which @code{find} was run.
1368 For example, to compare each C header file in the current directory with
1369 the file @file{/tmp/master}:
1372 find . -name '*.h' -exec diff -u '@{@}' /tmp/master ';'
1377 @deffn Action -exec command +
1378 This variant of @samp{-exec} not yet supported. Once it has been
1379 implemented as specified in the POSIX standard, a @samp{@{@}} will expand to
1380 a list of names of matching files. This expansion is done in such a
1381 way as to avoid exceeding the maximum command line length available on
1386 @node Multiple Files
1387 @subsection Multiple Files
1389 Sometimes you need to process files alone. But when you
1390 don't, it is faster to run a command on as many files as possible at a
1391 time, rather than once per file. Doing this saves on the time it takes
1392 to start up the command each time.
1394 To run a command on more than one file at once, use the @code{xargs}
1395 command, which is invoked like this:
1398 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
1401 @code{xargs} reads arguments from the standard input, delimited by
1402 blanks (which can be protected with double or single quotes or a
1403 backslash) or newlines. It executes the @var{command} (default is
1404 @file{/bin/echo}) one or more times with any @var{initial-arguments}
1405 followed by arguments read from standard input. Blank lines on the
1406 standard input are ignored.
1408 Instead of blank-delimited names, it is safer to use @samp{find -print0}
1409 or @samp{find -fprint0} and process the output by giving the @samp{-0}
1410 or @samp{--null} option to GNU @code{xargs}, GNU @code{tar}, GNU
1411 @code{cpio}, or @code{perl}.
1413 You can use shell command substitution (backquotes) to process a list of
1414 arguments, like this:
1417 grep -l sprintf `find $HOME -name '*.c' -print`
1420 However, that method produces an error if the length of the @samp{.c}
1421 file names exceeds the operating system's command-line length limit.
1422 @code{xargs} avoids that problem by running the command as many times as
1423 necessary without exceeding the limit:
1426 find $HOME -name '*.c' -print | xargs grep -l sprintf
1429 However, if the command needs to have its standard input be a terminal
1430 (@code{less}, for example), you have to use the shell command
1431 substitution method.
1433 The @code{xargs} command will process all its input, building command
1434 lines and executing them, unless one of the commands exits with a
1435 status of 255 (this will cause xargs to issue an error message and
1436 stop) or it reads a line contains the end of file string specified
1437 with the @samp{--eof} option.
1440 * Unsafe File Name Handling::
1441 * Safe File Name Handling::
1442 * Limiting Command Size::
1443 * Interspersing File Names::
1446 @node Unsafe File Name Handling
1447 @subsubsection Unsafe File Name Handling
1449 Because file names can contain quotes, backslashes, blank characters,
1450 and even newlines, it is not safe to process them using @code{xargs} in its
1451 default mode of operation. But since most files' names do not contain
1452 blanks, this problem occurs only infrequently. If you are only
1453 searching through files that you know have safe names, then you need not
1454 be concerned about it.
1456 @c This example is adapted from:
1457 @c From: pfalstad@stone.Princeton.EDU (Paul John Falstad)
1458 @c Newsgroups: comp.unix.shell
1459 @c Subject: Re: Beware xargs security holes
1460 @c Date: 16 Oct 90 19:12:06 GMT
1462 In many applications, if @code{xargs} botches processing a file because
1463 its name contains special characters, some data might be lost. The
1464 importance of this problem depends on the importance of the data and
1465 whether anyone notices the loss soon enough to correct it. However,
1466 here is an extreme example of the problems that using blank-delimited
1467 names can cause. If the following command is run daily from
1468 @code{cron}, then any user can remove any file on the system:
1471 find / -name '#*' -atime +7 -print | xargs rm
1474 For example, you could do something like this:
1482 and then @code{cron} would delete @file{/vmunix}, if it ran
1483 @code{xargs} with @file{/} as its current directory.
1485 To delete other files, for example @file{/u/joeuser/.plan}, you could do
1493 eg$ mkdir u u/joeuser u/joeuser/.plan'
1495 eg$ echo > u/joeuser/.plan'
1498 eg$ find . -name '#*' -print | xargs echo
1499 ./# ./# /u/joeuser/.plan /#foo
1502 @node Safe File Name Handling
1503 @subsubsection Safe File Name Handling
1505 Here is how to make @code{find} output file names so that they can be
1506 used by other programs without being mangled or misinterpreted. You can
1507 process file names generated this way by giving the @samp{-0} or
1508 @samp{--null} option to GNU @code{xargs}, GNU @code{tar}, GNU
1509 @code{cpio}, or @code{perl}.
1511 @deffn Action -print0
1512 True; print the full file name on the standard output, followed by a
1516 @deffn Action -fprint0 file
1517 True; like @samp{-print0} but write to @var{file} like @samp{-fprint}
1518 (@pxref{Print File Name}).
1521 @node Limiting Command Size
1522 @subsubsection Limiting Command Size
1524 @code{xargs} gives you control over how many arguments it passes to the
1525 command each time it executes it. By default, it uses up to
1526 @code{ARG_MAX} - 2k, or 20k, whichever is smaller, characters per
1527 command. It uses as many lines and arguments as fit within that limit.
1528 The following options modify those values.
1531 @item --no-run-if-empty
1533 If the standard input does not contain any nonblanks, do not run the
1534 command. By default, the command is run once even if there is no input.
1536 @item --max-lines@r{[}=@var{max-lines}@r{]}
1537 @itemx -l@r{[}@var{max-lines}@r{]}
1538 Use at most @var{max-lines} nonblank input lines per command line;
1539 @var{max-lines} defaults to 1 if omitted. Trailing blanks cause an
1540 input line to be logically continued on the next input line, for the
1541 purpose of counting the lines. Implies @samp{-x}.
1543 @item --max-args=@var{max-args}
1544 @itemx -n @var{max-args}
1545 Use at most @var{max-args} arguments per command line. Fewer than
1546 @var{max-args} arguments will be used if the size (see the @samp{-s}
1547 option) is exceeded, unless the @samp{-x} option is given, in which case
1548 @code{xargs} will exit.
1550 @item --max-chars=@var{max-chars}
1551 @itemx -s @var{max-chars}
1552 Use at most @var{max-chars} characters per command line, including the
1553 command and initial arguments and the terminating nulls at the ends of
1554 the argument strings.
1556 @item --max-procs=@var{max-procs}
1557 @itemx -P @var{max-procs}
1558 Run up to @var{max-procs} processes at a time; the default is 1. If
1559 @var{max-procs} is 0, @code{xargs} will run as many processes as
1560 possible at a time. Use the @samp{-n}, @samp{-s}, or @samp{-l} option
1561 with @samp{-P}; otherwise chances are that the command will be run only
1565 @node Interspersing File Names
1566 @subsubsection Interspersing File Names
1568 @code{xargs} can insert the name of the file it is processing between
1569 arguments you give for the command. Unless you also give options to
1570 limit the command size (@pxref{Limiting Command Size}), this mode of
1571 operation is equivalent to @samp{find -exec} (@pxref{Single File}).
1574 @item --replace@r{[}=@var{replace-str}@r{]}
1575 @itemx -i@r{[}@var{replace-str}@r{]}
1576 Replace occurrences of @var{replace-str} in the initial arguments with
1577 names read from standard input. Also, unquoted blanks do not terminate
1578 arguments. If @var{replace-str} is omitted, it defaults to @samp{@{@}}
1579 (like for @samp{find -exec}). Implies @samp{-x} and @samp{-l 1}. As an
1580 example, to sort each file the @file{bills} directory, leaving the
1581 output in that file name with @file{.sorted} appended, you could do:
1584 find bills -type f | xargs -iXX sort -o XX.sorted XX
1588 The equivalent command using @samp{find -exec} is:
1591 find bills -type f -exec sort -o '@{@}.sorted' '@{@}' ';'
1596 @subsection Querying
1598 To ask the user whether to execute a command on a single file, you can
1599 use the @code{find} primary @samp{-ok} instead of @samp{-exec}:
1601 @deffn Action -ok command ;
1602 Like @samp{-exec} (@pxref{Single File}), but ask the user first (on
1603 the standard input); if the response does not start with @samp{y} or
1604 @samp{Y}, do not run the command, and return false.
1607 When processing multiple files with a single command, to query the user
1608 you give @code{xargs} the following option. When using this option, you
1609 might find it useful to control the number of files processed per
1610 invocation of the command (@pxref{Limiting Command Size}).
1615 Prompt the user about whether to run each command line and read a line
1616 from the terminal. Only run the command line if the response starts
1617 with @samp{y} or @samp{Y}. Implies @samp{-t}.
1621 @section Adding Tests
1623 You can test for file attributes that none of the @code{find} builtin
1624 tests check. To do this, use @code{xargs} to run a program that filters
1625 a list of files printed by @code{find}. If possible, use @code{find}
1626 builtin tests to pare down the list, so the program run by @code{xargs}
1627 has less work to do. The tests builtin to @code{find} will likely run
1628 faster than tests that other programs perform.
1630 For example, here is a way to print the names of all of the unstripped
1631 binaries in the @file{/usr/local} directory tree. Builtin tests avoid
1632 running @code{file} on files that are not regular files or are not
1636 find /usr/local -type f -perm +a=x | xargs file |
1637 grep 'not stripped' | cut -d: -f1
1641 The @code{cut} program removes everything after the file name from the
1642 output of @code{file}.
1644 @c Idea from Martin Weitzel.
1645 If you want to place a special test somewhere in the middle of a
1646 @code{find} expression, you can use @samp{-exec} to run a program that
1647 performs the test. Because @samp{-exec} evaluates to the exit status of
1648 the executed program, you can write a program (which can be a shell
1649 script) that tests for a special attribute and make it exit with a true
1650 (zero) or false (non-zero) status. It is a good idea to place such a
1651 special test @emph{after} the builtin tests, because it starts a new
1652 process which could be avoided if a builtin test evaluates to false.
1653 Use this method only when @code{xargs} is not flexible enough, because
1654 starting one or more new processes to test each file is slower than
1655 using @code{xargs} to start one process that tests many files.
1657 Here is a shell script called @code{unstripped} that checks whether its
1658 argument is an unstripped binary file:
1662 file $1 | grep 'not stripped' > /dev/null
1665 This script relies on the fact that the shell exits with the status of
1666 the last program it executed, in this case @code{grep}. @code{grep}
1667 exits with a true status if it found any matches, false if not. Here is
1668 an example of using the script (assuming it is in your search path). It
1669 lists the stripped executables in the file @file{sbins} and the
1670 unstripped ones in @file{ubins}.
1673 find /usr/local -type f -perm +a=x \
1674 \( -exec unstripped '@{@}' \; -fprint ubins -o -fprint sbins \)
1677 @node Common Tasks, Databases, Actions, Top
1678 @chapter Common Tasks
1680 The sections that follow contain some extended examples that both give a
1681 good idea of the power of these programs, and show you how to solve
1682 common real-world problems.
1685 * Viewing And Editing::
1688 * Strange File Names::
1689 * Fixing Permissions::
1690 * Classifying Files::
1693 @node Viewing And Editing
1694 @section Viewing And Editing
1696 To view a list of files that meet certain criteria, simply run your file
1697 viewing program with the file names as arguments. Shells substitute a
1698 command enclosed in backquotes with its output, so the whole command
1702 less `find /usr/include -name '*.h' | xargs grep -l mode_t`
1706 You can edit those files by giving an editor name instead of a file
1712 You can pass a list of files produced by @code{find} to a file archiving
1713 program. GNU @code{tar} and @code{cpio} can both read lists of file
1714 names from the standard input---either delimited by nulls (the safe way)
1715 or by blanks (the lazy, risky default way). To use null-delimited
1716 names, give them the @samp{--null} option. You can store a file archive
1717 in a file, write it on a tape, or send it over a network to extract on
1720 One common use of @code{find} to archive files is to send a list of the
1721 files in a directory tree to @code{cpio}. Use @samp{-depth} so if a
1722 directory does not have write permission for its owner, its contents can
1723 still be restored from the archive since the directory's permissions are
1724 restored after its contents. Here is an example of doing this using
1725 @code{cpio}; you could use a more complex @code{find} expression to
1726 archive only certain files.
1729 find . -depth -print0 |
1730 cpio --create --null --format=crc --file=/dev/nrst0
1733 You could restore that archive using this command:
1736 cpio --extract --null --make-dir --unconditional \
1737 --preserve --file=/dev/nrst0
1740 Here are the commands to do the same things using @code{tar}:
1743 find . -depth -print0 |
1744 tar --create --null --files-from=- --file=/dev/nrst0
1746 tar --extract --null --preserve-perm --same-owner \
1750 @c Idea from Rick Sladkey.
1751 Here is an example of copying a directory from one machine to another:
1754 find . -depth -print0 | cpio -0o -Hnewc |
1755 rsh @var{other-machine} "cd `pwd` && cpio -i0dum"
1759 @section Cleaning Up
1761 @c Idea from Jim Meyering.
1762 This section gives examples of removing unwanted files in various situations.
1763 Here is a command to remove the CVS backup files created when an update
1767 find . -name '.#*' -print0 | xargs -0r rm -f
1770 @c Idea from Franc,ois Pinard.
1771 You can run this command to clean out your clutter in @file{/tmp}. You
1772 might place it in the file your shell runs when you log out
1773 (@file{.bash_logout}, @file{.logout}, or @file{.zlogout}, depending on
1774 which shell you use).
1777 find /tmp -user $LOGNAME -type f -print0 | xargs -0 -r rm -f
1780 If your @code{find} command removes directories, you may find that
1781 you get a spurious error message when @code{find} tries to recurse
1782 into a directory that has now been removed. Using the @samp{-depth}
1783 option will normally resolve this problem.
1785 @c Idea from Noah Friedman.
1786 To remove old Emacs backup and auto-save files, you can use a command
1787 like the following. It is especially important in this case to use
1788 null-terminated file names because Emacs packages like the VM mailer
1789 often create temporary file names with spaces in them, like @file{#reply
1790 to David J. MacKenzie<1>#}.
1793 find ~ \( -name '*~' -o -name '#*#' \) -print0 |
1794 xargs --no-run-if-empty --null rm -vf
1797 Removing old files from @file{/tmp} is commonly done from @code{cron}:
1799 @c Idea from Kaveh Ghazi.
1801 find /tmp /var/tmp -not -type d -mtime +3 -print0 |
1802 xargs --null --no-run-if-empty rm -f
1804 find /tmp /var/tmp -depth -mindepth 1 -type d -empty -exec rmdir @{@} \;
1807 The second @code{find} command above uses @samp{-depth} so it cleans out
1808 empty directories depth-first, hoping that the parents become empty and
1809 can be removed too. It uses @samp{-mindepth} to avoid removing
1810 @file{/tmp} itself if it becomes totally empty.
1812 @node Strange File Names
1813 @section Strange File Names
1816 @c From: tmatimar@isgtec.com (Ted Timar)
1817 @c Newsgroups: comp.unix.questions,comp.unix.shell,comp.answers,news.answers
1818 @c Subject: Unix - Frequently Asked Questions (2/7) [Frequent posting]
1819 @c Subject: How do I remove a file with funny characters in the filename ?
1820 @c Date: Thu Mar 18 17:16:55 EST 1993
1821 @code{find} can help you remove or rename a file with strange characters
1822 in its name. People are sometimes stymied by files whose names contain
1823 characters such as spaces, tabs, control characters, or characters with
1824 the high bit set. The simplest way to remove such files is:
1827 rm -i @var{some*pattern*that*matches*the*problem*file}
1830 @code{rm} asks you whether to remove each file matching the given
1831 pattern. If you are using an old shell, this approach might not work if
1832 the file name contains a character with the high bit set; the shell may
1833 strip it off. A more reliable way is:
1836 find . -maxdepth 1 @var{tests} -ok rm '@{@}' \;
1840 where @var{tests} uniquely identify the file. The @samp{-maxdepth 1}
1841 option prevents @code{find} from wasting time searching for the file in
1842 any subdirectories; if there are no subdirectories, you may omit it. A
1843 good way to uniquely identify the problem file is to figure out its
1850 Suppose you have a file whose name contains control characters, and you
1851 have found that its inode number is 12345. This command prompts you for
1852 whether to remove it:
1855 find . -maxdepth 1 -inum 12345 -ok rm -f '@{@}' \;
1858 If you don't want to be asked, perhaps because the file name may contain
1859 a strange character sequence that will mess up your screen when printed,
1860 then use @samp{-exec} instead of @samp{-ok}.
1862 If you want to rename the file instead, you can use @code{mv} instead of
1866 find . -maxdepth 1 -inum 12345 -ok mv '@{@}' @var{new-file-name} \;
1869 @node Fixing Permissions
1870 @section Fixing Permissions
1872 Suppose you want to make sure that everyone can write to the directories in a
1873 certain directory tree. Here is a way to find directories lacking either
1874 user or group write permission (or both), and fix their permissions:
1877 find . -type d -not -perm -ug=w | xargs chmod ug+w
1881 You could also reverse the operations, if you want to make sure that
1882 directories do @emph{not} have world write permission.
1884 @node Classifying Files
1885 @section Classifying Files
1888 @c From: martin@mwtech.UUCP (Martin Weitzel)
1889 @c Newsgroups: comp.unix.wizards,comp.unix.questions
1890 @c Subject: Advanced usage of 'find' (Re: Unix security automating script)
1891 @c Date: 22 Mar 90 15:05:19 GMT
1892 If you want to classify a set of files into several groups based on
1893 different criteria, you can use the comma operator to perform multiple
1894 independent tests on the files. Here is an example:
1897 find / -type d \( -perm -o=w -fprint allwrite , \
1898 -perm -o=x -fprint allexec \)
1900 echo "Directories that can be written to by everyone:"
1903 echo "Directories with search permissions for everyone:"
1907 @code{find} has only to make one scan through the directory tree (which
1908 is one of the most time consuming parts of its work).
1910 @node Databases, File Permissions, Common Tasks, Top
1911 @chapter File Name Databases
1913 The file name databases used by @code{locate} contain lists of files
1914 that were in particular directory trees when the databases were last
1915 updated. The file name of the default database is determined when
1916 @code{locate} and @code{updatedb} are configured and installed. The
1917 frequency with which the databases are updated and the directories for
1918 which they contain entries depend on how often @code{updatedb} is run,
1919 and with which arguments.
1922 * Database Locations::
1923 * Database Formats::
1926 @node Database Locations
1927 @section Database Locations
1929 There can be multiple file name databases. Users can select which
1930 databases @code{locate} searches using an environment variable or a
1931 command line option. The system administrator can choose the file name
1932 of the default database, the frequency with which the databases are
1933 updated, and the directories for which they contain entries. File name
1934 databases are updated by running the @code{updatedb} program, typically
1937 In networked environments, it often makes sense to build a database at
1938 the root of each filesystem, containing the entries for that filesystem.
1939 @code{updatedb} is then run for each filesystem on the fileserver where
1940 that filesystem is on a local disk, to prevent thrashing the network.
1941 Here are the options to @code{updatedb} to select which directories each
1942 database contains entries for:
1945 @item --localpaths='@var{path}@dots{}'
1946 Non-network directories to put in the database.
1947 Default is @file{/}.
1949 @item --netpaths='@var{path}@dots{}'
1950 Network (NFS, AFS, RFS, etc.) directories to put in the database.
1951 The environment variable @code{NETPATHS} also sets this value.
1955 @item --prunepaths='@var{path}@dots{}'
1956 Directories to not put in the database, which would otherwise be.
1957 The environment variable @code{PRUNEPATHS} also sets this value.
1958 Default is @file{/tmp /usr/tmp /var/tmp /afs}.
1960 @item --prunefs='@var{path}@dots{}'
1961 File systems to not put in the database, which would otherwise be.
1962 Note that files are pruned when a file system is reached;
1963 Any file system mounted under an undesired file system will be
1965 The environment variable @code{PRUNEFS} also sets this value.
1966 Default is @file{nfs NFS proc}.
1968 @item --output=@var{dbfile}
1969 The database file to build.
1970 Default is system-dependent, but typically @file{/usr/local/var/locatedb}.
1972 @item --localuser=@var{user}
1973 The user to search the non-network directories as, using @code{su}.
1974 Default is to search the non-network directories as the current user.
1975 You can also use the environment variable @code{LOCALUSER} to set this user.
1977 @item --netuser=@var{user}
1978 The user to search network directories as, using @code{su}.
1979 Default is @code{daemon}.
1980 You can also use the environment variable @code{NETUSER} to set this user.
1983 @node Database Formats
1984 @section Database Formats
1986 The file name databases contain lists of files that were in particular
1987 directory trees when the databases were last updated. The file name
1988 database format changed starting with GNU @code{locate} version 4.0 to
1989 allow machines with different byte orderings to share the databases. The
1990 new GNU @code{locate} can read both the old and new database formats.
1991 However, old versions of @code{locate} and @code{find} produce incorrect
1992 results if given a new-format database.
1995 * New Database Format::
1997 * Old Database Format::
2000 @node New Database Format
2001 @subsection New Database Format
2003 @code{updatedb} runs a program called @code{frcode} to
2004 @dfn{front-compress} the list of file names, which reduces the database
2005 size by a factor of 4 to 5. Front-compression (also known as
2006 incremental encoding) works as follows.
2008 The database entries are a sorted list (case-insensitively, for users'
2009 convenience). Since the list is sorted, each entry is likely to share a
2010 prefix (initial string) with the previous entry. Each database entry
2011 begins with an offset-differential count byte, which is the additional
2012 number of characters of prefix of the preceding entry to use beyond the
2013 number that the preceding entry is using of its predecessor. (The
2014 counts can be negative.) Following the count is a null-terminated ASCII
2015 remainder---the part of the name that follows the shared prefix.
2017 If the offset-differential count is larger than can be stored in a byte
2018 (+/-127), the byte has the value 0x80 and the count follows in a 2-byte
2019 word, with the high byte first (network byte order).
2021 Every database begins with a dummy entry for a file called
2022 @file{LOCATE02}, which @code{locate} checks for to ensure that the
2023 database file has the correct format; it ignores the entry in doing the
2026 Databases can not be concatenated together, even if the first (dummy)
2027 entry is trimmed from all but the first database. This is because the
2028 offset-differential count in the first entry of the second and following
2029 databases will be wrong.
2031 @node Sample Database
2032 @subsection Sample Database
2034 Sample input to @code{frcode}:
2035 @c with nulls changed to newlines:
2039 /usr/src/cmd/aardvark.c
2040 /usr/src/cmd/armadillo.c
2044 Length of the longest prefix of the preceding entry to share:
2053 Output from @code{frcode}, with trailing nulls changed to newlines
2054 and count bytes made printable:
2064 (6 = 14 - 8, and -9 = 5 - 14)
2066 @node Old Database Format
2067 @subsection Old Database Format
2069 The old database format is used by Unix @code{locate} and @code{find}
2070 programs and earlier releases of the GNU ones. @code{updatedb} produces
2071 this format if given the @samp{--old-format} option.
2073 @code{updatedb} runs programs called @code{bigram} and @code{code} to
2074 produce old-format databases. The old format differs from the new one
2075 in the following ways. Instead of each entry starting with an
2076 offset-differential count byte and ending with a null, byte values from
2077 0 through 28 indicate offset-differential counts from -14 through 14.
2078 The byte value indicating that a long offset-differential count follows
2079 is 0x1e (30), not 0x80. The long counts are stored in host byte order,
2080 which is not necessarily network byte order, and host integer word size,
2081 which is usually 4 bytes. They also represent a count 14 less than
2082 their value. The database lines have no termination byte; the start of
2083 the next line is indicated by its first byte having a value <= 30.
2085 In addition, instead of starting with a dummy entry, the old database
2086 format starts with a 256 byte table containing the 128 most common
2087 bigrams in the file list. A bigram is a pair of adjacent bytes. Bytes
2088 in the database that have the high bit set are indexes (with the high
2089 bit cleared) into the bigram table. The bigram and offset-differential
2090 count coding makes these databases 20-25% smaller than the new format,
2091 but makes them not 8-bit clean. Any byte in a file name that is in the
2092 ranges used for the special codes is replaced in the database by a
2093 question mark, which not coincidentally is the shell wildcard to match a
2096 The old format therefore can not faithfully store entries with non-ASCII
2097 characters. It therefore should not be used in internationalized
2100 @node File Permissions, Reference, Databases, Top
2101 @chapter File Permissions
2105 @node Reference, Primary Index, File Permissions, Top
2108 Below are summaries of the command line syntax for the programs
2109 discussed in this manual.
2114 * Invoking updatedb::
2118 @node Invoking find, Invoking locate, , Reference
2119 @section Invoking @code{find}
2122 find @r{[}@var{file}@dots{}@r{]} @r{[}@var{expression}@r{]}
2125 @code{find} searches the directory tree rooted at each file name
2126 @var{file} by evaluating the @var{expression} on each file it finds in
2129 @code{find} considers the first argument that begins with @samp{-},
2130 @samp{(}, @samp{)}, @samp{,}, or @samp{!} to be the beginning of the
2131 expression; any arguments before it are paths to search, and any
2132 arguments after it are the rest of the expression. If no paths are
2133 given, the current directory is used. If no expression is given, the
2134 expression @samp{-print} is used.
2136 @code{find} exits with status 0 if all files are processed successfully,
2137 greater than 0 if errors occur.
2139 @xref{Primary Index}, for a summary of all of the tests, actions, and
2140 options that the expression can contain.
2142 @code{find} also recognizes two options for administrative use:
2146 Print a summary of the command-line argument format and exit.
2148 Print the version number of @code{find} and exit.
2151 @node Invoking locate, Invoking updatedb, Invoking find, Reference
2152 @section Invoking @code{locate}
2155 locate @r{[}@var{option}@dots{}@r{]} @var{pattern}@dots{}
2159 @item --database=@var{path}
2160 @itemx -d @var{path}
2161 Instead of searching the default file name database, search the file
2162 name databases in @var{path}, which is a colon-separated list of
2163 database file names. You can also use the environment variable
2164 @code{LOCATE_PATH} to set the list of database files to search. The
2165 option overrides the environment variable if both are used.
2169 Only print out such names which currently exist (instead of such names
2170 which existed when the database was created).
2171 Note that this may slow down the program a lot, if there are many matches
2176 Ignore case distinctions in both the pattern and the file names.
2179 Print a summary of the options to @code{locate} and exit.
2182 Print the version number of @code{locate} and exit.
2185 @node Invoking updatedb, Invoking xargs, Invoking locate, Reference
2186 @section Invoking @code{updatedb}
2189 updatedb @r{[}@var{option}@dots{}@r{]}
2193 @item --localpaths='@var{path}@dots{}'
2194 Non-network directories to put in the database.
2195 Default is @file{/}.
2197 @item --netpaths='@var{path}@dots{}'
2198 Network (NFS, AFS, RFS, etc.) directories to put in the database.
2199 The environment variable @code{NETPATHS} also sets this value.
2202 @item --prunepaths='@var{path}@dots{}'
2203 Directories to not put in the database, which would otherwise be.
2204 The environment variable @code{PRUNEPATHS} also sets this value.
2205 Default is @file{/tmp /usr/tmp /var/tmp /afs}.
2207 @item --prunefs='@var{path}@dots{}'
2208 File systems to not put in the database, which would otherwise be.
2209 Note that files are pruned when a file system is reached;
2210 Any file system mounted under an undesired file system will be
2212 The environment variable @code{PRUNEFS} also sets this value.
2213 Default is @file{nfs NFS proc}.
2215 @item --output=@var{dbfile}
2216 The database file to build.
2217 Default is system-dependent, but typically @file{/usr/local/var/locatedb}.
2219 @item --localuser=@var{user}
2220 The user to search the non-network directories as, using @code{su}.
2221 Default is to search the non-network directories as the current user.
2222 You can also use the environment variable @code{LOCALUSER} to set this user.
2224 @item --netuser=@var{user}
2225 The user to search network directories as, using @code{su}(1).
2226 Default is @code{daemon}.
2227 You can also use the environment variable @code{NETUSER} to set this user.
2230 @node Invoking xargs, , Invoking updatedb, Reference
2231 @section Invoking @code{xargs}
2234 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
2237 @code{xargs} exits with the following status:
2243 if any invocation of the command exited with status 1-125
2245 if the command exited with status 255
2247 if the command is killed by a signal
2249 if the command cannot be run
2251 if the command is not found
2253 if some other error occurred.
2259 Input filenames are terminated by a null character instead of by
2260 whitespace, and the quotes and backslash are not special (every
2261 character is taken literally). Disables the end of file string, which
2262 is treated like any other argument.
2264 @item --eof@r{[}=@var{eof-str}@r{]}
2265 @itemx -e@r{[}@var{eof-str}@r{]}
2266 Set the end of file string to @var{eof-str}. If the end of file string
2267 occurs as a line of input, the rest of the input is ignored. If
2268 @var{eof-str} is omitted, there is no end of file string. If this
2269 option is not given, the end of file string defaults to @samp{_}.
2272 Print a summary of the options to @code{xargs} and exit.
2274 @item --replace@r{[}=@var{replace-str}@r{]}
2275 @itemx -i@r{[}@var{replace-str}@r{]}
2276 Replace occurrences of @var{replace-str} in the initial arguments with
2277 names read from standard input. Also, unquoted blanks do not terminate
2278 arguments. If @var{replace-str} is omitted, it defaults to @samp{@{@}}
2279 (like for @samp{find -exec}). Implies @samp{-x} and @samp{-l 1}.
2281 @item --max-lines@r{[}=@var{max-lines}@r{]}
2282 @itemx -l@r{[}@var{max-lines}@r{]}
2283 Use at most @var{max-lines} nonblank input lines per command line;
2284 @var{max-lines} defaults to 1 if omitted. Trailing blanks cause an
2285 input line to be logically continued on the next input line, for the
2286 purpose of counting the lines. Implies @samp{-x}.
2288 @item --max-args=@var{max-args}
2289 @itemx -n @var{max-args}
2290 Use at most @var{max-args} arguments per command line. Fewer than
2291 @var{max-args} arguments will be used if the size (see the @samp{-s}
2292 option) is exceeded, unless the @samp{-x} option is given, in which case
2293 @code{xargs} will exit.
2297 Prompt the user about whether to run each command line and read a line
2298 from the terminal. Only run the command line if the response starts
2299 with @samp{y} or @samp{Y}. Implies @samp{-t}.
2301 @item --no-run-if-empty
2303 If the standard input does not contain any nonblanks, do not run the
2304 command. By default, the command is run once even if there is no input.
2306 @item --max-chars=@var{max-chars}
2307 @itemx -s @var{max-chars}
2308 Use at most @var{max-chars} characters per command line, including the
2309 command and initial arguments and the terminating nulls at the ends of
2310 the argument strings.
2314 Print the command line on the standard error output before executing
2318 Print the version number of @code{xargs} and exit.
2322 Exit if the size (see the @samp{-s} option) is exceeded.
2325 @item --max-procs=@var{max-procs}
2326 @itemx -P @var{max-procs}
2327 Run up to @var{max-procs} processes at a time; the default is 1. If
2328 @var{max-procs} is 0, @code{xargs} will run as many processes as
2332 @node Primary Index, , Reference, Top
2333 @unnumbered @code{find} Primary Index
2335 This is a list of all of the primaries (tests, actions, and options)
2336 that make up @code{find} expressions for selecting files. @xref{find
2337 Expressions}, for more information on expressions.
2344 @comment texi related words used by Emacs' spell checker ispell.el
2346 @comment LocalWords: texinfo setfilename settitle setchapternewpage
2347 @comment LocalWords: iftex finalout ifinfo DIR titlepage vskip pt
2348 @comment LocalWords: filll dir samp dfn noindent xref pxref
2349 @comment LocalWords: var deffn texi deffnx itemx emph asis
2350 @comment LocalWords: findex smallexample subsubsection cindex
2352 @comment other words used by Emacs' spell checker ispell.el
2353 @comment LocalWords: README fred updatedb xargs Plett Rendell akefile
2354 @comment LocalWords: args grep Filesystems fo foo fOo wildcards iname
2355 @comment LocalWords: ipath regex iregex expr fubar regexps
2356 @comment LocalWords: metacharacters macs sr sc inode lname ilname
2357 @comment LocalWords: sysdep noleaf ls inum xdev filesystems usr atime
2358 @comment LocalWords: ctime mtime amin cmin mmin al daystart Sladkey rm
2359 @comment LocalWords: anewer cnewer bckw rf xtype uname gname uid gid
2360 @comment LocalWords: nouser nogroup chown chgrp perm ch maxdepth
2361 @comment LocalWords: mindepth cpio src CD AFS statted stat fstype ufs
2362 @comment LocalWords: nfs tmp mfs printf fprint dils rw djm Nov lwall
2363 @comment LocalWords: POSIXLY fls fprintf strftime locale's EDT GMT AP
2364 @comment LocalWords: EST diff perl backquotes sprintf Falstad Oct cron
2365 @comment LocalWords: eg vmunix mkdir afs allexec allwrite ARG bigram
2366 @comment LocalWords: bigrams cd chmod comp crc CVS dbfile dum eof
2367 @comment LocalWords: fileserver filesystem fn frcode Ghazi Hnewc iXX
2368 @comment LocalWords: joeuser Kaveh localpaths localuser LOGNAME
2369 @comment LocalWords: Meyering mv netpaths netuser nonblank nonblanks
2370 @comment LocalWords: ois ok Pinard printindex proc procs prunefs
2371 @comment LocalWords: prunepaths pwd RFS rmadillo rmdir rsh sbins str
2372 @comment LocalWords: su Timar ubins ug unstripped vf VM Weitzel
2373 @comment LocalWords: wildcard zlogout