doc/find.texi

   1 \input texinfo @c -*-texinfo-*-
   2 @c %**start of header
   3 @setfilename find.info
   4 @settitle Finding Files
   5 @c For double-sided printing, uncomment:
   6 @c @setchapternewpage odd
   7 @c %**end of header
   8
   9 @include version.texi
  10
  11 @iftex
  12 @finalout
  13 @end iftex
  14
  15 @ifinfo
  16 @format
  17 START-INFO-DIR-ENTRY
  18 * Finding Files: (find).        Listing and operating on files
  19                                 that match certain criteria.
  20 END-INFO-DIR-ENTRY
  21 @end format
  22
  23 This file documents the GNU utilities for finding files that match
  24 certain criteria and performing various operations on them.
  25
  26 Copyright (C) 1994 Free Software Foundation, Inc.
  27
  28 Permission is granted to make and distribute verbatim copies of
  29 this manual provided the copyright notice and this permission notice
  30 are preserved on all copies.
  31
  32 @ignore
  33 Permission is granted to process this file through TeX and print the
  34 results, provided the printed document carries copying permission
  35 notice identical to this one except for the removal of this paragraph
  36 (this paragraph not being relevant to the printed manual).
  37
  38 @end ignore
  39 Permission is granted to copy and distribute modified versions of this
  40 manual under the conditions for verbatim copying, provided that the entire
  41 resulting derived work is distributed under the terms of a permission
  42 notice identical to this one.
  43
  44 Permission is granted to copy and distribute translations of this manual
  45 into another language, under the above conditions for modified versions,
  46 except that this permission notice may be stated in a translation approved
  47 by the Foundation.
  48 @end ifinfo
  49
  50 @titlepage
  51 @title Finding Files
  52 @subtitle Edition @value{EDITION}, for GNU @code{find} version @value{VERSION}
  53 @subtitle @value{UPDATED}
  54 @author by David MacKenzie
  55
  56 @page
  57 @vskip 0pt plus 1filll
  58 Copyright @copyright{} 1994 Free Software Foundation, Inc.
  59
  60 Permission is granted to make and distribute verbatim copies of
  61 this manual provided the copyright notice and this permission notice
  62 are preserved on all copies.
  63
  64 Permission is granted to copy and distribute modified versions of this
  65 manual under the conditions for verbatim copying, provided that the entire
  66 resulting derived work is distributed under the terms of a permission
  67 notice identical to this one.
  68
  69 Permission is granted to copy and distribute translations of this manual
  70 into another language, under the above conditions for modified versions,
  71 except that this permission notice may be stated in a translation approved
  72 by the Foundation.
  73 @end titlepage
  74
  75 @node Top, Introduction, , (dir)
  76 @comment  node-name,  next,  previous,  up
  77
  78 @ifinfo
  79 This file documents the GNU utilities for finding files that match
  80 certain criteria and performing various actions on them.
  81 This is edition @value{EDITION}, for @code{find} version @value{VERSION}.
  82 @end ifinfo
  83
  84 @c The master menu, created with texinfo-master-menu, goes here.
  85
  86 @menu
  87 * Introduction::                Summary of the tasks this manual describes.
  88 * Finding Files::               Finding files that match certain criteria.
  89 * Actions::                     Doing things to files you have found.
  90 * Common Tasks::                Solutions to common real-world problems.
  91 * Databases::                   Maintaining file name databases.
  92 * File Permissions::            How to control access to files.
  93 * Reference::                   Summary of how to invoke the programs.
  94 * Primary Index::               The components of @code{find} expressions.
  95 @end menu
  96
  97 @node Introduction, Finding Files, Top, Top
  98 @chapter Introduction
  99
 100 This manual shows how to find files that meet criteria you specify, and
 101 how to perform various actions on the files that you find.  The
 102 principal programs that you use to perform these tasks are @code{find},
 103 @code{locate}, and @code{xargs}.  Some of the examples in this manual
 104 use capabilities specific to the GNU versions of those programs.
 105
 106 GNU @code{find} was originally written by Eric Decker, with enhancements
 107 by David MacKenzie, Jay Plett, and Tim Wood.  GNU @code{xargs} was
 108 originally written by Mike Rendell, with enhancements by David
 109 MacKenzie.  GNU @code{locate} and its associated utilities were
 110 originally written by James Woods, with enhancements by David MacKenzie.
 111 The idea for @samp{find -print0} and @samp{xargs -0} came from Dan
 112 Bernstein.  Many other people have contributed bug fixes, small
 113 improvements, and helpful suggestions.  Thanks!
 114
 115 Mail suggestions and bug reports for these programs to
 116 @code{bug-findutils@@gnu.org}.  Please include the version
 117 number, which you can get by running @samp{find --version}.
 118
 119 @menu
 120 * Scope::
 121 * Overview::
 122 * find Expressions::
 123 @end menu
 124
 125 @node Scope
 126 @section Scope
 127
 128 For brevity, the word @dfn{file} in this manual means a regular file, a
 129 directory, a symbolic link, or any other kind of node that has a
 130 directory entry.  A directory entry is also called a @dfn{file name}.  A
 131 file name may contain some, all, or none of the directories in a path
 132 that leads to the file.  These are all examples of what this manual
 133 calls ``file names'':
 134
 135 @example
 136 parser.c
 137 README
 138 ./budget/may-94.sc
 139 fred/.cshrc
 140 /usr/local/include/termcap.h
 141 @end example
 142
 143 A @dfn{directory tree} is a directory and the files it contains, all of
 144 its subdirectories and the files they contain, etc.  It can also be a
 145 single non-directory file.
 146
 147 These programs enable you to find the files in one or more directory
 148 trees that:
 149
 150 @itemize @bullet
 151 @item
 152 have names that contain certain text or match a certain pattern;
 153 @item
 154 are links to certain files;
 155 @item
 156 were last used during a certain period of time;
 157 @item
 158 are within a certain size range;
 159 @item
 160 are of a certain type (regular file, directory, symbolic link, etc.);
 161 @item
 162 are owned by a certain user or group;
 163 @item
 164 have certain access permissions;
 165 @item
 166 contain text that matches a certain pattern;
 167 @item
 168 are within a certain depth in the directory tree;
 169 @item
 170 or some combination of the above.
 171 @end itemize
 172
 173 Once you have found the files you're looking for (or files that are
 174 potentially the ones you're looking for), you can do more to them than
 175 simply list their names.  You can get any combination of the files'
 176 attributes, or process the files in many ways, either individually or in
 177 groups of various sizes.  Actions that you might want to perform on the
 178 files you have found include, but are not limited to:
 179
 180 @itemize @bullet
 181 @item
 182 view or edit
 183 @item
 184 store in an archive
 185 @item
 186 remove or rename
 187 @item
 188 change access permissions
 189 @item
 190 classify into groups
 191 @end itemize
 192
 193 This manual describes how to perform each of those tasks, and more.
 194
 195 @node Overview
 196 @section Overview
 197
 198 The principal programs used for making lists of files that match given
 199 criteria and running commands on them are @code{find}, @code{locate},
 200 and @code{xargs}.  An additional command, @code{updatedb}, is used by
 201 system administrators to create databases for @code{locate} to use.
 202
 203 @code{find} searches for files in a directory hierarchy and prints
 204 information about the files it found.  It is run like this:
 205
 206 @example
 207 find @r{[}@var{file}@dots{}@r{]} @r{[}@var{expression}@r{]}
 208 @end example
 209
 210 @noindent
 211 Here is a typical use of @code{find}.  This example prints the names of
 212 all files in the directory tree rooted in @file{/usr/src} whose name
 213 ends with @samp{.c} and that are larger than 100 Kilobytes.
 214 @example
 215 find /usr/src -name '*.c' -size +100k -print
 216 @end example
 217
 218 @code{locate} searches special file name databases for file names that
 219 match patterns.  The system administrator runs the @code{updatedb}
 220 program to create the databases.  @code{locate} is run like this:
 221
 222 @example
 223 locate @r{[}@var{option}@dots{}@r{]} @var{pattern}@dots{}
 224 @end example
 225
 226 @noindent
 227 This example prints the names of all files in the default file name
 228 database whose name ends with @samp{Makefile} or @samp{makefile}.  Which
 229 file names are stored in the database depends on how the system
 230 administrator ran @code{updatedb}.
 231 @example
 232 locate '*[Mm]akefile'
 233 @end example
 234
 235 The name @code{xargs}, pronounced EX-args, means ``combine arguments.''
 236 @code{xargs} builds and executes command lines by gathering together
 237 arguments it reads on the standard input.  Most often, these arguments
 238 are lists of file names generated by @code{find}.  @code{xargs} is run
 239 like this:
 240
 241 @example
 242 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
 243 @end example
 244
 245 @noindent
 246 The following command searches the files listed in the file
 247 @file{file-list} and prints all of the lines in them that contain the
 248 word @samp{typedef}.
 249 @example
 250 xargs grep typedef < file-list
 251 @end example
 252
 253 @node find Expressions
 254 @section @code{find} Expressions
 255
 256 The expression that @code{find} uses to select files consists of one or
 257 more @dfn{primaries}, each of which is a separate command line argument
 258 to @code{find}.  @code{find} evaluates the expression each time it
 259 processes a file.  An expression can contain any of the following types
 260 of primaries:
 261
 262 @table @dfn
 263 @item options
 264 affect overall operation rather than the processing of a specific file;
 265 @item tests
 266 return a true or false value, depending on the file's attributes;
 267 @item actions
 268 have side effects and return a true or false value; and
 269 @item operators
 270 connect the other arguments and affect when and whether they are
 271 evaluated.
 272 @end table
 273
 274 You can omit the operator between two primaries; it defaults to
 275 @samp{-and}.  @xref{Combining Primaries With Operators}, for ways to
 276 connect primaries into more complex expressions.  If the expression
 277 contains no actions other than @samp{-prune}, @samp{-print} is performed
 278 on all files for which the entire expression is true (@pxref{Print File
 279 Name}).
 280
 281 Options take effect immediately, rather than being evaluated for each
 282 file when their place in the expression is reached.  Therefore, for
 283 clarity, it is best to place them at the beginning of the expression.
 284
 285 Many of the primaries take arguments, which immediately follow them in
 286 the next command line argument to @code{find}.  Some arguments are file
 287 names, patterns, or other strings; others are numbers.  Numeric
 288 arguments can be specified as
 289
 290 @table @code
 291 @item +@var{n}
 292 for greater than @var{n},
 293 @item -@var{n}
 294 for less than @var{n},
 295 @item @var{n}
 296 for exactly @var{n}.
 297 @end table
 298
 299 @node Finding Files, Actions, Introduction, Top
 300 @chapter Finding Files
 301
 302 By default, @code{find} prints to the standard output the names of the
 303 files that match the given criteria.  @xref{Actions}, for how to get more
 304 information about the matching files.
 305
 306 @menu
 307 * Name::
 308 * Links::
 309 * Time::
 310 * Size::
 311 * Type::
 312 * Owner::
 313 * Permissions::
 314 * Contents::
 315 * Directories::
 316 * Filesystems::
 317 * Combining Primaries With Operators::
 318 @end menu
 319
 320 @node Name
 321 @section Name
 322
 323 Here are ways to search for files whose name matches a certain pattern.
 324 @xref{Shell Pattern Matching}, for a description of the @var{pattern}
 325 arguments to these tests.
 326
 327 Each of these tests has a case-sensitive version and a case-insensitive
 328 version, whose name begins with @samp{i}.  In a case-insensitive
 329 comparison, the patterns @samp{fo*} and @samp{F??} match the file names
 330 @file{Foo}, @samp{FOO}, @samp{foo}, @samp{fOo}, etc.
 331
 332 @menu
 333 * Base Name Patterns::
 334 * Full Name Patterns::
 335 * Fast Full Name Search::
 336 * Shell Pattern Matching::      Wildcards used by these programs.
 337 @end menu
 338
 339 @node Base Name Patterns
 340 @subsection Base Name Patterns
 341
 342 @deffn Test -name pattern
 343 @deffnx Test -iname pattern
 344 True if the base of the file name (the path with the leading directories
 345 removed) matches shell pattern @var{pattern}.  For @samp{-iname}, the
 346 match is case-insensitive.  To ignore a whole directory tree, use
 347 @samp{-prune} (@pxref{Directories}).  As an example, to find Texinfo
 348 source files in @file{/usr/local/doc}:
 349
 350 @example
 351 find /usr/local/doc -name '*.texi'
 352 @end example
 353 @end deffn
 354
 355 Patterns for @samp{-name} and @samp{-iname} will match a filename with
 356 a leading @samp{.}.  For example the command @samp{find /tmp -name
 357 \*bar} will match the file @file{/tmp/.foobar}.
 358
 359
 360 @node Full Name Patterns
 361 @subsection Full Name Patterns
 362
 363 @deffn Test -wholename pattern
 364 @deffnx Test -iwholename pattern
 365 True if the entire file name, starting with the command line argument
 366 under which the file was found, matches shell pattern @var{pattern}.
 367 For @samp{-iwholename}, the match is case-insensitive.  To ignore a whole
 368 directory tree, use @samp{-prune} rather than checking every file in the
 369 tree (@pxref{Directories}).
 370 @end deffn
 371
 372 @deffn Test -path pattern
 373 @deffnx Test -ipath pattern
 374 These tests are deprecated, but work as for @samp{-wholename} and @samp{-iwholename},
 375 respectively.  The @samp{-ipath} test is a GNU extension, but @samp{-path} is also
 376 provided by HP-UX @code{find}.
 377 @end deffn
 378
 379 @deffn Test -regex expr
 380 @deffnx Test -iregex expr
 381 True if the entire file name matches regular expression @var{expr}.
 382 This is a match on the whole path, not a search.  For example, to match
 383 a file named @file{./fubar3}, you can use the regular expression
 384 @samp{.*bar.} or @samp{.*b.*3}, but not @samp{b.*r3}.  @xref{Regexps, ,
 385 Syntax of Regular Expressions, emacs, The GNU Emacs Manual}, for a
 386 description of the syntax of regular expressions.  For @samp{-iregex},
 387 the match is case-insensitive.
 388 @end deffn
 389
 390 @node Fast Full Name Search
 391 @subsection Fast Full Name Search
 392
 393 To search for files by name without having to actually scan the
 394 directories on the disk (which can be slow), you can use the
 395 @code{locate} program.  For each shell pattern you give it,
 396 @code{locate} searches one or more databases of file names and displays
 397 the file names that contain the pattern.  @xref{Shell Pattern Matching},
 398 for details about shell patterns.
 399
 400 If a pattern is a plain string---it contains no
 401 metacharacters---@code{locate} displays all file names in the database
 402 that contain that string.  If a pattern contains
 403 metacharacters, @code{locate} only displays file names that match the
 404 pattern exactly.  As a result, patterns that contain metacharacters
 405 should usually begin with a @samp{*}, and will most often end with one
 406 as well.  The exceptions are patterns that are intended to explicitly
 407 match the beginning or end of a file name.
 408
 409 The command
 410 @example
 411 locate @var{pattern}
 412 @end example
 413
 414 is almost equivalent to
 415 @example
 416 find @var{directories} -name @var{pattern}
 417 @end example
 418
 419 where @var{directories} are the directories for which the file name
 420 databases contain information.  The differences are that the
 421 @code{locate} information might be out of date, and that @code{locate}
 422 handles wildcards in the pattern slightly differently than @code{find}
 423 (@pxref{Shell Pattern Matching}).
 424
 425 The file name databases contain lists of files that were on the system
 426 when the databases were last updated.  The system administrator can
 427 choose the file name of the default database, the frequency with which
 428 the databases are updated, and the directories for which they contain
 429 entries.
 430
 431 Here is how to select which file name databases @code{locate} searches.
 432 The default is system-dependent.
 433
 434 @table @code
 435 @item --database=@var{path}
 436 @itemx -d @var{path}
 437 Instead of searching the default file name database, search the file
 438 name databases in @var{path}, which is a colon-separated list of
 439 database file names.  You can also use the environment variable
 440 @code{LOCATE_PATH} to set the list of database files to search.  The
 441 option overrides the environment variable if both are used.
 442 @end table
 443
 444 @node Shell Pattern Matching
 445 @subsection Shell Pattern Matching
 446
 447 @code{find} and @code{locate} can compare file names, or parts of file
 448 names, to shell patterns.  A @dfn{shell pattern} is a string that may
 449 contain the following special characters, which are known as
 450 @dfn{wildcards} or @dfn{metacharacters}.
 451
 452 You must quote patterns that contain metacharacters to prevent the shell
 453 from expanding them itself.  Double and single quotes both work; so does
 454 escaping with a backslash.
 455
 456 @table @code
 457 @item *
 458 Matches any zero or more characters.
 459
 460 @item ?
 461 Matches any one character.
 462
 463 @item [@var{string}]
 464 Matches exactly one character that is a member of the string
 465 @var{string}.  This is called a @dfn{character class}.  As a shorthand,
 466 @var{string} may contain ranges, which consist of two characters with a
 467 dash between them.  For example, the class @samp{[a-z0-9_]} matches a
 468 lowercase letter, a number, or an underscore.  You can negate a class by
 469 placing a @samp{!} or @samp{^} immediately after the opening bracket.
 470 Thus, @samp{[^A-Z@@]} matches any character except an uppercase letter
 471 or an at sign.
 472
 473 @item \
 474 Removes the special meaning of the character that follows it.  This
 475 works even in character classes.
 476 @end table
 477
 478 In the @code{find} tests that do shell pattern matching (@samp{-name},
 479 @samp{-wholename}, etc.), wildcards in the pattern do not match a @samp{.}
 480 at the beginning of a file name.  This is not the case for
 481 @code{locate}.  Thus, @samp{find -name '*macs'} does not match a file
 482 named @file{.emacs}, but @samp{locate '*macs'} does.
 483
 484 Slash characters have no special significance in the shell pattern
 485 matching that @code{find} and @code{locate} do, unlike in the shell, in
 486 which wildcards do not match them.  Therefore, a pattern @samp{foo*bar}
 487 can match a file name @samp{foo3/bar}, and a pattern @samp{./sr*sc} can
 488 match a file name @samp{./src/misc}.
 489
 490 @node Links
 491 @section Links
 492
 493 There are two ways that files can be linked together.  @dfn{Symbolic
 494 links} are a special type of file whose contents are a portion of the
 495 name of another file.  @dfn{Hard links} are multiple directory entries
 496 for one file; the file names all have the same index node (@dfn{inode})
 497 number on the disk.
 498
 499 @menu
 500 * Symbolic Links::
 501 * Hard Links::
 502 @end menu
 503
 504 @node Symbolic Links
 505 @subsection Symbolic Links
 506
 507 @deffn Test -lname pattern
 508 @deffnx Test -ilname pattern
 509 True if the file is a symbolic link whose contents match shell pattern
 510 @var{pattern}.  For @samp{-ilname}, the match is case-insensitive.
 511 @xref{Shell Pattern Matching}, for details about the @var{pattern}
 512 argument.  So, to list any symbolic links to @file{sysdep.c} in the
 513 current directory and its subdirectories, you can do:
 514
 515 @example
 516 find . -lname '*sysdep.c'
 517 @end example
 518 @end deffn
 519
 520 @deffn Option -follow
 521 Dereference symbolic links.  The following differences in behavior occur
 522 when this option is given:
 523
 524 @itemize @bullet
 525 @item
 526 @code{find} follows symbolic links to directories when searching
 527 directory trees.
 528 @item
 529 @samp{-lname} and @samp{-ilname} always return false.
 530 @item
 531 @samp{-type} reports the types of the files that symbolic links point
 532 to.
 533 @item
 534 Implies @samp{-noleaf} (@pxref{Directories}).
 535 @end itemize
 536 @end deffn
 537
 538 @node Hard Links
 539 @subsection Hard Links
 540
 541 To find hard links, first get the inode number of the file whose links
 542 you want to find.  You can learn a file's inode number and the number of
 543 links to it by running @samp{ls -i} or @samp{find -ls}.  If the file has
 544 more than one link, you can search for the other links by passing that
 545 inode number to @samp{-inum}.  Add the @samp{-xdev} option if you are
 546 starting the search at a directory that has other filesystems mounted on
 547 it, such as @file{/usr} on many systems.  Doing this saves needless
 548 searching, since hard links to a file must be on the same filesystem.
 549 @xref{Filesystems}.
 550
 551 @deffn Test -inum n
 552 File has inode number @var{n}.
 553 @end deffn
 554
 555 You can also search for files that have a certain number of links, with
 556 @samp{-links}.  Directories normally have at least two hard links; their
 557 @file{.} entry is the second one.  If they have subdirectories, each of
 558 those also has a hard link called @file{..} to its parent directory.
 559
 560 @deffn Test -links n
 561 File has @var{n} hard links.
 562 @end deffn
 563
 564 @node Time
 565 @section Time
 566
 567 Each file has three time stamps, which record the last time that certain
 568 operations were performed on the file:
 569
 570 @enumerate
 571 @item
 572 access (read the file's contents)
 573 @item
 574 change the status (modify the file or its attributes)
 575 @item
 576 modify (change the file's contents)
 577 @end enumerate
 578
 579 You can search for files whose time stamps are within a certain age
 580 range, or compare them to other time stamps.
 581
 582 @menu
 583 * Age Ranges::
 584 * Comparing Timestamps::
 585 @end menu
 586
 587 @node Age Ranges
 588 @subsection Age Ranges
 589
 590 These tests are mainly useful with ranges (@samp{+@var{n}} and
 591 @samp{-@var{n}}).
 592
 593 @deffn Test -atime n
 594 @deffnx Test -ctime n
 595 @deffnx Test -mtime n
 596 True if the file was last accessed (or its status changed, or it was
 597 modified) @var{n}*24 hours ago.
 598 @end deffn
 599
 600 @deffn Test -amin n
 601 @deffnx Test -cmin n
 602 @deffnx Test -mmin n
 603 True if the file was last accessed (or its status changed, or it was
 604 modified) @var{n} minutes ago.  These tests provide finer granularity of
 605 measurement than @samp{-atime} et al.  For example, to list files in
 606 @file{/u/bill} that were last read from 2 to 6 minutes ago:
 607
 608 @example
 609 find /u/bill -amin +2 -amin -6
 610 @end example
 611 @end deffn
 612
 613 @deffn Option -daystart
 614 Measure times from the beginning of today rather than from 24 hours ago.
 615 So, to list the regular files in your home directory that were modified
 616 yesterday, do
 617
 618 @example
 619 find ~ -daystart -type f -mtime 1
 620 @end example
 621 @end deffn
 622
 623 @node Comparing Timestamps
 624 @subsection Comparing Timestamps
 625
 626 As an alternative to comparing timestamps to the current time, you can
 627 compare them to another file's timestamp.  That file's timestamp could
 628 be updated by another program when some event occurs.  Or you could set
 629 it to a particular fixed date using the @code{touch} command.  For
 630 example, to list files in @file{/usr} modified after February 1 of the
 631 current year:
 632
 633 @c Idea from Rick Sladkey.
 634 @example
 635 touch -t 02010000 /tmp/stamp$$
 636 find /usr -newer /tmp/stamp$$
 637 rm -f /tmp/stamp$$
 638 @end example
 639
 640 @deffn Test -anewer file
 641 @deffnx Test -cnewer file
 642 @deffnx Test -newer file
 643 True if the file was last accessed (or its status changed, or it was
 644 modified) more recently than @var{file} was modified.  These tests are
 645 affected by @samp{-follow} only if @samp{-follow} comes before them on
 646 the command line.  @xref{Symbolic Links}, for more information on
 647 @samp{-follow}.  As an example, to list any files modified since
 648 @file{/bin/sh} was last modified:
 649
 650 @example
 651 find . -newer /bin/sh
 652 @end example
 653 @end deffn
 654
 655 @deffn Test -used n
 656 True if the file was last accessed @var{n} days after its status was
 657 last changed.  Useful for finding files that are not being used, and
 658 could perhaps be archived or removed to save disk space.
 659 @end deffn
 660
 661 @node Size
 662 @section Size
 663
 664 @deffn Test -size n@r{[}bckwMG@r{]}
 665 True if the file uses @var{n} units of space, rounding up.  The units
 666 are 512-byte blocks by default, but they can be changed by adding a
 667 one-character suffix to @var{n}:
 668
 669 @table @code
 670 @item b
 671 512-byte blocks (never 1024)
 672 @item c
 673 bytes
 674 @item k
 675 kilobytes (1024 bytes)
 676 @item w
 677 2-byte words
 678 @item M
 679 Megabytes
 680 @item G
 681 Gigabytes
 682 @end table
 683
 684 The `b' suffix always considers blocks to be 512 bytes.  This is not
 685 affected by the setting (or non-setting) of the POSIXLY_CORRECT
 686 environment variable.  This behaviour is different to the behaviour of
 687 the @samp{-ls} action).  If you want to use 1024-byte units, use the
 688 `k' suffix instead.
 689
 690 The number can be prefixed with a `+' or a `-'.  A plus sign indicates
 691 that the test should succeed if the file uses at least @var{n} units
 692 of storage (this is the way I normally use this test) and a minus sign
 693 indicates that the test should succeed if the file uses less than
 694 @var{n} units of storage.   There is no `=' prefix, because that's the
 695 default anyway.
 696
 697 The size does not count indirect blocks, but it does count blocks in
 698 sparse files that are not actually allocated.  This handling of sparse
 699 files differs from the output of the @samp{%k} and @samp{%b} format
 700 specifiers for the @samp{-printf} predicate.
 701
 702 @end deffn
 703
 704 @deffn Test -empty
 705 True if the file is empty and is either a regular file or a directory.
 706 This might make it a good candidate for deletion.  This test is useful
 707 with @samp{-depth} (@pxref{Directories}) and @samp{-exec rm -rf '@{@}' ';'}
 708 (@pxref{Single File}).
 709 @end deffn
 710
 711 @node Type
 712 @section Type
 713
 714 @deffn Test -type c
 715 True if the file is of type @var{c}:
 716
 717 @table @code
 718 @item b
 719 block (buffered) special
 720 @item c
 721 character (unbuffered) special
 722 @item d
 723 directory
 724 @item p
 725 named pipe (FIFO)
 726 @item f
 727 regular file
 728 @item l
 729 symbolic link
 730 @item s
 731 socket
 732 @item D
 733 door (Solaris)
 734 @end table
 735 @end deffn
 736
 737 @deffn Test -xtype c
 738 The same as @samp{-type} unless the file is a symbolic link.  For
 739 symbolic links: if @samp{-follow} has not been given, true if the file
 740 is a link to a file of type @var{c}; if @samp{-follow} has been given,
 741 true if @var{c} is @samp{l}.  In other words, for symbolic links,
 742 @samp{-xtype} checks the type of the file that @samp{-type} does not
 743 check.  @xref{Symbolic Links}, for more information on @samp{-follow}.
 744 @end deffn
 745
 746 @node Owner
 747 @section Owner
 748
 749 @deffn Test -user uname
 750 @deffnx Test -group gname
 751 True if the file is owned by user @var{uname} (belongs to group @var{gname}).
 752 A numeric ID is allowed.
 753 @end deffn
 754
 755 @deffn Test -uid n
 756 @deffnx Test -gid n
 757 True if the file's numeric user ID (group ID) is @var{n}.  These tests
 758 support ranges (@samp{+@var{n}} and @samp{-@var{n}}), unlike
 759 @samp{-user} and @samp{-group}.
 760 @end deffn
 761
 762 @deffn Test -nouser
 763 @deffnx Test -nogroup
 764 True if no user corresponds to the file's numeric user ID (no group
 765 corresponds to the numeric group ID).  These cases usually mean that the
 766 files belonged to users who have since been removed from the system.
 767 You probably should change the ownership of such files to an existing
 768 user or group, using the @code{chown} or @code{chgrp} program.
 769 @end deffn
 770
 771 @node Permissions
 772 @section Permissions
 773
 774 @xref{File Permissions}, for information on how file permissions are
 775 structured and how to specify them.
 776
 777 @deffn Test -perm mode
 778 True if the
 779 file's permissions are exactly @var{mode} (which can be numeric or symbolic).
 780 Symbolic modes use mode 0 as a point of departure.
 781 If @var{mode} starts with @samp{-}, true if
 782 @emph{all} of the permissions set in @var{mode} are set for the file;
 783 permissions not set in @var{mode} are ignored.
 784 If @var{mode} starts with @samp{+}, true if
 785 @emph{any} of the permissions set in @var{mode} are set for the file;
 786 permissions not set in @var{mode} are ignored.
 787 @end deffn
 788
 789 @node Contents
 790 @section Contents
 791
 792 To search for files based on their contents, you can use the @code{grep}
 793 program.  For example, to find out which C source files in the current
 794 directory contain the string @samp{thing}, you can do:
 795
 796 @example
 797 grep -l thing *.[ch]
 798 @end example
 799
 800 If you also want to search for the string in files in subdirectories,
 801 you can combine @code{grep} with @code{find} and @code{xargs}, like
 802 this:
 803
 804 @example
 805 find . -name '*.[ch]' | xargs grep -l thing
 806 @end example
 807
 808 The @samp{-l} option causes @code{grep} to print only the names of files
 809 that contain the string, rather than the lines that contain it.  The
 810 string argument (@samp{thing}) is actually a regular expression, so it
 811 can contain metacharacters.  This method can be refined a little by
 812 using the @samp{-r} option to make @code{xargs} not run @code{grep} if
 813 @code{find} produces no output, and using the @code{find} action
 814 @samp{-print0} and the @code{xargs} option @samp{-0} to avoid
 815 misinterpreting files whose names contain spaces:
 816
 817 @example
 818 find . -name '*.[ch]' -print0 | xargs -r -0 grep -l thing
 819 @end example
 820
 821 For a fuller treatment of finding files whose contents match a pattern,
 822 see the manual page for @code{grep}.
 823
 824 @node Directories
 825 @section Directories
 826
 827 Here is how to control which directories @code{find} searches, and how
 828 it searches them.  These two options allow you to process a horizontal
 829 slice of a directory tree.
 830
 831 @deffn Option -maxdepth levels
 832 Descend at most @var{levels} (a non-negative integer) levels of
 833 directories below the command line arguments.  @samp{-maxdepth 0} means
 834 only apply the tests and actions to the command line arguments.
 835 @end deffn
 836
 837 @deffn Option -mindepth levels
 838 Do not apply any tests or actions at levels less than @var{levels} (a
 839 non-negative integer).  @samp{-mindepth 1} means process all files
 840 except the command line arguments.
 841 @end deffn
 842
 843 @deffn Option -depth
 844 Process each directory's contents before the directory itself.  Doing
 845 this is a good idea when producing lists of files to archive with
 846 @code{cpio} or @code{tar}.  If a directory does not have write
 847 permission for its owner, its contents can still be restored from the
 848 archive since the directory's permissions are restored after its contents.
 849 @end deffn
 850
 851 @deffn Action -prune
 852 If @samp{-depth} is not given, true; do not descend into the current
 853 directory.  If @samp{-depth} is given, false; no effect.  @samp{-prune}
 854 only affects tests and actions that come after it in the expression, not
 855 those that come before.
 856
 857 For example, to skip the directory @file{src/emacs} and all files and
 858 directories under it, and print the names of the other files found:
 859
 860 @example
 861 find . -wholename './src/emacs' -prune -o -print
 862 @end example
 863 @end deffn
 864
 865 @deffn Option -noleaf
 866 Do not optimize by assuming that directories contain 2 fewer
 867 subdirectories than their hard link count.  This option is needed when
 868 searching filesystems that do not follow the Unix directory-link
 869 convention, such as CD-ROM or MS-DOS filesystems or AFS volume mount
 870 points.  Each directory on a normal Unix filesystem has at least 2 hard
 871 links: its name and its @file{.}  entry.  Additionally, its
 872 subdirectories (if any) each have a @file{..}  entry linked to that
 873 directory.  When @code{find} is examining a directory, after it has
 874 statted 2 fewer subdirectories than the directory's link count, it knows
 875 that the rest of the entries in the directory are non-directories
 876 (@dfn{leaf} files in the directory tree).  If only the files' names need
 877 to be examined, there is no need to stat them; this gives a significant
 878 increase in search speed.
 879 @end deffn
 880
 881 @deffn Option -ignore_readdir_race
 882 If a file disappears after its name has been read from a directory but
 883 before @code{find} gets around to examining the file with @code{stat},
 884 don't issue an error message.  If you don't specify this option, an
 885 error message will be issued.  This option can be useful in system
 886 scripts (cron scripts, for example) that examine areas of the
 887 filesystem that change frequently (mail queues, temporary directories,
 888 and so forth), because this scenario is common for those sorts of
 889 directories.  Completely silencing error messages from @code{find} is
 890 undesirable, so this option neatly solves the problem.  There is no
 891 way to search one part of the filesystem with this option on and part
 892 of it with this option off, though.
 893 @end deffn
 894
 895 @deffn Option -noignore_readdir_race
 896 This option reverses the effect of the @samp{-ignore_readdir_race} option.
 897 @end deffn
 898
 899
 900 @node Filesystems
 901 @section Filesystems
 902
 903 A @dfn{filesystem} is a section of a disk, either on the local host or
 904 mounted from a remote host over a network.  Searching network
 905 filesystems can be slow, so it is common to make @code{find} avoid them.
 906
 907 There are two ways to avoid searching certain filesystems.  One way is
 908 to tell @code{find} to only search one filesystem:
 909
 910 @deffn Option -xdev
 911 @deffnx Option -mount
 912 Don't descend directories on other filesystems.  These options are synonyms.
 913 @end deffn
 914
 915 The other way is to check the type of filesystem each file is on, and
 916 not descend directories that are on undesirable filesystem types:
 917
 918 @deffn Test -fstype type
 919 True if the file is on a filesystem of type @var{type}.  The valid
 920 filesystem types vary among different versions of Unix; an incomplete
 921 list of filesystem types that are accepted on some version of Unix or
 922 another is:
 923 @example
 924 ufs 4.2 4.3 nfs tmp mfs S51K S52K
 925 @end example
 926 You can use @samp{-printf} with the @samp{%F} directive to see the types
 927 of your filesystems.  @xref{Print File Information}.  @samp{-fstype} is
 928 usually used with @samp{-prune} to avoid searching remote filesystems
 929 (@pxref{Directories}).
 930 @end deffn
 931
 932 @node Combining Primaries With Operators
 933 @section Combining Primaries With Operators
 934
 935 Operators build a complex expression from tests and actions.
 936 The operators are, in order of decreasing precedence:
 937
 938 @table @code
 939 @item @asis{( @var{expr} )}
 940 @findex ()
 941 Force precedence.  True if @var{expr} is true.
 942
 943 @item @asis{! @var{expr}}
 944 @itemx @asis{-not @var{expr}}
 945 @findex !
 946 @findex -not
 947 True if @var{expr} is false.
 948
 949 @item @asis{@var{expr1 expr2}}
 950 @itemx @asis{@var{expr1} -a @var{expr2}}
 951 @itemx @asis{@var{expr1} -and @var{expr2}}
 952 @findex -a
 953 @findex -and
 954 And; @var{expr2} is not evaluated if @var{expr1} is false.
 955
 956 @item @asis{@var{expr1} -o @var{expr2}}
 957 @itemx @asis{@var{expr1} -or @var{expr2}}
 958 @findex -o
 959 @findex -or
 960 Or; @var{expr2} is not evaluated if @var{expr1} is true.
 961
 962 @item @asis{@var{expr1} , @var{expr2}}
 963 @findex ,
 964 List; both @var{expr1} and @var{expr2} are always evaluated.  True if
 965 @var{expr2} is true.  The value of @var{expr1} is discarded.  This
 966 operator lets you do multiple independent operations on one traversal,
 967 without depending on whether other operations succeeded.
 968 @end table
 969
 970 @code{find} searches the directory tree rooted at each file name by
 971 evaluating the expression from left to right, according to the rules of
 972 precedence, until the outcome is known (the left hand side is false for
 973 @samp{-and}, true for @samp{-or}), at which point @code{find} moves on
 974 to the next file name.
 975
 976 There are two other tests that can be useful in complex expressions:
 977
 978 @deffn Test -true
 979 Always true.
 980 @end deffn
 981
 982 @deffn Test -false
 983 Always false.
 984 @end deffn
 985
 986 @node Actions, Common Tasks, Finding Files, Top
 987 @chapter Actions
 988
 989 There are several ways you can print information about the files that
 990 match the criteria you gave in the @code{find} expression.  You can
 991 print the information either to the standard output or to a file that
 992 you name.  You can also execute commands that have the file names as
 993 arguments.  You can use those commands as further filters to select files.
 994
 995 @menu
 996 * Print File Name::
 997 * Print File Information::
 998 * Run Commands::
 999 * Adding Tests::
1000 @end menu
1001
1002 @node Print File Name
1003 @section Print File Name
1004
1005 @deffn Action -print
1006 True; print the full file name on the standard output, followed by a
1007 newline.
1008 @end deffn
1009
1010 @deffn Action -fprint file
1011 True; print the full file name into file @var{file}, followed by a
1012 newline.  If @var{file} does not exist when @code{find} is run, it is
1013 created; if it does exist, it is truncated to 0 bytes.  The file names
1014 @file{/dev/stdout} and @file{/dev/stderr} are handled specially; they
1015 refer to the standard output and standard error output, respectively.
1016 @end deffn
1017
1018 @node Print File Information
1019 @section Print File Information
1020
1021 @deffn Action -ls
1022 True; list the current file in @samp{ls -dils} format on the standard
1023 output.  The output looks like this:
1024
1025 @smallexample
1026 204744   17 -rw-r--r--   1 djm      staff       17337 Nov  2  1992 ./lwall-quotes
1027 @end smallexample
1028
1029 The fields are:
1030
1031 @enumerate
1032 @item
1033 The inode number of the file.  @xref{Hard Links}, for how to find files
1034 based on their inode number.
1035
1036 @item
1037 the number of blocks in the file.  The block counts are of 1K blocks,
1038 unless the environment variable @code{POSIXLY_CORRECT} is set, in which
1039 case 512-byte blocks are used.  @xref{Size}, for how to find files based
1040 on their size.
1041
1042 @item
1043 The file's type and permissions.  The type is shown as a dash for a
1044 regular file; for other file types, a letter like for @samp{-type} is
1045 used (@pxref{Type}).  The permissions are read, write, and execute for
1046 the file's owner, its group, and other users, respectively; a dash means
1047 the permission is not granted.  @xref{File Permissions}, for more details
1048 about file permissions.  @xref{Permissions}, for how to find files based
1049 on their permissions.
1050
1051 @item
1052 The number of hard links to the file.
1053
1054 @item
1055 The user who owns the file.
1056
1057 @item
1058 The file's group.
1059
1060 @item
1061 The file's size in bytes.
1062
1063 @item
1064 The date the file was last modified.
1065
1066 @item
1067 The file's name.  @samp{-ls} quotes non-printable characters in the file
1068 names using C-like backslash escapes.
1069 @end enumerate
1070 @end deffn
1071
1072 @deffn Action -fls file
1073 True; like @samp{-ls} but write to @var{file} like @samp{-fprint}
1074 (@pxref{Print File Name}).
1075 @end deffn
1076
1077 @deffn Action -printf format
1078 True; print @var{format} on the standard output, interpreting @samp{\}
1079 escapes and @samp{%} directives.  Field widths and precisions can be
1080 specified as with the @code{printf} C function.  Unlike @samp{-print},
1081 @samp{-printf} does not add a newline at the end of the string.
1082 @end deffn
1083
1084 @deffn Action -fprintf file format
1085 True; like @samp{-printf} but write to @var{file} like @samp{-fprint}
1086 (@pxref{Print File Name}).
1087 @end deffn
1088
1089 @menu
1090 * Escapes::
1091 * Format Directives::
1092 * Time Formats::
1093 @end menu
1094
1095 @node Escapes
1096 @subsection Escapes
1097
1098 The escapes that @samp{-printf} and @samp{-fprintf} recognize are:
1099
1100 @table @code
1101 @item \a
1102 Alarm bell.
1103 @item \b
1104 Backspace.
1105 @item \c
1106 Stop printing from this format immediately and flush the output.
1107 @item \f
1108 Form feed.
1109 @item \n
1110 Newline.
1111 @item \r
1112 Carriage return.
1113 @item \t
1114 Horizontal tab.
1115 @item \v
1116 Vertical tab.
1117 @item \\
1118 A literal backslash (@samp{\}).
1119 @item \NNN
1120 The character whose ASCII code is NNN (octal).
1121 @end table
1122
1123 A @samp{\} character followed by any other character is treated as an
1124 ordinary character, so they both are printed, and a warning message is
1125 printed to the standard error output (because it was probably a typo).
1126
1127 @node Format Directives
1128 @subsection Format Directives
1129
1130 @samp{-printf} and @samp{-fprintf} support the following format
1131 directives to print information about the file being processed.
1132 The C @code{printf} function, field width and precision specifiers
1133 are supported, as applied to string (%s) types. I.E. you can specify
1134 "minimum field width"."maximum field width" for each directive.
1135
1136 @samp{%%} is a literal percent sign.  A @samp{%} character followed by
1137 an unrecognised character (i.e. not a known directive or printf field
1138 width and precision specifier), is discarded (but the unrecognised character
1139 is printed), and a warning message is printed to the standard error output
1140 (because it was probably a typo).
1141
1142 @menu
1143 * Name Directives::
1144 * Ownership Directives::
1145 * Size Directives::
1146 * Location Directives::
1147 * Time Directives::
1148 @end menu
1149
1150 @node Name Directives
1151 @subsubsection Name Directives
1152
1153 @table @code
1154 @item %p
1155 File's name.
1156 @item %f
1157 File's name with any leading directories removed (only the last element).
1158 @item %h
1159 Leading directories of file's name (all but the last element and the
1160 slash before it).
1161 @item %P
1162 File's name with the name of the command line argument under which
1163 it was found removed from the beginning.
1164 @item %H
1165 Command line argument under which file was found.
1166 @end table
1167
1168 @node Ownership Directives
1169 @subsubsection Ownership Directives
1170
1171 @table @code
1172 @item %g
1173 File's group name, or numeric group ID if the group has no name.
1174 @item %G
1175 File's numeric group ID.
1176 @item %u
1177 File's user name, or numeric user ID if the user has no name.
1178 @item %U
1179 File's numeric user ID.
1180 @item %m
1181 File's permissions (in octal).
1182 @end table
1183
1184 @node Size Directives
1185 @subsubsection Size Directives
1186
1187 @table @code
1188 @item %k
1189 Amount of disk space occupied by the file, measured in 1K blocks
1190 (rounded up).  This can be less than the length of the file if
1191 it is a sparse file (that is, it has ``holes'').
1192 @item %b
1193 File's size in 512-byte blocks (rounded up).  This also can be less
1194 than the length of the file, if the file is sparse.
1195 @item %s
1196 File's size in bytes.
1197 @end table
1198
1199 @node Location Directives
1200 @subsubsection Location Directives
1201
1202 @table @code
1203 @item %d
1204 File's depth in the directory tree; files named on the command line
1205 have a depth of 0.
1206 @item %F
1207 Type of the filesystem the file is on; this value can be used for
1208 @samp{-fstype} (@pxref{Directories}).
1209 @item %l
1210 Object of symbolic link (empty string if file is not a symbolic link).
1211 @item %i
1212 File's inode number (in decimal).
1213 @item %n
1214 Number of hard links to file.
1215 @end table
1216
1217 @node Time Directives
1218 @subsubsection Time Directives
1219
1220 Some of these directives use the C @code{ctime} function.  Its output
1221 depends on the current locale, but it typically looks like
1222
1223 @example
1224 Wed Nov  2 00:42:36 1994
1225 @end example
1226
1227 @table @code
1228 @item %a
1229 File's last access time in the format returned by the C @code{ctime} function.
1230 @item %A@var{k}
1231 File's last access time in the format specified by @var{k}
1232 (@pxref{Time Formats}).
1233 @item %c
1234 File's last status change time in the format returned by the C @code{ctime}
1235 function.
1236 @item %C@var{k}
1237 File's last status change time in the format specified by @var{k}
1238 (@pxref{Time Formats}).
1239 @item %t
1240 File's last modification time in the format returned by the C @code{ctime}
1241 function.
1242 @item %T@var{k}
1243 File's last modification time in the format specified by @var{k}
1244 (@pxref{Time Formats}).
1245 @end table
1246
1247 @node Time Formats
1248 @subsection Time Formats
1249
1250 Below are the formats for the directives @samp{%A}, @samp{%C}, and
1251 @samp{%T}, which print the file's timestamps.  Some of these formats
1252 might not be available on all systems, due to differences in the C
1253 @code{strftime} function between systems.
1254
1255 @menu
1256 * Time Components::
1257 * Date Components::
1258 * Combined Time Formats::
1259 @end menu
1260
1261 @node Time Components
1262 @subsubsection Time Components
1263
1264 The following format directives print single components of the time.
1265
1266 @table @code
1267 @item H
1268 hour (00..23)
1269 @item I
1270 hour (01..12)
1271 @item k
1272 hour ( 0..23)
1273 @item l
1274 hour ( 1..12)
1275 @item p
1276 locale's AM or PM
1277 @item Z
1278 time zone (e.g., EDT), or nothing if no time zone is determinable
1279 @item M
1280 minute (00..59)
1281 @item S
1282 second (00..61)
1283 @item @@
1284 seconds since Jan. 1, 1970, 00:00 GMT.
1285 @end table
1286
1287 @node Date Components
1288 @subsubsection Date Components
1289
1290 The following format directives print single components of the date.
1291
1292 @table @code
1293 @item a
1294 locale's abbreviated weekday name (Sun..Sat)
1295 @item A
1296 locale's full weekday name, variable length (Sunday..Saturday)
1297 @item b
1298 @itemx h
1299 locale's abbreviated month name (Jan..Dec)
1300 @item B
1301 locale's full month name, variable length (January..December)
1302 @item m
1303 month (01..12)
1304 @item d
1305 day of month (01..31)
1306 @item w
1307 day of week (0..6)
1308 @item j
1309 day of year (001..366)
1310 @item U
1311 week number of year with Sunday as first day of week (00..53)
1312 @item W
1313 week number of year with Monday as first day of week (00..53)
1314 @item Y
1315 year (1970@dots{})
1316 @item y
1317 last two digits of year (00..99)
1318 @end table
1319
1320 @node Combined Time Formats
1321 @subsubsection Combined Time Formats
1322
1323 The following format directives print combinations of time and date
1324 components.
1325
1326 @table @code
1327 @item r
1328 time, 12-hour (hh:mm:ss [AP]M)
1329 @item T
1330 time, 24-hour (hh:mm:ss)
1331 @item X
1332 locale's time representation (H:M:S)
1333 @item c
1334 locale's date and time (Sat Nov 04 12:02:33 EST 1989)
1335 @item D
1336 date (mm/dd/yy)
1337 @item x
1338 locale's date representation (mm/dd/yy)
1339 @end table
1340
1341 @node Run Commands
1342 @section Run Commands
1343
1344 You can use the list of file names created by @code{find} or
1345 @code{locate} as arguments to other commands.  In this way you can
1346 perform arbitrary actions on the files.
1347
1348 @menu
1349 * Single File::
1350 * Multiple Files::
1351 * Querying::
1352 @end menu
1353
1354 @node Single File
1355 @subsection Single File
1356
1357 Here is how to run a command on one file at a time.
1358
1359 @deffn Action -exec command ;
1360 Execute @var{command}; true if 0 status is returned.  @code{find} takes
1361 all arguments after @samp{-exec} to be part of the command until an
1362 argument consisting of @samp{;} is reached.  It replaces the string
1363 @samp{@{@}} by the current file name being processed everywhere it
1364 occurs in the command.  Both of these constructions need to be escaped
1365 (with a @samp{\}) or quoted to protect them from expansion by the shell.
1366 The command is executed in the directory in which @code{find} was run.
1367
1368 For example, to compare each C header file in the current directory with
1369 the file @file{/tmp/master}:
1370
1371 @example
1372 find . -name '*.h' -exec diff -u '@{@}' /tmp/master ';'
1373 @end example
1374 @end deffn
1375
1376
1377 @deffn Action -exec command +
1378 This variant of @samp{-exec} not yet supported.   Once it has been
1379 implemented as specified in the POSIX standard, a @samp{@{@}} will expand to
1380 a list of names of matching files.  This expansion is done in such a
1381 way as to avoid exceeding the maximum command line length available on
1382 the system.
1383 @end deffn
1384
1385
1386 @node Multiple Files
1387 @subsection Multiple Files
1388
1389 Sometimes you need to process files alone.  But when you
1390 don't, it is faster to run a command on as many files as possible at a
1391 time, rather than once per file.  Doing this saves on the time it takes
1392 to start up the command each time.
1393
1394 To run a command on more than one file at once, use the @code{xargs}
1395 command, which is invoked like this:
1396
1397 @example
1398 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
1399 @end example
1400
1401 @code{xargs} reads arguments from the standard input, delimited by
1402 blanks (which can be protected with double or single quotes or a
1403 backslash) or newlines.  It executes the @var{command} (default is
1404 @file{/bin/echo}) one or more times with any @var{initial-arguments}
1405 followed by arguments read from standard input.  Blank lines on the
1406 standard input are ignored.
1407
1408 Instead of blank-delimited names, it is safer to use @samp{find -print0}
1409 or @samp{find -fprint0} and process the output by giving the @samp{-0}
1410 or @samp{--null} option to GNU @code{xargs}, GNU @code{tar}, GNU
1411 @code{cpio}, or @code{perl}.
1412
1413 You can use shell command substitution (backquotes) to process a list of
1414 arguments, like this:
1415
1416 @example
1417 grep -l sprintf `find $HOME -name '*.c' -print`
1418 @end example
1419
1420 However, that method produces an error if the length of the @samp{.c}
1421 file names exceeds the operating system's command-line length limit.
1422 @code{xargs} avoids that problem by running the command as many times as
1423 necessary without exceeding the limit:
1424
1425 @example
1426 find $HOME -name '*.c' -print | xargs grep -l sprintf
1427 @end example
1428
1429 However, if the command needs to have its standard input be a terminal
1430 (@code{less}, for example), you have to use the shell command
1431 substitution method.
1432
1433 The @code{xargs} command will process all its input, building command
1434 lines and executing them, unless one of the commands exits with a
1435 status of 255 (this will cause xargs to issue an error message and
1436 stop) or it reads a line contains the end of file string specified
1437 with the @samp{--eof} option.
1438
1439 @menu
1440 * Unsafe File Name Handling::
1441 * Safe File Name Handling::
1442 * Limiting Command Size::
1443 * Interspersing File Names::
1444 @end menu
1445
1446 @node Unsafe File Name Handling
1447 @subsubsection Unsafe File Name Handling
1448
1449 Because file names can contain quotes, backslashes, blank characters,
1450 and even newlines, it is not safe to process them using @code{xargs} in its
1451 default mode of operation.  But since most files' names do not contain
1452 blanks, this problem occurs only infrequently.  If you are only
1453 searching through files that you know have safe names, then you need not
1454 be concerned about it.
1455
1456 @c This example is adapted from:
1457 @c From: pfalstad@stone.Princeton.EDU (Paul John Falstad)
1458 @c Newsgroups: comp.unix.shell
1459 @c Subject: Re: Beware xargs security holes
1460 @c Date: 16 Oct 90 19:12:06 GMT
1461 @c
1462 In many applications, if @code{xargs} botches processing a file because
1463 its name contains special characters, some data might be lost.  The
1464 importance of this problem depends on the importance of the data and
1465 whether anyone notices the loss soon enough to correct it.  However,
1466 here is an extreme example of the problems that using blank-delimited
1467 names can cause.  If the following command is run daily from
1468 @code{cron}, then any user can remove any file on the system:
1469
1470 @example
1471 find / -name '#*' -atime +7 -print | xargs rm
1472 @end example
1473
1474 For example, you could do something like this:
1475
1476 @example
1477 eg$ echo > '#
1478 vmunix'
1479 @end example
1480
1481 @noindent
1482 and then @code{cron} would delete @file{/vmunix}, if it ran
1483 @code{xargs} with @file{/} as its current directory.
1484
1485 To delete other files, for example @file{/u/joeuser/.plan}, you could do
1486 this:
1487
1488 @example
1489 eg$ mkdir '#
1490 '
1491 eg$ cd '#
1492 '
1493 eg$ mkdir u u/joeuser u/joeuser/.plan'
1494 '
1495 eg$ echo > u/joeuser/.plan'
1496 /#foo'
1497 eg$ cd ..
1498 eg$ find . -name '#*' -print | xargs echo
1499 ./# ./# /u/joeuser/.plan /#foo
1500 @end example
1501
1502 @node Safe File Name Handling
1503 @subsubsection Safe File Name Handling
1504
1505 Here is how to make @code{find} output file names so that they can be
1506 used by other programs without being mangled or misinterpreted.  You can
1507 process file names generated this way by giving the @samp{-0} or
1508 @samp{--null} option to GNU @code{xargs}, GNU @code{tar}, GNU
1509 @code{cpio}, or @code{perl}.
1510
1511 @deffn Action -print0
1512 True; print the full file name on the standard output, followed by a
1513 null character.
1514 @end deffn
1515
1516 @deffn Action -fprint0 file
1517 True; like @samp{-print0} but write to @var{file} like @samp{-fprint}
1518 (@pxref{Print File Name}).
1519 @end deffn
1520
1521 @node Limiting Command Size
1522 @subsubsection Limiting Command Size
1523
1524 @code{xargs} gives you control over how many arguments it passes to the
1525 command each time it executes it.  By default, it uses up to
1526 @code{ARG_MAX} - 2k, or 20k, whichever is smaller, characters per
1527 command.  It uses as many lines and arguments as fit within that limit.
1528 The following options modify those values.
1529
1530 @table @code
1531 @item --no-run-if-empty
1532 @itemx -r
1533 If the standard input does not contain any nonblanks, do not run the
1534 command.  By default, the command is run once even if there is no input.
1535
1536 @item --max-lines@r{[}=@var{max-lines}@r{]}
1537 @itemx -l@r{[}@var{max-lines}@r{]}
1538 Use at most @var{max-lines} nonblank input lines per command line;
1539 @var{max-lines} defaults to 1 if omitted.  Trailing blanks cause an
1540 input line to be logically continued on the next input line, for the
1541 purpose of counting the lines.  Implies @samp{-x}.
1542
1543 @item --max-args=@var{max-args}
1544 @itemx -n @var{max-args}
1545 Use at most @var{max-args} arguments per command line.  Fewer than
1546 @var{max-args} arguments will be used if the size (see the @samp{-s}
1547 option) is exceeded, unless the @samp{-x} option is given, in which case
1548 @code{xargs} will exit.
1549
1550 @item --max-chars=@var{max-chars}
1551 @itemx -s @var{max-chars}
1552 Use at most @var{max-chars} characters per command line, including the
1553 command and initial arguments and the terminating nulls at the ends of
1554 the argument strings.
1555
1556 @item --max-procs=@var{max-procs}
1557 @itemx -P @var{max-procs}
1558 Run up to @var{max-procs} processes at a time; the default is 1.  If
1559 @var{max-procs} is 0, @code{xargs} will run as many processes as
1560 possible at a time.  Use the @samp{-n}, @samp{-s}, or @samp{-l} option
1561 with @samp{-P}; otherwise chances are that the command will be run only
1562 once.
1563 @end table
1564
1565 @node Interspersing File Names
1566 @subsubsection Interspersing File Names
1567
1568 @code{xargs} can insert the name of the file it is processing between
1569 arguments you give for the command.  Unless you also give options to
1570 limit the command size (@pxref{Limiting Command Size}), this mode of
1571 operation is equivalent to @samp{find -exec} (@pxref{Single File}).
1572
1573 @table @code
1574 @item --replace@r{[}=@var{replace-str}@r{]}
1575 @itemx -i@r{[}@var{replace-str}@r{]}
1576 Replace occurrences of @var{replace-str} in the initial arguments with
1577 names read from standard input.  Also, unquoted blanks do not terminate
1578 arguments.  If @var{replace-str} is omitted, it defaults to @samp{@{@}}
1579 (like for @samp{find -exec}).  Implies @samp{-x} and @samp{-l 1}.  As an
1580 example, to sort each file the @file{bills} directory, leaving the
1581 output in that file name with @file{.sorted} appended, you could do:
1582
1583 @example
1584 find bills -type f | xargs -iXX sort -o XX.sorted XX
1585 @end example
1586
1587 @noindent
1588 The equivalent command using @samp{find -exec} is:
1589
1590 @example
1591 find bills -type f -exec sort -o '@{@}.sorted' '@{@}' ';'
1592 @end example
1593 @end table
1594
1595 @node Querying
1596 @subsection Querying
1597
1598 To ask the user whether to execute a command on a single file, you can
1599 use the @code{find} primary @samp{-ok} instead of @samp{-exec}:
1600
1601 @deffn Action -ok command ;
1602 Like @samp{-exec} (@pxref{Single File}), but ask the user first (on
1603 the standard input); if the response does not start with @samp{y} or
1604 @samp{Y}, do not run the command, and return false.
1605 @end deffn
1606
1607 When processing multiple files with a single command, to query the user
1608 you give @code{xargs} the following option.  When using this option, you
1609 might find it useful to control the number of files processed per
1610 invocation of the command (@pxref{Limiting Command Size}).
1611
1612 @table @code
1613 @item --interactive
1614 @itemx -p
1615 Prompt the user about whether to run each command line and read a line
1616 from the terminal.  Only run the command line if the response starts
1617 with @samp{y} or @samp{Y}.  Implies @samp{-t}.
1618 @end table
1619
1620 @node Adding Tests
1621 @section Adding Tests
1622
1623 You can test for file attributes that none of the @code{find} builtin
1624 tests check.  To do this, use @code{xargs} to run a program that filters
1625 a list of files printed by @code{find}.  If possible, use @code{find}
1626 builtin tests to pare down the list, so the program run by @code{xargs}
1627 has less work to do.  The tests builtin to @code{find} will likely run
1628 faster than tests that other programs perform.
1629
1630 For example, here is a way to print the names of all of the unstripped
1631 binaries in the @file{/usr/local} directory tree.  Builtin tests avoid
1632 running @code{file} on files that are not regular files or are not
1633 executable.
1634
1635 @example
1636 find /usr/local -type f -perm +a=x | xargs file |
1637   grep 'not stripped' | cut -d: -f1
1638 @end example
1639
1640 @noindent
1641 The @code{cut} program removes everything after the file name from the
1642 output of @code{file}.
1643
1644 @c Idea from Martin Weitzel.
1645 If you want to place a special test somewhere in the middle of a
1646 @code{find} expression, you can use @samp{-exec} to run a program that
1647 performs the test.  Because @samp{-exec} evaluates to the exit status of
1648 the executed program, you can write a program (which can be a shell
1649 script) that tests for a special attribute and make it exit with a true
1650 (zero) or false (non-zero) status.  It is a good idea to place such a
1651 special test @emph{after} the builtin tests, because it starts a new
1652 process which could be avoided if a builtin test evaluates to false.
1653 Use this method only when @code{xargs} is not flexible enough, because
1654 starting one or more new processes to test each file is slower than
1655 using @code{xargs} to start one process that tests many files.
1656
1657 Here is a shell script called @code{unstripped} that checks whether its
1658 argument is an unstripped binary file:
1659
1660 @example
1661 #! /bin/sh
1662 file $1 | grep 'not stripped' > /dev/null
1663 @end example
1664
1665 This script relies on the fact that the shell exits with the status of
1666 the last program it executed, in this case @code{grep}.  @code{grep}
1667 exits with a true status if it found any matches, false if not.  Here is
1668 an example of using the script (assuming it is in your search path).  It
1669 lists the stripped executables in the file @file{sbins} and the
1670 unstripped ones in @file{ubins}.
1671
1672 @example
1673 find /usr/local -type f -perm +a=x \
1674   \( -exec unstripped '@{@}' \; -fprint ubins -o -fprint sbins \)
1675 @end example
1676
1677 @node Common Tasks, Databases, Actions, Top
1678 @chapter Common Tasks
1679
1680 The sections that follow contain some extended examples that both give a
1681 good idea of the power of these programs, and show you how to solve
1682 common real-world problems.
1683
1684 @menu
1685 * Viewing And Editing::
1686 * Archiving::
1687 * Cleaning Up::
1688 * Strange File Names::
1689 * Fixing Permissions::
1690 * Classifying Files::
1691 @end menu
1692
1693 @node Viewing And Editing
1694 @section Viewing And Editing
1695
1696 To view a list of files that meet certain criteria, simply run your file
1697 viewing program with the file names as arguments.  Shells substitute a
1698 command enclosed in backquotes with its output, so the whole command
1699 looks like this:
1700
1701 @example
1702 less `find /usr/include -name '*.h' | xargs grep -l mode_t`
1703 @end example
1704
1705 @noindent
1706 You can edit those files by giving an editor name instead of a file
1707 viewing program.
1708
1709 @node Archiving
1710 @section Archiving
1711
1712 You can pass a list of files produced by @code{find} to a file archiving
1713 program.  GNU @code{tar} and @code{cpio} can both read lists of file
1714 names from the standard input---either delimited by nulls (the safe way)
1715 or by blanks (the lazy, risky default way).  To use null-delimited
1716 names, give them the @samp{--null} option.  You can store a file archive
1717 in a file, write it on a tape, or send it over a network to extract on
1718 another machine.
1719
1720 One common use of @code{find} to archive files is to send a list of the
1721 files in a directory tree to @code{cpio}.  Use @samp{-depth} so if a
1722 directory does not have write permission for its owner, its contents can
1723 still be restored from the archive since the directory's permissions are
1724 restored after its contents.  Here is an example of doing this using
1725 @code{cpio}; you could use a more complex @code{find} expression to
1726 archive only certain files.
1727
1728 @example
1729 find . -depth -print0 |
1730   cpio --create --null --format=crc --file=/dev/nrst0
1731 @end example
1732
1733 You could restore that archive using this command:
1734
1735 @example
1736 cpio --extract --null --make-dir --unconditional \
1737   --preserve --file=/dev/nrst0
1738 @end example
1739
1740 Here are the commands to do the same things using @code{tar}:
1741
1742 @example
1743 find . -depth -print0 |
1744   tar --create --null --files-from=- --file=/dev/nrst0
1745
1746 tar --extract --null --preserve-perm --same-owner \
1747   --file=/dev/nrst0
1748 @end example
1749
1750 @c Idea from Rick Sladkey.
1751 Here is an example of copying a directory from one machine to another:
1752
1753 @example
1754 find . -depth -print0 | cpio -0o -Hnewc |
1755   rsh @var{other-machine} "cd `pwd` && cpio -i0dum"
1756 @end example
1757
1758 @node Cleaning Up
1759 @section Cleaning Up
1760
1761 @c Idea from Jim Meyering.
1762 This section gives examples of removing unwanted files in various situations.
1763 Here is a command to remove the CVS backup files created when an update
1764 requires a merge:
1765
1766 @example
1767 find . -name '.#*' -print0 | xargs -0r rm -f
1768 @end example
1769
1770 @c Idea from Franc,ois Pinard.
1771 You can run this command to clean out your clutter in @file{/tmp}.  You
1772 might place it in the file your shell runs when you log out
1773 (@file{.bash_logout}, @file{.logout}, or @file{.zlogout}, depending on
1774 which shell you use).
1775
1776 @example
1777 find /tmp -user $LOGNAME -type f -print0 | xargs -0 -r rm -f
1778 @end example
1779
1780 If your @code{find} command removes directories, you may find that
1781 you get a spurious error message when @code{find} tries to recurse
1782 into a directory that has now been removed.  Using the @samp{-depth}
1783 option will normally resolve this problem.
1784
1785 @c Idea from Noah Friedman.
1786 To remove old Emacs backup and auto-save files, you can use a command
1787 like the following.  It is especially important in this case to use
1788 null-terminated file names because Emacs packages like the VM mailer
1789 often create temporary file names with spaces in them, like @file{#reply
1790 to David J. MacKenzie<1>#}.
1791
1792 @example
1793 find ~ \( -name '*~' -o -name '#*#' \) -print0 |
1794   xargs --no-run-if-empty --null rm -vf
1795 @end example
1796
1797 Removing old files from @file{/tmp} is commonly done from @code{cron}:
1798
1799 @c Idea from Kaveh Ghazi.
1800 @example
1801 find /tmp /var/tmp -not -type d -mtime +3 -print0 |
1802   xargs --null --no-run-if-empty rm -f
1803
1804 find /tmp /var/tmp -depth -mindepth 1 -type d -empty -exec rmdir @{@} \;
1805 @end example
1806
1807 The second @code{find} command above uses @samp{-depth} so it cleans out
1808 empty directories depth-first, hoping that the parents become empty and
1809 can be removed too.  It uses @samp{-mindepth} to avoid removing
1810 @file{/tmp} itself if it becomes totally empty.
1811
1812 @node Strange File Names
1813 @section Strange File Names
1814
1815 @c Idea from:
1816 @c From: tmatimar@isgtec.com (Ted Timar)
1817 @c Newsgroups: comp.unix.questions,comp.unix.shell,comp.answers,news.answers
1818 @c Subject: Unix - Frequently Asked Questions (2/7) [Frequent posting]
1819 @c Subject: How do I remove a file with funny characters in the filename ?
1820 @c Date: Thu Mar 18 17:16:55 EST 1993
1821 @code{find} can help you remove or rename a file with strange characters
1822 in its name.  People are sometimes stymied by files whose names contain
1823 characters such as spaces, tabs, control characters, or characters with
1824 the high bit set.  The simplest way to remove such files is:
1825
1826 @example
1827 rm -i @var{some*pattern*that*matches*the*problem*file}
1828 @end example
1829
1830 @code{rm} asks you whether to remove each file matching the given
1831 pattern.  If you are using an old shell, this approach might not work if
1832 the file name contains a character with the high bit set; the shell may
1833 strip it off.  A more reliable way is:
1834
1835 @example
1836 find . -maxdepth 1 @var{tests} -ok rm '@{@}' \;
1837 @end example
1838
1839 @noindent
1840 where @var{tests} uniquely identify the file.  The @samp{-maxdepth 1}
1841 option prevents @code{find} from wasting time searching for the file in
1842 any subdirectories; if there are no subdirectories, you may omit it.  A
1843 good way to uniquely identify the problem file is to figure out its
1844 inode number; use
1845
1846 @example
1847 ls -i
1848 @end example
1849
1850 Suppose you have a file whose name contains control characters, and you
1851 have found that its inode number is 12345.  This command prompts you for
1852 whether to remove it:
1853
1854 @example
1855 find . -maxdepth 1 -inum 12345 -ok rm -f '@{@}' \;
1856 @end example
1857
1858 If you don't want to be asked, perhaps because the file name may contain
1859 a strange character sequence that will mess up your screen when printed,
1860 then use @samp{-exec} instead of @samp{-ok}.
1861
1862 If you want to rename the file instead, you can use @code{mv} instead of
1863 @code{rm}:
1864
1865 @example
1866 find . -maxdepth 1 -inum 12345 -ok mv '@{@}' @var{new-file-name} \;
1867 @end example
1868
1869 @node Fixing Permissions
1870 @section Fixing Permissions
1871
1872 Suppose you want to make sure that everyone can write to the directories in a
1873 certain directory tree.  Here is a way to find directories lacking either
1874 user or group write permission (or both), and fix their permissions:
1875
1876 @example
1877 find . -type d -not -perm -ug=w | xargs chmod ug+w
1878 @end example
1879
1880 @noindent
1881 You could also reverse the operations, if you want to make sure that
1882 directories do @emph{not} have world write permission.
1883
1884 @node Classifying Files
1885 @section Classifying Files
1886
1887 @c Idea from:
1888 @c From: martin@mwtech.UUCP (Martin Weitzel)
1889 @c Newsgroups: comp.unix.wizards,comp.unix.questions
1890 @c Subject: Advanced usage of 'find' (Re: Unix security automating script)
1891 @c Date: 22 Mar 90 15:05:19 GMT
1892 If you want to classify a set of files into several groups based on
1893 different criteria, you can use the comma operator to perform multiple
1894 independent tests on the files.  Here is an example:
1895
1896 @example
1897 find / -type d \( -perm -o=w -fprint allwrite , \
1898   -perm -o=x -fprint allexec \)
1899
1900 echo "Directories that can be written to by everyone:"
1901 cat allwrite
1902 echo ""
1903 echo "Directories with search permissions for everyone:"
1904 cat allexec
1905 @end example
1906
1907 @code{find} has only to make one scan through the directory tree (which
1908 is one of the most time consuming parts of its work).
1909
1910 @node Databases, File Permissions, Common Tasks, Top
1911 @chapter File Name Databases
1912
1913 The file name databases used by @code{locate} contain lists of files
1914 that were in particular directory trees when the databases were last
1915 updated.  The file name of the default database is determined when
1916 @code{locate} and @code{updatedb} are configured and installed.  The
1917 frequency with which the databases are updated and the directories for
1918 which they contain entries depend on how often @code{updatedb} is run,
1919 and with which arguments.
1920
1921 @menu
1922 * Database Locations::
1923 * Database Formats::
1924 @end menu
1925
1926 @node Database Locations
1927 @section Database Locations
1928
1929 There can be multiple file name databases.  Users can select which
1930 databases @code{locate} searches using an environment variable or a
1931 command line option.  The system administrator can choose the file name
1932 of the default database, the frequency with which the databases are
1933 updated, and the directories for which they contain entries.  File name
1934 databases are updated by running the @code{updatedb} program, typically
1935 nightly.
1936
1937 In networked environments, it often makes sense to build a database at
1938 the root of each filesystem, containing the entries for that filesystem.
1939 @code{updatedb} is then run for each filesystem on the fileserver where
1940 that filesystem is on a local disk, to prevent thrashing the network.
1941 Here are the options to @code{updatedb} to select which directories each
1942 database contains entries for:
1943
1944 @table @code
1945 @item --localpaths='@var{path}@dots{}'
1946 Non-network directories to put in the database.
1947 Default is @file{/}.
1948
1949 @item --netpaths='@var{path}@dots{}'
1950 Network (NFS, AFS, RFS, etc.) directories to put in the database.
1951 The environment variable @code{NETPATHS} also sets this value.
1952 Default is none.
1953
1954
1955 @item --prunepaths='@var{path}@dots{}'
1956 Directories to not put in the database, which would otherwise be.
1957 The environment variable @code{PRUNEPATHS} also sets this value.
1958 Default is @file{/tmp /usr/tmp /var/tmp /afs}.
1959
1960 @item --prunefs='@var{path}@dots{}'
1961 File systems to not put in the database, which would otherwise be.
1962 Note that files are pruned when a file system is reached;
1963 Any file system mounted under an undesired file system will be
1964 ignored.
1965 The environment variable @code{PRUNEFS} also sets this value.
1966 Default is @file{nfs NFS proc}.
1967
1968 @item --output=@var{dbfile}
1969 The database file to build.
1970 Default is system-dependent, but typically @file{/usr/local/var/locatedb}.
1971
1972 @item --localuser=@var{user}
1973 The user to search the non-network directories as, using @code{su}.
1974 Default is to search the non-network directories as the current user.
1975 You can also use the environment variable @code{LOCALUSER} to set this user.
1976
1977 @item --netuser=@var{user}
1978 The user to search network directories as, using @code{su}.
1979 Default is @code{daemon}.
1980 You can also use the environment variable @code{NETUSER} to set this user.
1981 @end table
1982
1983 @node Database Formats
1984 @section Database Formats
1985
1986 The file name databases contain lists of files that were in particular
1987 directory trees when the databases were last updated.  The file name
1988 database format changed starting with GNU @code{locate} version 4.0 to
1989 allow machines with different byte orderings to share the databases.  The
1990 new GNU @code{locate} can read both the old and new database formats.
1991 However, old versions of @code{locate} and @code{find} produce incorrect
1992 results if given a new-format database.
1993
1994 @menu
1995 * New Database Format::
1996 * Sample Database::
1997 * Old Database Format::
1998 @end menu
1999
2000 @node New Database Format
2001 @subsection New Database Format
2002
2003 @code{updatedb} runs a program called @code{frcode} to
2004 @dfn{front-compress} the list of file names, which reduces the database
2005 size by a factor of 4 to 5.  Front-compression (also known as
2006 incremental encoding) works as follows.
2007
2008 The database entries are a sorted list (case-insensitively, for users'
2009 convenience).  Since the list is sorted, each entry is likely to share a
2010 prefix (initial string) with the previous entry.  Each database entry
2011 begins with an offset-differential count byte, which is the additional
2012 number of characters of prefix of the preceding entry to use beyond the
2013 number that the preceding entry is using of its predecessor.  (The
2014 counts can be negative.)  Following the count is a null-terminated ASCII
2015 remainder---the part of the name that follows the shared prefix.
2016
2017 If the offset-differential count is larger than can be stored in a byte
2018 (+/-127), the byte has the value 0x80 and the count follows in a 2-byte
2019 word, with the high byte first (network byte order).
2020
2021 Every database begins with a dummy entry for a file called
2022 @file{LOCATE02}, which @code{locate} checks for to ensure that the
2023 database file has the correct format; it ignores the entry in doing the
2024 search.
2025
2026 Databases can not be concatenated together, even if the first (dummy)
2027 entry is trimmed from all but the first database.  This is because the
2028 offset-differential count in the first entry of the second and following
2029 databases will be wrong.
2030
2031 @node Sample Database
2032 @subsection Sample Database
2033
2034 Sample input to @code{frcode}:
2035 @c with nulls changed to newlines:
2036
2037 @example
2038 /usr/src
2039 /usr/src/cmd/aardvark.c
2040 /usr/src/cmd/armadillo.c
2041 /usr/tmp/zoo
2042 @end example
2043
2044 Length of the longest prefix of the preceding entry to share:
2045
2046 @example
2047 0 /usr/src
2048 8 /cmd/aardvark.c
2049 14 rmadillo.c
2050 5 tmp/zoo
2051 @end example
2052
2053 Output from @code{frcode}, with trailing nulls changed to newlines
2054 and count bytes made printable:
2055
2056 @example
2057 0 LOCATE02
2058 0 /usr/src
2059 8 /cmd/aardvark.c
2060 6 rmadillo.c
2061 -9 tmp/zoo
2062 @end example
2063
2064 (6 = 14 - 8, and -9 = 5 - 14)
2065
2066 @node Old Database Format
2067 @subsection Old Database Format
2068
2069 The old database format is used by Unix @code{locate} and @code{find}
2070 programs and earlier releases of the GNU ones.  @code{updatedb} produces
2071 this format if given the @samp{--old-format} option.
2072
2073 @code{updatedb} runs programs called @code{bigram} and @code{code} to
2074 produce old-format databases.  The old format differs from the new one
2075 in the following ways.  Instead of each entry starting with an
2076 offset-differential count byte and ending with a null, byte values from
2077 0 through 28 indicate offset-differential counts from -14 through 14.
2078 The byte value indicating that a long offset-differential count follows
2079 is 0x1e (30), not 0x80.  The long counts are stored in host byte order,
2080 which is not necessarily network byte order, and host integer word size,
2081 which is usually 4 bytes.  They also represent a count 14 less than
2082 their value.  The database lines have no termination byte; the start of
2083 the next line is indicated by its first byte having a value <= 30.
2084
2085 In addition, instead of starting with a dummy entry, the old database
2086 format starts with a 256 byte table containing the 128 most common
2087 bigrams in the file list.  A bigram is a pair of adjacent bytes.  Bytes
2088 in the database that have the high bit set are indexes (with the high
2089 bit cleared) into the bigram table.  The bigram and offset-differential
2090 count coding makes these databases 20-25% smaller than the new format,
2091 but makes them not 8-bit clean.  Any byte in a file name that is in the
2092 ranges used for the special codes is replaced in the database by a
2093 question mark, which not coincidentally is the shell wildcard to match a
2094 single character.
2095
2096 The old format therefore can not faithfully store entries with non-ASCII
2097 characters. It therefore should not be used in internationalized
2098 environments.
2099
2100 @node File Permissions, Reference, Databases, Top
2101 @chapter File Permissions
2102
2103 @include perm.texi
2104
2105 @node Reference, Primary Index, File Permissions, Top
2106 @chapter Reference
2107
2108 Below are summaries of the command line syntax for the programs
2109 discussed in this manual.
2110
2111 @menu
2112 * Invoking find::
2113 * Invoking locate::
2114 * Invoking updatedb::
2115 * Invoking xargs::
2116 @end menu
2117
2118 @node Invoking find, Invoking locate, , Reference
2119 @section Invoking @code{find}
2120
2121 @example
2122 find @r{[}@var{file}@dots{}@r{]} @r{[}@var{expression}@r{]}
2123 @end example
2124
2125 @code{find} searches the directory tree rooted at each file name
2126 @var{file} by evaluating the @var{expression} on each file it finds in
2127 the tree.
2128
2129 @code{find} considers the first argument that begins with @samp{-},
2130 @samp{(}, @samp{)}, @samp{,}, or @samp{!} to be the beginning of the
2131 expression; any arguments before it are paths to search, and any
2132 arguments after it are the rest of the expression.  If no paths are
2133 given, the current directory is used.  If no expression is given, the
2134 expression @samp{-print} is used.
2135
2136 @code{find} exits with status 0 if all files are processed successfully,
2137 greater than 0 if errors occur.
2138
2139 @xref{Primary Index}, for a summary of all of the tests, actions, and
2140 options that the expression can contain.
2141
2142 @code{find} also recognizes two options for administrative use:
2143
2144 @table @code
2145 @item --help
2146 Print a summary of the command-line argument format and exit.
2147 @item --version
2148 Print the version number of @code{find} and exit.
2149 @end table
2150
2151 @node Invoking locate, Invoking updatedb, Invoking find, Reference
2152 @section Invoking @code{locate}
2153
2154 @example
2155 locate @r{[}@var{option}@dots{}@r{]} @var{pattern}@dots{}
2156 @end example
2157
2158 @table @code
2159 @item --database=@var{path}
2160 @itemx -d @var{path}
2161 Instead of searching the default file name database, search the file
2162 name databases in @var{path}, which is a colon-separated list of
2163 database file names.  You can also use the environment variable
2164 @code{LOCATE_PATH} to set the list of database files to search.  The
2165 option overrides the environment variable if both are used.
2166
2167 @item --existing
2168 @itemx -e
2169 Only print out such names which currently exist (instead of such names
2170 which existed when the database was created).
2171 Note that this may slow down the program a lot, if there are many matches
2172 in the database.
2173
2174 @item --ignore-case
2175 @itemx -i
2176 Ignore case distinctions in both the pattern and the file names.
2177
2178 @item --help
2179 Print a summary of the options to @code{locate} and exit.
2180
2181 @item --version
2182 Print the version number of @code{locate} and exit.
2183 @end table
2184
2185 @node Invoking updatedb, Invoking xargs, Invoking locate, Reference
2186 @section Invoking @code{updatedb}
2187
2188 @example
2189 updatedb @r{[}@var{option}@dots{}@r{]}
2190 @end example
2191
2192 @table @code
2193 @item --localpaths='@var{path}@dots{}'
2194 Non-network directories to put in the database.
2195 Default is @file{/}.
2196
2197 @item --netpaths='@var{path}@dots{}'
2198 Network (NFS, AFS, RFS, etc.) directories to put in the database.
2199 The environment variable @code{NETPATHS} also sets this value.
2200 Default is none.
2201
2202 @item --prunepaths='@var{path}@dots{}'
2203 Directories to not put in the database, which would otherwise be.
2204 The environment variable @code{PRUNEPATHS} also sets this value.
2205 Default is @file{/tmp /usr/tmp /var/tmp /afs}.
2206
2207 @item --prunefs='@var{path}@dots{}'
2208 File systems to not put in the database, which would otherwise be.
2209 Note that files are pruned when a file system is reached;
2210 Any file system mounted under an undesired file system will be
2211 ignored.
2212 The environment variable @code{PRUNEFS} also sets this value.
2213 Default is @file{nfs NFS proc}.
2214
2215 @item --output=@var{dbfile}
2216 The database file to build.
2217 Default is system-dependent, but typically @file{/usr/local/var/locatedb}.
2218
2219 @item --localuser=@var{user}
2220 The user to search the non-network directories as, using @code{su}.
2221 Default is to search the non-network directories as the current user.
2222 You can also use the environment variable @code{LOCALUSER} to set this user.
2223
2224 @item --netuser=@var{user}
2225 The user to search network directories as, using @code{su}(1).
2226 Default is @code{daemon}.
2227 You can also use the environment variable @code{NETUSER} to set this user.
2228 @end table
2229
2230 @node Invoking xargs, ,  Invoking updatedb, Reference
2231 @section Invoking @code{xargs}
2232
2233 @example
2234 xargs @r{[}@var{option}@dots{}@r{]} @r{[}@var{command} @r{[}@var{initial-arguments}@r{]}@r{]}
2235 @end example
2236
2237 @code{xargs} exits with the following status:
2238
2239 @table @asis
2240 @item 0
2241 if it succeeds
2242 @item 123
2243 if any invocation of the command exited with status 1-125
2244 @item 124
2245 if the command exited with status 255
2246 @item 125
2247 if the command is killed by a signal
2248 @item 126
2249 if the command cannot be run
2250 @item 127
2251 if the command is not found
2252 @item 1
2253 if some other error occurred.
2254 @end table
2255
2256 @table @code
2257 @item --null
2258 @itemx -0
2259 Input filenames are terminated by a null character instead of by
2260 whitespace, and the quotes and backslash are not special (every
2261 character is taken literally).  Disables the end of file string, which
2262 is treated like any other argument.
2263
2264 @item --eof@r{[}=@var{eof-str}@r{]}
2265 @itemx -e@r{[}@var{eof-str}@r{]}
2266 Set the end of file string to @var{eof-str}.  If the end of file string
2267 occurs as a line of input, the rest of the input is ignored.  If
2268 @var{eof-str} is omitted, there is no end of file string.  If this
2269 option is not given, the end of file string defaults to @samp{_}.
2270
2271 @item --help
2272 Print a summary of the options to @code{xargs} and exit.
2273
2274 @item --replace@r{[}=@var{replace-str}@r{]}
2275 @itemx -i@r{[}@var{replace-str}@r{]}
2276 Replace occurrences of @var{replace-str} in the initial arguments with
2277 names read from standard input.  Also, unquoted blanks do not terminate
2278 arguments.  If @var{replace-str} is omitted, it defaults to @samp{@{@}}
2279 (like for @samp{find -exec}).  Implies @samp{-x} and @samp{-l 1}.
2280
2281 @item --max-lines@r{[}=@var{max-lines}@r{]}
2282 @itemx -l@r{[}@var{max-lines}@r{]}
2283 Use at most @var{max-lines} nonblank input lines per command line;
2284 @var{max-lines} defaults to 1 if omitted.  Trailing blanks cause an
2285 input line to be logically continued on the next input line, for the
2286 purpose of counting the lines.  Implies @samp{-x}.
2287
2288 @item --max-args=@var{max-args}
2289 @itemx -n @var{max-args}
2290 Use at most @var{max-args} arguments per command line.  Fewer than
2291 @var{max-args} arguments will be used if the size (see the @samp{-s}
2292 option) is exceeded, unless the @samp{-x} option is given, in which case
2293 @code{xargs} will exit.
2294
2295 @item --interactive
2296 @itemx -p
2297 Prompt the user about whether to run each command line and read a line
2298 from the terminal.  Only run the command line if the response starts
2299 with @samp{y} or @samp{Y}.  Implies @samp{-t}.
2300
2301 @item --no-run-if-empty
2302 @itemx -r
2303 If the standard input does not contain any nonblanks, do not run the
2304 command.  By default, the command is run once even if there is no input.
2305
2306 @item --max-chars=@var{max-chars}
2307 @itemx -s @var{max-chars}
2308 Use at most @var{max-chars} characters per command line, including the
2309 command and initial arguments and the terminating nulls at the ends of
2310 the argument strings.
2311
2312 @item --verbose
2313 @itemx -t
2314 Print the command line on the standard error output before executing
2315 it.
2316
2317 @item --version
2318 Print the version number of @code{xargs} and exit.
2319
2320 @item --exit
2321 @itemx -x
2322 Exit if the size (see the @samp{-s} option) is exceeded.
2323
2324
2325 @item --max-procs=@var{max-procs}
2326 @itemx -P @var{max-procs}
2327 Run up to @var{max-procs} processes at a time; the default is 1.  If
2328 @var{max-procs} is 0, @code{xargs} will run as many processes as
2329 possible at a time.
2330 @end table
2331
2332 @node Primary Index, , Reference, Top
2333 @unnumbered @code{find} Primary Index
2334
2335 This is a list of all of the primaries (tests, actions, and options)
2336 that make up @code{find} expressions for selecting files.  @xref{find
2337 Expressions}, for more information on expressions.
2338
2339 @printindex fn
2340
2341 @contents
2342 @bye
2343
2344 @comment texi related words used by Emacs' spell checker ispell.el
2345
2346 @comment LocalWords: texinfo setfilename settitle setchapternewpage
2347 @comment LocalWords: iftex finalout ifinfo DIR titlepage vskip pt
2348 @comment LocalWords: filll dir samp dfn noindent xref pxref
2349 @comment LocalWords: var deffn texi deffnx itemx emph asis
2350 @comment LocalWords: findex smallexample subsubsection cindex
2351
2352 @comment other words used by Emacs' spell checker ispell.el
2353 @comment LocalWords: README fred updatedb xargs Plett Rendell akefile
2354 @comment LocalWords: args grep Filesystems fo foo fOo wildcards iname
2355 @comment LocalWords: ipath regex iregex expr fubar regexps
2356 @comment LocalWords: metacharacters macs sr sc inode lname ilname
2357 @comment LocalWords: sysdep noleaf ls inum xdev filesystems usr atime
2358 @comment LocalWords: ctime mtime amin cmin mmin al daystart Sladkey rm
2359 @comment LocalWords: anewer cnewer bckw rf xtype uname gname uid gid
2360 @comment LocalWords: nouser nogroup chown chgrp perm ch maxdepth
2361 @comment LocalWords: mindepth cpio src CD AFS statted stat fstype ufs
2362 @comment LocalWords: nfs tmp mfs printf fprint dils rw djm Nov lwall
2363 @comment LocalWords: POSIXLY fls fprintf strftime locale's EDT GMT AP
2364 @comment LocalWords: EST diff perl backquotes sprintf Falstad Oct cron
2365 @comment LocalWords: eg vmunix mkdir afs allexec allwrite ARG bigram
2366 @comment LocalWords: bigrams cd chmod comp crc CVS dbfile dum eof
2367 @comment LocalWords: fileserver filesystem fn frcode Ghazi Hnewc iXX
2368 @comment LocalWords: joeuser Kaveh localpaths localuser LOGNAME
2369 @comment LocalWords: Meyering mv netpaths netuser nonblank nonblanks
2370 @comment LocalWords: ois ok Pinard printindex proc procs prunefs
2371 @comment LocalWords: prunepaths pwd RFS rmadillo rmdir rsh sbins str
2372 @comment LocalWords: su Timar ubins ug unstripped vf VM Weitzel
2373 @comment LocalWords: wildcard zlogout