3 @setfilename coreutils.info
4 @settitle GNU Coreutils
9 @include constants.texi
11 @c Define new indices.
15 @c Put everything in one index (arbitrarily chosen to be the concept index).
25 * Coreutils: (coreutils). Core GNU (file, text, shell) utilities.
26 * Common options: (coreutils)Common options.
27 * File permissions: (coreutils)File permissions. Access modes.
28 * Date input formats: (coreutils)Date input formats.
31 @c FIXME: the following need documentation
32 @c * [: (coreutils)[ invocation. File/string tests.
33 @c * pinky: (coreutils)pinky invocation. FIXME.
35 @dircategory Individual utilities
37 * arch: (coreutils)arch invocation. Print machine hardware name.
38 * base64: (coreutils)base64 invocation. Base64 encode/decode data.
39 * basename: (coreutils)basename invocation. Strip directory and suffix.
40 * cat: (coreutils)cat invocation. Concatenate and write files.
41 * chcon: (coreutils)chcon invocation. Change SELinux CTX of files.
42 * chgrp: (coreutils)chgrp invocation. Change file groups.
43 * chmod: (coreutils)chmod invocation. Change access permissions.
44 * chown: (coreutils)chown invocation. Change file owners and groups.
45 * chroot: (coreutils)chroot invocation. Specify the root directory.
46 * cksum: (coreutils)cksum invocation. Print POSIX CRC checksum.
47 * comm: (coreutils)comm invocation. Compare sorted files by line.
48 * cp: (coreutils)cp invocation. Copy files.
49 * csplit: (coreutils)csplit invocation. Split by context.
50 * cut: (coreutils)cut invocation. Print selected parts of lines.
51 * date: (coreutils)date invocation. Print/set system date and time.
52 * dd: (coreutils)dd invocation. Copy and convert a file.
53 * df: (coreutils)df invocation. Report file system disk usage.
54 * dir: (coreutils)dir invocation. List directories briefly.
55 * dircolors: (coreutils)dircolors invocation. Color setup for ls.
56 * dirname: (coreutils)dirname invocation. Strip last file name component.
57 * du: (coreutils)du invocation. Report on disk usage.
58 * echo: (coreutils)echo invocation. Print a line of text.
59 * env: (coreutils)env invocation. Modify the environment.
60 * expand: (coreutils)expand invocation. Convert tabs to spaces.
61 * expr: (coreutils)expr invocation. Evaluate expressions.
62 * factor: (coreutils)factor invocation. Print prime factors
63 * false: (coreutils)false invocation. Do nothing, unsuccessfully.
64 * fmt: (coreutils)fmt invocation. Reformat paragraph text.
65 * fold: (coreutils)fold invocation. Wrap long input lines.
66 * groups: (coreutils)groups invocation. Print group names a user is in.
67 * head: (coreutils)head invocation. Output the first part of files.
68 * hostid: (coreutils)hostid invocation. Print numeric host identifier.
69 * hostname: (coreutils)hostname invocation. Print or set system name.
70 * id: (coreutils)id invocation. Print user identity.
71 * install: (coreutils)install invocation. Copy and change attributes.
72 * join: (coreutils)join invocation. Join lines on a common field.
73 * kill: (coreutils)kill invocation. Send a signal to processes.
74 * link: (coreutils)link invocation. Make hard links between files.
75 * ln: (coreutils)ln invocation. Make links between files.
76 * logname: (coreutils)logname invocation. Print current login name.
77 * ls: (coreutils)ls invocation. List directory contents.
78 * md5sum: (coreutils)md5sum invocation. Print or check MD5 digests.
79 * mkdir: (coreutils)mkdir invocation. Create directories.
80 * mkfifo: (coreutils)mkfifo invocation. Create FIFOs (named pipes).
81 * mknod: (coreutils)mknod invocation. Create special files.
82 * mktemp: (coreutils)mktemp invocation. Create temporary files.
83 * mv: (coreutils)mv invocation. Rename files.
84 * nice: (coreutils)nice invocation. Modify niceness.
85 * nl: (coreutils)nl invocation. Number lines and write files.
86 * nohup: (coreutils)nohup invocation. Immunize to hangups.
87 * nproc: (coreutils)nproc invocation. Print the number of processors.
88 * numfmt: (coreutils)numfmt invocation. Reformat numbers.
89 * od: (coreutils)od invocation. Dump files in octal, etc.
90 * paste: (coreutils)paste invocation. Merge lines of files.
91 * pathchk: (coreutils)pathchk invocation. Check file name portability.
92 * pr: (coreutils)pr invocation. Paginate or columnate files.
93 * printenv: (coreutils)printenv invocation. Print environment variables.
94 * printf: (coreutils)printf invocation. Format and print data.
95 * ptx: (coreutils)ptx invocation. Produce permuted indexes.
96 * pwd: (coreutils)pwd invocation. Print working directory.
97 * readlink: (coreutils)readlink invocation. Print referent of a symlink.
98 * realpath: (coreutils)readpath invocation. Print resolved file names.
99 * rm: (coreutils)rm invocation. Remove files.
100 * rmdir: (coreutils)rmdir invocation. Remove empty directories.
101 * runcon: (coreutils)runcon invocation. Run in specified SELinux CTX.
102 * seq: (coreutils)seq invocation. Print numeric sequences
103 * sha1sum: (coreutils)sha1sum invocation. Print or check SHA-1 digests.
104 * sha2: (coreutils)sha2 utilities. Print or check SHA-2 digests.
105 * shred: (coreutils)shred invocation. Remove files more securely.
106 * shuf: (coreutils)shuf invocation. Shuffling text files.
107 * sleep: (coreutils)sleep invocation. Delay for a specified time.
108 * sort: (coreutils)sort invocation. Sort text files.
109 * split: (coreutils)split invocation. Split into pieces.
110 * stat: (coreutils)stat invocation. Report file(system) status.
111 * stdbuf: (coreutils)stdbuf invocation. Modify stdio buffering.
112 * stty: (coreutils)stty invocation. Print/change terminal settings.
113 * sum: (coreutils)sum invocation. Print traditional checksum.
114 * sync: (coreutils)sync invocation. Synchronize memory and disk.
115 * tac: (coreutils)tac invocation. Reverse files.
116 * tail: (coreutils)tail invocation. Output the last part of files.
117 * tee: (coreutils)tee invocation. Redirect to multiple files.
118 * test: (coreutils)test invocation. File/string tests.
119 * timeout: (coreutils)timeout invocation. Run with time limit.
120 * touch: (coreutils)touch invocation. Change file timestamps.
121 * tr: (coreutils)tr invocation. Translate characters.
122 * true: (coreutils)true invocation. Do nothing, successfully.
123 * truncate: (coreutils)truncate invocation. Shrink/extend size of a file.
124 * tsort: (coreutils)tsort invocation. Topological sort.
125 * tty: (coreutils)tty invocation. Print terminal name.
126 * uname: (coreutils)uname invocation. Print system information.
127 * unexpand: (coreutils)unexpand invocation. Convert spaces to tabs.
128 * uniq: (coreutils)uniq invocation. Uniquify files.
129 * unlink: (coreutils)unlink invocation. Removal via unlink(2).
130 * uptime: (coreutils)uptime invocation. Print uptime and load.
131 * users: (coreutils)users invocation. Print current user names.
132 * vdir: (coreutils)vdir invocation. List directories verbosely.
133 * wc: (coreutils)wc invocation. Line, word, and byte counts.
134 * who: (coreutils)who invocation. Print who is logged in.
135 * whoami: (coreutils)whoami invocation. Print effective user ID.
136 * yes: (coreutils)yes invocation. Print a string indefinitely.
140 This manual documents version @value{VERSION} of the GNU core
141 utilities, including the standard programs for text and file manipulation.
143 Copyright @copyright{} 1994-2013 Free Software Foundation, Inc.
146 Permission is granted to copy, distribute and/or modify this document
147 under the terms of the GNU Free Documentation License, Version 1.3 or
148 any later version published by the Free Software Foundation; with no
149 Invariant Sections, with no Front-Cover Texts, and with no Back-Cover
150 Texts. A copy of the license is included in the section entitled ``GNU
151 Free Documentation License''.
156 @title GNU @code{Coreutils}
157 @subtitle Core GNU utilities
158 @subtitle for version @value{VERSION}, @value{UPDATED}
159 @author David MacKenzie et al.
162 @vskip 0pt plus 1filll
175 @cindex core utilities
176 @cindex text utilities
177 @cindex shell utilities
178 @cindex file utilities
181 * Introduction:: Caveats, overview, and authors
182 * Common options:: Common options
183 * Output of entire files:: cat tac nl od base64
184 * Formatting file contents:: fmt pr fold
185 * Output of parts of files:: head tail split csplit
186 * Summarizing files:: wc sum cksum md5sum sha1sum sha2
187 * Operating on sorted files:: sort shuf uniq comm ptx tsort
188 * Operating on fields:: cut paste join
189 * Operating on characters:: tr expand unexpand
190 * Directory listing:: ls dir vdir dircolors
191 * Basic operations:: cp dd install mv rm shred
192 * Special file types:: mkdir rmdir unlink mkfifo mknod ln link readlink
193 * Changing file attributes:: chgrp chmod chown touch
194 * Disk usage:: df du stat sync truncate
195 * Printing text:: echo printf yes
196 * Conditions:: false true test expr
198 * File name manipulation:: dirname basename pathchk mktemp realpath
199 * Working context:: pwd stty printenv tty
200 * User information:: id logname whoami groups users who
201 * System context:: date arch nproc uname hostname hostid uptime
202 * SELinux context:: chcon runcon
203 * Modified command invocation:: chroot env nice nohup stdbuf timeout
204 * Process control:: kill
206 * Numeric operations:: factor seq
207 * File permissions:: Access modes
208 * Date input formats:: Specifying date strings
209 * Opening the software toolbox:: The software tools philosophy
210 * GNU Free Documentation License:: Copying and sharing this manual
211 * Concept index:: General index
214 --- The Detailed Node Listing ---
218 * Exit status:: Indicating program success or failure
219 * Backup options:: Backup options
220 * Block size:: Block size
221 * Floating point:: Floating point number representation
222 * Signal specifications:: Specifying signals
223 * Disambiguating names and IDs:: chgrp and chown owner and group syntax
224 * Random sources:: Sources of random data
225 * Target directory:: Target directory
226 * Trailing slashes:: Trailing slashes
227 * Traversing symlinks:: Traversing symlinks to directories
228 * Treating / specially:: Treating / specially
229 * Standards conformance:: Standards conformance
231 Output of entire files
233 * cat invocation:: Concatenate and write files
234 * tac invocation:: Concatenate and write files in reverse
235 * nl invocation:: Number lines and write files
236 * od invocation:: Write files in octal or other formats
237 * base64 invocation:: Transform data into printable data
239 Formatting file contents
241 * fmt invocation:: Reformat paragraph text
242 * numfmt invocation:: Reformat numbers
243 * pr invocation:: Paginate or columnate files for printing
244 * fold invocation:: Wrap input lines to fit in specified width
246 Output of parts of files
248 * head invocation:: Output the first part of files
249 * tail invocation:: Output the last part of files
250 * split invocation:: Split a file into fixed-size pieces
251 * csplit invocation:: Split a file into context-determined pieces
255 * wc invocation:: Print newline, word, and byte counts
256 * sum invocation:: Print checksum and block counts
257 * cksum invocation:: Print CRC checksum and byte counts
258 * md5sum invocation:: Print or check MD5 digests
259 * sha1sum invocation:: Print or check SHA-1 digests
260 * sha2 utilities:: Print or check SHA-2 digests
262 Operating on sorted files
264 * sort invocation:: Sort text files
265 * shuf invocation:: Shuffle text files
266 * uniq invocation:: Uniquify files
267 * comm invocation:: Compare two sorted files line by line
268 * ptx invocation:: Produce a permuted index of file contents
269 * tsort invocation:: Topological sort
271 @command{ptx}: Produce permuted indexes
273 * General options in ptx:: Options which affect general program behavior
274 * Charset selection in ptx:: Underlying character set considerations
275 * Input processing in ptx:: Input fields, contexts, and keyword selection
276 * Output formatting in ptx:: Types of output format, and sizing the fields
277 * Compatibility in ptx:: The GNU extensions to @command{ptx}
281 * cut invocation:: Print selected parts of lines
282 * paste invocation:: Merge lines of files
283 * join invocation:: Join lines on a common field
285 Operating on characters
287 * tr invocation:: Translate, squeeze, and/or delete characters
288 * expand invocation:: Convert tabs to spaces
289 * unexpand invocation:: Convert spaces to tabs
291 @command{tr}: Translate, squeeze, and/or delete characters
293 * Character sets:: Specifying sets of characters
294 * Translating:: Changing one set of characters to another
295 * Squeezing:: Squeezing repeats and deleting
299 * ls invocation:: List directory contents
300 * dir invocation:: Briefly list directory contents
301 * vdir invocation:: Verbosely list directory contents
302 * dircolors invocation:: Color setup for @command{ls}
304 @command{ls}: List directory contents
306 * Which files are listed:: Which files are listed
307 * What information is listed:: What information is listed
308 * Sorting the output:: Sorting the output
309 * Details about version sort:: More details about version sort
310 * General output formatting:: General output formatting
311 * Formatting the file names:: Formatting the file names
315 * cp invocation:: Copy files and directories
316 * dd invocation:: Convert and copy a file
317 * install invocation:: Copy files and set attributes
318 * mv invocation:: Move (rename) files
319 * rm invocation:: Remove files or directories
320 * shred invocation:: Remove files more securely
324 * link invocation:: Make a hard link via the link syscall
325 * ln invocation:: Make links between files
326 * mkdir invocation:: Make directories
327 * mkfifo invocation:: Make FIFOs (named pipes)
328 * mknod invocation:: Make block or character special files
329 * readlink invocation:: Print value of a symlink or canonical file name
330 * rmdir invocation:: Remove empty directories
331 * unlink invocation:: Remove files via unlink syscall
333 Changing file attributes
335 * chown invocation:: Change file owner and group
336 * chgrp invocation:: Change group ownership
337 * chmod invocation:: Change access permissions
338 * touch invocation:: Change file timestamps
342 * df invocation:: Report file system disk space usage
343 * du invocation:: Estimate file space usage
344 * stat invocation:: Report file or file system status
345 * sync invocation:: Synchronize data on disk with memory
346 * truncate invocation:: Shrink or extend the size of a file
350 * echo invocation:: Print a line of text
351 * printf invocation:: Format and print data
352 * yes invocation:: Print a string until interrupted
356 * false invocation:: Do nothing, unsuccessfully
357 * true invocation:: Do nothing, successfully
358 * test invocation:: Check file types and compare values
359 * expr invocation:: Evaluate expressions
361 @command{test}: Check file types and compare values
363 * File type tests:: File type tests
364 * Access permission tests:: Access permission tests
365 * File characteristic tests:: File characteristic tests
366 * String tests:: String tests
367 * Numeric tests:: Numeric tests
369 @command{expr}: Evaluate expression
371 * String expressions:: + : match substr index length
372 * Numeric expressions:: + - * / %
373 * Relations for expr:: | & < <= = == != >= >
374 * Examples of expr:: Examples of using @command{expr}
378 * tee invocation:: Redirect output to multiple files or processes
380 File name manipulation
382 * basename invocation:: Strip directory and suffix from a file name
383 * dirname invocation:: Strip last file name component
384 * pathchk invocation:: Check file name validity and portability
385 * mktemp invocation:: Create temporary file or directory
386 * realpath invocation:: Print resolved file names
390 * pwd invocation:: Print working directory
391 * stty invocation:: Print or change terminal characteristics
392 * printenv invocation:: Print all or some environment variables
393 * tty invocation:: Print file name of terminal on standard input
395 @command{stty}: Print or change terminal characteristics
397 * Control:: Control settings
398 * Input:: Input settings
399 * Output:: Output settings
400 * Local:: Local settings
401 * Combination:: Combination settings
402 * Characters:: Special characters
403 * Special:: Special settings
407 * id invocation:: Print user identity
408 * logname invocation:: Print current login name
409 * whoami invocation:: Print effective user ID
410 * groups invocation:: Print group names a user is in
411 * users invocation:: Print login names of users currently logged in
412 * who invocation:: Print who is currently logged in
416 * arch invocation:: Print machine hardware name
417 * date invocation:: Print or set system date and time
418 * nproc invocation:: Print the number of processors
419 * uname invocation:: Print system information
420 * hostname invocation:: Print or set system name
421 * hostid invocation:: Print numeric host identifier
422 * uptime invocation:: Print system uptime and load
424 @command{date}: Print or set system date and time
426 * Time conversion specifiers:: %[HIklMNpPrRsSTXzZ]
427 * Date conversion specifiers:: %[aAbBcCdDeFgGhjmuUVwWxyY]
428 * Literal conversion specifiers:: %[%nt]
429 * Padding and other flags:: Pad with zeros, spaces, etc.
430 * Setting the time:: Changing the system clock
431 * Options for date:: Instead of the current time
432 * Date input formats:: Specifying date strings
433 * Examples of date:: Examples
437 * chcon invocation:: Change SELinux context of file
438 * runcon invocation:: Run a command in specified SELinux context
440 Modified command invocation
442 * chroot invocation:: Run a command with a different root directory
443 * env invocation:: Run a command in a modified environment
444 * nice invocation:: Run a command with modified niceness
445 * nohup invocation:: Run a command immune to hangups
446 * stdbuf invocation:: Run a command with modified I/O buffering
447 * timeout invocation:: Run a command with a time limit
451 * kill invocation:: Sending a signal to processes.
455 * sleep invocation:: Delay for a specified time
459 * factor invocation:: Print prime factors
460 * seq invocation:: Print numeric sequences
464 * Mode Structure:: Structure of file mode bits
465 * Symbolic Modes:: Mnemonic representation of file mode bits
466 * Numeric Modes:: File mode bits as octal numbers
467 * Directory Setuid and Setgid:: Set-user-ID and set-group-ID on directories
471 * General date syntax:: Common rules
472 * Calendar date items:: 19 Dec 1994
473 * Time of day items:: 9:20pm
474 * Time zone items:: EST, PDT, UTC, @dots{}
475 * Combined date and time of day items:: 1972-09-24T20:02:00,000000-0500
476 * Day of week items:: Monday and others
477 * Relative items in date strings:: next tuesday, 2 years ago
478 * Pure numbers in date strings:: 19931219, 1440
479 * Seconds since the Epoch:: @@1078100502
480 * Specifying time zone rules:: TZ="America/New_York", TZ="UTC0"
481 * Authors of parse_datetime:: Bellovin, Eggert, Salz, Berets, et al.
483 Opening the software toolbox
485 * Toolbox introduction:: Toolbox introduction
486 * I/O redirection:: I/O redirection
487 * The who command:: The @command{who} command
488 * The cut command:: The @command{cut} command
489 * The sort command:: The @command{sort} command
490 * The uniq command:: The @command{uniq} command
491 * Putting the tools together:: Putting the tools together
495 * GNU Free Documentation License:: Copying and sharing this manual
502 @chapter Introduction
504 This manual is a work in progress: many sections make no attempt to explain
505 basic concepts in a way suitable for novices. Thus, if you are interested,
506 please get involved in improving this manual. The entire GNU community
510 The GNU utilities documented here are mostly compatible with the
512 @cindex bugs, reporting
513 Please report bugs to @email{bug-coreutils@@gnu.org}. Remember
514 to include the version number, machine architecture, input files, and
515 any other information needed to reproduce the bug: your input, what you
516 expected, what you got, and why it is wrong. Diffs are welcome, but
517 please include a description of the problem as well, since this is
518 sometimes difficult to infer. @xref{Bugs, , , gcc, Using and Porting GNU CC}.
524 @cindex MacKenzie, D.
527 This manual was originally derived from the Unix man pages in the
528 distributions, which were written by David MacKenzie and updated by Jim
529 Meyering. What you are reading now is the authoritative documentation
530 for these utilities; the man pages are no longer being maintained. The
531 original @command{fmt} man page was written by Ross Paterson. Fran@,{c}ois
532 Pinard did the initial conversion to Texinfo format. Karl Berry did the
533 indexing, some reorganization, and editing of the results. Brian
534 Youmans of the Free Software Foundation office staff combined the
535 manuals for textutils, fileutils, and sh-utils to produce the present
536 omnibus manual. Richard Stallman contributed his usual invaluable
537 insights to the overall process.
540 @chapter Common options
544 @itemx @w{@kbd{--backup}[=@var{method}]}
547 @vindex VERSION_CONTROL
548 @cindex backups, making
549 @xref{Backup options}.
550 Make a backup of each file that would otherwise be overwritten or removed.
553 @macro optBackupSuffix
554 @item -S @var{suffix}
555 @itemx --suffix=@var{suffix}
558 Append @var{suffix} to each backup file made with @option{-b}.
559 @xref{Backup options}.
562 @macro optTargetDirectory
563 @item -t @var{directory}
564 @itemx @w{@kbd{--target-directory}=@var{directory}}
566 @opindex --target-directory
567 @cindex target directory
568 @cindex destination directory
569 Specify the destination @var{directory}.
570 @xref{Target directory}.
573 @macro optNoTargetDirectory
575 @itemx --no-target-directory
577 @opindex --no-target-directory
578 @cindex target directory
579 @cindex destination directory
580 Do not treat the last operand specially when it is a directory or a
581 symbolic link to a directory. @xref{Target directory}.
589 @cindex output NUL-byte-terminated lines
590 Output a zero byte (ASCII NUL) at the end of each line,
591 rather than a newline. This option enables other programs to parse the
592 output of @command{\cmd\} even when that output would contain data
593 with embedded newlines.
600 Append an SI-style abbreviation to each size, such as @samp{M} for
601 megabytes. Powers of 1000 are used, not 1024; @samp{M} stands for
602 1,000,000 bytes. This option is equivalent to
603 @option{--block-size=si}. Use the @option{-h} or
604 @option{--human-readable} option if
605 you prefer powers of 1024.
608 @macro optHumanReadable
610 @itemx --human-readable
612 @opindex --human-readable
613 @cindex human-readable output
614 Append a size letter to each size, such as @samp{M} for mebibytes.
615 Powers of 1024 are used, not 1000; @samp{M} stands for 1,048,576 bytes.
616 This option is equivalent to @option{--block-size=human-readable}.
617 Use the @option{--si} option if you prefer powers of 1000.
620 @macro optStripTrailingSlashes
621 @item @w{@kbd{--strip-trailing-slashes}}
622 @opindex --strip-trailing-slashes
623 @cindex stripping trailing slashes
624 Remove any trailing slashes from each @var{source} argument.
625 @xref{Trailing slashes}.
628 @macro mayConflictWithShellBuiltIn{cmd}
629 @cindex conflicts with shell built-ins
630 @cindex built-in shell commands, conflicts with
631 Due to shell aliases and built-in @command{\cmd\} functions, using an
632 unadorned @command{\cmd\} interactively or in a script may get you
633 different functionality than that described here. Invoke it via
634 @command{env} (i.e., @code{env \cmd\ @dots{}}) to avoid interference
639 @macro multiplierSuffixes{varName}
640 @var{\varName\} may be, or may be an integer optionally followed by,
641 one of the following multiplicative suffixes:
643 @samp{b} => 512 ("blocks")
644 @samp{KB} => 1000 (KiloBytes)
645 @samp{K} => 1024 (KibiBytes)
646 @samp{MB} => 1000*1000 (MegaBytes)
647 @samp{M} => 1024*1024 (MebiBytes)
648 @samp{GB} => 1000*1000*1000 (GigaBytes)
649 @samp{G} => 1024*1024*1024 (GibiBytes)
651 and so on for @samp{T}, @samp{P}, @samp{E}, @samp{Z}, and @samp{Y}.
654 @c FIXME: same as above, but no ``blocks'' line.
655 @macro multiplierSuffixesNoBlocks{varName}
656 @var{\varName\} may be, or may be an integer optionally followed by,
657 one of the following multiplicative suffixes:
659 @samp{KB} => 1000 (KiloBytes)
660 @samp{K} => 1024 (KibiBytes)
661 @samp{MB} => 1000*1000 (MegaBytes)
662 @samp{M} => 1024*1024 (MebiBytes)
663 @samp{GB} => 1000*1000*1000 (GigaBytes)
664 @samp{G} => 1024*1024*1024 (GibiBytes)
666 and so on for @samp{T}, @samp{P}, @samp{E}, @samp{Z}, and @samp{Y}.
669 @cindex common options
671 Certain options are available in all of these programs. Rather than
672 writing identical descriptions for each of the programs, they are
673 described here. (In fact, every GNU program accepts (or should accept)
676 @vindex POSIXLY_CORRECT
677 Normally options and operands can appear in any order, and programs act
678 as if all the options appear before any operands. For example,
679 @samp{sort -r passwd -t :} acts like @samp{sort -r -t : passwd}, since
680 @samp{:} is an option-argument of @option{-t}. However, if the
681 @env{POSIXLY_CORRECT} environment variable is set, options must appear
682 before operands, unless otherwise specified for a particular command.
684 A few programs can usefully have trailing operands with leading
685 @samp{-}. With such a program, options must precede operands even if
686 @env{POSIXLY_CORRECT} is not set, and this fact is noted in the
687 program description. For example, the @command{env} command's options
688 must appear before its operands, since in some cases the operands
689 specify a command that itself contains options.
691 Most programs that accept long options recognize unambiguous
692 abbreviations of those options. For example, @samp{rmdir
693 --ignore-fail-on-non-empty} can be invoked as @samp{rmdir
694 --ignore-fail} or even @samp{rmdir --i}. Ambiguous options, such as
695 @samp{ls --h}, are identified as such.
697 Some of these programs recognize the @option{--help} and @option{--version}
698 options only when one of them is the sole command line argument. For
699 these programs, abbreviations of the long options are not always recognized.
706 Print a usage message listing all available options, then exit successfully.
710 @cindex version number, finding
711 Print the version number, then exit successfully.
715 @cindex option delimiter
716 Delimit the option list. Later arguments, if any, are treated as
717 operands even if they begin with @samp{-}. For example, @samp{sort --
718 -r} reads from the file named @file{-r}.
722 @cindex standard input
723 @cindex standard output
724 A single @samp{-} operand is not really an option, though it looks like one. It
725 stands for standard input, or for standard output if that is clear from
726 the context. For example, @samp{sort -} reads from standard input,
727 and is equivalent to plain @samp{sort}, and @samp{tee -} writes an
728 extra copy of its input to standard output. Unless otherwise
729 specified, @samp{-} can appear as any operand that requires a file
733 * Exit status:: Indicating program success or failure.
734 * Backup options:: -b -S, in some programs.
735 * Block size:: BLOCK_SIZE and --block-size, in some programs.
736 * Floating point:: Floating point number representation.
737 * Signal specifications:: Specifying signals using the --signal option.
738 * Disambiguating names and IDs:: chgrp and chown owner and group syntax
739 * Random sources:: --random-source, in some programs.
740 * Target directory:: Specifying a target directory, in some programs.
741 * Trailing slashes:: --strip-trailing-slashes, in some programs.
742 * Traversing symlinks:: -H, -L, or -P, in some programs.
743 * Treating / specially:: --preserve-root and --no-preserve-root.
744 * Special built-in utilities:: @command{break}, @command{:}, @dots{}
745 * Standards conformance:: Conformance to the POSIX standard.
753 An exit status of zero indicates success,
754 and a nonzero value indicates failure.
757 Nearly every command invocation yields an integral @dfn{exit status}
758 that can be used to change how other commands work.
759 For the vast majority of commands, an exit status of zero indicates
760 success. Failure is indicated by a nonzero value---typically
761 @samp{1}, though it may differ on unusual platforms as POSIX
762 requires only that it be nonzero.
764 However, some of the programs documented here do produce
765 other exit status values and a few associate different
766 meanings with the values @samp{0} and @samp{1}.
767 Here are some of the exceptions:
768 @command{chroot}, @command{env}, @command{expr}, @command{nice},
769 @command{nohup}, @command{numfmt}, @command{printenv}, @command{sort},
770 @command{stdbuf}, @command{test}, @command{timeout}, @command{tty}.
774 @section Backup options
776 @cindex backup options
778 Some GNU programs (at least @command{cp}, @command{install},
779 @command{ln}, and @command{mv}) optionally make backups of files
780 before writing new versions.
781 These options control the details of these backups. The options are also
782 briefly mentioned in the descriptions of the particular programs.
787 @itemx @w{@kbd{--backup}[=@var{method}]}
790 @vindex VERSION_CONTROL
791 @cindex backups, making
792 Make a backup of each file that would otherwise be overwritten or removed.
793 Without this option, the original versions are destroyed.
794 Use @var{method} to determine the type of backups to make.
795 When this option is used but @var{method} is not specified,
796 then the value of the @env{VERSION_CONTROL}
797 environment variable is used. And if @env{VERSION_CONTROL} is not set,
798 the default backup type is @samp{existing}.
800 Note that the short form of this option, @option{-b} does not accept any
801 argument. Using @option{-b} is equivalent to using @option{--backup=existing}.
803 @vindex version-control @r{Emacs variable}
804 This option corresponds to the Emacs variable @samp{version-control};
805 the values for @var{method} are the same as those used in Emacs.
806 This option also accepts more descriptive names.
807 The valid @var{method}s are (unique abbreviations are accepted):
812 @opindex none @r{backup method}
817 @opindex numbered @r{backup method}
818 Always make numbered backups.
822 @opindex existing @r{backup method}
823 Make numbered backups of files that already have them, simple backups
828 @opindex simple @r{backup method}
829 Always make simple backups. Please note @samp{never} is not to be
830 confused with @samp{none}.
834 @item -S @var{suffix}
835 @itemx --suffix=@var{suffix}
838 @cindex backup suffix
839 @vindex SIMPLE_BACKUP_SUFFIX
840 Append @var{suffix} to each backup file made with @option{-b}. If this
841 option is not specified, the value of the @env{SIMPLE_BACKUP_SUFFIX}
842 environment variable is used. And if @env{SIMPLE_BACKUP_SUFFIX} is not
843 set, the default is @samp{~}, just as in Emacs.
852 Some GNU programs (at least @command{df}, @command{du}, and
853 @command{ls}) display sizes in ``blocks''. You can adjust the block size
854 and method of display to make sizes easier to read. The block size
855 used for display is independent of any file system block size.
856 Fractional block counts are rounded up to the nearest integer.
858 @opindex --block-size=@var{size}
861 @vindex DF_BLOCK_SIZE
862 @vindex DU_BLOCK_SIZE
863 @vindex LS_BLOCK_SIZE
864 @vindex POSIXLY_CORRECT@r{, and block size}
866 The default block size is chosen by examining the following environment
867 variables in turn; the first one that is set determines the block size.
872 This specifies the default block size for the @command{df} command.
873 Similarly, @env{DU_BLOCK_SIZE} specifies the default for @command{du} and
874 @env{LS_BLOCK_SIZE} for @command{ls}.
877 This specifies the default block size for all three commands, if the
878 above command-specific environment variables are not set.
881 This specifies the default block size for all values that are normally
882 printed as blocks, if neither @env{BLOCK_SIZE} nor the above
883 command-specific environment variables are set. Unlike the other
884 environment variables, @env{BLOCKSIZE} does not affect values that are
885 normally printed as byte counts, e.g., the file sizes contained in
888 @item POSIXLY_CORRECT
889 If neither @env{@var{command}_BLOCK_SIZE}, nor @env{BLOCK_SIZE}, nor
890 @env{BLOCKSIZE} is set, but this variable is set, the block size
895 If none of the above environment variables are set, the block size
896 currently defaults to 1024 bytes in most contexts, but this number may
897 change in the future. For @command{ls} file sizes, the block size
900 @cindex human-readable output
903 A block size specification can be a positive integer specifying the number
904 of bytes per block, or it can be @code{human-readable} or @code{si} to
905 select a human-readable format. Integers may be followed by suffixes
906 that are upward compatible with the
907 @uref{http://www.bipm.org/en/si/si_brochure/chapter3/prefixes.html, SI prefixes}
908 for decimal multiples and with the
909 @uref{http://physics.nist.gov/cuu/Units/binary.html, ISO/IEC 80000-13
910 (formerly IEC 60027-2) prefixes} for binary multiples.
912 With human-readable formats, output sizes are followed by a size letter
913 such as @samp{M} for megabytes. @code{BLOCK_SIZE=human-readable} uses
914 powers of 1024; @samp{M} stands for 1,048,576 bytes.
915 @code{BLOCK_SIZE=si} is similar, but uses powers of 1000 and appends
916 @samp{B}; @samp{MB} stands for 1,000,000 bytes.
919 A block size specification preceded by @samp{'} causes output sizes to
920 be displayed with thousands separators. The @env{LC_NUMERIC} locale
921 specifies the thousands separator and grouping. For example, in an
922 American English locale, @samp{--block-size="'1kB"} would cause a size
923 of 1234000 bytes to be displayed as @samp{1,234}. In the default C
924 locale, there is no thousands separator so a leading @samp{'} has no
927 An integer block size can be followed by a suffix to specify a
928 multiple of that size. A bare size letter,
929 or one followed by @samp{iB}, specifies
930 a multiple using powers of 1024. A size letter followed by @samp{B}
931 specifies powers of 1000 instead. For example, @samp{1M} and
932 @samp{1MiB} are equivalent to @samp{1048576}, whereas @samp{1MB} is
933 equivalent to @samp{1000000}.
935 A plain suffix without a preceding integer acts as if @samp{1} were
936 prepended, except that it causes a size indication to be appended to
937 the output. For example, @samp{--block-size="kB"} displays 3000 as
940 The following suffixes are defined. Large sizes like @code{1Y}
941 may be rejected by your computer due to limitations of its arithmetic.
945 @cindex kilobyte, definition of
946 kilobyte: @math{10^3 = 1000}.
950 @cindex kibibyte, definition of
951 kibibyte: @math{2^{10} = 1024}. @samp{K} is special: the SI prefix is
952 @samp{k} and the ISO/IEC 80000-13 prefix is @samp{Ki}, but tradition and
953 POSIX use @samp{k} to mean @samp{KiB}.
955 @cindex megabyte, definition of
956 megabyte: @math{10^6 = 1,000,000}.
959 @cindex mebibyte, definition of
960 mebibyte: @math{2^{20} = 1,048,576}.
962 @cindex gigabyte, definition of
963 gigabyte: @math{10^9 = 1,000,000,000}.
966 @cindex gibibyte, definition of
967 gibibyte: @math{2^{30} = 1,073,741,824}.
969 @cindex terabyte, definition of
970 terabyte: @math{10^{12} = 1,000,000,000,000}.
973 @cindex tebibyte, definition of
974 tebibyte: @math{2^{40} = 1,099,511,627,776}.
976 @cindex petabyte, definition of
977 petabyte: @math{10^{15} = 1,000,000,000,000,000}.
980 @cindex pebibyte, definition of
981 pebibyte: @math{2^{50} = 1,125,899,906,842,624}.
983 @cindex exabyte, definition of
984 exabyte: @math{10^{18} = 1,000,000,000,000,000,000}.
987 @cindex exbibyte, definition of
988 exbibyte: @math{2^{60} = 1,152,921,504,606,846,976}.
990 @cindex zettabyte, definition of
991 zettabyte: @math{10^{21} = 1,000,000,000,000,000,000,000}
994 @math{2^{70} = 1,180,591,620,717,411,303,424}.
996 @cindex yottabyte, definition of
997 yottabyte: @math{10^{24} = 1,000,000,000,000,000,000,000,000}.
1000 @math{2^{80} = 1,208,925,819,614,629,174,706,176}.
1005 @opindex --block-size
1006 @opindex --human-readable
1009 Block size defaults can be overridden by an explicit
1010 @option{--block-size=@var{size}} option. The @option{-k}
1011 option is equivalent to @option{--block-size=1K}, which
1012 is the default unless the @env{POSIXLY_CORRECT} environment variable is
1013 set. The @option{-h} or @option{--human-readable} option is equivalent to
1014 @option{--block-size=human-readable}. The @option{--si} option is
1015 equivalent to @option{--block-size=si}.
1017 @node Floating point
1018 @section Floating point numbers
1019 @cindex floating point
1020 @cindex IEEE floating point
1022 Commands that accept or produce floating point numbers employ the
1023 floating point representation of the underlying system, and suffer
1024 from rounding error, overflow, and similar floating-point issues.
1025 Almost all modern systems use IEEE-754 floating point, and it is
1026 typically portable to assume IEEE-754 behavior these days. IEEE-754
1027 has positive and negative infinity, distinguishes positive from
1028 negative zero, and uses special values called NaNs to represent
1029 invalid computations such as dividing zero by itself. For more
1030 information, please see David Goldberg's paper
1031 @uref{http://@/www.validlab.com/@/goldberg/@/paper.pdf, What Every
1032 Computer Scientist Should Know About Floating-Point Arithmetic}.
1035 Commands that accept floating point numbers as options, operands or
1036 input use the standard C functions @code{strtod} and @code{strtold} to
1037 convert from text to floating point numbers. These floating point
1038 numbers therefore can use scientific notation like @code{1.0e-34} and
1039 @code{-10e100}. Modern C implementations also accept hexadecimal
1040 floating point numbers such as @code{-0x.ep-3}, which stands for
1041 @minus{}14/16 times @math{2^-3}, which equals @minus{}0.109375. The
1042 @env{LC_NUMERIC} locale determines the decimal-point character.
1043 @xref{Parsing of Floats,,, libc, The GNU C Library Reference Manual}.
1045 @node Signal specifications
1046 @section Signal specifications
1047 @cindex signals, specifying
1049 A @var{signal} may be a signal name like @samp{HUP}, or a signal
1050 number like @samp{1}, or an exit status of a process terminated by the
1051 signal. A signal name can be given in canonical form or prefixed by
1052 @samp{SIG}@. The case of the letters is ignored. The following signal names
1053 and numbers are supported on all POSIX compliant systems:
1059 2. Terminal interrupt.
1065 9. Kill (cannot be caught or ignored).
1073 Other supported signal names have system-dependent corresponding
1074 numbers. All systems conforming to POSIX 1003.1-2001 also
1075 support the following signals:
1079 Access to an undefined portion of a memory object.
1081 Child process terminated, stopped, or continued.
1083 Continue executing, if stopped.
1085 Erroneous arithmetic operation.
1087 Illegal Instruction.
1089 Write on a pipe with no one to read it.
1091 Invalid memory reference.
1093 Stop executing (cannot be caught or ignored).
1097 Background process attempting read.
1099 Background process attempting write.
1101 High bandwidth data is available at a socket.
1103 User-defined signal 1.
1105 User-defined signal 2.
1109 POSIX 1003.1-2001 systems that support the XSI extension
1110 also support the following signals:
1116 Profiling timer expired.
1120 Trace/breakpoint trap.
1122 Virtual timer expired.
1124 CPU time limit exceeded.
1126 File size limit exceeded.
1130 POSIX 1003.1-2001 systems that support the XRT extension
1131 also support at least eight real-time signals called @samp{RTMIN},
1132 @samp{RTMIN+1}, @dots{}, @samp{RTMAX-1}, @samp{RTMAX}.
1134 @node Disambiguating names and IDs
1135 @section chown and chgrp: Disambiguating user names and IDs
1136 @cindex user names, disambiguating
1137 @cindex user IDs, disambiguating
1138 @cindex group names, disambiguating
1139 @cindex group IDs, disambiguating
1140 @cindex disambiguating group names and IDs
1142 Since the @var{owner} and @var{group} arguments to @command{chown} and
1143 @command{chgrp} may be specified as names or numeric IDs, there is an
1145 What if a user or group @emph{name} is a string of digits?
1146 @footnote{Using a number as a user name is common in some environments.}
1147 Should the command interpret it as a user name or as an ID@?
1148 POSIX requires that @command{chown} and @command{chgrp}
1149 first attempt to resolve the specified string as a name, and
1150 only once that fails, then try to interpret it as an ID@.
1151 This is troublesome when you want to specify a numeric ID, say 42,
1152 and it must work even in a pathological situation where
1153 @samp{42} is a user name that maps to some other user ID, say 1000.
1154 Simply invoking @code{chown 42 F}, will set @file{F}s owner ID to
1155 1000---not what you intended.
1157 GNU @command{chown} and @command{chgrp} provide a way to work around this,
1158 that at the same time may result in a significant performance improvement
1159 by eliminating a database look-up.
1160 Simply precede each numeric user ID and/or group ID with a @samp{+},
1161 in order to force its interpretation as an integer:
1165 chgrp +$numeric_group_id another-file
1169 GNU @command{chown} and @command{chgrp}
1170 skip the name look-up process for each @samp{+}-prefixed string,
1171 because a string containing @samp{+} is never a valid user or group name.
1172 This syntax is accepted on most common Unix systems, but not on Solaris 10.
1174 @node Random sources
1175 @section Sources of random data
1177 @cindex random sources
1179 The @command{shuf}, @command{shred}, and @command{sort} commands
1180 sometimes need random data to do their work. For example, @samp{sort
1181 -R} must choose a hash function at random, and it needs random data to
1182 make this selection.
1184 By default these commands use an internal pseudorandom generator
1185 initialized by a small amount of entropy, but can be directed to use
1186 an external source with the @option{--random-source=@var{file}} option.
1187 An error is reported if @var{file} does not contain enough bytes.
1189 For example, the device file @file{/dev/urandom} could be used as the
1190 source of random data. Typically, this device gathers environmental
1191 noise from device drivers and other sources into an entropy pool, and
1192 uses the pool to generate random bits. If the pool is short of data,
1193 the device reuses the internal pool to produce more bits, using a
1194 cryptographically secure pseudorandom number generator. But be aware
1195 that this device is not designed for bulk random data generation
1196 and is relatively slow.
1198 @file{/dev/urandom} suffices for most practical uses, but applications
1199 requiring high-value or long-term protection of private data may
1200 require an alternate data source like @file{/dev/random} or
1201 @file{/dev/arandom}. The set of available sources depends on your
1204 To reproduce the results of an earlier invocation of a command, you
1205 can save some random data into a file and then use that file as the
1206 random source in earlier and later invocations of the command.
1208 @node Target directory
1209 @section Target directory
1211 @cindex target directory
1213 The @command{cp}, @command{install}, @command{ln}, and @command{mv}
1214 commands normally treat the last operand specially when it is a
1215 directory or a symbolic link to a directory. For example, @samp{cp
1216 source dest} is equivalent to @samp{cp source dest/source} if
1217 @file{dest} is a directory. Sometimes this behavior is not exactly
1218 what is wanted, so these commands support the following options to
1219 allow more fine-grained control:
1224 @itemx --no-target-directory
1225 @opindex --no-target-directory
1226 @cindex target directory
1227 @cindex destination directory
1228 Do not treat the last operand specially when it is a directory or a
1229 symbolic link to a directory. This can help avoid race conditions in
1230 programs that operate in a shared area. For example, when the command
1231 @samp{mv /tmp/source /tmp/dest} succeeds, there is no guarantee that
1232 @file{/tmp/source} was renamed to @file{/tmp/dest}: it could have been
1233 renamed to @file{/tmp/dest/source} instead, if some other process
1234 created @file{/tmp/dest} as a directory. However, if @file{mv
1235 -T /tmp/source /tmp/dest} succeeds, there is no
1236 question that @file{/tmp/source} was renamed to @file{/tmp/dest}.
1238 In the opposite situation, where you want the last operand to be
1239 treated as a directory and want a diagnostic otherwise, you can use
1240 the @option{--target-directory} (@option{-t}) option.
1242 @item -t @var{directory}
1243 @itemx @w{@kbd{--target-directory}=@var{directory}}
1244 @opindex --target-directory
1245 @cindex target directory
1246 @cindex destination directory
1247 Use @var{directory} as the directory component of each destination
1250 The interface for most programs is that after processing options and a
1251 finite (possibly zero) number of fixed-position arguments, the remaining
1252 argument list is either expected to be empty, or is a list of items
1253 (usually files) that will all be handled identically. The @command{xargs}
1254 program is designed to work well with this convention.
1256 The commands in the @command{mv}-family are unusual in that they take
1257 a variable number of arguments with a special case at the @emph{end}
1258 (namely, the target directory). This makes it nontrivial to perform some
1259 operations, e.g., ``move all files from here to ../d/'', because
1260 @code{mv * ../d/} might exhaust the argument space, and @code{ls | xargs ...}
1261 doesn't have a clean way to specify an extra final argument for each
1262 invocation of the subject command. (It can be done by going through a
1263 shell command, but that requires more human labor and brain power than
1266 The @w{@kbd{--target-directory}} (@option{-t}) option allows the @command{cp},
1267 @command{install}, @command{ln}, and @command{mv} programs to be used
1268 conveniently with @command{xargs}. For example, you can move the files
1269 from the current directory to a sibling directory, @code{d} like this:
1272 ls | xargs mv -t ../d --
1275 However, this doesn't move files whose names begin with @samp{.}.
1276 If you use the GNU @command{find} program, you can move those
1277 files too, with this command:
1280 find . -mindepth 1 -maxdepth 1 \
1284 But both of the above approaches fail if there are no files in the
1285 current directory, or if any file has a name containing a blank or
1286 some other special characters.
1287 The following example removes those limitations and requires both
1288 GNU @command{find} and GNU @command{xargs}:
1291 find . -mindepth 1 -maxdepth 1 -print0 \
1292 | xargs --null --no-run-if-empty \
1299 The @option{--target-directory} (@option{-t}) and
1300 @option{--no-target-directory} (@option{-T})
1301 options cannot be combined.
1303 @node Trailing slashes
1304 @section Trailing slashes
1306 @cindex trailing slashes
1308 Some GNU programs (at least @command{cp} and @command{mv}) allow you to
1309 remove any trailing slashes from each @var{source} argument before
1310 operating on it. The @w{@kbd{--strip-trailing-slashes}} option enables
1313 This is useful when a @var{source} argument may have a trailing slash and
1314 @c FIXME: mv's behavior in this case is system-dependent
1315 specify a symbolic link to a directory. This scenario is in fact rather
1316 common because some shells can automatically append a trailing slash when
1317 performing file name completion on such symbolic links. Without this
1318 option, @command{mv}, for example, (via the system's rename function) must
1319 interpret a trailing slash as a request to dereference the symbolic link
1320 and so must rename the indirectly referenced @emph{directory} and not
1321 the symbolic link. Although it may seem surprising that such behavior
1322 be the default, it is required by POSIX and is consistent with
1323 other parts of that standard.
1325 @node Traversing symlinks
1326 @section Traversing symlinks
1328 @cindex symbolic link to directory, controlling traversal of
1330 The following options modify how @command{chown} and @command{chgrp}
1331 @c FIXME: note that 'du' has these options, too, but they have slightly
1332 @c different meaning.
1333 traverse a hierarchy when the @option{--recursive} (@option{-R})
1334 option is also specified.
1335 If more than one of the following options is specified, only the final
1337 These options specify whether processing a symbolic link to a directory
1338 entails operating on just the symbolic link or on all files in the
1339 hierarchy rooted at that directory.
1341 These options are independent of @option{--dereference} and
1342 @option{--no-dereference} (@option{-h}), which control whether to modify
1343 a symlink or its referent.
1350 @cindex symbolic link to directory, traverse if on the command line
1351 If @option{--recursive} (@option{-R}) is specified and
1352 a command line argument is a symbolic link to a directory, traverse it.
1359 @cindex symbolic link to directory, traverse each that is encountered
1360 In a recursive traversal, traverse every symbolic link to a directory
1361 that is encountered.
1368 @cindex symbolic link to directory, never traverse
1369 Do not traverse any symbolic links.
1370 This is the default if none of @option{-H}, @option{-L},
1371 or @option{-P} is specified.
1378 @node Treating / specially
1379 @section Treating @file{/} specially
1381 Certain commands can operate destructively on entire hierarchies.
1382 For example, if a user with appropriate privileges mistakenly runs
1383 @samp{rm -rf / tmp/junk}, that may remove
1384 all files on the entire system. Since there are so few
1385 legitimate uses for such a command,
1386 GNU @command{rm} normally declines to operate on any directory
1387 that resolves to @file{/}. If you really want to try to remove all
1388 the files on your system, you can use the @option{--no-preserve-root}
1389 option, but the default behavior, specified by the
1390 @option{--preserve-root} option, is safer for most purposes.
1392 The commands @command{chgrp}, @command{chmod} and @command{chown}
1393 can also operate destructively on entire hierarchies, so they too
1394 support these options. Although, unlike @command{rm}, they don't
1395 actually unlink files, these commands are arguably more dangerous
1396 when operating recursively on @file{/}, since they often work much
1397 more quickly, and hence damage more files before an alert user can
1398 interrupt them. Tradition and POSIX require these commands
1399 to operate recursively on @file{/}, so they default to
1400 @option{--no-preserve-root}, but using the @option{--preserve-root}
1401 option makes them safer for most purposes. For convenience you can
1402 specify @option{--preserve-root} in an alias or in a shell function.
1404 Note that the @option{--preserve-root} option also ensures
1405 that @command{chgrp} and @command{chown} do not modify @file{/}
1406 even when dereferencing a symlink pointing to @file{/}.
1408 @node Special built-in utilities
1409 @section Special built-in utilities
1411 Some programs like @command{nice} can invoke other programs; for
1412 example, the command @samp{nice cat file} invokes the program
1413 @command{cat} by executing the command @samp{cat file}. However,
1414 @dfn{special built-in utilities} like @command{exit} cannot be invoked
1415 this way. For example, the command @samp{nice exit} does not have a
1416 well-defined behavior: it may generate an error message instead of
1419 Here is a list of the special built-in utilities that are standardized
1420 by POSIX 1003.1-2004.
1423 @t{.@: : break continue eval exec exit export readonly
1424 return set shift times trap unset}
1427 For example, because @samp{.}, @samp{:}, and @samp{exec} are special,
1428 the commands @samp{nice . foo.sh}, @samp{nice :}, and @samp{nice exec
1429 pwd} do not work as you might expect.
1431 Many shells extend this list. For example, Bash has several extra
1432 special built-in utilities like @command{history}, and
1433 @command{suspend}, and with Bash the command @samp{nice suspend}
1434 generates an error message instead of suspending.
1436 @node Standards conformance
1437 @section Standards conformance
1439 @vindex POSIXLY_CORRECT
1440 In a few cases, the GNU utilities' default behavior is
1441 incompatible with the POSIX standard. To suppress these
1442 incompatibilities, define the @env{POSIXLY_CORRECT} environment
1443 variable. Unless you are checking for POSIX conformance, you
1444 probably do not need to define @env{POSIXLY_CORRECT}.
1446 Newer versions of POSIX are occasionally incompatible with older
1447 versions. For example, older versions of POSIX required the
1448 command @samp{sort +1} to sort based on the second and succeeding
1449 fields in each input line, but starting with POSIX 1003.1-2001
1450 the same command is required to sort the file named @file{+1}, and you
1451 must instead use the command @samp{sort -k 2} to get the field-based
1454 @vindex _POSIX2_VERSION
1455 The GNU utilities normally conform to the version of POSIX
1456 that is standard for your system. To cause them to conform to a
1457 different version of POSIX, define the @env{_POSIX2_VERSION}
1458 environment variable to a value of the form @var{yyyymm} specifying
1459 the year and month the standard was adopted. Three values are currently
1460 supported for @env{_POSIX2_VERSION}: @samp{199209} stands for
1461 POSIX 1003.2-1992, @samp{200112} stands for POSIX
1462 1003.1-2001, and @samp{200809} stands for POSIX 1003.1-2008.
1463 For example, if you have a newer system but are running software
1464 that assumes an older version of POSIX and uses @samp{sort +1}
1465 or @samp{tail +10}, you can work around any compatibility problems by setting
1466 @samp{_POSIX2_VERSION=199209} in your environment.
1468 @node Output of entire files
1469 @chapter Output of entire files
1471 @cindex output of entire files
1472 @cindex entire files, output of
1474 These commands read and write entire files, possibly transforming them
1478 * cat invocation:: Concatenate and write files.
1479 * tac invocation:: Concatenate and write files in reverse.
1480 * nl invocation:: Number lines and write files.
1481 * od invocation:: Write files in octal or other formats.
1482 * base64 invocation:: Transform data into printable data.
1485 @node cat invocation
1486 @section @command{cat}: Concatenate and write files
1489 @cindex concatenate and write files
1490 @cindex copying files
1492 @command{cat} copies each @var{file} (@samp{-} means standard input), or
1493 standard input if none are given, to standard output. Synopsis:
1496 cat [@var{option}] [@var{file}]@dots{}
1499 The program accepts the following options. Also see @ref{Common options}.
1507 Equivalent to @option{-vET}.
1510 @itemx --number-nonblank
1512 @opindex --number-nonblank
1513 Number all nonempty output lines, starting with 1.
1517 Equivalent to @option{-vE}.
1522 @opindex --show-ends
1523 Display a @samp{$} after the end of each line.
1529 Number all output lines, starting with 1. This option is ignored
1530 if @option{-b} is in effect.
1533 @itemx --squeeze-blank
1535 @opindex --squeeze-blank
1536 @cindex squeezing empty lines
1537 Suppress repeated adjacent empty lines; output just one empty line
1542 Equivalent to @option{-vT}.
1547 @opindex --show-tabs
1548 Display TAB characters as @samp{^I}.
1552 Ignored; for POSIX compatibility.
1555 @itemx --show-nonprinting
1557 @opindex --show-nonprinting
1558 Display control characters except for LFD and TAB using
1559 @samp{^} notation and precede characters that have the high bit set with
1564 On systems like MS-DOS that distinguish between text and binary files,
1565 @command{cat} normally reads and writes in binary mode. However,
1566 @command{cat} reads in text mode if one of the options
1567 @option{-bensAE} is used or if @command{cat} is reading from standard
1568 input and standard input is a terminal. Similarly, @command{cat}
1569 writes in text mode if one of the options @option{-bensAE} is used or
1570 if standard output is a terminal.
1577 # Output f's contents, then standard input, then g's contents.
1580 # Copy standard input to standard output.
1585 @node tac invocation
1586 @section @command{tac}: Concatenate and write files in reverse
1589 @cindex reversing files
1591 @command{tac} copies each @var{file} (@samp{-} means standard input), or
1592 standard input if none are given, to standard output, reversing the
1593 records (lines by default) in each separately. Synopsis:
1596 tac [@var{option}]@dots{} [@var{file}]@dots{}
1599 @dfn{Records} are separated by instances of a string (newline by
1600 default). By default, this separator string is attached to the end of
1601 the record that it follows in the file.
1603 The program accepts the following options. Also see @ref{Common options}.
1611 The separator is attached to the beginning of the record that it
1612 precedes in the file.
1618 Treat the separator string as a regular expression.
1620 @item -s @var{separator}
1621 @itemx --separator=@var{separator}
1623 @opindex --separator
1624 Use @var{separator} as the record separator, instead of newline.
1628 On systems like MS-DOS that distinguish between text and binary files,
1629 @command{tac} reads and writes in binary mode.
1636 # Reverse a file character by character.
1642 @section @command{nl}: Number lines and write files
1645 @cindex numbering lines
1646 @cindex line numbering
1648 @command{nl} writes each @var{file} (@samp{-} means standard input), or
1649 standard input if none are given, to standard output, with line numbers
1650 added to some or all of the lines. Synopsis:
1653 nl [@var{option}]@dots{} [@var{file}]@dots{}
1656 @cindex logical pages, numbering on
1657 @command{nl} decomposes its input into (logical) pages; by default, the
1658 line number is reset to 1 at the top of each logical page. @command{nl}
1659 treats all of the input files as a single document; it does not reset
1660 line numbers or logical pages between files.
1662 @cindex headers, numbering
1663 @cindex body, numbering
1664 @cindex footers, numbering
1665 A logical page consists of three sections: header, body, and footer.
1666 Any of the sections can be empty. Each can be numbered in a different
1667 style from the others.
1669 The beginnings of the sections of logical pages are indicated in the
1670 input file by a line containing exactly one of these delimiter strings:
1681 The two characters from which these strings are made can be changed from
1682 @samp{\} and @samp{:} via options (see below), but the pattern and
1683 length of each string cannot be changed.
1685 A section delimiter is replaced by an empty line on output. Any text
1686 that comes before the first section delimiter string in the input file
1687 is considered to be part of a body section, so @command{nl} treats a
1688 file that contains no section delimiters as a single body section.
1690 The program accepts the following options. Also see @ref{Common options}.
1694 @item -b @var{style}
1695 @itemx --body-numbering=@var{style}
1697 @opindex --body-numbering
1698 Select the numbering style for lines in the body section of each
1699 logical page. When a line is not numbered, the current line number
1700 is not incremented, but the line number separator character is still
1701 prepended to the line. The styles are:
1707 number only nonempty lines (default for body),
1709 do not number lines (default for header and footer),
1711 number only lines that contain a match for the basic regular
1712 expression @var{bre}.
1713 @xref{Regular Expressions, , Regular Expressions, grep, The GNU Grep Manual}.
1717 @itemx --section-delimiter=@var{cd}
1719 @opindex --section-delimiter
1720 @cindex section delimiters of pages
1721 Set the section delimiter characters to @var{cd}; default is
1722 @samp{\:}. If only @var{c} is given, the second remains @samp{:}.
1723 (Remember to protect @samp{\} or other metacharacters from shell
1724 expansion with quotes or extra backslashes.)
1726 @item -f @var{style}
1727 @itemx --footer-numbering=@var{style}
1729 @opindex --footer-numbering
1730 Analogous to @option{--body-numbering}.
1732 @item -h @var{style}
1733 @itemx --header-numbering=@var{style}
1735 @opindex --header-numbering
1736 Analogous to @option{--body-numbering}.
1738 @item -i @var{number}
1739 @itemx --line-increment=@var{number}
1741 @opindex --line-increment
1742 Increment line numbers by @var{number} (default 1).
1744 @item -l @var{number}
1745 @itemx --join-blank-lines=@var{number}
1747 @opindex --join-blank-lines
1748 @cindex empty lines, numbering
1749 @cindex blank lines, numbering
1750 Consider @var{number} (default 1) consecutive empty lines to be one
1751 logical line for numbering, and only number the last one. Where fewer
1752 than @var{number} consecutive empty lines occur, do not number them.
1753 An empty line is one that contains no characters, not even spaces
1756 @item -n @var{format}
1757 @itemx --number-format=@var{format}
1759 @opindex --number-format
1760 Select the line numbering format (default is @code{rn}):
1764 @opindex ln @r{format for @command{nl}}
1765 left justified, no leading zeros;
1767 @opindex rn @r{format for @command{nl}}
1768 right justified, no leading zeros;
1770 @opindex rz @r{format for @command{nl}}
1771 right justified, leading zeros.
1775 @itemx --no-renumber
1777 @opindex --no-renumber
1778 Do not reset the line number at the start of a logical page.
1780 @item -s @var{string}
1781 @itemx --number-separator=@var{string}
1783 @opindex --number-separator
1784 Separate the line number from the text line in the output with
1785 @var{string} (default is the TAB character).
1787 @item -v @var{number}
1788 @itemx --starting-line-number=@var{number}
1790 @opindex --starting-line-number
1791 Set the initial line number on each logical page to @var{number} (default 1).
1793 @item -w @var{number}
1794 @itemx --number-width=@var{number}
1796 @opindex --number-width
1797 Use @var{number} characters for line numbers (default 6).
1805 @section @command{od}: Write files in octal or other formats
1808 @cindex octal dump of files
1809 @cindex hex dump of files
1810 @cindex ASCII dump of files
1811 @cindex file contents, dumping unambiguously
1813 @command{od} writes an unambiguous representation of each @var{file}
1814 (@samp{-} means standard input), or standard input if none are given.
1818 od [@var{option}]@dots{} [@var{file}]@dots{}
1819 od [-abcdfilosx]@dots{} [@var{file}] [[+]@var{offset}[.][b]]
1820 od [@var{option}]@dots{} --traditional [@var{file}]@c
1821 [[+]@var{offset}[.][b] [[+]@var{label}[.][b]]]
1824 Each line of output consists of the offset in the input, followed by
1825 groups of data from the file. By default, @command{od} prints the offset in
1826 octal, and each group of file data is a C @code{short int}'s worth of input
1827 printed as a single octal number.
1829 If @var{offset} is given, it specifies how many input bytes to skip
1830 before formatting and writing. By default, it is interpreted as an
1831 octal number, but the optional trailing decimal point causes it to be
1832 interpreted as decimal. If no decimal is specified and the offset
1833 begins with @samp{0x} or @samp{0X} it is interpreted as a hexadecimal
1834 number. If there is a trailing @samp{b}, the number of bytes skipped
1835 will be @var{offset} multiplied by 512.
1837 If a command is of both the first and second forms, the second form is
1838 assumed if the last operand begins with @samp{+} or (if there are two
1839 operands) a digit. For example, in @samp{od foo 10} and @samp{od +10}
1840 the @samp{10} is an offset, whereas in @samp{od 10} the @samp{10} is a
1843 The program accepts the following options. Also see @ref{Common options}.
1847 @item -A @var{radix}
1848 @itemx --address-radix=@var{radix}
1850 @opindex --address-radix
1851 @cindex radix for file offsets
1852 @cindex file offset radix
1853 Select the base in which file offsets are printed. @var{radix} can
1854 be one of the following:
1864 none (do not print offsets).
1867 The default is octal.
1869 @item -j @var{bytes}
1870 @itemx --skip-bytes=@var{bytes}
1872 @opindex --skip-bytes
1873 Skip @var{bytes} input bytes before formatting and writing. If
1874 @var{bytes} begins with @samp{0x} or @samp{0X}, it is interpreted in
1875 hexadecimal; otherwise, if it begins with @samp{0}, in octal; otherwise,
1877 @multiplierSuffixes{bytes}
1879 @item -N @var{bytes}
1880 @itemx --read-bytes=@var{bytes}
1882 @opindex --read-bytes
1883 Output at most @var{bytes} bytes of the input. Prefixes and suffixes on
1884 @code{bytes} are interpreted as for the @option{-j} option.
1886 @item -S @var{bytes}
1887 @itemx --strings[=@var{bytes}]
1890 @cindex string constants, outputting
1891 Instead of the normal output, output only @dfn{string constants}: at
1892 least @var{bytes} consecutive ASCII graphic characters,
1893 followed by a zero byte (ASCII NUL).
1894 Prefixes and suffixes on @var{bytes} are interpreted as for the
1897 If @var{bytes} is omitted with @option{--strings}, the default is 3.
1900 @itemx --format=@var{type}
1903 Select the format in which to output the file data. @var{type} is a
1904 string of one or more of the below type indicator characters. If you
1905 include more than one type indicator character in a single @var{type}
1906 string, or use this option more than once, @command{od} writes one copy
1907 of each output line using each of the data types that you specified,
1908 in the order that you specified.
1910 Adding a trailing ``z'' to any type specification appends a display
1911 of the single byte character representation of the printable characters
1912 to the output line generated by the type specification.
1916 named character, ignoring high-order bit
1918 printable single byte character, C backslash escape
1919 or a 3 digit octal sequence
1923 floating point (@pxref{Floating point})
1932 The type @code{a} outputs things like @samp{sp} for space, @samp{nl} for
1933 newline, and @samp{nul} for a zero byte. Only the least significant
1934 seven bits of each byte is used; the high-order bit is ignored.
1935 Type @code{c} outputs
1936 @samp{ }, @samp{\n}, and @code{\0}, respectively.
1939 Except for types @samp{a} and @samp{c}, you can specify the number
1940 of bytes to use in interpreting each number in the given data type
1941 by following the type indicator character with a decimal integer.
1942 Alternately, you can specify the size of one of the C compiler's
1943 built-in data types by following the type indicator character with
1944 one of the following characters. For integers (@samp{d}, @samp{o},
1945 @samp{u}, @samp{x}):
1958 For floating point (@code{f}):
1970 @itemx --output-duplicates
1972 @opindex --output-duplicates
1973 Output consecutive lines that are identical. By default, when two or
1974 more consecutive output lines would be identical, @command{od} outputs only
1975 the first line, and puts just an asterisk on the following line to
1976 indicate the elision.
1979 @itemx --width[=@var{n}]
1982 Dump @code{n} input bytes per output line. This must be a multiple of
1983 the least common multiple of the sizes associated with the specified
1986 If this option is not given at all, the default is 16. If @var{n} is
1987 omitted, the default is 32.
1991 The next several options are shorthands for format specifications.
1992 GNU @command{od} accepts any combination of shorthands and format
1993 specification options. These options accumulate.
1999 Output as named characters. Equivalent to @samp{-t a}.
2003 Output as octal bytes. Equivalent to @samp{-t o1}.
2007 Output as printable single byte characters, C backslash escapes
2008 or 3 digit octal sequences. Equivalent to @samp{-t c}.
2012 Output as unsigned decimal two-byte units. Equivalent to @samp{-t u2}.
2016 Output as floats. Equivalent to @samp{-t fF}.
2020 Output as decimal ints. Equivalent to @samp{-t dI}.
2024 Output as decimal long ints. Equivalent to @samp{-t dL}.
2028 Output as octal two-byte units. Equivalent to @option{-t o2}.
2032 Output as decimal two-byte units. Equivalent to @option{-t d2}.
2036 Output as hexadecimal two-byte units. Equivalent to @samp{-t x2}.
2039 @opindex --traditional
2040 Recognize the non-option label argument that traditional @command{od}
2041 accepted. The following syntax:
2044 od --traditional [@var{file}] [[+]@var{offset}[.][b] [[+]@var{label}[.][b]]]
2048 can be used to specify at most one file and optional arguments
2049 specifying an offset and a pseudo-start address, @var{label}.
2050 The @var{label} argument is interpreted
2051 just like @var{offset}, but it specifies an initial pseudo-address. The
2052 pseudo-addresses are displayed in parentheses following any normal
2059 @node base64 invocation
2060 @section @command{base64}: Transform data into printable data
2063 @cindex base64 encoding
2065 @command{base64} transforms data read from a file, or standard input,
2066 into (or from) base64 encoded form. The base64 encoded form uses
2067 printable ASCII characters to represent binary data.
2071 base64 [@var{option}]@dots{} [@var{file}]
2072 base64 --decode [@var{option}]@dots{} [@var{file}]
2075 The base64 encoding expands data to roughly 133% of the original.
2076 The format conforms to
2077 @uref{ftp://ftp.rfc-editor.org/in-notes/rfc4648.txt, RFC 4648}.
2079 The program accepts the following options. Also see @ref{Common options}.
2084 @itemx --wrap=@var{cols}
2088 @cindex column to wrap data after
2089 During encoding, wrap lines after @var{cols} characters. This must be
2092 The default is to wrap after 76 characters. Use the value 0 to
2093 disable line wrapping altogether.
2099 @cindex Decode base64 data
2100 @cindex Base64 decoding
2101 Change the mode of operation, from the default of encoding data, to
2102 decoding data. Input is expected to be base64 encoded data, and the
2103 output will be the original data.
2106 @itemx --ignore-garbage
2108 @opindex --ignore-garbage
2109 @cindex Ignore garbage in base64 stream
2110 When decoding, newlines are always accepted.
2111 During decoding, ignore unrecognized bytes,
2112 to permit distorted data to be decoded.
2119 @node Formatting file contents
2120 @chapter Formatting file contents
2122 @cindex formatting file contents
2124 These commands reformat the contents of files.
2127 * fmt invocation:: Reformat paragraph text.
2128 * numfmt invocation:: Reformat numbers.
2129 * pr invocation:: Paginate or columnate files for printing.
2130 * fold invocation:: Wrap input lines to fit in specified width.
2134 @node fmt invocation
2135 @section @command{fmt}: Reformat paragraph text
2138 @cindex reformatting paragraph text
2139 @cindex paragraphs, reformatting
2140 @cindex text, reformatting
2142 @command{fmt} fills and joins lines to produce output lines of (at most)
2143 a given number of characters (75 by default). Synopsis:
2146 fmt [@var{option}]@dots{} [@var{file}]@dots{}
2149 @command{fmt} reads from the specified @var{file} arguments (or standard
2150 input if none are given), and writes to standard output.
2152 By default, blank lines, spaces between words, and indentation are
2153 preserved in the output; successive input lines with different
2154 indentation are not joined; tabs are expanded on input and introduced on
2157 @cindex line-breaking
2158 @cindex sentences and line-breaking
2159 @cindex Knuth, Donald E.
2160 @cindex Plass, Michael F.
2161 @command{fmt} prefers breaking lines at the end of a sentence, and tries to
2162 avoid line breaks after the first word of a sentence or before the last
2163 word of a sentence. A @dfn{sentence break} is defined as either the end
2164 of a paragraph or a word ending in any of @samp{.?!}, followed by two
2165 spaces or end of line, ignoring any intervening parentheses or quotes.
2166 Like @TeX{}, @command{fmt} reads entire ``paragraphs'' before choosing line
2167 breaks; the algorithm is a variant of that given by Donald E. Knuth
2168 and Michael F. Plass in ``Breaking Paragraphs Into Lines'',
2169 @cite{Software---Practice & Experience} @b{11}, 11 (November 1981),
2172 The program accepts the following options. Also see @ref{Common options}.
2177 @itemx --crown-margin
2179 @opindex --crown-margin
2180 @cindex crown margin
2181 @dfn{Crown margin} mode: preserve the indentation of the first two
2182 lines within a paragraph, and align the left margin of each subsequent
2183 line with that of the second line.
2186 @itemx --tagged-paragraph
2188 @opindex --tagged-paragraph
2189 @cindex tagged paragraphs
2190 @dfn{Tagged paragraph} mode: like crown margin mode, except that if
2191 indentation of the first line of a paragraph is the same as the
2192 indentation of the second, the first line is treated as a one-line
2198 @opindex --split-only
2199 Split lines only. Do not join short lines to form longer ones. This
2200 prevents sample lines of code, and other such ``formatted'' text from
2201 being unduly combined.
2204 @itemx --uniform-spacing
2206 @opindex --uniform-spacing
2207 Uniform spacing. Reduce spacing between words to one space, and spacing
2208 between sentences to two spaces.
2211 @itemx -w @var{width}
2212 @itemx --width=@var{width}
2213 @opindex -@var{width}
2216 Fill output lines up to @var{width} characters (default 75 or @var{goal}
2217 plus 10, if @var{goal} is provided).
2220 @itemx --goal=@var{goal}
2223 @command{fmt} initially tries to make lines @var{goal} characters wide.
2224 By default, this is 7% shorter than @var{width}.
2226 @item -p @var{prefix}
2227 @itemx --prefix=@var{prefix}
2228 Only lines beginning with @var{prefix} (possibly preceded by whitespace)
2229 are subject to formatting. The prefix and any preceding whitespace are
2230 stripped for the formatting and then re-attached to each formatted output
2231 line. One use is to format certain kinds of program comments, while
2232 leaving the code unchanged.
2238 @node numfmt invocation
2239 @section @command{numfmt}: Reformat numbers
2243 @command{numfmt} reads numbers in various representations and reformats them
2244 as requested. The most common usage is converting numbers to/from @emph{human}
2245 representation (e.g. @samp{4G} @expansion{} @samp{4,000,000,000}).
2248 numfmt [@var{option}]@dots{} [@var{number}]
2251 @command{numfmt} converts each @var{number} on the command-line according to the
2252 specified options (see below). If no @var{number}s are given, it reads numbers
2253 from standard input. @command{numfmt} can optionally extract numbers from
2254 specific columns, maintaining proper line padding and alignment.
2258 See @option{--invalid} for additional information regarding exit status.
2260 @subsection General options
2262 The program accepts the following options. Also see @ref{Common options}.
2268 Print (to standard error) warning messages about possible erroneous usage.
2271 @itemx --delimiter=@var{d}
2273 @opindex --delimiter
2274 Use the character @var{d} as input field separator (default: whitespace).
2275 @emph{Note}: Using non-default delimiter turns off automatic padding.
2277 @item --field=@var{n}
2279 Convert the number in input field @var{n} (default: 1).
2281 @item --format=@var{format}
2283 Use printf-style floating FORMAT string. The @var{format} string must contain
2284 one @samp{%f} directive, optionally with @samp{'}, @samp{-}, or width
2285 modifiers. The @samp{'} modifier will enable @option{--grouping}, the @samp{-}
2286 modifier will enable left-aligned @option{--padding} and the width modifier will
2287 enable right-aligned @option{--padding}.
2289 @item --from=@var{unit}
2291 Auto-scales input numbers according to @var{unit}. See UNITS below.
2292 The default is no scaling, meaning suffixes (e.g. @samp{M}, @samp{G}) will
2295 @item --from-unit=@var{n}
2296 @opindex --from-unit
2297 Specify the input unit size (instead of the default 1). Use this option when
2298 the input numbers represent other units (e.g. if the input number @samp{10}
2299 represents 10 units of 512 bytes, use @samp{--from=unit=512}).
2303 Group digits in output numbers according to the current locale's grouping rules
2304 (e.g @emph{Thousands Separator} character, commonly @samp{.} (dot) or @samp{,}
2305 comma). This option has no effect in @samp{POSIX/C} locale.
2307 @item --header[=@var{n}]
2310 Print the first @var{n} (default: 1) lines without any conversion.
2312 @item --invalid=@var{mode}
2314 The default action on input errors is to exit immediately with status code 2.
2315 @option{--invalid=@samp{abort}} explicitly specifies this default mode.
2316 With a @var{mode} of @samp{fail}, print a warning for @emph{each} conversion
2317 error, and exit with status 2. With a @var{mode} of @samp{warn}, exit with
2318 status 0, even in the presence of conversion errors, and with a @var{mode} of
2319 @samp{ignore} do not even print diagnostics.
2321 @item --padding=@var{n}
2323 Pad the output numbers to @var{n} characters, by adding spaces. If @var{n} is
2324 a positive number, numbers will be right-aligned. If @var{n} is a negative
2325 number, numbers will be left-aligned. By default, numbers are automatically
2326 aligned based on the input line's width (only with the default delimiter).
2328 @item --round=@var{method}
2331 @opindex --round=down
2332 @opindex --round=from-zero
2333 @opindex --round=towards-zero
2334 @opindex --round=nearest
2335 When converting number representations, round the number according to
2336 @var{method}, which can be @samp{up}, @samp{down},
2337 @samp{from-zero} (the default), @samp{towards-zero}, @samp{nearest}.
2339 @item --suffix=@var{suffix}
2341 Add @samp{SUFFIX} to the output numbers, and accept optional @samp{SUFFIX} in
2344 @item --to=@var{unit}
2346 Auto-scales output numbers according to @var{unit}. See @emph{Units} below.
2347 The default is no scaling, meaning all the digits of the number are printed.
2349 @item --to-unit=@var{n}
2351 Specify the output unit size (instead of the default 1). Use this option when
2352 the output numbers represent other units (e.g. to represent @samp{4,000,000}
2353 bytes in blocks of 1KB, use @samp{--to=si --to=units=1000}).
2357 @subsection Possible @var{unit}s:
2359 The following are the possible @var{unit} options with @option{--from=UNITS} and
2360 @option{--to=UNITS}:
2365 No scaling is performed. For input numbers, no suffixes are accepted, and any
2366 trailing characters following the number will trigger an error. For output
2367 numbers, all digits of the numbers will be printed.
2370 Auto-scale numbers according to the @emph{International System of Units (SI)}
2372 For input numbers, accept one of the following suffixes.
2373 For output numbers, values larger than 1000 will be rounded, and printed with
2374 one of the following suffixes:
2377 @samp{K} => @math{1000^1 = 10^3} (Kilo)
2378 @samp{M} => @math{1000^2 = 10^6} (Mega)
2379 @samp{G} => @math{1000^3 = 10^9} (Giga)
2380 @samp{T} => @math{1000^4 = 10^{12}} (Tera)
2381 @samp{P} => @math{1000^5 = 10^{15}} (Peta)
2382 @samp{E} => @math{1000^6 = 10^{18}} (Exa)
2383 @samp{Z} => @math{1000^7 = 10^{21}} (Zetta)
2384 @samp{Y} => @math{1000^8 = 10^{24}} (Yotta)
2388 Auto-scale numbers according to the @emph{International Electronical
2389 Commission (IEC)} standard.
2390 For input numbers, accept one of the following suffixes.
2391 For output numbers, values larger than 1024 will be rounded, and printed with
2392 one of the following suffixes:
2395 @samp{K} => @math{1024^1 = 2^{10}} (Kibi)
2396 @samp{M} => @math{1024^2 = 2^{20}} (Mebi)
2397 @samp{G} => @math{1024^3 = 2^{30}} (Gibi)
2398 @samp{T} => @math{1024^4 = 2^{40}} (Tebi)
2399 @samp{P} => @math{1024^5 = 2^{50}} (Pebi)
2400 @samp{E} => @math{1024^6 = 2^{60}} (Exbi)
2401 @samp{Z} => @math{1024^7 = 2^{70}} (Zebi)
2402 @samp{Y} => @math{1024^8 = 2^{80}} (Yobi)
2405 The @option{iec} option uses a single letter suffix (e.g. @samp{G}), which is
2406 not fully standard, as the @emph{iec} standard recommends a two-letter symbol
2407 (e.g @samp{Gi}) - but in practice, this method common. Compare with
2408 the @option{iec-i} option.
2411 Auto-scale numbers according to the @emph{International Electronical
2412 Commission (IEC)} standard.
2413 For input numbers, accept one of the following suffixes.
2414 For output numbers, values larger than 1024 will be rounded, and printed with
2415 one of the following suffixes:
2418 @samp{Ki} => @math{1024^1 = 2^{10}} (Kibi)
2419 @samp{Mi} => @math{1024^2 = 2^{20}} (Mebi)
2420 @samp{Gi} => @math{1024^3 = 2^{30}} (Gibi)
2421 @samp{Ti} => @math{1024^4 = 2^{40}} (Tebi)
2422 @samp{Pi} => @math{1024^5 = 2^{50}} (Pebi)
2423 @samp{Ei} => @math{1024^6 = 2^{60}} (Exbi)
2424 @samp{Zi} => @math{1024^7 = 2^{70}} (Zebi)
2425 @samp{Yi} => @math{1024^8 = 2^{80}} (Yobi)
2428 The @option{iec-i} option uses a two-letter suffix symbol (e.g. @samp{Gi}),
2429 as the @emph{iec} standard recommends, but this is not always common in
2430 practice. Compare with the @option{iec} option.
2433 @samp{auto} can only be used with @option{--from}. With this method, numbers
2434 with @samp{K},@samp{M},@samp{G},@samp{T},@samp{P},@samp{E},@samp{Z},@samp{Y}
2435 suffixes are interpreted as @emph{SI} values, and numbers with @samp{Ki},
2436 @samp{Mi},@samp{Gi},@samp{Ti},@samp{Pi},@samp{Ei},@samp{Zi},@samp{Yi} suffixes
2437 are interpreted as @emph{IEC} values.
2441 @subsection Examples of using @command{numfmt}
2443 Converting a single number from/to @emph{human} representation:
2445 $ nunfmt --to=si 500000
2448 $ numfmt --to=iec 500000
2451 $ numfmt --to=iec-i 500000
2454 $ numfmt --from=si 1M
2457 $ numfmt --from=iec 1M
2460 # with '--from=auto', M=Mega, Mi=Mebi
2461 $ numfmt --from=auto 1M
2463 $ numfmt --from=auto 1Mi
2467 Converting from @samp{SI} to @samp{IEC} scales (e.g. when a harddisk capacity is
2468 advertised as @samp{1TB}, while checking the drive's capacity gives lower
2472 $ numfmt --from=si --to=iec 1T
2477 Converting a single field from an input file / piped input (these contrived
2478 examples are for demonstration purposes only, as both @command{ls} and
2479 @command{df} support the @option{--human-readable} option to
2480 output sizes in human-readable format):
2483 # Third field (file size) will be shown in SI representation
2484 $ ls -log | numfmt --field 3 --header --to=si | head -n4
2485 -rw-r--r-- 1 94K Aug 23 2011 ABOUT-NLS
2486 -rw-r--r-- 1 3.7K Jan 7 16:15 AUTHORS
2487 -rw-r--r-- 1 36K Jun 1 2011 COPYING
2488 -rw-r--r-- 1 0 Jan 7 15:15 ChangeLog
2490 # Second field (size) will be shown in IEC representation
2491 $ df --block-size=1 | numfmt --field 2 --header --to=iec | head -n4
2492 File system 1B-blocks Used Available Use% Mounted on
2493 rootfs 132G 104741408 26554036 80% /
2494 tmpfs 794M 7580 804960 1% /run/shm
2495 /dev/sdb1 694G 651424756 46074696 94% /home
2499 Output can be tweaked using @option{--padding} or @option{--format}:
2502 # Pad to 10 characters, right-aligned
2503 $ du -s * | numfmt --to=si --padding=10
2509 # Pad to 10 characters, left-aligned
2510 $ du -s * | numfmt --to=si --padding=-10
2516 # Pad to 10 characters, left-aligned, using 'format'
2517 $ du -s * | numfmt --to=si --format="%10f"
2523 # Pad to 10 characters, left-aligned, using 'format'
2524 $ du -s * | numfmt --to=si --padding="%-10f"
2531 With locales that support grouping digits, using @option{--grouping} or
2532 @option{--format} enables grouping. In @samp{POSIX} locale, grouping is
2536 $ LC_ALL=C numfmt --from=iec --grouping 2G
2539 $ LC_ALL=en_US.utf8 numfmt --from=iec --grouping 2G
2542 $ LC_ALL=ta_IN numfmt --from=iec --grouping 2G
2545 $ LC_ALL=C ./src/numfmt --from=iec --format="==%'15f==" 2G
2548 $ LC_ALL=en_US.utf8 ./src/numfmt --from=iec --format="==%'15f==" 2G
2551 $ LC_ALL=en_US.utf8 ./src/numfmt --from=iec --format="==%'-15f==" 2G
2554 $ LC_ALL=ta_IN ./src/numfmt --from=iec --format="==%'15f==" 2G
2559 @section @command{pr}: Paginate or columnate files for printing
2562 @cindex printing, preparing files for
2563 @cindex multicolumn output, generating
2564 @cindex merging files in parallel
2566 @command{pr} writes each @var{file} (@samp{-} means standard input), or
2567 standard input if none are given, to standard output, paginating and
2568 optionally outputting in multicolumn format; optionally merges all
2569 @var{file}s, printing all in parallel, one per column. Synopsis:
2572 pr [@var{option}]@dots{} [@var{file}]@dots{}
2576 By default, a 5-line header is printed at each page: two blank lines;
2577 a line with the date, the file name, and the page count; and two more
2578 blank lines. A footer of five blank lines is also printed.
2579 The default @var{page_length} is 66
2580 lines. The default number of text lines is therefore 56.
2581 The text line of the header takes the form
2582 @samp{@var{date} @var{string} @var{page}}, with spaces inserted around
2583 @var{string} so that the line takes up the full @var{page_width}. Here,
2584 @var{date} is the date (see the @option{-D} or @option{--date-format}
2585 option for details), @var{string} is the centered header string, and
2586 @var{page} identifies the page number. The @env{LC_MESSAGES} locale
2587 category affects the spelling of @var{page}; in the default C locale, it
2588 is @samp{Page @var{number}} where @var{number} is the decimal page
2591 Form feeds in the input cause page breaks in the output. Multiple form
2592 feeds produce empty pages.
2594 Columns are of equal width, separated by an optional string (default
2595 is @samp{space}). For multicolumn output, lines will always be truncated to
2596 @var{page_width} (default 72), unless you use the @option{-J} option.
2598 column output no line truncation occurs by default. Use @option{-W} option to
2599 truncate lines in that case.
2601 The program accepts the following options. Also see @ref{Common options}.
2605 @item +@var{first_page}[:@var{last_page}]
2606 @itemx --pages=@var{first_page}[:@var{last_page}]
2607 @c The two following @opindex lines evoke warnings because they contain ':'
2608 @c The 'info' spec does not permit that. If we use those lines, we end
2609 @c up with truncated index entries that don't work.
2610 @c @opindex +@var{first_page}[:@var{last_page}]
2611 @c @opindex --pages=@var{first_page}[:@var{last_page}]
2612 @opindex +@var{page_range}
2613 @opindex --pages=@var{page_range}
2614 Begin printing with page @var{first_page} and stop with @var{last_page}.
2615 Missing @samp{:@var{last_page}} implies end of file. While estimating
2616 the number of skipped pages each form feed in the input file results
2617 in a new page. Page counting with and without @samp{+@var{first_page}}
2618 is identical. By default, counting starts with the first page of input
2619 file (not first page printed). Line numbering may be altered by @option{-N}
2623 @itemx --columns=@var{column}
2624 @opindex -@var{column}
2626 @cindex down columns
2627 With each single @var{file}, produce @var{column} columns of output
2628 (default is 1) and print columns down, unless @option{-a} is used. The
2629 column width is automatically decreased as @var{column} increases; unless
2630 you use the @option{-W/-w} option to increase @var{page_width} as well.
2631 This option might well cause some lines to be truncated. The number of
2632 lines in the columns on each page are balanced. The options @option{-e}
2633 and @option{-i} are on for multiple text-column output. Together with
2634 @option{-J} option column alignment and line truncation is turned off.
2635 Lines of full length are joined in a free field format and @option{-S}
2636 option may set field separators. @option{-@var{column}} may not be used
2637 with @option{-m} option.
2643 @cindex across columns
2644 With each single @var{file}, print columns across rather than down. The
2645 @option{-@var{column}} option must be given with @var{column} greater than one.
2646 If a line is too long to fit in a column, it is truncated.
2649 @itemx --show-control-chars
2651 @opindex --show-control-chars
2652 Print control characters using hat notation (e.g., @samp{^G}); print
2653 other nonprinting characters in octal backslash notation. By default,
2654 nonprinting characters are not changed.
2657 @itemx --double-space
2659 @opindex --double-space
2660 @cindex double spacing
2661 Double space the output.
2663 @item -D @var{format}
2664 @itemx --date-format=@var{format}
2665 @cindex time formats
2666 @cindex formatting times
2667 Format header dates using @var{format}, using the same conventions as
2668 for the command @samp{date +@var{format}}. @xref{date invocation}.
2669 Except for directives, which start with
2670 @samp{%}, characters in @var{format} are printed unchanged. You can use
2671 this option to specify an arbitrary string in place of the header date,
2672 e.g., @option{--date-format="Monday morning"}.
2674 @vindex POSIXLY_CORRECT
2676 The default date format is @samp{%Y-%m-%d %H:%M} (for example,
2677 @samp{2001-12-04 23:59});
2678 but if the @env{POSIXLY_CORRECT} environment variable is set
2679 and the @env{LC_TIME} locale category specifies the POSIX
2680 locale, the default is @samp{%b %e %H:%M %Y} (for example,
2681 @samp{Dec@ @ 4 23:59 2001}.
2684 Time stamps are listed according to the time zone rules specified by
2685 the @env{TZ} environment variable, or by the system default rules if
2686 @env{TZ} is not set. @xref{TZ Variable,, Specifying the Time Zone
2687 with @env{TZ}, libc, The GNU C Library Reference Manual}.
2689 @item -e[@var{in-tabchar}[@var{in-tabwidth}]]
2690 @itemx --expand-tabs[=@var{in-tabchar}[@var{in-tabwidth}]]
2692 @opindex --expand-tabs
2694 Expand @var{tab}s to spaces on input. Optional argument @var{in-tabchar} is
2695 the input tab character (default is the TAB character). Second optional
2696 argument @var{in-tabwidth} is the input tab character's width (default
2704 @opindex --form-feed
2705 Use a form feed instead of newlines to separate output pages. This does
2706 not alter the default page length of 66 lines.
2708 @item -h @var{header}
2709 @itemx --header=@var{header}
2712 Replace the file name in the header with the centered string @var{header}.
2713 When using the shell, @var{header} should be quoted and should be
2714 separated from @option{-h} by a space.
2716 @item -i[@var{out-tabchar}[@var{out-tabwidth}]]
2717 @itemx --output-tabs[=@var{out-tabchar}[@var{out-tabwidth}]]
2719 @opindex --output-tabs
2721 Replace spaces with @var{tab}s on output. Optional argument @var{out-tabchar}
2722 is the output tab character (default is the TAB character). Second optional
2723 argument @var{out-tabwidth} is the output tab character's width (default
2729 @opindex --join-lines
2730 Merge lines of full length. Used together with the column options
2731 @option{-@var{column}}, @option{-a -@var{column}} or @option{-m}. Turns off
2732 @option{-W/-w} line truncation;
2733 no column alignment used; may be used with
2734 @option{--sep-string[=@var{string}]}. @option{-J} has been introduced
2735 (together with @option{-W} and @option{--sep-string})
2736 to disentangle the old (POSIX-compliant) options @option{-w} and
2737 @option{-s} along with the three column options.
2740 @item -l @var{page_length}
2741 @itemx --length=@var{page_length}
2744 Set the page length to @var{page_length} (default 66) lines, including
2745 the lines of the header [and the footer]. If @var{page_length} is less
2746 than or equal to 10, the header and footer are omitted, as if the
2747 @option{-t} option had been given.
2753 Merge and print all @var{file}s in parallel, one in each column. If a
2754 line is too long to fit in a column, it is truncated, unless the @option{-J}
2755 option is used. @option{--sep-string[=@var{string}]} may be used.
2757 some @var{file}s (form feeds set) produce empty columns, still marked
2758 by @var{string}. The result is a continuous line numbering and column
2759 marking throughout the whole merged file. Completely empty merged pages
2760 show no separators or line numbers. The default header becomes
2761 @samp{@var{date} @var{page}} with spaces inserted in the middle; this
2762 may be used with the @option{-h} or @option{--header} option to fill up
2763 the middle blank part.
2765 @item -n[@var{number-separator}[@var{digits}]]
2766 @itemx --number-lines[=@var{number-separator}[@var{digits}]]
2768 @opindex --number-lines
2769 Provide @var{digits} digit line numbering (default for @var{digits} is
2770 5). With multicolumn output the number occupies the first @var{digits}
2771 column positions of each text column or only each line of @option{-m}
2772 output. With single column output the number precedes each line just as
2773 @option{-m} does. Default counting of the line numbers starts with the
2774 first line of the input file (not the first line printed, compare the
2775 @option{--page} option and @option{-N} option).
2776 Optional argument @var{number-separator} is the character appended to
2777 the line number to separate it from the text followed. The default
2778 separator is the TAB character. In a strict sense a TAB is always
2779 printed with single column output only. The TAB width varies
2780 with the TAB position, e.g., with the left @var{margin} specified
2781 by @option{-o} option. With multicolumn output priority is given to
2782 @samp{equal width of output columns} (a POSIX specification).
2783 The TAB width is fixed to the value of the first column and does
2784 not change with different values of left @var{margin}. That means a
2785 fixed number of spaces is always printed in the place of the
2786 @var{number-separator} TAB@. The tabification depends upon the output
2789 @item -N @var{line_number}
2790 @itemx --first-line-number=@var{line_number}
2792 @opindex --first-line-number
2793 Start line counting with the number @var{line_number} at first line of
2794 first page printed (in most cases not the first line of the input file).
2796 @item -o @var{margin}
2797 @itemx --indent=@var{margin}
2800 @cindex indenting lines
2802 Indent each line with a margin @var{margin} spaces wide (default is zero).
2803 The total page width is the size of the margin plus the @var{page_width}
2804 set with the @option{-W/-w} option. A limited overflow may occur with
2805 numbered single column output (compare @option{-n} option).
2808 @itemx --no-file-warnings
2810 @opindex --no-file-warnings
2811 Do not print a warning message when an argument @var{file} cannot be
2812 opened. (The exit status will still be nonzero, however.)
2814 @item -s[@var{char}]
2815 @itemx --separator[=@var{char}]
2817 @opindex --separator
2818 Separate columns by a single character @var{char}. The default for
2819 @var{char} is the TAB character without @option{-w} and @samp{no
2820 character} with @option{-w}. Without @option{-s} the default separator
2821 @samp{space} is set. @option{-s[char]} turns off line truncation of all
2822 three column options (@option{-COLUMN}|@option{-a -COLUMN}|@option{-m}) unless
2823 @option{-w} is set. This is a POSIX-compliant formulation.
2826 @item -S[@var{string}]
2827 @itemx --sep-string[=@var{string}]
2829 @opindex --sep-string
2830 Use @var{string} to separate output columns. The @option{-S} option doesn't
2831 affect the @option{-W/-w} option, unlike the @option{-s} option which does. It
2832 does not affect line truncation or column alignment.
2833 Without @option{-S}, and with @option{-J}, @command{pr} uses the default output
2835 Without @option{-S} or @option{-J}, @command{pr} uses a @samp{space}
2836 (same as @option{-S"@w{ }"}).
2837 If no @samp{@var{string}} argument is specified, @samp{""} is assumed.
2840 @itemx --omit-header
2842 @opindex --omit-header
2843 Do not print the usual header [and footer] on each page, and do not fill
2844 out the bottom of pages (with blank lines or a form feed). No page
2845 structure is produced, but form feeds set in the input files are retained.
2846 The predefined pagination is not changed. @option{-t} or @option{-T} may be
2847 useful together with other options; e.g.: @option{-t -e4}, expand TAB characters
2848 in the input file to 4 spaces but don't make any other changes. Use of
2849 @option{-t} overrides @option{-h}.
2852 @itemx --omit-pagination
2854 @opindex --omit-pagination
2855 Do not print header [and footer]. In addition eliminate all form feeds
2856 set in the input files.
2859 @itemx --show-nonprinting
2861 @opindex --show-nonprinting
2862 Print nonprinting characters in octal backslash notation.
2864 @item -w @var{page_width}
2865 @itemx --width=@var{page_width}
2868 Set page width to @var{page_width} characters for multiple text-column
2869 output only (default for @var{page_width} is 72). @option{-s[CHAR]} turns
2870 off the default page width and any line truncation and column alignment.
2871 Lines of full length are merged, regardless of the column options
2872 set. No @var{page_width} setting is possible with single column output.
2873 A POSIX-compliant formulation.
2875 @item -W @var{page_width}
2876 @itemx --page_width=@var{page_width}
2878 @opindex --page_width
2879 Set the page width to @var{page_width} characters. That's valid with and
2880 without a column option. Text lines are truncated, unless @option{-J}
2881 is used. Together with one of the three column options
2882 (@option{-@var{column}}, @option{-a -@var{column}} or @option{-m}) column
2883 alignment is always used. The separator options @option{-S} or @option{-s}
2884 don't affect the @option{-W} option. Default is 72 characters. Without
2885 @option{-W @var{page_width}} and without any of the column options NO line
2886 truncation is used (defined to keep downward compatibility and to meet
2887 most frequent tasks). That's equivalent to @option{-W 72 -J}@. The header
2888 line is never truncated.
2895 @node fold invocation
2896 @section @command{fold}: Wrap input lines to fit in specified width
2899 @cindex wrapping long input lines
2900 @cindex folding long input lines
2902 @command{fold} writes each @var{file} (@option{-} means standard input), or
2903 standard input if none are given, to standard output, breaking long
2907 fold [@var{option}]@dots{} [@var{file}]@dots{}
2910 By default, @command{fold} breaks lines wider than 80 columns. The output
2911 is split into as many lines as necessary.
2913 @cindex screen columns
2914 @command{fold} counts screen columns by default; thus, a tab may count more
2915 than one column, backspace decreases the column count, and carriage
2916 return sets the column to zero.
2918 The program accepts the following options. Also see @ref{Common options}.
2926 Count bytes rather than columns, so that tabs, backspaces, and carriage
2927 returns are each counted as taking up one column, just like other
2934 Break at word boundaries: the line is broken after the last blank before
2935 the maximum line length. If the line contains no such blanks, the line
2936 is broken at the maximum line length as usual.
2938 @item -w @var{width}
2939 @itemx --width=@var{width}
2942 Use a maximum line length of @var{width} columns instead of 80.
2944 For compatibility @command{fold} supports an obsolete option syntax
2945 @option{-@var{width}}. New scripts should use @option{-w @var{width}}
2953 @node Output of parts of files
2954 @chapter Output of parts of files
2956 @cindex output of parts of files
2957 @cindex parts of files, output of
2959 These commands output pieces of the input.
2962 * head invocation:: Output the first part of files.
2963 * tail invocation:: Output the last part of files.
2964 * split invocation:: Split a file into pieces.
2965 * csplit invocation:: Split a file into context-determined pieces.
2968 @node head invocation
2969 @section @command{head}: Output the first part of files
2972 @cindex initial part of files, outputting
2973 @cindex first part of files, outputting
2975 @command{head} prints the first part (10 lines by default) of each
2976 @var{file}; it reads from standard input if no files are given or
2977 when given a @var{file} of @option{-}. Synopsis:
2980 head [@var{option}]@dots{} [@var{file}]@dots{}
2983 If more than one @var{file} is specified, @command{head} prints a
2984 one-line header consisting of:
2987 ==> @var{file name} <==
2991 before the output for each @var{file}.
2993 The program accepts the following options. Also see @ref{Common options}.
2998 @itemx --bytes=@var{k}
3001 Print the first @var{k} bytes, instead of initial lines.
3002 However, if @var{k} starts with a @samp{-},
3003 print all but the last @var{k} bytes of each file.
3004 @multiplierSuffixes{k}
3007 @itemx --lines=@var{k}
3010 Output the first @var{k} lines.
3011 However, if @var{k} starts with a @samp{-},
3012 print all but the last @var{k} lines of each file.
3013 Size multiplier suffixes are the same as with the @option{-c} option.
3021 Never print file name headers.
3027 Always print file name headers.
3031 For compatibility @command{head} also supports an obsolete option syntax
3032 @option{-@var{count}@var{options}}, which is recognized only if it is
3033 specified first. @var{count} is a decimal number optionally followed
3034 by a size letter (@samp{b}, @samp{k}, @samp{m}) as in @option{-c}, or
3035 @samp{l} to mean count by lines, or other option letters (@samp{cqv}).
3036 Scripts intended for standard hosts should use @option{-c @var{count}}
3037 or @option{-n @var{count}} instead. If your script must also run on
3038 hosts that support only the obsolete syntax, it is usually simpler to
3039 avoid @command{head}, e.g., by using @samp{sed 5q} instead of
3045 @node tail invocation
3046 @section @command{tail}: Output the last part of files
3049 @cindex last part of files, outputting
3051 @command{tail} prints the last part (10 lines by default) of each
3052 @var{file}; it reads from standard input if no files are given or
3053 when given a @var{file} of @samp{-}. Synopsis:
3056 tail [@var{option}]@dots{} [@var{file}]@dots{}
3059 If more than one @var{file} is specified, @command{tail} prints a
3060 one-line header consisting of:
3063 ==> @var{file name} <==
3067 before the output for each @var{file}.
3069 @cindex BSD @command{tail}
3070 GNU @command{tail} can output any amount of data (some other versions of
3071 @command{tail} cannot). It also has no @option{-r} option (print in
3072 reverse), since reversing a file is really a different job from printing
3073 the end of a file; BSD @command{tail} (which is the one with @option{-r}) can
3074 only reverse files that are at most as large as its buffer, which is
3075 typically 32 KiB@. A more reliable and versatile way to reverse files is
3076 the GNU @command{tac} command.
3078 The program accepts the following options. Also see @ref{Common options}.
3083 @itemx --bytes=@var{k}
3086 Output the last @var{k} bytes, instead of final lines.
3087 However, if @var{k} starts with a @samp{+}, start printing with the
3088 @var{k}th byte from the start of each file, instead of from the end.
3089 @multiplierSuffixes{k}
3092 @itemx --follow[=@var{how}]
3095 @cindex growing files
3096 @vindex name @r{follow option}
3097 @vindex descriptor @r{follow option}
3098 Loop forever trying to read more characters at the end of the file,
3099 presumably because the file is growing.
3100 If more than one file is given, @command{tail} prints a header whenever it
3101 gets output from a different file, to indicate which file that output is
3104 There are two ways to specify how you'd like to track files with this option,
3105 but that difference is noticeable only when a followed file is removed or
3107 If you'd like to continue to track the end of a growing file even after
3108 it has been unlinked, use @option{--follow=descriptor}. This is the default
3109 behavior, but it is not useful if you're tracking a log file that may be
3110 rotated (removed or renamed, then reopened). In that case, use
3111 @option{--follow=name} to track the named file, perhaps by reopening it
3112 periodically to see if it has been removed and recreated by some other program.
3113 Note that the inotify-based implementation handles this case without
3114 the need for any periodic reopening.
3116 No matter which method you use, if the tracked file is determined to have
3117 shrunk, @command{tail} prints a message saying the file has been truncated
3118 and resumes tracking the end of the file from the newly-determined endpoint.
3120 When a file is removed, @command{tail}'s behavior depends on whether it is
3121 following the name or the descriptor. When following by name, tail can
3122 detect that a file has been removed and gives a message to that effect,
3123 and if @option{--retry} has been specified it will continue checking
3124 periodically to see if the file reappears.
3125 When following a descriptor, tail does not detect that the file has
3126 been unlinked or renamed and issues no message; even though the file
3127 may no longer be accessible via its original name, it may still be
3130 The option values @samp{descriptor} and @samp{name} may be specified only
3131 with the long form of the option, not with @option{-f}.
3133 The @option{-f} option is ignored if
3134 no @var{file} operand is specified and standard input is a FIFO or a pipe.
3135 Likewise, the @option{-f} option has no effect for any
3136 operand specified as @samp{-}, when standard input is a FIFO or a pipe.
3138 With kernel inotify support, output is triggered by file changes
3139 and is generally very prompt.
3140 Otherwise, @command{tail} sleeps for one second between checks---
3141 use @option{--sleep-interval=@var{n}} to change that default---which can
3142 make the output appear slightly less responsive or bursty.
3143 When using tail without inotify support, you can make it more responsive
3144 by using a sub-second sleep interval, e.g., via an alias like this:
3147 alias tail='tail -s.1'
3152 This option is the same as @option{--follow=name --retry}. That is, tail
3153 will attempt to reopen a file when it is removed. Should this fail, tail
3154 will keep trying until it becomes accessible again.
3158 Indefinitely try to open the specified file.
3159 This option is useful mainly when following (and otherwise issues a warning).
3161 When following by file descriptor (i.e., with @option{--follow=descriptor}),
3162 this option only affects the initial open of the file, as after a successful
3163 open, @command{tail} will start following the file descriptor.
3165 When following by name (i.e., with @option{--follow=name}), @command{tail}
3166 infinitely retries to re-open the given files until killed.
3168 Without this option, when @command{tail} encounters a file that doesn't
3169 exist or is otherwise inaccessible, it reports that fact and
3170 never checks it again.
3172 @item --sleep-interval=@var{number}
3173 @opindex --sleep-interval
3174 Change the number of seconds to wait between iterations (the default is 1.0).
3175 During one iteration, every specified file is checked to see if it has
3177 Historical implementations of @command{tail} have required that
3178 @var{number} be an integer. However, GNU @command{tail} accepts
3179 an arbitrary floating point number. @xref{Floating point}.
3180 When @command{tail} uses inotify, this polling-related option
3181 is usually ignored. However, if you also specify @option{--pid=@var{p}},
3182 @command{tail} checks whether process @var{p} is alive at least
3183 every @var{number} seconds.
3185 @item --pid=@var{pid}
3187 When following by name or by descriptor, you may specify the process ID,
3188 @var{pid}, of the sole writer of all @var{file} arguments. Then, shortly
3189 after that process terminates, tail will also terminate. This will
3190 work properly only if the writer and the tailing process are running on
3191 the same machine. For example, to save the output of a build in a file
3192 and to watch the file grow, if you invoke @command{make} and @command{tail}
3193 like this then the tail process will stop when your build completes.
3194 Without this option, you would have had to kill the @code{tail -f}
3198 $ make >& makerr & tail --pid=$! -f makerr
3201 If you specify a @var{pid} that is not in use or that does not correspond
3202 to the process that is writing to the tailed files, then @command{tail}
3203 may terminate long before any @var{file}s stop growing or it may not
3204 terminate until long after the real writer has terminated.
3205 Note that @option{--pid} cannot be supported on some systems; @command{tail}
3206 will print a warning if this is the case.
3208 @item --max-unchanged-stats=@var{n}
3209 @opindex --max-unchanged-stats
3210 When tailing a file by name, if there have been @var{n} (default
3211 n=@value{DEFAULT_MAX_N_UNCHANGED_STATS_BETWEEN_OPENS}) consecutive
3212 iterations for which the file has not changed, then
3213 @code{open}/@code{fstat} the file to determine if that file name is
3214 still associated with the same device/inode-number pair as before.
3215 When following a log file that is rotated, this is approximately the
3216 number of seconds between when tail prints the last pre-rotation lines
3217 and when it prints the lines that have accumulated in the new log file.
3218 This option is meaningful only when polling (i.e., without inotify)
3219 and when following by name.
3222 @itemx --lines=@var{k}
3225 Output the last @var{k} lines.
3226 However, if @var{k} starts with a @samp{+}, start printing with the
3227 @var{k}th line from the start of each file, instead of from the end.
3228 Size multiplier suffixes are the same as with the @option{-c} option.
3236 Never print file name headers.
3242 Always print file name headers.
3246 For compatibility @command{tail} also supports an obsolete usage
3247 @samp{tail -[@var{count}][bcl][f] [@var{file}]}, which is recognized
3248 only if it does not conflict with the usage described
3249 above. This obsolete form uses exactly one option and at most one
3250 file. In the option, @var{count} is an optional decimal number optionally
3251 followed by a size letter (@samp{b}, @samp{c}, @samp{l}) to mean count
3252 by 512-byte blocks, bytes, or lines, optionally followed by @samp{f}
3253 which has the same meaning as @option{-f}.
3255 @vindex _POSIX2_VERSION
3256 On older systems, the leading @samp{-} can be replaced by @samp{+} in
3257 the obsolete option syntax with the same meaning as in counts, and
3258 obsolete usage overrides normal usage when the two conflict.
3259 This obsolete behavior can be enabled or disabled with the
3260 @env{_POSIX2_VERSION} environment variable (@pxref{Standards
3263 Scripts intended for use on standard hosts should avoid obsolete
3264 syntax and should use @option{-c @var{count}[b]}, @option{-n
3265 @var{count}}, and/or @option{-f} instead. If your script must also
3266 run on hosts that support only the obsolete syntax, you can often
3267 rewrite it to avoid problematic usages, e.g., by using @samp{sed -n
3268 '$p'} rather than @samp{tail -1}. If that's not possible, the script
3269 can use a test like @samp{if tail -c +1 </dev/null >/dev/null 2>&1;
3270 then @dots{}} to decide which syntax to use.
3272 Even if your script assumes the standard behavior, you should still
3273 beware usages whose behaviors differ depending on the POSIX
3274 version. For example, avoid @samp{tail - main.c}, since it might be
3275 interpreted as either @samp{tail main.c} or as @samp{tail -- -
3276 main.c}; avoid @samp{tail -c 4}, since it might mean either @samp{tail
3277 -c4} or @samp{tail -c 10 4}; and avoid @samp{tail +4}, since it might
3278 mean either @samp{tail ./+4} or @samp{tail -n +4}.
3283 @node split invocation
3284 @section @command{split}: Split a file into pieces.
3287 @cindex splitting a file into pieces
3288 @cindex pieces, splitting a file into
3290 @command{split} creates output files containing consecutive or interleaved
3291 sections of @var{input} (standard input if none is given or @var{input}
3292 is @samp{-}). Synopsis:
3295 split [@var{option}] [@var{input} [@var{prefix}]]
3298 By default, @command{split} puts 1000 lines of @var{input} (or whatever is
3299 left over for the last section), into each output file.
3301 @cindex output file name prefix
3302 The output files' names consist of @var{prefix} (@samp{x} by default)
3303 followed by a group of characters (@samp{aa}, @samp{ab}, @dots{} by
3304 default), such that concatenating the output files in traditional
3305 sorted order by file name produces the original input file (except
3306 @option{-nr/@var{n}}). By default split will initially create files
3307 with two generated suffix characters, and will increase this width by two
3308 when the next most significant position reaches the last character.
3309 (@samp{yz}, @samp{zaaa}, @samp{zaab}, @dots{}). In this way an arbitrary
3310 number of output files are supported, which sort as described above,
3311 even in the presence of an @option{--additional-suffix} option.
3312 If the @option{-a} option is specified and the output file names are
3313 exhausted, @command{split} reports an error without deleting the
3314 output files that it did create.
3316 The program accepts the following options. Also see @ref{Common options}.
3320 @item -l @var{lines}
3321 @itemx --lines=@var{lines}
3324 Put @var{lines} lines of @var{input} into each output file.
3326 For compatibility @command{split} also supports an obsolete
3327 option syntax @option{-@var{lines}}. New scripts should use
3328 @option{-l @var{lines}} instead.
3331 @itemx --bytes=@var{size}
3334 Put @var{size} bytes of @var{input} into each output file.
3335 @multiplierSuffixes{size}
3338 @itemx --line-bytes=@var{size}
3340 @opindex --line-bytes
3341 Put into each output file as many complete lines of @var{input} as
3342 possible without exceeding @var{size} bytes. Individual lines longer than
3343 @var{size} bytes are broken into multiple files.
3344 @var{size} has the same format as for the @option{--bytes} option.
3346 @item --filter=@var{command}
3348 With this option, rather than simply writing to each output file,
3349 write through a pipe to the specified shell @var{command} for each output file.
3350 @var{command} should use the $FILE environment variable, which is set
3351 to a different output file name for each invocation of the command.
3352 For example, imagine that you have a 1TiB compressed file
3353 that, if uncompressed, would be too large to reside on disk,
3354 yet you must split it into individually-compressed pieces
3355 of a more manageable size.
3356 To do that, you might run this command:
3359 xz -dc BIG.xz | split -b200G --filter='xz > $FILE.xz' - big-
3362 Assuming a 10:1 compression ratio, that would create about fifty 20GiB files
3363 with names @file{big-aa.xz}, @file{big-ab.xz}, @file{big-ac.xz}, etc.
3365 @item -n @var{chunks}
3366 @itemx --number=@var{chunks}
3370 Split @var{input} to @var{chunks} output files where @var{chunks} may be:
3373 @var{n} generate @var{n} files based on current size of @var{input}
3374 @var{k}/@var{n} only output @var{k}th of @var{n} to stdout
3375 l/@var{n} generate @var{n} files without splitting lines
3376 l/@var{k}/@var{n} likewise but only output @var{k}th of @var{n} to stdout
3377 r/@var{n} like @samp{l} but use round robin distribution
3378 r/@var{k}/@var{n} likewise but only output @var{k}th of @var{n} to stdout
3381 Any excess bytes remaining after dividing the @var{input}
3382 into @var{n} chunks, are assigned to the last chunk.
3383 Any excess bytes appearing after the initial calculation are discarded
3384 (except when using @samp{r} mode).
3386 All @var{n} files are created even if there are fewer than @var{n} lines,
3387 or the @var{input} is truncated.
3389 For @samp{l} mode, chunks are approximately @var{input} size / @var{n}.
3390 The @var{input} is partitioned into @var{n} equal sized portions, with
3391 the last assigned any excess. If a line @emph{starts} within a partition
3392 it is written completely to the corresponding file. Since lines
3393 are not split even if they overlap a partition, the files written
3394 can be larger or smaller than the partition size, and even empty
3395 if a line is so long as to completely overlap the partition.
3397 For @samp{r} mode, the size of @var{input} is irrelevant,
3398 and so can be a pipe for example.
3400 @item -a @var{length}
3401 @itemx --suffix-length=@var{length}
3403 @opindex --suffix-length
3404 Use suffixes of length @var{length}. If a @var{length} of 0 is specified,
3405 this is the same as if (any previous) @option{-a} was not specified, and
3406 thus enables the default behavior, which starts the suffix length at 2,
3407 and unless @option{-n} or @option{--numeric-suffixes=@var{from}} is
3408 specified, will auto increase the length by 2 as required.
3411 @itemx --numeric-suffixes[=@var{from}]
3413 @opindex --numeric-suffixes
3414 Use digits in suffixes rather than lower-case letters. The numerical
3415 suffix counts from @var{from} if specified, 0 otherwise.
3416 Note specifying a @var{from} value also disables the default
3417 auto suffix length expansion described above, and so you may also
3418 want to specify @option{-a} to allow suffixes beyond @samp{99}.
3420 @item --additional-suffix=@var{suffix}
3421 @opindex --additional-suffix
3422 Append an additional @var{suffix} to output file names. @var{suffix}
3423 must not contain slash.
3426 @itemx --elide-empty-files
3428 @opindex --elide-empty-files
3429 Suppress the generation of zero-length output files. This can happen
3430 with the @option{--number} option if a file is (truncated to be) shorter
3431 than the number requested, or if a line is so long as to completely
3432 span a chunk. The output file sequence numbers, always run consecutively
3433 even when this option is specified.
3438 @opindex --unbuffered
3439 Immediately copy input to output in @option{--number r/@dots{}} mode,
3440 which is a much slower mode of operation.
3444 Write a diagnostic just before each output file is opened.
3450 Here are a few examples to illustrate how the
3451 @option{--number} (@option{-n}) option works:
3453 Notice how, by default, one line may be split onto two or more:
3456 $ seq -w 6 10 > k; split -n3 k; head xa?
3469 Use the "l/" modifier to suppress that:
3472 $ seq -w 6 10 > k; split -nl/3 k; head xa?
3485 Use the "r/" modifier to distribute lines in a round-robin fashion:
3488 $ seq -w 6 10 > k; split -nr/3 k; head xa?
3501 You can also extract just the Kth chunk.
3502 This extracts and prints just the 7th "chunk" of 33:
3505 $ seq 100 > k; split -nl/7/33 k
3512 @node csplit invocation
3513 @section @command{csplit}: Split a file into context-determined pieces
3516 @cindex context splitting
3517 @cindex splitting a file into pieces by context
3519 @command{csplit} creates zero or more output files containing sections of
3520 @var{input} (standard input if @var{input} is @samp{-}). Synopsis:
3523 csplit [@var{option}]@dots{} @var{input} @var{pattern}@dots{}
3526 The contents of the output files are determined by the @var{pattern}
3527 arguments, as detailed below. An error occurs if a @var{pattern}
3528 argument refers to a nonexistent line of the input file (e.g., if no
3529 remaining line matches a given regular expression). After every
3530 @var{pattern} has been matched, any remaining input is copied into one
3533 By default, @command{csplit} prints the number of bytes written to each
3534 output file after it has been created.
3536 The types of pattern arguments are:
3541 Create an output file containing the input up to but not including line
3542 @var{n} (a positive integer). If followed by a repeat count, also
3543 create an output file containing the next @var{n} lines of the input
3544 file once for each repeat.
3546 @item /@var{regexp}/[@var{offset}]
3547 Create an output file containing the current line up to (but not
3548 including) the next line of the input file that contains a match for
3549 @var{regexp}. The optional @var{offset} is an integer.
3550 If it is given, the input up to (but not including) the
3551 matching line plus or minus @var{offset} is put into the output file,
3552 and the line after that begins the next section of input.
3554 @item %@var{regexp}%[@var{offset}]
3555 Like the previous type, except that it does not create an output
3556 file, so that section of the input file is effectively ignored.
3558 @item @{@var{repeat-count}@}
3559 Repeat the previous pattern @var{repeat-count} additional
3560 times. The @var{repeat-count} can either be a positive integer or an
3561 asterisk, meaning repeat as many times as necessary until the input is
3566 The output files' names consist of a prefix (@samp{xx} by default)
3567 followed by a suffix. By default, the suffix is an ascending sequence
3568 of two-digit decimal numbers from @samp{00} to @samp{99}. In any case,
3569 concatenating the output files in sorted order by file name produces the
3570 original input file.
3572 By default, if @command{csplit} encounters an error or receives a hangup,
3573 interrupt, quit, or terminate signal, it removes any output files
3574 that it has created so far before it exits.
3576 The program accepts the following options. Also see @ref{Common options}.
3580 @item -f @var{prefix}
3581 @itemx --prefix=@var{prefix}
3584 @cindex output file name prefix
3585 Use @var{prefix} as the output file name prefix.
3587 @item -b @var{suffix}
3588 @itemx --suffix=@var{suffix}
3591 @cindex output file name suffix
3592 Use @var{suffix} as the output file name suffix. When this option is
3593 specified, the suffix string must include exactly one
3594 @code{printf(3)}-style conversion specification, possibly including
3595 format specification flags, a field width, a precision specifications,
3596 or all of these kinds of modifiers. The format letter must convert a
3597 binary unsigned integer argument to readable form. The format letters
3598 @samp{d} and @samp{i} are aliases for @samp{u}, and the
3599 @samp{u}, @samp{o}, @samp{x}, and @samp{X} conversions are allowed. The
3600 entire @var{suffix} is given (with the current output file number) to
3601 @code{sprintf(3)} to form the file name suffixes for each of the
3602 individual output files in turn. If this option is used, the
3603 @option{--digits} option is ignored.
3605 @item -n @var{digits}
3606 @itemx --digits=@var{digits}
3609 Use output file names containing numbers that are @var{digits} digits
3610 long instead of the default 2.
3615 @opindex --keep-files
3616 Do not remove output files when errors are encountered.
3618 @item --suppress-matched
3619 @opindex --suppress-matched
3620 Do not output lines matching the specified @var{pattern}.
3621 I.E. suppress the boundary line from the start of the second
3622 and subsequent splits.
3625 @itemx --elide-empty-files
3627 @opindex --elide-empty-files
3628 Suppress the generation of zero-length output files. (In cases where
3629 the section delimiters of the input file are supposed to mark the first
3630 lines of each of the sections, the first output file will generally be a
3631 zero-length file unless you use this option.) The output file sequence
3632 numbers always run consecutively starting from 0, even when this option
3643 Do not print counts of output file sizes.
3649 Here is an example of its usage.
3650 First, create an empty directory for the exercise,
3657 Now, split the sequence of 1..14 on lines that end with 0 or 5:
3660 $ seq 14 | csplit - '/[05]$/' '@{*@}'
3666 Each number printed above is the size of an output
3667 file that csplit has just created.
3668 List the names of those output files:
3675 Use @command{head} to show their contents:
3700 Example of splitting input by empty lines:
3703 $ csplit --suppress-matched @var{input.txt} '/^$/' '@{*@}'
3707 @c TODO: "uniq" already supports "--group".
3708 @c when it gets the "--key" option, uncomment this example.
3710 @c Example of splitting input file, based on the value of column 2:
3713 @c $ cat @var{input.txt} |
3715 @c uniq --group -k2,2 |
3716 @c csplit -m '/^$/' '@{*@}'
3719 @node Summarizing files
3720 @chapter Summarizing files
3722 @cindex summarizing files
3724 These commands generate just a few numbers representing entire
3728 * wc invocation:: Print newline, word, and byte counts.
3729 * sum invocation:: Print checksum and block counts.
3730 * cksum invocation:: Print CRC checksum and byte counts.
3731 * md5sum invocation:: Print or check MD5 digests.
3732 * sha1sum invocation:: Print or check SHA-1 digests.
3733 * sha2 utilities:: Print or check SHA-2 digests.
3738 @section @command{wc}: Print newline, word, and byte counts
3742 @cindex character count
3746 @command{wc} counts the number of bytes, characters, whitespace-separated
3747 words, and newlines in each given @var{file}, or standard input if none
3748 are given or for a @var{file} of @samp{-}. Synopsis:
3751 wc [@var{option}]@dots{} [@var{file}]@dots{}
3754 @cindex total counts
3755 @command{wc} prints one line of counts for each file, and if the file was
3756 given as an argument, it prints the file name following the counts. If
3757 more than one @var{file} is given, @command{wc} prints a final line
3758 containing the cumulative counts, with the file name @file{total}. The
3759 counts are printed in this order: newlines, words, characters, bytes,
3760 maximum line length.
3761 Each count is printed right-justified in a field with at least one
3762 space between fields so that the numbers and file names normally line
3763 up nicely in columns. The width of the count fields varies depending
3764 on the inputs, so you should not depend on a particular field width.
3765 However, as a GNU extension, if only one count is printed,
3766 it is guaranteed to be printed without leading spaces.
3768 By default, @command{wc} prints three counts: the newline, words, and byte
3769 counts. Options can specify that only certain counts be printed.
3770 Options do not undo others previously given, so
3777 prints both the byte counts and the word counts.
3779 With the @option{--max-line-length} option, @command{wc} prints the length
3780 of the longest line per file, and if there is more than one file it
3781 prints the maximum (not the sum) of those lengths. The line lengths here
3782 are measured in screen columns, according to the current locale and
3783 assuming tab positions in every 8th column.
3785 The program accepts the following options. Also see @ref{Common options}.
3793 Print only the byte counts.
3799 Print only the character counts.
3805 Print only the word counts.
3811 Print only the newline counts.
3814 @itemx --max-line-length
3816 @opindex --max-line-length
3817 Print only the maximum line lengths.
3819 @macro filesZeroFromOption{cmd,withTotalOption,subListOutput}
3820 @item --files0-from=@var{file}
3821 @opindex --files0-from=@var{file}
3822 @c This is commented out to avoid a texi2dvi failure.
3823 @c texi2dvi (GNU Texinfo 4.11) 1.104
3824 @c @cindex including files from @command{\cmd\}
3825 Disallow processing files named on the command line, and instead process
3826 those named in file @var{file}; each name being terminated by a zero byte
3828 This is useful \withTotalOption\
3829 when the list of file names is so long that it may exceed a command line
3831 In such cases, running @command{\cmd\} via @command{xargs} is undesirable
3832 because it splits the list into pieces and makes @command{\cmd\} print
3833 \subListOutput\ for each sublist rather than for the entire list.
3834 One way to produce a list of ASCII NUL terminated file
3836 @command{find}, using its @option{-print0} predicate.
3837 If @var{file} is @samp{-} then the ASCII NUL terminated
3838 file names are read from standard input.
3840 @filesZeroFromOption{wc,,a total}
3842 For example, to find the length of the longest line in any @file{.c} or
3843 @file{.h} file in the current hierarchy, do this:
3846 find . -name '*.[ch]' -print0 |
3847 wc -L --files0-from=- | tail -n1
3855 @node sum invocation
3856 @section @command{sum}: Print checksum and block counts
3859 @cindex 16-bit checksum
3860 @cindex checksum, 16-bit
3862 @command{sum} computes a 16-bit checksum for each given @var{file}, or
3863 standard input if none are given or for a @var{file} of @samp{-}. Synopsis:
3866 sum [@var{option}]@dots{} [@var{file}]@dots{}
3869 @command{sum} prints the checksum for each @var{file} followed by the
3870 number of blocks in the file (rounded up). If more than one @var{file}
3871 is given, file names are also printed (by default). (With the
3872 @option{--sysv} option, corresponding file names are printed when there is
3873 at least one file argument.)
3875 By default, GNU @command{sum} computes checksums using an algorithm
3876 compatible with BSD @command{sum} and prints file sizes in units of
3879 The program accepts the following options. Also see @ref{Common options}.
3885 @cindex BSD @command{sum}
3886 Use the default (BSD compatible) algorithm. This option is included for
3887 compatibility with the System V @command{sum}. Unless @option{-s} was also
3888 given, it has no effect.
3894 @cindex System V @command{sum}
3895 Compute checksums using an algorithm compatible with System V
3896 @command{sum}'s default, and print file sizes in units of 512-byte blocks.
3900 @command{sum} is provided for compatibility; the @command{cksum} program (see
3901 next section) is preferable in new applications.
3906 @node cksum invocation
3907 @section @command{cksum}: Print CRC checksum and byte counts
3910 @cindex cyclic redundancy check
3911 @cindex CRC checksum
3913 @command{cksum} computes a cyclic redundancy check (CRC) checksum for each
3914 given @var{file}, or standard input if none are given or for a
3915 @var{file} of @samp{-}. Synopsis:
3918 cksum [@var{option}]@dots{} [@var{file}]@dots{}
3921 @command{cksum} prints the CRC checksum for each file along with the number
3922 of bytes in the file, and the file name unless no arguments were given.
3924 @command{cksum} is typically used to ensure that files
3925 transferred by unreliable means (e.g., netnews) have not been corrupted,
3926 by comparing the @command{cksum} output for the received files with the
3927 @command{cksum} output for the original files (typically given in the
3930 The CRC algorithm is specified by the POSIX standard. It is not
3931 compatible with the BSD or System V @command{sum} algorithms (see the
3932 previous section); it is more robust.
3934 The only options are @option{--help} and @option{--version}. @xref{Common
3940 @node md5sum invocation
3941 @section @command{md5sum}: Print or check MD5 digests
3945 @cindex 128-bit checksum
3946 @cindex checksum, 128-bit
3947 @cindex fingerprint, 128-bit
3948 @cindex message-digest, 128-bit
3950 @command{md5sum} computes a 128-bit checksum (or @dfn{fingerprint} or
3951 @dfn{message-digest}) for each specified @var{file}.
3953 Note: The MD5 digest is more reliable than a simple CRC (provided by
3954 the @command{cksum} command) for detecting accidental file corruption,
3955 as the chances of accidentally having two files with identical MD5
3956 are vanishingly small. However, it should not be considered secure
3957 against malicious tampering: although finding a file with a given MD5
3958 fingerprint is considered infeasible at the moment, it is known how
3959 to modify certain files, including digital certificates, so that they
3960 appear valid when signed with an MD5 digest.
3961 For more secure hashes, consider using SHA-2. @xref{sha2 utilities}.
3963 If a @var{file} is specified as @samp{-} or if no files are given
3964 @command{md5sum} computes the checksum for the standard input.
3965 @command{md5sum} can also determine whether a file and checksum are
3966 consistent. Synopsis:
3969 md5sum [@var{option}]@dots{} [@var{file}]@dots{}
3972 For each @var{file}, @samp{md5sum} outputs the MD5 checksum, a flag
3973 indicating binary or text input mode, and the file name.
3974 If @var{file} contains a backslash or newline, the
3975 line is started with a backslash, and each problematic character in
3976 the file name is escaped with a backslash, making the output
3977 unambiguous even in the presence of arbitrary file names.
3978 If @var{file} is omitted or specified as @samp{-}, standard input is read.
3980 The program accepts the following options. Also see @ref{Common options}.
3988 @cindex binary input files
3989 Treat each input file as binary, by reading it in binary mode and
3990 outputting a @samp{*} flag. This is the inverse of @option{--text}.
3991 On systems like GNU that do not distinguish between binary
3992 and text files, this option merely flags each input mode as binary:
3993 the MD5 checksum is unaffected. This option is the default on systems
3994 like MS-DOS that distinguish between binary and text files, except
3995 for reading standard input when standard input is a terminal.
3999 Read file names and checksum information (not data) from each
4000 @var{file} (or from stdin if no @var{file} was specified) and report
4001 whether the checksums match the contents of the named files.
4002 The input to this mode of @command{md5sum} is usually the output of
4003 a prior, checksum-generating run of @samp{md5sum}.
4004 Each valid line of input consists of an MD5 checksum, a binary/text
4005 flag, and then a file name.
4006 Binary mode is indicated with @samp{*}, text with @samp{ } (space).
4007 For each such line, @command{md5sum} reads the named file and computes its
4008 MD5 checksum. Then, if the computed message digest does not match the
4009 one on the line with the file name, the file is noted as having
4010 failed the test. Otherwise, the file passes the test.
4011 By default, for each valid line, one line is written to standard
4012 output indicating whether the named file passed the test.
4013 After all checks have been performed, if there were any failures,
4014 a warning is issued to standard error.
4015 Use the @option{--status} option to inhibit that output.
4016 If any listed file cannot be opened or read, if any valid line has
4017 an MD5 checksum inconsistent with the associated file, or if no valid
4018 line is found, @command{md5sum} exits with nonzero status. Otherwise,
4019 it exits successfully.
4023 @cindex verifying MD5 checksums
4024 This option is useful only when verifying checksums.
4025 When verifying checksums, don't generate an 'OK' message per successfully
4026 checked file. Files that fail the verification are reported in the
4027 default one-line-per-file format. If there is any checksum mismatch,
4028 print a warning summarizing the failures to standard error.
4032 @cindex verifying MD5 checksums
4033 This option is useful only when verifying checksums.
4034 When verifying checksums, don't generate the default one-line-per-file
4035 diagnostic and don't output the warning summarizing any failures.
4036 Failures to open or read a file still evoke individual diagnostics to
4038 If all listed files are readable and are consistent with the associated
4039 MD5 checksums, exit successfully. Otherwise exit with a status code
4040 indicating there was a failure.
4045 Output BSD style checksums, which indicate the checksum algorithm used.
4046 As a GNU extension, file names with problematic characters
4047 are escaped as described above, with the same escaping indicator of @samp{\}
4048 at the start of the line, being used.
4049 The @option{--tag} option implies binary mode, and is disallowed with
4050 @option{--text} mode as supporting that would unnecessarily complicate
4051 the output format, while providing little benefit.
4057 @cindex text input files
4058 Treat each input file as text, by reading it in text mode and
4059 outputting a @samp{ } flag. This is the inverse of @option{--binary}.
4060 This option is the default on systems like GNU that do not
4061 distinguish between binary and text files. On other systems, it is
4062 the default for reading standard input when standard input is a
4063 terminal. This mode is never defaulted to if @option{--tag} is used.
4069 @cindex verifying MD5 checksums
4070 When verifying checksums, warn about improperly formatted MD5 checksum lines.
4071 This option is useful only if all but a few lines in the checked input
4076 @cindex verifying MD5 checksums
4077 When verifying checksums,
4078 if one or more input line is invalid,
4079 exit nonzero after all warnings have been issued.
4086 @node sha1sum invocation
4087 @section @command{sha1sum}: Print or check SHA-1 digests
4091 @cindex 160-bit checksum
4092 @cindex checksum, 160-bit
4093 @cindex fingerprint, 160-bit
4094 @cindex message-digest, 160-bit
4096 @command{sha1sum} computes a 160-bit checksum for each specified
4097 @var{file}. The usage and options of this command are precisely the
4098 same as for @command{md5sum}. @xref{md5sum invocation}.
4100 Note: The SHA-1 digest is more secure than MD5, and no collisions of
4101 it are known (different files having the same fingerprint). However,
4102 it is known that they can be produced with considerable, but not
4103 unreasonable, resources. For this reason, it is generally considered
4104 that SHA-1 should be gradually phased out in favor of the more secure
4105 SHA-2 hash algorithms. @xref{sha2 utilities}.
4108 @node sha2 utilities
4109 @section sha2 utilities: Print or check SHA-2 digests
4116 @cindex 224-bit checksum
4117 @cindex 256-bit checksum
4118 @cindex 384-bit checksum
4119 @cindex 512-bit checksum
4120 @cindex checksum, 224-bit
4121 @cindex checksum, 256-bit
4122 @cindex checksum, 384-bit
4123 @cindex checksum, 512-bit
4124 @cindex fingerprint, 224-bit
4125 @cindex fingerprint, 256-bit
4126 @cindex fingerprint, 384-bit
4127 @cindex fingerprint, 512-bit
4128 @cindex message-digest, 224-bit
4129 @cindex message-digest, 256-bit
4130 @cindex message-digest, 384-bit
4131 @cindex message-digest, 512-bit
4133 The commands @command{sha224sum}, @command{sha256sum},
4134 @command{sha384sum} and @command{sha512sum} compute checksums of
4135 various lengths (respectively 224, 256, 384 and 512 bits),
4136 collectively known as the SHA-2 hashes. The usage and options of
4137 these commands are precisely the same as for @command{md5sum}.
4138 @xref{md5sum invocation}.
4140 Note: The SHA384 and SHA512 digests are considerably slower to
4141 compute, especially on 32-bit computers, than SHA224 or SHA256.
4144 @node Operating on sorted files
4145 @chapter Operating on sorted files
4147 @cindex operating on sorted files
4148 @cindex sorted files, operations on
4150 These commands work with (or produce) sorted files.
4153 * sort invocation:: Sort text files.
4154 * shuf invocation:: Shuffle text files.
4155 * uniq invocation:: Uniquify files.
4156 * comm invocation:: Compare two sorted files line by line.
4157 * ptx invocation:: Produce a permuted index of file contents.
4158 * tsort invocation:: Topological sort.
4162 @node sort invocation
4163 @section @command{sort}: Sort text files
4166 @cindex sorting files
4168 @command{sort} sorts, merges, or compares all the lines from the given
4169 files, or standard input if none are given or for a @var{file} of
4170 @samp{-}. By default, @command{sort} writes the results to standard
4174 sort [@var{option}]@dots{} [@var{file}]@dots{}
4177 @command{sort} has three modes of operation: sort (the default), merge,
4178 and check for sortedness. The following options change the operation
4185 @itemx --check=diagnose-first
4188 @cindex checking for sortedness
4189 Check whether the given file is already sorted: if it is not all
4190 sorted, print a diagnostic containing the first out-of-order line and
4191 exit with a status of 1.
4192 Otherwise, exit successfully.
4193 At most one input file can be given.
4196 @itemx --check=quiet
4197 @itemx --check=silent
4200 @cindex checking for sortedness
4201 Exit successfully if the given file is already sorted, and
4202 exit with status 1 otherwise.
4203 At most one input file can be given.
4204 This is like @option{-c}, except it does not print a diagnostic.
4210 @cindex merging sorted files
4211 Merge the given files by sorting them as a group. Each input file must
4212 always be individually sorted. It always works to sort instead of
4213 merge; merging is provided because it is faster, in the case where it
4218 @cindex sort stability
4219 @cindex sort's last-resort comparison
4220 A pair of lines is compared as follows:
4221 @command{sort} compares each pair of fields, in the
4222 order specified on the command line, according to the associated
4223 ordering options, until a difference is found or no fields are left.
4224 If no key fields are specified, @command{sort} uses a default key of
4225 the entire line. Finally, as a last resort when all keys compare
4226 equal, @command{sort} compares entire lines as if no ordering options
4227 other than @option{--reverse} (@option{-r}) were specified. The
4228 @option{--stable} (@option{-s}) option disables this @dfn{last-resort
4229 comparison} so that lines in which all fields compare equal are left
4230 in their original relative order. The @option{--unique}
4231 (@option{-u}) option also disables the last-resort comparison.
4235 Unless otherwise specified, all comparisons use the character collating
4236 sequence specified by the @env{LC_COLLATE} locale.@footnote{If you
4237 use a non-POSIX locale (e.g., by setting @env{LC_ALL}
4238 to @samp{en_US}), then @command{sort} may produce output that is sorted
4239 differently than you're accustomed to. In that case, set the @env{LC_ALL}
4240 environment variable to @samp{C}@. Note that setting only @env{LC_COLLATE}
4241 has two problems. First, it is ineffective if @env{LC_ALL} is also set.
4242 Second, it has undefined behavior if @env{LC_CTYPE} (or @env{LANG}, if
4243 @env{LC_CTYPE} is unset) is set to an incompatible value. For example,
4244 you get undefined behavior if @env{LC_CTYPE} is @code{ja_JP.PCK} but
4245 @env{LC_COLLATE} is @code{en_US.UTF-8}.}
4247 GNU @command{sort} (as specified for all GNU utilities) has no
4248 limit on input line length or restrictions on bytes allowed within lines.
4249 In addition, if the final byte of an input file is not a newline, GNU
4250 @command{sort} silently supplies one. A line's trailing newline is not
4251 part of the line for comparison purposes.
4253 @cindex exit status of @command{sort}
4257 0 if no error occurred
4258 1 if invoked with @option{-c} or @option{-C} and the input is not sorted
4259 2 if an error occurred
4263 If the environment variable @env{TMPDIR} is set, @command{sort} uses its
4264 value as the directory for temporary files instead of @file{/tmp}. The
4265 @option{--temporary-directory} (@option{-T}) option in turn overrides
4266 the environment variable.
4268 The following options affect the ordering of output lines. They may be
4269 specified globally or as part of a specific key field. If no key
4270 fields are specified, global options apply to comparison of entire
4271 lines; otherwise the global options are inherited by key fields that do
4272 not specify any special options of their own. In pre-POSIX
4273 versions of @command{sort}, global options affect only later key fields,
4274 so portable shell scripts should specify global options first.
4279 @itemx --ignore-leading-blanks
4281 @opindex --ignore-leading-blanks
4282 @cindex blanks, ignoring leading
4284 Ignore leading blanks when finding sort keys in each line.
4285 By default a blank is a space or a tab, but the @env{LC_CTYPE} locale
4286 can change this. Note blanks may be ignored by your locale's collating
4287 rules, but without this option they will be significant for character
4288 positions specified in keys with the @option{-k} option.
4291 @itemx --dictionary-order
4293 @opindex --dictionary-order
4294 @cindex dictionary order
4295 @cindex phone directory order
4296 @cindex telephone directory order
4298 Sort in @dfn{phone directory} order: ignore all characters except
4299 letters, digits and blanks when sorting.
4300 By default letters and digits are those of ASCII and a blank
4301 is a space or a tab, but the @env{LC_CTYPE} locale can change this.
4304 @itemx --ignore-case
4306 @opindex --ignore-case
4307 @cindex ignoring case
4308 @cindex case folding
4310 Fold lowercase characters into the equivalent uppercase characters when
4311 comparing so that, for example, @samp{b} and @samp{B} sort as equal.
4312 The @env{LC_CTYPE} locale determines character types.
4313 When used with @option{--unique} those lower case equivalent lines are
4314 thrown away. (There is currently no way to throw away the upper case
4315 equivalent instead. (Any @option{--reverse} given would only affect
4316 the final result, after the throwing away.))
4319 @itemx --general-numeric-sort
4320 @itemx --sort=general-numeric
4322 @opindex --general-numeric-sort
4324 @cindex general numeric sort
4326 Sort numerically, converting a prefix of each line to a long
4327 double-precision floating point number. @xref{Floating point}.
4328 Do not report overflow, underflow, or conversion errors.
4329 Use the following collating sequence:
4333 Lines that do not start with numbers (all considered to be equal).
4335 NaNs (``Not a Number'' values, in IEEE floating point arithmetic)
4336 in a consistent but machine-dependent order.
4340 Finite numbers in ascending numeric order (with @math{-0} and @math{+0} equal).
4345 Use this option only if there is no alternative; it is much slower than
4346 @option{--numeric-sort} (@option{-n}) and it can lose information when
4347 converting to floating point.
4350 @itemx --human-numeric-sort
4351 @itemx --sort=human-numeric
4353 @opindex --human-numeric-sort
4355 @cindex human numeric sort
4357 Sort numerically, first by numeric sign (negative, zero, or positive);
4358 then by SI suffix (either empty, or @samp{k} or @samp{K}, or
4359 one of @samp{MGTPEZY}, in that order; @pxref{Block size}); and finally
4360 by numeric value. For example, @samp{1023M} sorts before @samp{1G}
4361 because @samp{M} (mega) precedes @samp{G} (giga) as an SI
4362 suffix. This option sorts values that are consistently scaled to the
4363 nearest suffix, regardless of whether suffixes denote powers of 1000
4364 or 1024, and it therefore sorts the output of any single invocation of
4365 the @command{df}, @command{du}, or @command{ls} commands that are
4366 invoked with their @option{--human-readable} or @option{--si} options.
4367 The syntax for numbers is the same as for the @option{--numeric-sort}
4368 option; the SI suffix must immediately follow the number.
4369 Note also the @command{numfmt} command, which can be used to reformat
4370 numbers to human format @emph{after} the sort, thus often allowing
4371 sort to operate on more accurate numbers.
4374 @itemx --ignore-nonprinting
4376 @opindex --ignore-nonprinting
4377 @cindex nonprinting characters, ignoring
4378 @cindex unprintable characters, ignoring
4380 Ignore nonprinting characters.
4381 The @env{LC_CTYPE} locale determines character types.
4382 This option has no effect if the stronger @option{--dictionary-order}
4383 (@option{-d}) option is also given.
4389 @opindex --month-sort
4391 @cindex months, sorting by
4393 An initial string, consisting of any amount of blanks, followed
4394 by a month name abbreviation, is folded to UPPER case and
4395 compared in the order @samp{JAN} < @samp{FEB} < @dots{} < @samp{DEC}@.
4396 Invalid names compare low to valid names. The @env{LC_TIME} locale
4397 category determines the month spellings.
4398 By default a blank is a space or a tab, but the @env{LC_CTYPE} locale
4402 @itemx --numeric-sort
4403 @itemx --sort=numeric
4405 @opindex --numeric-sort
4407 @cindex numeric sort
4409 Sort numerically. The number begins each line and consists
4410 of optional blanks, an optional @samp{-} sign, and zero or more
4411 digits possibly separated by thousands separators, optionally followed
4412 by a decimal-point character and zero or more digits. An empty
4413 number is treated as @samp{0}. The @env{LC_NUMERIC}
4414 locale specifies the decimal-point character and thousands separator.
4415 By default a blank is a space or a tab, but the @env{LC_CTYPE} locale
4418 Comparison is exact; there is no rounding error.
4420 Neither a leading @samp{+} nor exponential notation is recognized.
4421 To compare such strings numerically, use the
4422 @option{--general-numeric-sort} (@option{-g}) option.
4425 @itemx --version-sort
4427 @opindex --version-sort
4428 @cindex version number sort
4429 Sort by version name and number. It behaves like a standard sort,
4430 except that each sequence of decimal digits is treated numerically
4431 as an index/version number. (@xref{Details about version sort}.)
4437 @cindex reverse sorting
4438 Reverse the result of comparison, so that lines with greater key values
4439 appear earlier in the output instead of later.
4442 @itemx --random-sort
4443 @itemx --sort=random
4445 @opindex --random-sort
4448 Sort by hashing the input keys and then sorting the hash values.
4449 Choose the hash function at random, ensuring that it is free of
4450 collisions so that differing keys have differing hash values. This is
4451 like a random permutation of the inputs (@pxref{shuf invocation}),
4452 except that keys with the same value sort together.
4454 If multiple random sort fields are specified, the same random hash
4455 function is used for all fields. To use different random hash
4456 functions for different fields, you can invoke @command{sort} more
4459 The choice of hash function is affected by the
4460 @option{--random-source} option.
4468 @item --compress-program=@var{prog}
4469 Compress any temporary files with the program @var{prog}.
4471 With no arguments, @var{prog} must compress standard input to standard
4472 output, and when given the @option{-d} option it must decompress
4473 standard input to standard output.
4475 Terminate with an error if @var{prog} exits with nonzero status.
4477 White space and the backslash character should not appear in
4478 @var{prog}; they are reserved for future use.
4480 @filesZeroFromOption{sort,,sorted output}
4482 @item -k @var{pos1}[,@var{pos2}]
4483 @itemx --key=@var{pos1}[,@var{pos2}]
4487 Specify a sort field that consists of the part of the line between
4488 @var{pos1} and @var{pos2} (or the end of the line, if @var{pos2} is
4489 omitted), @emph{inclusive}.
4491 Each @var{pos} has the form @samp{@var{f}[.@var{c}][@var{opts}]},
4492 where @var{f} is the number of the field to use, and @var{c} is the number
4493 of the first character from the beginning of the field. Fields and character
4494 positions are numbered starting with 1; a character position of zero in
4495 @var{pos2} indicates the field's last character. If @samp{.@var{c}} is
4496 omitted from @var{pos1}, it defaults to 1 (the beginning of the field);
4497 if omitted from @var{pos2}, it defaults to 0 (the end of the field).
4498 @var{opts} are ordering options, allowing individual keys to be sorted
4499 according to different rules; see below for details. Keys can span
4502 Example: To sort on the second field, use @option{--key=2,2}
4503 (@option{-k 2,2}). See below for more notes on keys and more examples.
4504 See also the @option{--debug} option to help determine the part
4505 of the line being used in the sort.
4508 Highlight the portion of each line used for sorting.
4509 Also issue warnings about questionable usage to stderr.
4511 @item --batch-size=@var{nmerge}
4512 @opindex --batch-size
4513 @cindex number of inputs to merge, nmerge
4514 Merge at most @var{nmerge} inputs at once.
4516 When @command{sort} has to merge more than @var{nmerge} inputs,
4517 it merges them in groups of @var{nmerge}, saving the result in
4518 a temporary file, which is then used as an input in a subsequent merge.
4520 A large value of @var{nmerge} may improve merge performance and decrease
4521 temporary storage utilization at the expense of increased memory usage
4522 and I/O@. Conversely a small value of @var{nmerge} may reduce memory
4523 requirements and I/O at the expense of temporary storage consumption and
4526 The value of @var{nmerge} must be at least 2. The default value is
4527 currently 16, but this is implementation-dependent and may change in
4530 The value of @var{nmerge} may be bounded by a resource limit for open
4531 file descriptors. The commands @samp{ulimit -n} or @samp{getconf
4532 OPEN_MAX} may display limits for your systems; these limits may be
4533 modified further if your program already has some files open, or if
4534 the operating system has other limits on the number of open files. If
4535 the value of @var{nmerge} exceeds the resource limit, @command{sort}
4536 silently uses a smaller value.
4538 @item -o @var{output-file}
4539 @itemx --output=@var{output-file}
4542 @cindex overwriting of input, allowed
4543 Write output to @var{output-file} instead of standard output.
4544 Normally, @command{sort} reads all input before opening
4545 @var{output-file}, so you can safely sort a file in place by using
4546 commands like @code{sort -o F F} and @code{cat F | sort -o F}@.
4547 However, @command{sort} with @option{--merge} (@option{-m}) can open
4548 the output file before reading all input, so a command like @code{cat
4549 F | sort -m -o F - G} is not safe as @command{sort} might start
4550 writing @file{F} before @command{cat} is done reading it.
4552 @vindex POSIXLY_CORRECT
4553 On newer systems, @option{-o} cannot appear after an input file if
4554 @env{POSIXLY_CORRECT} is set, e.g., @samp{sort F -o F}@. Portable
4555 scripts should specify @option{-o @var{output-file}} before any input
4558 @item --random-source=@var{file}
4559 @opindex --random-source
4560 @cindex random source for sorting
4561 Use @var{file} as a source of random data used to determine which
4562 random hash function to use with the @option{-R} option. @xref{Random
4569 @cindex sort stability
4570 @cindex sort's last-resort comparison
4572 Make @command{sort} stable by disabling its last-resort comparison.
4573 This option has no effect if no fields or global ordering options
4574 other than @option{--reverse} (@option{-r}) are specified.
4577 @itemx --buffer-size=@var{size}
4579 @opindex --buffer-size
4580 @cindex size for main memory sorting
4581 Use a main-memory sort buffer of the given @var{size}. By default,
4582 @var{size} is in units of 1024 bytes. Appending @samp{%} causes
4583 @var{size} to be interpreted as a percentage of physical memory.
4584 Appending @samp{K} multiplies @var{size} by 1024 (the default),
4585 @samp{M} by 1,048,576, @samp{G} by 1,073,741,824, and so on for
4586 @samp{T}, @samp{P}, @samp{E}, @samp{Z}, and @samp{Y}@. Appending
4587 @samp{b} causes @var{size} to be interpreted as a byte count, with no
4590 This option can improve the performance of @command{sort} by causing it
4591 to start with a larger or smaller sort buffer than the default.
4592 However, this option affects only the initial buffer size. The buffer
4593 grows beyond @var{size} if @command{sort} encounters input lines larger
4596 @item -t @var{separator}
4597 @itemx --field-separator=@var{separator}
4599 @opindex --field-separator
4600 @cindex field separator character
4601 Use character @var{separator} as the field separator when finding the
4602 sort keys in each line. By default, fields are separated by the empty
4603 string between a non-blank character and a blank character.
4604 By default a blank is a space or a tab, but the @env{LC_CTYPE} locale
4607 That is, given the input line @w{@samp{ foo bar}}, @command{sort} breaks it
4608 into fields @w{@samp{ foo}} and @w{@samp{ bar}}. The field separator is
4609 not considered to be part of either the field preceding or the field
4610 following, so with @samp{sort @w{-t " "}} the same input line has
4611 three fields: an empty field, @samp{foo}, and @samp{bar}.
4612 However, fields that extend to the end of the line,
4613 as @option{-k 2}, or fields consisting of a range, as @option{-k 2,3},
4614 retain the field separators present between the endpoints of the range.
4616 To specify ASCII NUL as the field separator,
4617 use the two-character string @samp{\0}, e.g., @samp{sort -t '\0'}.
4619 @item -T @var{tempdir}
4620 @itemx --temporary-directory=@var{tempdir}
4622 @opindex --temporary-directory
4623 @cindex temporary directory
4625 Use directory @var{tempdir} to store temporary files, overriding the
4626 @env{TMPDIR} environment variable. If this option is given more than
4627 once, temporary files are stored in all the directories given. If you
4628 have a large sort or merge that is I/O-bound, you can often improve
4629 performance by using this option to specify directories on different
4630 disks and controllers.
4632 @item --parallel=@var{n}
4634 @cindex multithreaded sort
4635 Set the number of sorts run in parallel to @var{n}. By default,
4636 @var{n} is set to the number of available processors, but limited
4637 to 8, as there are diminishing performance gains after that.
4638 Note also that using @var{n} threads increases the memory usage by
4639 a factor of log @var{n}. Also see @ref{nproc invocation}.
4645 @cindex uniquifying output
4647 Normally, output only the first of a sequence of lines that compare
4648 equal. For the @option{--check} (@option{-c} or @option{-C}) option,
4649 check that no pair of consecutive lines compares equal.
4651 This option also disables the default last-resort comparison.
4653 The commands @code{sort -u} and @code{sort | uniq} are equivalent, but
4654 this equivalence does not extend to arbitrary @command{sort} options.
4655 For example, @code{sort -n -u} inspects only the value of the initial
4656 numeric string when checking for uniqueness, whereas @code{sort -n |
4657 uniq} inspects the entire line. @xref{uniq invocation}.
4659 @macro zeroTerminatedOption
4661 @itemx --zero-terminated
4663 @opindex --zero-terminated
4664 @cindex process zero-terminated items
4665 Delimit items with a zero byte rather than a newline (ASCII LF).
4666 I.e., treat input as items separated by ASCII NUL
4667 and terminate output items with ASCII NUL.
4668 This option can be useful in conjunction with @samp{perl -0} or
4669 @samp{find -print0} and @samp{xargs -0} which do the same in order to
4670 reliably handle arbitrary file names (even those containing blanks
4671 or other special characters).
4673 @zeroTerminatedOption
4677 Historical (BSD and System V) implementations of @command{sort} have
4678 differed in their interpretation of some options, particularly
4679 @option{-b}, @option{-f}, and @option{-n}.
4680 GNU sort follows the POSIX
4681 behavior, which is usually (but not always!) like the System V behavior.
4682 According to POSIX, @option{-n} no longer implies @option{-b}. For
4683 consistency, @option{-M} has been changed in the same way. This may
4684 affect the meaning of character positions in field specifications in
4685 obscure cases. The only fix is to add an explicit @option{-b}.
4687 A position in a sort field specified with @option{-k} may have any
4688 of the option letters @samp{MbdfghinRrV} appended to it, in which case no
4689 global ordering options are inherited by that particular field. The
4690 @option{-b} option may be independently attached to either or both of
4691 the start and end positions of a field specification, and if it is
4692 inherited from the global options it will be attached to both.
4693 If input lines can contain leading or adjacent blanks and @option{-t}
4694 is not used, then @option{-k} is typically combined with @option{-b} or
4695 an option that implicitly ignores leading blanks (@samp{Mghn}) as otherwise
4696 the varying numbers of leading blanks in fields can cause confusing results.
4698 If the start position in a sort field specifier falls after the end of
4699 the line or after the end field, the field is empty. If the @option{-b}
4700 option was specified, the @samp{.@var{c}} part of a field specification
4701 is counted from the first nonblank character of the field.
4703 @vindex _POSIX2_VERSION
4704 @vindex POSIXLY_CORRECT
4705 On older systems, @command{sort} supports an obsolete origin-zero
4706 syntax @samp{+@var{pos1} [-@var{pos2}]} for specifying sort keys.
4707 The obsolete sequence @samp{sort +@var{a}.@var{x} -@var{b}.@var{y}}
4708 is equivalent to @samp{sort -k @var{a+1}.@var{x+1},@var{b}} if @var{y}
4709 is @samp{0} or absent, otherwise it is equivalent to @samp{sort -k
4710 @var{a+1}.@var{x+1},@var{b+1}.@var{y}}.
4712 This obsolete behavior can be enabled or disabled with the
4713 @env{_POSIX2_VERSION} environment variable (@pxref{Standards
4714 conformance}); it can also be enabled when @env{POSIXLY_CORRECT} is
4715 not set by using the obsolete syntax with @samp{-@var{pos2}} present.
4717 Scripts intended for use on standard hosts should avoid obsolete
4718 syntax and should use @option{-k} instead. For example, avoid
4719 @samp{sort +2}, since it might be interpreted as either @samp{sort
4720 ./+2} or @samp{sort -k 3}. If your script must also run on hosts that
4721 support only the obsolete syntax, it can use a test like @samp{if sort
4722 -k 1 </dev/null >/dev/null 2>&1; then @dots{}} to decide which syntax
4725 Here are some examples to illustrate various combinations of options.
4730 Sort in descending (reverse) numeric order.
4737 Run no more than 4 sorts concurrently, using a buffer size of 10M.
4740 sort --parallel=4 -S 10M
4744 Sort alphabetically, omitting the first and second fields
4745 and the blanks at the start of the third field.
4746 This uses a single key composed of the characters beginning
4747 at the start of the first nonblank character in field three
4748 and extending to the end of each line.
4755 Sort numerically on the second field and resolve ties by sorting
4756 alphabetically on the third and fourth characters of field five.
4757 Use @samp{:} as the field delimiter.
4760 sort -t : -k 2,2n -k 5.3,5.4
4763 Note that if you had written @option{-k 2n} instead of @option{-k 2,2n}
4764 @command{sort} would have used all characters beginning in the second field
4765 and extending to the end of the line as the primary @emph{numeric}
4766 key. For the large majority of applications, treating keys spanning
4767 more than one field as numeric will not do what you expect.
4769 Also note that the @samp{n} modifier was applied to the field-end
4770 specifier for the first key. It would have been equivalent to
4771 specify @option{-k 2n,2} or @option{-k 2n,2n}. All modifiers except
4772 @samp{b} apply to the associated @emph{field}, regardless of whether
4773 the modifier character is attached to the field-start and/or the
4774 field-end part of the key specifier.
4777 Sort the password file on the fifth field and ignore any
4778 leading blanks. Sort lines with equal values in field five
4779 on the numeric user ID in field three. Fields are separated
4783 sort -t : -k 5b,5 -k 3,3n /etc/passwd
4784 sort -t : -n -k 5b,5 -k 3,3 /etc/passwd
4785 sort -t : -b -k 5,5 -k 3,3n /etc/passwd
4788 These three commands have equivalent effect. The first specifies that
4789 the first key's start position ignores leading blanks and the second
4790 key is sorted numerically. The other two commands rely on global
4791 options being inherited by sort keys that lack modifiers. The inheritance
4792 works in this case because @option{-k 5b,5b} and @option{-k 5b,5} are
4793 equivalent, as the location of a field-end lacking a @samp{.@var{c}}
4794 character position is not affected by whether initial blanks are
4798 Sort a set of log files, primarily by IPv4 address and secondarily by
4799 time stamp. If two lines' primary and secondary keys are identical,
4800 output the lines in the same order that they were input. The log
4801 files contain lines that look like this:
4804 4.150.156.3 - - [01/Apr/2004:06:31:51 +0000] message 1
4805 211.24.3.231 - - [24/Apr/2004:20:17:39 +0000] message 2
4808 Fields are separated by exactly one space. Sort IPv4 addresses
4809 lexicographically, e.g., 212.61.52.2 sorts before 212.129.233.201
4810 because 61 is less than 129.
4813 sort -s -t ' ' -k 4.9n -k 4.5M -k 4.2n -k 4.14,4.21 file*.log |
4814 sort -s -t '.' -k 1,1n -k 2,2n -k 3,3n -k 4,4n
4817 This example cannot be done with a single @command{sort} invocation,
4818 since IPv4 address components are separated by @samp{.} while dates
4819 come just after a space. So it is broken down into two invocations of
4820 @command{sort}: the first sorts by time stamp and the second by IPv4
4821 address. The time stamp is sorted by year, then month, then day, and
4822 finally by hour-minute-second field, using @option{-k} to isolate each
4823 field. Except for hour-minute-second there's no need to specify the
4824 end of each key field, since the @samp{n} and @samp{M} modifiers sort
4825 based on leading prefixes that cannot cross field boundaries. The
4826 IPv4 addresses are sorted lexicographically. The second sort uses
4827 @samp{-s} so that ties in the primary key are broken by the secondary
4828 key; the first sort uses @samp{-s} so that the combination of the two
4832 Generate a tags file in case-insensitive sorted order.
4835 find src -type f -print0 | sort -z -f | xargs -0 etags --append
4838 The use of @option{-print0}, @option{-z}, and @option{-0} in this case means
4839 that file names that contain blanks or other special characters are
4841 by the sort operation.
4843 @c This example is a bit contrived and needs more explanation.
4845 @c Sort records separated by an arbitrary string by using a pipe to convert
4846 @c each record delimiter string to @samp{\0}, then using sort's -z option,
4847 @c and converting each @samp{\0} back to the original record delimiter.
4850 @c printf 'c\n\nb\n\na\n' |
4851 @c perl -0pe 's/\n\n/\n\0/g' |
4853 @c perl -0pe 's/\0/\n/g'
4857 Use the common DSU, Decorate Sort Undecorate idiom to
4858 sort lines according to their length.
4861 awk '@{print length, $0@}' /etc/passwd | sort -n | cut -f2- -d' '
4864 In general this technique can be used to sort data that the @command{sort}
4865 command does not support, or is inefficient at, sorting directly.
4868 Shuffle a list of directories, but preserve the order of files within
4869 each directory. For instance, one could use this to generate a music
4870 playlist in which albums are shuffled but the songs of each album are
4874 ls */* | sort -t / -k 1,1R -k 2,2
4880 @node shuf invocation
4881 @section @command{shuf}: Shuffling text
4884 @cindex shuffling files
4886 @command{shuf} shuffles its input by outputting a random permutation
4887 of its input lines. Each output permutation is equally likely.
4891 shuf [@var{option}]@dots{} [@var{file}]
4892 shuf -e [@var{option}]@dots{} [@var{arg}]@dots{}
4893 shuf -i @var{lo}-@var{hi} [@var{option}]@dots{}
4896 @command{shuf} has three modes of operation that affect where it
4897 obtains its input lines. By default, it reads lines from standard
4898 input. The following options change the operation mode:
4906 @cindex command-line operands to shuffle
4907 Treat each command-line operand as an input line.
4909 @item -i @var{lo}-@var{hi}
4910 @itemx --input-range=@var{lo}-@var{hi}
4912 @opindex --input-range
4913 @cindex input range to shuffle
4914 Act as if input came from a file containing the range of unsigned
4915 decimal integers @var{lo}@dots{}@var{hi}, one per line.
4919 @command{shuf}'s other options can affect its behavior in all
4924 @item -n @var{lines}
4925 @itemx --head-count=@var{count}
4927 @opindex --head-count
4928 @cindex head of output
4929 Output at most @var{count} lines. By default, all input lines are
4932 @item -o @var{output-file}
4933 @itemx --output=@var{output-file}
4936 @cindex overwriting of input, allowed
4937 Write output to @var{output-file} instead of standard output.
4938 @command{shuf} reads all input before opening
4939 @var{output-file}, so you can safely shuffle a file in place by using
4940 commands like @code{shuf -o F <F} and @code{cat F | shuf -o F}.
4942 @item --random-source=@var{file}
4943 @opindex --random-source
4944 @cindex random source for shuffling
4945 Use @var{file} as a source of random data used to determine which
4946 permutation to generate. @xref{Random sources}.
4949 @itemx --repetitions
4951 @opindex --repetitions
4952 @cindex allowing repetitions in output values
4953 Changes the default behaviour of @command{shuf}, allowing repetition in
4954 output values (in which case, @option{--head-count} can be larger
4955 than the number of input values). If @option{--head-count} is not
4956 specified, output a single random value.
4958 @zeroTerminatedOption
4974 might produce the output
4984 Similarly, the command:
4987 shuf -e clubs hearts diamonds spades
5001 and the command @samp{shuf -i 1-4} might output:
5011 These examples all have four input lines, so @command{shuf} might
5012 produce any of the twenty-four possible permutations of the input. In
5013 general, if there are @var{n} input lines, there are @var{n}! (i.e.,
5014 @var{n} factorial, or @var{n} * (@var{n} - 1) * @dots{} * 1) possible
5015 output permutations.
5018 To output 50 random numbers between 0 and 9, use:
5021 shuf --repetitions --input-range 0-9 --head-count 50
5025 or (using short options):
5032 To simulate 100 coin flips, use:
5035 shuf -r -n100 -e Head Tail
5041 printf '%s\n' Head Tail | shuf -r -n100
5047 @node uniq invocation
5048 @section @command{uniq}: Uniquify files
5051 @cindex uniquify files
5053 @command{uniq} writes the unique lines in the given @file{input}, or
5054 standard input if nothing is given or for an @var{input} name of
5058 uniq [@var{option}]@dots{} [@var{input} [@var{output}]]
5061 By default, @command{uniq} prints its input lines, except that
5062 it discards all but the first of adjacent repeated lines, so that
5063 no output lines are repeated. Optionally, it can instead discard
5064 lines that are not repeated, or all repeated lines.
5066 The input need not be sorted, but repeated input lines are detected
5067 only if they are adjacent. If you want to discard non-adjacent
5068 duplicate lines, perhaps you want to use @code{sort -u}.
5069 @xref{sort invocation}.
5072 Comparisons honor the rules specified by the @env{LC_COLLATE}
5075 If no @var{output} file is specified, @command{uniq} writes to standard
5078 The program accepts the following options. Also see @ref{Common options}.
5083 @itemx --skip-fields=@var{n}
5085 @opindex --skip-fields
5086 Skip @var{n} fields on each line before checking for uniqueness. Use
5087 a null string for comparison if a line has fewer than @var{n} fields. Fields
5088 are sequences of non-space non-tab characters that are separated from
5089 each other by at least one space or tab.
5091 For compatibility @command{uniq} supports an obsolete option syntax
5092 @option{-@var{n}}. New scripts should use @option{-f @var{n}} instead.
5095 @itemx --skip-chars=@var{n}
5097 @opindex --skip-chars
5098 Skip @var{n} characters before checking for uniqueness. Use a null string
5099 for comparison if a line has fewer than @var{n} characters. If you use both
5100 the field and character skipping options, fields are skipped over first.
5102 @vindex _POSIX2_VERSION
5103 On older systems, @command{uniq} supports an obsolete option syntax
5105 This obsolete behavior can be enabled or disabled with the
5106 @env{_POSIX2_VERSION} environment variable (@pxref{Standards
5107 conformance}), but portable scripts should avoid commands whose
5108 behavior depends on this variable.
5109 For example, use @samp{uniq ./+10} or @samp{uniq -s 10} rather than
5110 the ambiguous @samp{uniq +10}.
5116 Print the number of times each line occurred along with the line.
5119 @itemx --ignore-case
5121 @opindex --ignore-case
5122 Ignore differences in case when comparing lines.
5128 @cindex repeated lines, outputting
5129 Discard lines that are not repeated. When used by itself, this option
5130 causes @command{uniq} to print the first copy of each repeated line,
5134 @itemx --all-repeated[=@var{delimit-method}]
5136 @opindex --all-repeated
5137 @cindex all repeated lines, outputting
5138 Do not discard the second and subsequent repeated input lines,
5139 but discard lines that are not repeated.
5140 This option is useful mainly in conjunction with other options e.g.,
5141 to ignore case or to compare only selected fields.
5142 The optional @var{delimit-method} specifies how to delimit
5143 groups of repeated lines, and must be one of the following:
5148 Do not delimit groups of repeated lines.
5149 This is equivalent to @option{--all-repeated} (@option{-D}).
5152 Output a newline before each group of repeated lines.
5153 @macro nulOutputNote
5154 With @option{--zero-terminated} (@option{-z}), use a zero
5155 byte (ASCII NUL) instead of a newline as the delimiter.
5160 Separate groups of repeated lines with a single newline.
5161 This is the same as using @samp{prepend}, except that
5162 no delimiter is inserted before the first group, and hence
5163 may be better suited for output direct to users.
5167 @macro ambiguousGroupNote
5168 Note that when groups are delimited and the input stream contains
5169 two or more consecutive blank lines, then the output is ambiguous.
5170 To avoid that, filter the input through @samp{tr -s '\\n'} to replace
5171 each sequence of consecutive newlines with a single newline.
5175 This is a GNU extension.
5176 @c FIXME: give an example showing *how* it's useful
5178 @item --group[=@var{delimit-method}]
5180 @cindex all lines, grouping
5181 Output all lines, and delimit each unique group.
5183 The optional @var{delimit-method} specifies how to delimit
5184 groups, and must be one of the following:
5189 Separate unique groups with a single delimiter.
5190 This is the default delimiting method if none is specified,
5191 and better suited for output direct to users.
5194 Output a delimiter before each group of unique items.
5197 Output a delimiter after each group of unique items.
5200 Output a delimiter around each group of unique items.
5205 This is a GNU extension.
5211 @cindex unique lines, outputting
5212 Discard the first repeated line. When used by itself, this option
5213 causes @command{uniq} to print unique lines, and nothing else.
5216 @itemx --check-chars=@var{n}
5218 @opindex --check-chars
5219 Compare at most @var{n} characters on each line (after skipping any specified
5220 fields and characters). By default the entire rest of the lines are
5223 @zeroTerminatedOption
5230 @node comm invocation
5231 @section @command{comm}: Compare two sorted files line by line
5234 @cindex line-by-line comparison
5235 @cindex comparing sorted files
5237 @command{comm} writes to standard output lines that are common, and lines
5238 that are unique, to two input files; a file name of @samp{-} means
5239 standard input. Synopsis:
5242 comm [@var{option}]@dots{} @var{file1} @var{file2}
5246 Before @command{comm} can be used, the input files must be sorted using the
5247 collating sequence specified by the @env{LC_COLLATE} locale.
5248 If an input file ends in a non-newline
5249 character, a newline is silently appended. The @command{sort} command with
5250 no options always outputs a file that is suitable input to @command{comm}.
5252 @cindex differing lines
5253 @cindex common lines
5254 With no options, @command{comm} produces three-column output. Column one
5255 contains lines unique to @var{file1}, column two contains lines unique
5256 to @var{file2}, and column three contains lines common to both files.
5257 Columns are separated by a single TAB character.
5258 @c FIXME: when there's an option to supply an alternative separator
5259 @c string, append "by default" to the above sentence.
5264 The options @option{-1}, @option{-2}, and @option{-3} suppress printing of
5265 the corresponding columns (and separators). Also see @ref{Common options}.
5267 Unlike some other comparison utilities, @command{comm} has an exit
5268 status that does not depend on the result of the comparison.
5269 Upon normal completion @command{comm} produces an exit code of zero.
5270 If there is an error it exits with nonzero status.
5272 @macro checkOrderOption{cmd}
5273 If the @option{--check-order} option is given, unsorted inputs will
5274 cause a fatal error message. If the option @option{--nocheck-order}
5275 is given, unsorted inputs will never cause an error message. If neither
5276 of these options is given, wrongly sorted inputs are diagnosed
5277 only if an input file is found to contain unpairable
5279 lines, and when both input files are non empty.
5281 @ifclear JOIN_COMMAND
5284 If an input file is diagnosed as being unsorted, the @command{\cmd\}
5285 command will exit with a nonzero status (and the output should not be used).
5287 Forcing @command{\cmd\} to process wrongly sorted input files
5288 containing unpairable lines by specifying @option{--nocheck-order} is
5289 not guaranteed to produce any particular output. The output will
5290 probably not correspond with whatever you hoped it would be.
5292 @checkOrderOption{comm}
5297 Fail with an error message if either input file is wrongly ordered.
5299 @item --nocheck-order
5300 Do not check that both input files are in sorted order.
5304 @item --output-delimiter=@var{str}
5305 Print @var{str} between adjacent output columns,
5306 rather than the default of a single TAB character.
5308 The delimiter @var{str} may not be empty.
5312 @node ptx invocation
5313 @section @command{ptx}: Produce permuted indexes
5317 @command{ptx} reads a text file and essentially produces a permuted index, with
5318 each keyword in its context. The calling sketch is either one of:
5321 ptx [@var{option} @dots{}] [@var{file} @dots{}]
5322 ptx -G [@var{option} @dots{}] [@var{input} [@var{output}]]
5325 The @option{-G} (or its equivalent: @option{--traditional}) option disables
5326 all GNU extensions and reverts to traditional mode, thus introducing some
5327 limitations and changing several of the program's default option values.
5328 When @option{-G} is not specified, GNU extensions are always enabled.
5329 GNU extensions to @command{ptx} are documented wherever appropriate in this
5330 document. @xref{Compatibility in ptx}, for the full list.
5332 Individual options are explained in the following sections.
5334 When GNU extensions are enabled, there may be zero, one or several
5335 @var{file}s after the options. If there is no @var{file}, the program
5336 reads the standard input. If there is one or several @var{file}s, they
5337 give the name of input files which are all read in turn, as if all the
5338 input files were concatenated. However, there is a full contextual
5339 break between each file and, when automatic referencing is requested,
5340 file names and line numbers refer to individual text input files. In
5341 all cases, the program outputs the permuted index to the standard
5344 When GNU extensions are @emph{not} enabled, that is, when the program
5345 operates in traditional mode, there may be zero, one or two parameters
5346 besides the options. If there are no parameters, the program reads the
5347 standard input and outputs the permuted index to the standard output.
5348 If there is only one parameter, it names the text @var{input} to be read
5349 instead of the standard input. If two parameters are given, they give
5350 respectively the name of the @var{input} file to read and the name of
5351 the @var{output} file to produce. @emph{Be very careful} to note that,
5352 in this case, the contents of file given by the second parameter is
5353 destroyed. This behavior is dictated by System V @command{ptx}
5354 compatibility; GNU Standards normally discourage output parameters not
5355 introduced by an option.
5357 Note that for @emph{any} file named as the value of an option or as an
5358 input text file, a single dash @kbd{-} may be used, in which case
5359 standard input is assumed. However, it would not make sense to use this
5360 convention more than once per program invocation.
5363 * General options in ptx:: Options which affect general program behavior.
5364 * Charset selection in ptx:: Underlying character set considerations.
5365 * Input processing in ptx:: Input fields, contexts, and keyword selection.
5366 * Output formatting in ptx:: Types of output format, and sizing the fields.
5367 * Compatibility in ptx::
5371 @node General options in ptx
5372 @subsection General options
5377 @itemx --traditional
5378 As already explained, this option disables all GNU extensions to
5379 @command{ptx} and switches to traditional mode.
5382 Print a short help on standard output, then exit without further
5386 Print the program version on standard output, then exit without further
5394 @node Charset selection in ptx
5395 @subsection Charset selection
5397 @c FIXME: People don't necessarily know what an IBM-PC was these days.
5398 As it is set up now, the program assumes that the input file is coded
5399 using 8-bit ISO 8859-1 code, also known as Latin-1 character set,
5400 @emph{unless} it is compiled for MS-DOS, in which case it uses the
5401 character set of the IBM-PC@. (GNU @command{ptx} is not known to work on
5402 smaller MS-DOS machines anymore.) Compared to 7-bit ASCII, the set
5403 of characters which are letters is different; this alters the behavior
5404 of regular expression matching. Thus, the default regular expression
5405 for a keyword allows foreign or diacriticized letters. Keyword sorting,
5406 however, is still crude; it obeys the underlying character set ordering
5412 @itemx --ignore-case
5413 Fold lower case letters to upper case for sorting.
5418 @node Input processing in ptx
5419 @subsection Word selection and input processing
5424 @itemx --break-file=@var{file}
5426 This option provides an alternative (to @option{-W}) method of describing
5427 which characters make up words. It introduces the name of a
5428 file which contains a list of characters which can@emph{not} be part of
5429 one word; this file is called the @dfn{Break file}. Any character which
5430 is not part of the Break file is a word constituent. If both options
5431 @option{-b} and @option{-W} are specified, then @option{-W} has precedence and
5432 @option{-b} is ignored.
5434 When GNU extensions are enabled, the only way to avoid newline as a
5435 break character is to write all the break characters in the file with no
5436 newline at all, not even at the end of the file. When GNU extensions
5437 are disabled, spaces, tabs and newlines are always considered as break
5438 characters even if not included in the Break file.
5441 @itemx --ignore-file=@var{file}
5443 The file associated with this option contains a list of words which will
5444 never be taken as keywords in concordance output. It is called the
5445 @dfn{Ignore file}. The file contains exactly one word in each line; the
5446 end of line separation of words is not subject to the value of the
5450 @itemx --only-file=@var{file}
5452 The file associated with this option contains a list of words which will
5453 be retained in concordance output; any word not mentioned in this file
5454 is ignored. The file is called the @dfn{Only file}. The file contains
5455 exactly one word in each line; the end of line separation of words is
5456 not subject to the value of the @option{-S} option.
5458 There is no default for the Only file. When both an Only file and an
5459 Ignore file are specified, a word is considered a keyword only
5460 if it is listed in the Only file and not in the Ignore file.
5465 On each input line, the leading sequence of non-white space characters will be
5466 taken to be a reference that has the purpose of identifying this input
5467 line in the resulting permuted index.
5468 @xref{Output formatting in ptx},
5469 for more information about reference production.
5470 Using this option changes the default value for option @option{-S}.
5472 Using this option, the program does not try very hard to remove
5473 references from contexts in output, but it succeeds in doing so
5474 @emph{when} the context ends exactly at the newline. If option
5475 @option{-r} is used with @option{-S} default value, or when GNU extensions
5476 are disabled, this condition is always met and references are completely
5477 excluded from the output contexts.
5479 @item -S @var{regexp}
5480 @itemx --sentence-regexp=@var{regexp}
5482 This option selects which regular expression will describe the end of a
5483 line or the end of a sentence. In fact, this regular expression is not
5484 the only distinction between end of lines or end of sentences, and input
5485 line boundaries have no special significance outside this option. By
5486 default, when GNU extensions are enabled and if @option{-r} option is not
5487 used, end of sentences are used. In this case, this @var{regex} is
5488 imported from GNU Emacs:
5491 [.?!][]\"')@}]*\\($\\|\t\\| \\)[ \t\n]*
5494 Whenever GNU extensions are disabled or if @option{-r} option is used, end
5495 of lines are used; in this case, the default @var{regexp} is just:
5501 Using an empty @var{regexp} is equivalent to completely disabling end of
5502 line or end of sentence recognition. In this case, the whole file is
5503 considered to be a single big line or sentence. The user might want to
5504 disallow all truncation flag generation as well, through option @option{-F
5505 ""}. @xref{Regexps, , Syntax of Regular Expressions, emacs, The GNU Emacs
5508 When the keywords happen to be near the beginning of the input line or
5509 sentence, this often creates an unused area at the beginning of the
5510 output context line; when the keywords happen to be near the end of the
5511 input line or sentence, this often creates an unused area at the end of
5512 the output context line. The program tries to fill those unused areas
5513 by wrapping around context in them; the tail of the input line or
5514 sentence is used to fill the unused area on the left of the output line;
5515 the head of the input line or sentence is used to fill the unused area
5516 on the right of the output line.
5518 As a matter of convenience to the user, many usual backslashed escape
5519 sequences from the C language are recognized and converted to the
5520 corresponding characters by @command{ptx} itself.
5522 @item -W @var{regexp}
5523 @itemx --word-regexp=@var{regexp}
5525 This option selects which regular expression will describe each keyword.
5526 By default, if GNU extensions are enabled, a word is a sequence of
5527 letters; the @var{regexp} used is @samp{\w+}. When GNU extensions are
5528 disabled, a word is by default anything which ends with a space, a tab
5529 or a newline; the @var{regexp} used is @samp{[^ \t\n]+}.
5531 An empty @var{regexp} is equivalent to not using this option.
5532 @xref{Regexps, , Syntax of Regular Expressions, emacs, The GNU Emacs
5535 As a matter of convenience to the user, many usual backslashed escape
5536 sequences, as found in the C language, are recognized and converted to
5537 the corresponding characters by @command{ptx} itself.
5542 @node Output formatting in ptx
5543 @subsection Output formatting
5545 Output format is mainly controlled by the @option{-O} and @option{-T} options
5546 described in the table below. When neither @option{-O} nor @option{-T} are
5547 selected, and if GNU extensions are enabled, the program chooses an
5548 output format suitable for a dumb terminal. Each keyword occurrence is
5549 output to the center of one line, surrounded by its left and right
5550 contexts. Each field is properly justified, so the concordance output
5551 can be readily observed. As a special feature, if automatic
5552 references are selected by option @option{-A} and are output before the
5553 left context, that is, if option @option{-R} is @emph{not} selected, then
5554 a colon is added after the reference; this nicely interfaces with GNU
5555 Emacs @code{next-error} processing. In this default output format, each
5556 white space character, like newline and tab, is merely changed to
5557 exactly one space, with no special attempt to compress consecutive
5558 spaces. This might change in the future. Except for those white space
5559 characters, every other character of the underlying set of 256
5560 characters is transmitted verbatim.
5562 Output format is further controlled by the following options.
5566 @item -g @var{number}
5567 @itemx --gap-size=@var{number}
5569 Select the size of the minimum white space gap between the fields on the
5572 @item -w @var{number}
5573 @itemx --width=@var{number}
5575 Select the maximum output width of each final line. If references are
5576 used, they are included or excluded from the maximum output width
5577 depending on the value of option @option{-R}@. If this option is not
5578 selected, that is, when references are output before the left context,
5579 the maximum output width takes into account the maximum length of all
5580 references. If this option is selected, that is, when references are
5581 output after the right context, the maximum output width does not take
5582 into account the space taken by references, nor the gap that precedes
5586 @itemx --auto-reference
5588 Select automatic references. Each input line will have an automatic
5589 reference made up of the file name and the line ordinal, with a single
5590 colon between them. However, the file name will be empty when standard
5591 input is being read. If both @option{-A} and @option{-r} are selected, then
5592 the input reference is still read and skipped, but the automatic
5593 reference is used at output time, overriding the input reference.
5596 @itemx --right-side-refs
5598 In the default output format, when option @option{-R} is not used, any
5599 references produced by the effect of options @option{-r} or @option{-A} are
5600 placed to the far right of output lines, after the right context. With
5601 default output format, when the @option{-R} option is specified, references
5602 are rather placed at the beginning of each output line, before the left
5603 context. For any other output format, option @option{-R} is
5604 ignored, with one exception: with @option{-R} the width of references
5605 is @emph{not} taken into account in total output width given by @option{-w}.
5607 This option is automatically selected whenever GNU extensions are
5610 @item -F @var{string}
5611 @itemx --flac-truncation=@var{string}
5613 This option will request that any truncation in the output be reported
5614 using the string @var{string}. Most output fields theoretically extend
5615 towards the beginning or the end of the current line, or current
5616 sentence, as selected with option @option{-S}@. But there is a maximum
5617 allowed output line width, changeable through option @option{-w}, which is
5618 further divided into space for various output fields. When a field has
5619 to be truncated because it cannot extend beyond the beginning or the end of
5620 the current line to fit in, then a truncation occurs. By default,
5621 the string used is a single slash, as in @option{-F /}.
5623 @var{string} may have more than one character, as in @option{-F @dots{}}.
5624 Also, in the particular case when @var{string} is empty (@option{-F ""}),
5625 truncation flagging is disabled, and no truncation marks are appended in
5628 As a matter of convenience to the user, many usual backslashed escape
5629 sequences, as found in the C language, are recognized and converted to
5630 the corresponding characters by @command{ptx} itself.
5632 @item -M @var{string}
5633 @itemx --macro-name=@var{string}
5635 Select another @var{string} to be used instead of @samp{xx}, while
5636 generating output suitable for @command{nroff}, @command{troff} or @TeX{}.
5639 @itemx --format=roff
5641 Choose an output format suitable for @command{nroff} or @command{troff}
5642 processing. Each output line will look like:
5645 .xx "@var{tail}" "@var{before}" "@var{keyword_and_after}"@c
5646 "@var{head}" "@var{ref}"
5649 so it will be possible to write a @samp{.xx} roff macro to take care of
5650 the output typesetting. This is the default output format when GNU
5651 extensions are disabled. Option @option{-M} can be used to change
5652 @samp{xx} to another macro name.
5654 In this output format, each non-graphical character, like newline and
5655 tab, is merely changed to exactly one space, with no special attempt to
5656 compress consecutive spaces. Each quote character: @kbd{"} is doubled
5657 so it will be correctly processed by @command{nroff} or @command{troff}.
5662 Choose an output format suitable for @TeX{} processing. Each output
5663 line will look like:
5666 \xx @{@var{tail}@}@{@var{before}@}@{@var{keyword}@}@c
5667 @{@var{after}@}@{@var{head}@}@{@var{ref}@}
5671 so it will be possible to write a @code{\xx} definition to take care of
5672 the output typesetting. Note that when references are not being
5673 produced, that is, neither option @option{-A} nor option @option{-r} is
5674 selected, the last parameter of each @code{\xx} call is inhibited.
5675 Option @option{-M} can be used to change @samp{xx} to another macro
5678 In this output format, some special characters, like @kbd{$}, @kbd{%},
5679 @kbd{&}, @kbd{#} and @kbd{_} are automatically protected with a
5680 backslash. Curly brackets @kbd{@{}, @kbd{@}} are protected with a
5681 backslash and a pair of dollar signs (to force mathematical mode). The
5682 backslash itself produces the sequence @code{\backslash@{@}}.
5683 Circumflex and tilde diacritical marks produce the sequence @code{^\@{ @}} and
5684 @code{~\@{ @}} respectively. Other diacriticized characters of the
5685 underlying character set produce an appropriate @TeX{} sequence as far
5686 as possible. The other non-graphical characters, like newline and tab,
5687 and all other characters which are not part of ASCII, are merely
5688 changed to exactly one space, with no special attempt to compress
5689 consecutive spaces. Let me know how to improve this special character
5690 processing for @TeX{}.
5695 @node Compatibility in ptx
5696 @subsection The GNU extensions to @command{ptx}
5698 This version of @command{ptx} contains a few features which do not exist in
5699 System V @command{ptx}. These extra features are suppressed by using the
5700 @option{-G} command line option, unless overridden by other command line
5701 options. Some GNU extensions cannot be recovered by overriding, so the
5702 simple rule is to avoid @option{-G} if you care about GNU extensions.
5703 Here are the differences between this program and System V @command{ptx}.
5708 This program can read many input files at once, it always writes the
5709 resulting concordance on standard output. On the other hand, System V
5710 @command{ptx} reads only one file and sends the result to standard output
5711 or, if a second @var{file} parameter is given on the command, to that
5714 Having output parameters not introduced by options is a dangerous
5715 practice which GNU avoids as far as possible. So, for using @command{ptx}
5716 portably between GNU and System V, you should always use it with a
5717 single input file, and always expect the result on standard output. You
5718 might also want to automatically configure in a @option{-G} option to
5719 @command{ptx} calls in products using @command{ptx}, if the configurator finds
5720 that the installed @command{ptx} accepts @option{-G}.
5723 The only options available in System V @command{ptx} are options @option{-b},
5724 @option{-f}, @option{-g}, @option{-i}, @option{-o}, @option{-r}, @option{-t} and
5725 @option{-w}. All other options are GNU extensions and are not repeated in
5726 this enumeration. Moreover, some options have a slightly different
5727 meaning when GNU extensions are enabled, as explained below.
5730 By default, concordance output is not formatted for @command{troff} or
5731 @command{nroff}. It is rather formatted for a dumb terminal. @command{troff}
5732 or @command{nroff} output may still be selected through option @option{-O}.
5735 Unless @option{-R} option is used, the maximum reference width is
5736 subtracted from the total output line width. With GNU extensions
5737 disabled, width of references is not taken into account in the output
5738 line width computations.
5741 All 256 bytes, even ASCII NUL bytes, are always read and
5742 processed from input file with no adverse effect, even if GNU extensions
5743 are disabled. However, System V @command{ptx} does not accept 8-bit
5744 characters, a few control characters are rejected, and the tilde
5745 @kbd{~} is also rejected.
5748 Input line length is only limited by available memory, even if GNU
5749 extensions are disabled. However, System V @command{ptx} processes only
5750 the first 200 characters in each line.
5753 The break (non-word) characters default to be every character except all
5754 letters of the underlying character set, diacriticized or not. When GNU
5755 extensions are disabled, the break characters default to space, tab and
5759 The program makes better use of output line width. If GNU extensions
5760 are disabled, the program rather tries to imitate System V @command{ptx},
5761 but still, there are some slight disposition glitches this program does
5762 not completely reproduce.
5765 The user can specify both an Ignore file and an Only file. This is not
5766 allowed with System V @command{ptx}.
5771 @node tsort invocation
5772 @section @command{tsort}: Topological sort
5775 @cindex topological sort
5777 @command{tsort} performs a topological sort on the given @var{file}, or
5778 standard input if no input file is given or for a @var{file} of
5779 @samp{-}. For more details and some history, see @ref{tsort background}.
5783 tsort [@var{option}] [@var{file}]
5786 @command{tsort} reads its input as pairs of strings, separated by blanks,
5787 indicating a partial ordering. The output is a total ordering that
5788 corresponds to the given partial ordering.
5802 will produce the output
5813 Consider a more realistic example.
5814 You have a large set of functions all in one file, and they may all be
5815 declared static except one. Currently that one (say @code{main}) is the
5816 first function defined in the file, and the ones it calls directly follow
5817 it, followed by those they call, etc. Let's say that you are determined
5818 to take advantage of prototypes, so you have to choose between declaring
5819 all of those functions (which means duplicating a lot of information from
5820 the definitions) and rearranging the functions so that as many as possible
5821 are defined before they are used. One way to automate the latter process
5822 is to get a list for each function of the functions it calls directly.
5823 Many programs can generate such lists. They describe a call graph.
5824 Consider the following list, in which a given line indicates that the
5825 function on the left calls the one on the right directly.
5831 tail_file pretty_name
5832 tail_file write_header
5834 tail_forever recheck
5835 tail_forever pretty_name
5836 tail_forever write_header
5837 tail_forever dump_remainder
5840 tail_lines start_lines
5841 tail_lines dump_remainder
5842 tail_lines file_lines
5843 tail_lines pipe_lines
5845 tail_bytes start_bytes
5846 tail_bytes dump_remainder
5847 tail_bytes pipe_bytes
5848 file_lines dump_remainder
5852 then you can use @command{tsort} to produce an ordering of those
5853 functions that satisfies your requirement.
5856 example$ tsort call-graph | tac
5876 @command{tsort} detects any cycles in the input and writes the first cycle
5877 encountered to standard error.
5879 Note that for a given partial ordering, generally there is no unique
5880 total ordering. In the context of the call graph above, the function
5881 @code{parse_options} may be placed anywhere in the list as long as it
5882 precedes @code{main}.
5884 The only options are @option{--help} and @option{--version}. @xref{Common
5890 * tsort background:: Where tsort came from.
5893 @node tsort background
5894 @subsection @command{tsort}: Background
5896 @command{tsort} exists because very early versions of the Unix linker processed
5897 an archive file exactly once, and in order. As @command{ld} read each object
5898 in the archive, it decided whether it was needed in the program based on
5899 whether it defined any symbols which were undefined at that point in
5902 This meant that dependencies within the archive had to be handled
5903 specially. For example, @code{scanf} probably calls @code{read}. That means
5904 that in a single pass through an archive, it was important for @code{scanf.o}
5905 to appear before read.o, because otherwise a program which calls
5906 @code{scanf} but not @code{read} might end up with an unexpected unresolved
5907 reference to @code{read}.
5909 The way to address this problem was to first generate a set of
5910 dependencies of one object file on another. This was done by a shell
5911 script called @command{lorder}. The GNU tools don't provide a version of
5912 lorder, as far as I know, but you can still find it in BSD
5915 Then you ran @command{tsort} over the @command{lorder} output, and you used the
5916 resulting sort to define the order in which you added objects to the archive.
5918 This whole procedure has been obsolete since about 1980, because
5919 Unix archives now contain a symbol table (traditionally built by
5920 @command{ranlib}, now generally built by @command{ar} itself), and the Unix
5921 linker uses the symbol table to effectively make multiple passes over
5924 Anyhow, that's where tsort came from. To solve an old problem with
5925 the way the linker handled archive files, which has since been solved
5929 @node Operating on fields
5930 @chapter Operating on fields
5933 * cut invocation:: Print selected parts of lines.
5934 * paste invocation:: Merge lines of files.
5935 * join invocation:: Join lines on a common field.
5939 @node cut invocation
5940 @section @command{cut}: Print selected parts of lines
5943 @command{cut} writes to standard output selected parts of each line of each
5944 input file, or standard input if no files are given or for a file name of
5948 cut @var{option}@dots{} [@var{file}]@dots{}
5951 In the table which follows, the @var{byte-list}, @var{character-list},
5952 and @var{field-list} are one or more numbers or ranges (two numbers
5953 separated by a dash) separated by commas. Bytes, characters, and
5954 fields are numbered starting at 1. Incomplete ranges may be
5955 given: @option{-@var{m}} means @samp{1-@var{m}}; @samp{@var{n}-} means
5956 @samp{@var{n}} through end of line or last field. The list elements
5957 can be repeated, can overlap, and can be specified in any order; but
5958 the selected input is written in the same order that it is read, and
5959 is written exactly once.
5961 The program accepts the following options. Also see @ref{Common
5966 @item -b @var{byte-list}
5967 @itemx --bytes=@var{byte-list}
5970 Select for printing only the bytes in positions listed in
5971 @var{byte-list}. Tabs and backspaces are treated like any other
5972 character; they take up 1 byte. If an output delimiter is specified,
5973 (see the description of @option{--output-delimiter}), then output that
5974 string between ranges of selected bytes.
5976 @item -c @var{character-list}
5977 @itemx --characters=@var{character-list}
5979 @opindex --characters
5980 Select for printing only the characters in positions listed in
5981 @var{character-list}. The same as @option{-b} for now, but
5982 internationalization will change that. Tabs and backspaces are
5983 treated like any other character; they take up 1 character. If an
5984 output delimiter is specified, (see the description of
5985 @option{--output-delimiter}), then output that string between ranges
5988 @item -f @var{field-list}
5989 @itemx --fields=@var{field-list}
5992 Select for printing only the fields listed in @var{field-list}.
5993 Fields are separated by a TAB character by default. Also print any
5994 line that contains no delimiter character, unless the
5995 @option{--only-delimited} (@option{-s}) option is specified.
5997 Note @command{awk} supports more sophisticated field processing,
5998 and by default will use (and discard) runs of blank characters to
5999 separate fields, and ignore leading and trailing blanks.
6002 awk '{print $2}' # print the second field
6003 awk '{print $NF-1}' # print the penultimate field
6004 awk '{print $2,$1}' # reorder the first two fields
6008 In the unlikely event that @command{awk} is unavailable,
6009 one can use the @command{join} command, to process blank
6010 characters as @command{awk} does above.
6013 join -a1 -o 1.2 - /dev/null # print the second field
6014 join -a1 -o 1.2,1.1 - /dev/null # reorder the first two fields
6018 @item -d @var{input_delim_byte}
6019 @itemx --delimiter=@var{input_delim_byte}
6021 @opindex --delimiter
6022 With @option{-f}, use the first byte of @var{input_delim_byte} as
6023 the input fields separator (default is TAB).
6027 Do not split multi-byte characters (no-op for now).
6030 @itemx --only-delimited
6032 @opindex --only-delimited
6033 For @option{-f}, do not print lines that do not contain the field separator
6034 character. Normally, any line without a field separator is printed verbatim.
6036 @item --output-delimiter=@var{output_delim_string}
6037 @opindex --output-delimiter
6038 With @option{-f}, output fields are separated by @var{output_delim_string}.
6039 The default with @option{-f} is to use the input delimiter.
6040 When using @option{-b} or @option{-c} to select ranges of byte or
6041 character offsets (as opposed to ranges of fields),
6042 output @var{output_delim_string} between non-overlapping
6043 ranges of selected bytes.
6046 @opindex --complement
6047 This option is a GNU extension.
6048 Select for printing the complement of the bytes, characters or fields
6049 selected with the @option{-b}, @option{-c} or @option{-f} options.
6050 In other words, do @emph{not} print the bytes, characters or fields
6051 specified via those options. This option is useful when you have
6052 many fields and want to print all but a few of them.
6059 @node paste invocation
6060 @section @command{paste}: Merge lines of files
6063 @cindex merging files
6065 @command{paste} writes to standard output lines consisting of sequentially
6066 corresponding lines of each given file, separated by a TAB character.
6067 Standard input is used for a file name of @samp{-} or if no input files
6089 paste [@var{option}]@dots{} [@var{file}]@dots{}
6092 The program accepts the following options. Also see @ref{Common options}.
6100 Paste the lines of one file at a time rather than one line from each
6101 file. Using the above example data:
6104 $ paste -s num2 let3
6109 @item -d @var{delim-list}
6110 @itemx --delimiters=@var{delim-list}
6112 @opindex --delimiters
6113 Consecutively use the characters in @var{delim-list} instead of
6114 TAB to separate merged lines. When @var{delim-list} is
6115 exhausted, start again at its beginning. Using the above example data:
6118 $ paste -d '%_' num2 let3 num2
6129 @node join invocation
6130 @section @command{join}: Join lines on a common field
6133 @cindex common field, joining on
6135 @command{join} writes to standard output a line for each pair of input
6136 lines that have identical join fields. Synopsis:
6139 join [@var{option}]@dots{} @var{file1} @var{file2}
6142 Either @var{file1} or @var{file2} (but not both) can be @samp{-},
6143 meaning standard input. @var{file1} and @var{file2} should be
6144 sorted on the join fields.
6147 Normally, the sort order is that of the
6148 collating sequence specified by the @env{LC_COLLATE} locale. Unless
6149 the @option{-t} option is given, the sort comparison ignores blanks at
6150 the start of the join field, as in @code{sort -b}. If the
6151 @option{--ignore-case} option is given, the sort comparison ignores
6152 the case of characters in the join field, as in @code{sort -f}.
6154 The @command{sort} and @command{join} commands should use consistent
6155 locales and options if the output of @command{sort} is fed to
6156 @command{join}. You can use a command like @samp{sort -k 1b,1} to
6157 sort a file on its default join field, but if you select a non-default
6158 locale, join field, separator, or comparison options, then you should
6159 do so consistently between @command{join} and @command{sort}.
6160 If @samp{join -t ''} is specified then the whole line is considered which
6161 matches the default operation of sort.
6163 If the input has no unpairable lines, a GNU extension is
6164 available; the sort order can be any order that considers two fields
6165 to be equal if and only if the sort comparison described above
6166 considers them to be equal. For example:
6184 @checkOrderOption{join}
6189 @item the join field is the first field in each line;
6190 @item fields in the input are separated by one or more blanks, with leading
6191 blanks on the line ignored;
6192 @item fields in the output are separated by a space;
6193 @item each output line consists of the join field, the remaining
6194 fields from @var{file1}, then the remaining fields from @var{file2}.
6197 The program accepts the following options. Also see @ref{Common options}.
6201 @item -a @var{file-number}
6203 Print a line for each unpairable line in file @var{file-number} (either
6204 @samp{1} or @samp{2}), in addition to the normal output.
6207 Fail with an error message if either input file is wrongly ordered.
6209 @item --nocheck-order
6210 Do not check that both input files are in sorted order. This is the default.
6212 @item -e @var{string}
6214 Replace those output fields that are missing in the input with @var{string}.
6215 I.e., missing fields specified with the @option{-12jo} options.
6219 Treat the first line of each input file as a header line. The header lines
6220 will be joined and printed as the first output line. If @option{-o} is used to
6221 specify output format, the header line will be printed according to the
6222 specified format. The header lines will not be checked for ordering even if
6223 @option{--check-order} is specified. Also if the header lines from each file
6224 do not match, the heading fields from the first file will be used.
6227 @itemx --ignore-case
6229 @opindex --ignore-case
6230 Ignore differences in case when comparing keys.
6231 With this option, the lines of the input files must be ordered in the same way.
6232 Use @samp{sort -f} to produce this ordering.
6234 @item -1 @var{field}
6236 Join on field @var{field} (a positive integer) of file 1.
6238 @item -2 @var{field}
6240 Join on field @var{field} (a positive integer) of file 2.
6242 @item -j @var{field}
6243 Equivalent to @option{-1 @var{field} -2 @var{field}}.
6245 @item -o @var{field-list}
6247 If the keyword @samp{auto} is specified, infer the output format from
6248 the first line in each file. This is the same as the default output format
6249 but also ensures the same number of fields are output for each line.
6250 Missing fields are replaced with the @option{-e} option and extra fields
6253 Otherwise, construct each output line according to the format in
6254 @var{field-list}. Each element in @var{field-list} is either the single
6255 character @samp{0} or has the form @var{m.n} where the file number, @var{m},
6256 is @samp{1} or @samp{2} and @var{n} is a positive field number.
6258 A field specification of @samp{0} denotes the join field.
6259 In most cases, the functionality of the @samp{0} field spec
6260 may be reproduced using the explicit @var{m.n} that corresponds
6261 to the join field. However, when printing unpairable lines
6262 (using either of the @option{-a} or @option{-v} options), there is no way
6263 to specify the join field using @var{m.n} in @var{field-list}
6264 if there are unpairable lines in both files.
6265 To give @command{join} that functionality, POSIX invented the @samp{0}
6266 field specification notation.
6268 The elements in @var{field-list}
6269 are separated by commas or blanks.
6270 Blank separators typically need to be quoted for the shell. For
6271 example, the commands @samp{join -o 1.2,2.2} and @samp{join -o '1.2
6272 2.2'} are equivalent.
6274 All output lines---including those printed because of any -a or -v
6275 option---are subject to the specified @var{field-list}.
6278 Use character @var{char} as the input and output field separator.
6279 Treat as significant each occurrence of @var{char} in the input file.
6280 Use @samp{sort -t @var{char}}, without the @option{-b} option of
6281 @samp{sort}, to produce this ordering. If @samp{join -t ''} is specified,
6282 the whole line is considered, matching the default operation of sort.
6283 If @samp{-t '\0'} is specified then the ASCII NUL
6284 character is used to delimit the fields.
6286 @item -v @var{file-number}
6287 Print a line for each unpairable line in file @var{file-number}
6288 (either @samp{1} or @samp{2}), instead of the normal output.
6290 @zeroTerminatedOption
6297 @node Operating on characters
6298 @chapter Operating on characters
6300 @cindex operating on characters
6302 This commands operate on individual characters.
6305 * tr invocation:: Translate, squeeze, and/or delete characters.
6306 * expand invocation:: Convert tabs to spaces.
6307 * unexpand invocation:: Convert spaces to tabs.
6312 @section @command{tr}: Translate, squeeze, and/or delete characters
6319 tr [@var{option}]@dots{} @var{set1} [@var{set2}]
6322 @command{tr} copies standard input to standard output, performing
6323 one of the following operations:
6327 translate, and optionally squeeze repeated characters in the result,
6329 squeeze repeated characters,
6333 delete characters, then squeeze repeated characters from the result.
6336 The @var{set1} and (if given) @var{set2} arguments define ordered
6337 sets of characters, referred to below as @var{set1} and @var{set2}. These
6338 sets are the characters of the input that @command{tr} operates on.
6339 The @option{--complement} (@option{-c}, @option{-C}) option replaces
6341 complement (all of the characters that are not in @var{set1}).
6343 Currently @command{tr} fully supports only single-byte characters.
6344 Eventually it will support multibyte characters; when it does, the
6345 @option{-C} option will cause it to complement the set of characters,
6346 whereas @option{-c} will cause it to complement the set of values.
6347 This distinction will matter only when some values are not characters,
6348 and this is possible only in locales using multibyte encodings when
6349 the input contains encoding errors.
6351 The program accepts the @option{--help} and @option{--version}
6352 options. @xref{Common options}. Options must precede operands.
6357 * Character sets:: Specifying sets of characters.
6358 * Translating:: Changing one set of characters to another.
6359 * Squeezing:: Squeezing repeats and deleting.
6363 @node Character sets
6364 @subsection Specifying sets of characters
6366 @cindex specifying sets of characters
6368 The format of the @var{set1} and @var{set2} arguments resembles
6369 the format of regular expressions; however, they are not regular
6370 expressions, only lists of characters. Most characters simply
6371 represent themselves in these strings, but the strings can contain
6372 the shorthands listed below, for convenience. Some of them can be
6373 used only in @var{set1} or @var{set2}, as noted below.
6377 @item Backslash escapes
6378 @cindex backslash escapes
6380 The following backslash escape sequences are recognized:
6398 The 8-bit character with the value given by @var{ooo}, which is 1 to 3
6399 octal digits. Note that @samp{\400} is interpreted as the two-byte
6400 sequence, @samp{\040} @samp{0}.
6405 While a backslash followed by a character not listed above is
6406 interpreted as that character, the backslash also effectively
6407 removes any special significance, so it is useful to escape
6408 @samp{[}, @samp{]}, @samp{*}, and @samp{-}.
6413 The notation @samp{@var{m}-@var{n}} expands to all of the characters
6414 from @var{m} through @var{n}, in ascending order. @var{m} should
6415 collate before @var{n}; if it doesn't, an error results. As an example,
6416 @samp{0-9} is the same as @samp{0123456789}.
6418 GNU @command{tr} does not support the System V syntax that uses square
6419 brackets to enclose ranges. Translations specified in that format
6420 sometimes work as expected, since the brackets are often transliterated
6421 to themselves. However, they should be avoided because they sometimes
6422 behave unexpectedly. For example, @samp{tr -d '[0-9]'} deletes brackets
6425 Many historically common and even accepted uses of ranges are not
6426 portable. For example, on EBCDIC hosts using the @samp{A-Z}
6427 range will not do what most would expect because @samp{A} through @samp{Z}
6428 are not contiguous as they are in ASCII@.
6429 If you can rely on a POSIX compliant version of @command{tr}, then
6430 the best way to work around this is to use character classes (see below).
6431 Otherwise, it is most portable (and most ugly) to enumerate the members
6434 @item Repeated characters
6435 @cindex repeated characters
6437 The notation @samp{[@var{c}*@var{n}]} in @var{set2} expands to @var{n}
6438 copies of character @var{c}. Thus, @samp{[y*6]} is the same as
6439 @samp{yyyyyy}. The notation @samp{[@var{c}*]} in @var{string2} expands
6440 to as many copies of @var{c} as are needed to make @var{set2} as long as
6441 @var{set1}. If @var{n} begins with @samp{0}, it is interpreted in
6442 octal, otherwise in decimal.
6444 @item Character classes
6445 @cindex character classes
6447 The notation @samp{[:@var{class}:]} expands to all of the characters in
6448 the (predefined) class @var{class}. The characters expand in no
6449 particular order, except for the @code{upper} and @code{lower} classes,
6450 which expand in ascending order. When the @option{--delete} (@option{-d})
6451 and @option{--squeeze-repeats} (@option{-s}) options are both given, any
6452 character class can be used in @var{set2}. Otherwise, only the
6453 character classes @code{lower} and @code{upper} are accepted in
6454 @var{set2}, and then only if the corresponding character class
6455 (@code{upper} and @code{lower}, respectively) is specified in the same
6456 relative position in @var{set1}. Doing this specifies case conversion.
6457 The class names are given below; an error results when an invalid class
6469 Horizontal whitespace.
6478 Printable characters, not including space.
6484 Printable characters, including space.
6487 Punctuation characters.
6490 Horizontal or vertical whitespace.
6499 @item Equivalence classes
6500 @cindex equivalence classes
6502 The syntax @samp{[=@var{c}=]} expands to all of the characters that are
6503 equivalent to @var{c}, in no particular order. Equivalence classes are
6504 a relatively recent invention intended to support non-English alphabets.
6505 But there seems to be no standard way to define them or determine their
6506 contents. Therefore, they are not fully implemented in GNU @command{tr};
6507 each character's equivalence class consists only of that character,
6508 which is of no particular use.
6514 @subsection Translating
6516 @cindex translating characters
6518 @command{tr} performs translation when @var{set1} and @var{set2} are
6519 both given and the @option{--delete} (@option{-d}) option is not given.
6520 @command{tr} translates each character of its input that is in @var{set1}
6521 to the corresponding character in @var{set2}. Characters not in
6522 @var{set1} are passed through unchanged. When a character appears more
6523 than once in @var{set1} and the corresponding characters in @var{set2}
6524 are not all the same, only the final one is used. For example, these
6525 two commands are equivalent:
6532 A common use of @command{tr} is to convert lowercase characters to
6533 uppercase. This can be done in many ways. Here are three of them:
6536 tr abcdefghijklmnopqrstuvwxyz ABCDEFGHIJKLMNOPQRSTUVWXYZ
6538 tr '[:lower:]' '[:upper:]'
6542 But note that using ranges like @code{a-z} above is not portable.
6544 When @command{tr} is performing translation, @var{set1} and @var{set2}
6545 typically have the same length. If @var{set1} is shorter than
6546 @var{set2}, the extra characters at the end of @var{set2} are ignored.
6548 On the other hand, making @var{set1} longer than @var{set2} is not
6549 portable; POSIX says that the result is undefined. In this situation,
6550 BSD @command{tr} pads @var{set2} to the length of @var{set1} by repeating
6551 the last character of @var{set2} as many times as necessary. System V
6552 @command{tr} truncates @var{set1} to the length of @var{set2}.
6554 By default, GNU @command{tr} handles this case like BSD @command{tr}.
6555 When the @option{--truncate-set1} (@option{-t}) option is given,
6556 GNU @command{tr} handles this case like the System V @command{tr}
6557 instead. This option is ignored for operations other than translation.
6559 Acting like System V @command{tr} in this case breaks the relatively common
6563 tr -cs A-Za-z0-9 '\012'
6567 because it converts only zero bytes (the first element in the
6568 complement of @var{set1}), rather than all non-alphanumerics, to
6572 By the way, the above idiom is not portable because it uses ranges, and
6573 it assumes that the octal code for newline is 012.
6574 Assuming a POSIX compliant @command{tr}, here is a better
6578 tr -cs '[:alnum:]' '[\n*]'
6583 @subsection Squeezing repeats and deleting
6585 @cindex squeezing repeat characters
6586 @cindex deleting characters
6588 When given just the @option{--delete} (@option{-d}) option, @command{tr}
6589 removes any input characters that are in @var{set1}.
6591 When given just the @option{--squeeze-repeats} (@option{-s}) option,
6592 @command{tr} replaces each input sequence of a repeated character that
6593 is in @var{set1} with a single occurrence of that character.
6595 When given both @option{--delete} and @option{--squeeze-repeats}, @command{tr}
6596 first performs any deletions using @var{set1}, then squeezes repeats
6597 from any remaining characters using @var{set2}.
6599 The @option{--squeeze-repeats} option may also be used when translating,
6600 in which case @command{tr} first performs translation, then squeezes
6601 repeats from any remaining characters using @var{set2}.
6603 Here are some examples to illustrate various combinations of options:
6608 Remove all zero bytes:
6615 Put all words on lines by themselves. This converts all
6616 non-alphanumeric characters to newlines, then squeezes each string
6617 of repeated newlines into a single newline:
6620 tr -cs '[:alnum:]' '[\n*]'
6624 Convert each sequence of repeated newlines to a single newline:
6631 Find doubled occurrences of words in a document.
6632 @c Separate the following two "the"s, so typo checkers don't complain.
6633 For example, people often write ``the @w{}the'' with the repeated words
6634 separated by a newline. The Bourne shell script below works first
6635 by converting each sequence of punctuation and blank characters to a
6636 single newline. That puts each ``word'' on a line by itself.
6637 Next it maps all uppercase characters to lower case, and finally it
6638 runs @command{uniq} with the @option{-d} option to print out only the words
6644 | tr -s '[:punct:][:blank:]' '[\n*]' \
6645 | tr '[:upper:]' '[:lower:]' \
6650 Deleting a small set of characters is usually straightforward. For example,
6651 to remove all @samp{a}s, @samp{x}s, and @samp{M}s you would do this:
6657 However, when @samp{-} is one of those characters, it can be tricky because
6658 @samp{-} has special meanings. Performing the same task as above but also
6659 removing all @samp{-} characters, we might try @code{tr -d -axM}, but
6660 that would fail because @command{tr} would try to interpret @option{-a} as
6661 a command-line option. Alternatively, we could try putting the hyphen
6662 inside the string, @code{tr -d a-xM}, but that wouldn't work either because
6663 it would make @command{tr} interpret @code{a-x} as the range of characters
6664 @samp{a}@dots{}@samp{x} rather than the three.
6665 One way to solve the problem is to put the hyphen at the end of the list
6672 Or you can use @samp{--} to terminate option processing:
6678 More generally, use the character class notation @code{[=c=]}
6679 with @samp{-} (or any other character) in place of the @samp{c}:
6685 Note how single quotes are used in the above example to protect the
6686 square brackets from interpretation by a shell.
6691 @node expand invocation
6692 @section @command{expand}: Convert tabs to spaces
6695 @cindex tabs to spaces, converting
6696 @cindex converting tabs to spaces
6698 @command{expand} writes the contents of each given @var{file}, or standard
6699 input if none are given or for a @var{file} of @samp{-}, to standard
6700 output, with tab characters converted to the appropriate number of
6704 expand [@var{option}]@dots{} [@var{file}]@dots{}
6707 By default, @command{expand} converts all tabs to spaces. It preserves
6708 backspace characters in the output; they decrement the column count for
6709 tab calculations. The default action is equivalent to @option{-t 8} (set
6710 tabs every 8 columns).
6712 The program accepts the following options. Also see @ref{Common options}.
6716 @item -t @var{tab1}[,@var{tab2}]@dots{}
6717 @itemx --tabs=@var{tab1}[,@var{tab2}]@dots{}
6720 @cindex tab stops, setting
6721 If only one tab stop is given, set the tabs @var{tab1} spaces apart
6722 (default is 8). Otherwise, set the tabs at columns @var{tab1},
6723 @var{tab2}, @dots{} (numbered from 0), and replace any tabs beyond the
6724 last tab stop given with single spaces. Tab stops can be separated by
6725 blanks as well as by commas.
6727 For compatibility, GNU @command{expand} also accepts the obsolete
6728 option syntax, @option{-@var{t1}[,@var{t2}]@dots{}}. New scripts
6729 should use @option{-t @var{t1}[,@var{t2}]@dots{}} instead.
6735 @cindex initial tabs, converting
6736 Only convert initial tabs (those that precede all non-space or non-tab
6737 characters) on each line to spaces.
6744 @node unexpand invocation
6745 @section @command{unexpand}: Convert spaces to tabs
6749 @command{unexpand} writes the contents of each given @var{file}, or
6750 standard input if none are given or for a @var{file} of @samp{-}, to
6751 standard output, converting blanks at the beginning of each line into
6752 as many tab characters as needed. In the default POSIX
6753 locale, a @dfn{blank} is a space or a tab; other locales may specify
6754 additional blank characters. Synopsis:
6757 unexpand [@var{option}]@dots{} [@var{file}]@dots{}
6760 By default, @command{unexpand} converts only initial blanks (those
6761 that precede all non-blank characters) on each line. It
6762 preserves backspace characters in the output; they decrement the column
6763 count for tab calculations. By default, tabs are set at every 8th
6766 The program accepts the following options. Also see @ref{Common options}.
6770 @item -t @var{tab1}[,@var{tab2}]@dots{}
6771 @itemx --tabs=@var{tab1}[,@var{tab2}]@dots{}
6774 If only one tab stop is given, set the tabs @var{tab1} columns apart
6775 instead of the default 8. Otherwise, set the tabs at columns
6776 @var{tab1}, @var{tab2}, @dots{} (numbered from 0), and leave blanks
6777 beyond the tab stops given unchanged. Tab stops can be separated by
6778 blanks as well as by commas. This option implies the @option{-a} option.
6780 For compatibility, GNU @command{unexpand} supports the obsolete option syntax,
6781 @option{-@var{tab1}[,@var{tab2}]@dots{}}, where tab stops must be
6782 separated by commas. (Unlike @option{-t}, this obsolete option does
6783 not imply @option{-a}.) New scripts should use @option{--first-only -t
6784 @var{tab1}[,@var{tab2}]@dots{}} instead.
6790 Also convert all sequences of two or more blanks just before a tab stop,
6791 even if they occur after non-blank characters in a line.
6798 @node Directory listing
6799 @chapter Directory listing
6801 This chapter describes the @command{ls} command and its variants @command{dir}
6802 and @command{vdir}, which list information about files.
6805 * ls invocation:: List directory contents.
6806 * dir invocation:: Briefly ls.
6807 * vdir invocation:: Verbosely ls.
6808 * dircolors invocation:: Color setup for ls, etc.
6813 @section @command{ls}: List directory contents
6816 @cindex directory listing
6818 The @command{ls} program lists information about files (of any type,
6819 including directories). Options and file arguments can be intermixed
6820 arbitrarily, as usual.
6822 For non-option command-line arguments that are directories, by default
6823 @command{ls} lists the contents of directories, not recursively, and
6824 omitting files with names beginning with @samp{.}. For other non-option
6825 arguments, by default @command{ls} lists just the file name. If no
6826 non-option argument is specified, @command{ls} operates on the current
6827 directory, acting as if it had been invoked with a single argument of @samp{.}.
6830 By default, the output is sorted alphabetically, according to the locale
6831 settings in effect.@footnote{If you use a non-POSIX
6832 locale (e.g., by setting @env{LC_ALL} to @samp{en_US}), then @command{ls} may
6833 produce output that is sorted differently than you're accustomed to.
6834 In that case, set the @env{LC_ALL} environment variable to @samp{C}.}
6835 If standard output is
6836 a terminal, the output is in columns (sorted vertically) and control
6837 characters are output as question marks; otherwise, the output is listed
6838 one per line and control characters are output as-is.
6840 Because @command{ls} is such a fundamental program, it has accumulated many
6841 options over the years. They are described in the subsections below;
6842 within each section, options are listed alphabetically (ignoring case).
6843 The division of options into the subsections is not absolute, since some
6844 options affect more than one aspect of @command{ls}'s operation.
6846 @cindex exit status of @command{ls}
6851 1 minor problems (e.g., failure to access a file or directory not
6852 specified as a command line argument. This happens when listing a
6853 directory in which entries are actively being removed or renamed.)
6854 2 serious trouble (e.g., memory exhausted, invalid option, failure
6855 to access a file or directory specified as a command line argument
6856 or a directory loop)
6859 Also see @ref{Common options}.
6862 * Which files are listed::
6863 * What information is listed::
6864 * Sorting the output::
6865 * Details about version sort::
6866 * General output formatting::
6867 * Formatting file timestamps::
6868 * Formatting the file names::
6872 @node Which files are listed
6873 @subsection Which files are listed
6875 These options determine which files @command{ls} lists information for.
6876 By default, @command{ls} lists files and the contents of any
6877 directories on the command line, except that in directories it ignores
6878 files whose names start with @samp{.}.
6886 In directories, do not ignore file names that start with @samp{.}.
6891 @opindex --almost-all
6892 In directories, do not ignore all file names that start with @samp{.};
6893 ignore only @file{.} and @file{..}. The @option{--all} (@option{-a})
6894 option overrides this option.
6897 @itemx --ignore-backups
6899 @opindex --ignore-backups
6900 @cindex backup files, ignoring
6901 In directories, ignore files that end with @samp{~}. This option is
6902 equivalent to @samp{--ignore='*~' --ignore='.*~'}.
6907 @opindex --directory
6908 List just the names of directories, as with other types of files, rather
6909 than listing their contents.
6910 @c The following sentence is the same as the one for -F.
6911 Do not follow symbolic links listed on the
6912 command line unless the @option{--dereference-command-line} (@option{-H}),
6913 @option{--dereference} (@option{-L}), or
6914 @option{--dereference-command-line-symlink-to-dir} options are specified.
6917 @itemx --dereference-command-line
6919 @opindex --dereference-command-line
6920 @cindex symbolic links, dereferencing
6921 If a command line argument specifies a symbolic link, show information
6922 for the file the link references rather than for the link itself.
6924 @item --dereference-command-line-symlink-to-dir
6925 @opindex --dereference-command-line-symlink-to-dir
6926 @cindex symbolic links, dereferencing
6927 Do not dereference symbolic links, with one exception:
6928 if a command line argument specifies a symbolic link that refers to
6929 a directory, show information for that directory rather than for the
6931 This is the default behavior when no other dereferencing-related
6932 option has been specified (@option{--classify} (@option{-F}),
6933 @option{--directory} (@option{-d}),
6935 @option{--dereference} (@option{-L}), or
6936 @option{--dereference-command-line} (@option{-H})).
6938 @item --group-directories-first
6939 @opindex --group-directories-first
6940 Group all the directories before the files and then sort the
6941 directories and the files separately using the selected sort key
6942 (see --sort option).
6943 That is, this option specifies a primary sort key,
6944 and the --sort option specifies a secondary key.
6945 However, any use of @option{--sort=none}
6946 (@option{-U}) disables this option altogether.
6948 @item --hide=PATTERN
6949 @opindex --hide=@var{pattern}
6950 In directories, ignore files whose names match the shell pattern
6951 @var{pattern}, unless the @option{--all} (@option{-a}) or
6952 @option{--almost-all} (@option{-A}) is also given. This
6953 option acts like @option{--ignore=@var{pattern}} except that it has no
6954 effect if @option{--all} (@option{-a}) or @option{--almost-all}
6955 (@option{-A}) is also given.
6957 This option can be useful in shell aliases. For example, if
6958 @command{lx} is an alias for @samp{ls --hide='*~'} and @command{ly} is
6959 an alias for @samp{ls --ignore='*~'}, then the command @samp{lx -A}
6960 lists the file @file{README~} even though @samp{ly -A} would not.
6962 @item -I @var{pattern}
6963 @itemx --ignore=@var{pattern}
6965 @opindex --ignore=@var{pattern}
6966 In directories, ignore files whose names match the shell pattern
6967 (not regular expression) @var{pattern}. As
6968 in the shell, an initial @samp{.} in a file name does not match a
6969 wildcard at the start of @var{pattern}. Sometimes it is useful
6970 to give this option several times. For example,
6973 $ ls --ignore='.??*' --ignore='.[^.]' --ignore='#*'
6976 The first option ignores names of length 3 or more that start with @samp{.},
6977 the second ignores all two-character names that start with @samp{.}
6978 except @samp{..}, and the third ignores names that start with @samp{#}.
6981 @itemx --dereference
6983 @opindex --dereference
6984 @cindex symbolic links, dereferencing
6985 When showing file information for a symbolic link, show information
6986 for the file the link references rather than the link itself.
6987 However, even with this option, @command{ls} still prints the name
6988 of the link itself, not the name of the file that the link points to.
6993 @opindex --recursive
6994 @cindex recursive directory listing
6995 @cindex directory listing, recursive
6996 List the contents of all directories recursively.
7001 @node What information is listed
7002 @subsection What information is listed
7004 These options affect the information that @command{ls} displays. By
7005 default, only file names are shown.
7011 @cindex hurd, author, printing
7012 List each file's author when producing long format directory listings.
7013 In GNU/Hurd, file authors can differ from their owners, but in other
7014 operating systems the two are the same.
7020 @cindex dired Emacs mode support
7021 With the long listing (@option{-l}) format, print an additional line after
7025 //DIRED// @var{beg1} @var{end1} @var{beg2} @var{end2} @dots{}
7029 The @var{begn} and @var{endn} are unsigned integers that record the
7030 byte position of the beginning and end of each file name in the output.
7031 This makes it easy for Emacs to find the names, even when they contain
7032 unusual characters such as space or newline, without fancy searching.
7034 If directories are being listed recursively (@option{-R}), output a similar
7035 line with offsets for each subdirectory name:
7038 //SUBDIRED// @var{beg1} @var{end1} @dots{}
7041 Finally, output a line of the form:
7044 //DIRED-OPTIONS// --quoting-style=@var{word}
7048 where @var{word} is the quoting style (@pxref{Formatting the file names}).
7050 Here is an actual example:
7053 $ mkdir -p a/sub/deeper a/sub2
7055 $ touch a/sub/deeper/file
7056 $ ls -gloRF --dired a
7059 -rw-r--r-- 1 0 Jun 10 12:27 f1
7060 -rw-r--r-- 1 0 Jun 10 12:27 f2
7061 drwxr-xr-x 3 4096 Jun 10 12:27 sub/
7062 drwxr-xr-x 2 4096 Jun 10 12:27 sub2/
7066 drwxr-xr-x 2 4096 Jun 10 12:27 deeper/
7070 -rw-r--r-- 1 0 Jun 10 12:27 file
7074 //DIRED// 48 50 84 86 120 123 158 162 217 223 282 286
7075 //SUBDIRED// 2 3 167 172 228 240 290 296
7076 //DIRED-OPTIONS// --quoting-style=literal
7079 Note that the pairs of offsets on the @samp{//DIRED//} line above delimit
7080 these names: @file{f1}, @file{f2}, @file{sub}, @file{sub2}, @file{deeper},
7082 The offsets on the @samp{//SUBDIRED//} line delimit the following
7083 directory names: @file{a}, @file{a/sub}, @file{a/sub/deeper}, @file{a/sub2}.
7085 Here is an example of how to extract the fifth entry name, @samp{deeper},
7086 corresponding to the pair of offsets, 222 and 228:
7089 $ ls -gloRF --dired a > out
7090 $ dd bs=1 skip=222 count=6 < out 2>/dev/null; echo
7094 Note that although the listing above includes a trailing slash
7095 for the @samp{deeper} entry, the offsets select the name without
7096 the trailing slash. However, if you invoke @command{ls} with @option{--dired}
7097 along with an option like @option{--escape} (aka @option{-b}) and operate
7098 on a file whose name contains special characters, notice that the backslash
7103 $ ls -blog --dired 'a b'
7104 -rw-r--r-- 1 0 Jun 10 12:28 a\ b
7106 //DIRED-OPTIONS// --quoting-style=escape
7109 If you use a quoting style that adds quote marks
7110 (e.g., @option{--quoting-style=c}), then the offsets include the quote marks.
7111 So beware that the user may select the quoting style via the environment
7112 variable @env{QUOTING_STYLE}@. Hence, applications using @option{--dired}
7113 should either specify an explicit @option{--quoting-style=literal} option
7114 (aka @option{-N} or @option{--literal}) on the command line, or else be
7115 prepared to parse the escaped names.
7118 @opindex --full-time
7119 Produce long format directory listings, and list times in full. It is
7120 equivalent to using @option{--format=long} with
7121 @option{--time-style=full-iso} (@pxref{Formatting file timestamps}).
7125 Produce long format directory listings, but don't display owner information.
7131 Inhibit display of group information in a long format directory listing.
7132 (This is the default in some non-GNU versions of @command{ls}, so we
7133 provide this option for compatibility.)
7141 @cindex inode number, printing
7142 Print the inode number (also called the file serial number and index
7143 number) of each file to the left of the file name. (This number
7144 uniquely identifies each file within a particular file system.)
7147 @itemx --format=long
7148 @itemx --format=verbose
7151 @opindex long ls @r{format}
7152 @opindex verbose ls @r{format}
7153 In addition to the name of each file, print the file type, file mode bits,
7154 number of hard links, owner name, group name, size, and
7155 timestamp (@pxref{Formatting file timestamps}), normally
7156 the modification time. Print question marks for information that
7157 cannot be determined.
7159 Normally the size is printed as a byte count without punctuation, but
7160 this can be overridden (@pxref{Block size}). For example, @option{-h}
7161 prints an abbreviated, human-readable count, and
7162 @samp{--block-size="'1"} prints a byte count with the thousands
7163 separator of the current locale.
7165 For each directory that is listed, preface the files with a line
7166 @samp{total @var{blocks}}, where @var{blocks} is the total disk allocation
7167 for all files in that directory. The block size currently defaults to 1024
7168 bytes, but this can be overridden (@pxref{Block size}).
7169 The @var{blocks} computed counts each hard link separately;
7170 this is arguably a deficiency.
7172 The file type is one of the following characters:
7174 @c The commented-out entries are ones we're not sure about.
7182 character special file
7184 high performance (``contiguous data'') file
7188 door (Solaris 2.5 and up)
7190 @c semaphore, if this is a distinct file type
7194 @c multiplexed file (7th edition Unix; obsolete)
7196 off-line (``migrated'') file (Cray DMF)
7198 network special file (HP-UX)
7202 port (Solaris 10 and up)
7204 @c message queue, if this is a distinct file type
7208 @c shared memory object, if this is a distinct file type
7210 @c typed memory object, if this is a distinct file type
7212 @c whiteout (4.4BSD; not implemented)
7214 some other file type
7217 @cindex permissions, output by @command{ls}
7218 The file mode bits listed are similar to symbolic mode specifications
7219 (@pxref{Symbolic Modes}). But @command{ls} combines multiple bits into the
7220 third character of each set of permissions as follows:
7224 If the set-user-ID or set-group-ID bit and the corresponding executable bit
7228 If the set-user-ID or set-group-ID bit is set but the corresponding
7229 executable bit is not set.
7232 If the restricted deletion flag or sticky bit, and the
7233 other-executable bit, are both set. The restricted deletion flag is
7234 another name for the sticky bit. @xref{Mode Structure}.
7237 If the restricted deletion flag or sticky bit is set but the
7238 other-executable bit is not set.
7241 If the executable bit is set and none of the above apply.
7247 Following the file mode bits is a single character that specifies
7248 whether an alternate access method such as an access control list
7249 applies to the file. When the character following the file mode bits is a
7250 space, there is no alternate access method. When it is a printing
7251 character, then there is such a method.
7253 GNU @command{ls} uses a @samp{.} character to indicate a file
7254 with an SELinux security context, but no other alternate access method.
7256 A file with any other combination of alternate access methods
7257 is marked with a @samp{+} character.
7260 @itemx --numeric-uid-gid
7262 @opindex --numeric-uid-gid
7263 @cindex numeric uid and gid
7264 @cindex numeric user and group IDs
7265 Produce long format directory listings, but
7266 display numeric user and group IDs instead of the owner and group names.
7270 Produce long format directory listings, but don't display group information.
7271 It is equivalent to using @option{--format=long} with @option{--no-group} .
7277 @cindex disk allocation
7278 @cindex size of files, reporting
7279 Print the disk allocation of each file to the left of the file name.
7280 This is the amount of disk space used by the file, which is usually a
7281 bit more than the file's size, but it can be less if the file has holes.
7283 Normally the disk allocation is printed in units of
7284 1024 bytes, but this can be overridden (@pxref{Block size}).
7286 @cindex NFS mounts from BSD to HP-UX
7287 For files that are NFS-mounted from an HP-UX system to a BSD system,
7288 this option reports sizes that are half the correct values. On HP-UX
7289 systems, it reports sizes that are twice the correct values for files
7290 that are NFS-mounted from BSD systems. This is due to a flaw in HP-UX;
7291 it also affects the HP-UX @command{ls} program.
7300 @cindex security context
7301 Display the SELinux security context or @samp{?} if none is found.
7302 When used with the @option{-l} option, print the security context
7303 to the left of the size column.
7308 @node Sorting the output
7309 @subsection Sorting the output
7311 @cindex sorting @command{ls} output
7312 These options change the order in which @command{ls} sorts the information
7313 it outputs. By default, sorting is done by character code
7314 (e.g., ASCII order).
7320 @itemx --time=status
7323 @opindex ctime@r{, printing or sorting by}
7324 @opindex status time@r{, printing or sorting by}
7325 @opindex use time@r{, printing or sorting files by}
7326 If the long listing format (e.g., @option{-l}, @option{-o}) is being used,
7327 print the status change time (the @samp{ctime} in the inode) instead of
7328 the modification time.
7329 When explicitly sorting by time (@option{--sort=time} or @option{-t})
7330 or when not using a long listing format,
7331 sort according to the status change time.
7335 @cindex unsorted directory listing
7336 @cindex directory order, listing by
7337 Primarily, like @option{-U}---do not sort; list the files in whatever
7338 order they are stored in the directory. But also enable @option{-a} (list
7339 all files) and disable @option{-l}, @option{--color}, and @option{-s} (if they
7340 were specified before the @option{-f}).
7346 @cindex reverse sorting
7347 Reverse whatever the sorting method is---e.g., list files in reverse
7348 alphabetical order, youngest first, smallest first, or whatever.
7354 @opindex size of files@r{, sorting files by}
7355 Sort by file size, largest first.
7361 @opindex modification time@r{, sorting files by}
7362 Sort by modification time (the @samp{mtime} in the inode), newest first.
7366 @itemx --time=access
7370 @opindex use time@r{, printing or sorting files by}
7371 @opindex atime@r{, printing or sorting files by}
7372 @opindex access time@r{, printing or sorting files by}
7373 If the long listing format (e.g., @option{--format=long}) is being used,
7374 print the last access time (the @samp{atime} in the inode).
7375 When explicitly sorting by time (@option{--sort=time} or @option{-t})
7376 or when not using a long listing format, sort according to the access time.
7382 @opindex none@r{, sorting option for @command{ls}}
7383 Do not sort; list the files in whatever order they are
7384 stored in the directory. (Do not do any of the other unrelated things
7385 that @option{-f} does.) This is especially useful when listing very large
7386 directories, since not doing any sorting can be noticeably faster.
7389 @itemx --sort=version
7392 @opindex version@r{, sorting option for @command{ls}}
7393 Sort by version name and number, lowest first. It behaves like a default
7394 sort, except that each sequence of decimal digits is treated numerically
7395 as an index/version number. (@xref{Details about version sort}.)
7398 @itemx --sort=extension
7401 @opindex extension@r{, sorting files by}
7402 Sort directory contents alphabetically by file extension (characters
7403 after the last @samp{.}); files with no extension are sorted first.
7408 @node Details about version sort
7409 @subsection Details about version sort
7411 Version sorting handles the fact that file names frequently include indices or
7412 version numbers. Standard sorting usually does not produce the order that one
7413 expects because comparisons are made on a character-by-character basis.
7414 Version sorting is especially useful when browsing directories that contain
7415 many files with indices/version numbers in their names:
7419 abc.zml-1.gz abc.zml-1.gz
7420 abc.zml-12.gz abc.zml-2.gz
7421 abc.zml-2.gz abc.zml-12.gz
7424 Version-sorted strings are compared such that if @var{ver1} and @var{ver2}
7425 are version numbers and @var{prefix} and @var{suffix} (@var{suffix} matching
7426 the regular expression @samp{(\.[A-Za-z~][A-Za-z0-9~]*)*}) are strings then
7427 @var{ver1} < @var{ver2} implies that the name composed of
7428 ``@var{prefix} @var{ver1} @var{suffix}'' sorts before
7429 ``@var{prefix} @var{ver2} @var{suffix}''.
7431 Note also that leading zeros of numeric parts are ignored:
7435 abc-1.007.tgz abc-1.01a.tgz
7436 abc-1.012b.tgz abc-1.007.tgz
7437 abc-1.01a.tgz abc-1.012b.tgz
7440 This functionality is implemented using gnulib's @code{filevercmp} function,
7441 which has some caveats worth noting.
7444 @item @env{LC_COLLATE} is ignored, which means @samp{ls -v} and @samp{sort -V}
7445 will sort non-numeric prefixes as if the @env{LC_COLLATE} locale category
7446 was set to @samp{C}@.
7447 @item Some suffixes will not be matched by the regular
7448 expression mentioned above. Consequently these examples may
7449 not sort as you expect:
7457 abc-1.2.3.4.x86_64.rpm
7458 abc-1.2.3.x86_64.rpm
7462 @node General output formatting
7463 @subsection General output formatting
7465 These options affect the appearance of the overall output.
7470 @itemx --format=single-column
7473 @opindex single-column @r{output of files}
7474 List one file per line. This is the default for @command{ls} when standard
7475 output is not a terminal.
7478 @itemx --format=vertical
7481 @opindex vertical @r{sorted files in columns}
7482 List files in columns, sorted vertically. This is the default for
7483 @command{ls} if standard output is a terminal. It is always the default
7484 for the @command{dir} program.
7485 GNU @command{ls} uses variable width columns to display as many files as
7486 possible in the fewest lines.
7488 @item --color [=@var{when}]
7490 @cindex color, distinguishing file types with
7491 Specify whether to use color for distinguishing file types. @var{when}
7492 may be omitted, or one of:
7495 @vindex none @r{color option}
7496 - Do not use color at all. This is the default.
7498 @vindex auto @r{color option}
7499 @cindex terminal, using color iff
7500 - Only use color if standard output is a terminal.
7502 @vindex always @r{color option}
7505 Specifying @option{--color} and no @var{when} is equivalent to
7506 @option{--color=always}.
7507 Piping a colorized listing through a pager like @command{more} or
7508 @command{less} usually produces unreadable results. However, using
7509 @code{more -f} does seem to work.
7512 @vindex SHELL @r{environment variable, and color}
7513 Note that using the @option{--color} option may incur a noticeable
7514 performance penalty when run in a directory with very many entries,
7515 because the default settings require that @command{ls} @code{stat} every
7516 single file it lists.
7517 However, if you would like most of the file-type coloring
7518 but can live without the other coloring options (e.g.,
7519 executable, orphan, sticky, other-writable, capability), use
7520 @command{dircolors} to set the @env{LS_COLORS} environment variable like this,
7522 eval $(dircolors -p | perl -pe \
7523 's/^((CAP|S[ET]|O[TR]|M|E)\w+).*/$1 00/' | dircolors -)
7525 and on a @code{dirent.d_type}-capable file system, @command{ls}
7526 will perform only one @code{stat} call per command line argument.
7530 @itemx --indicator-style=classify
7533 @opindex --indicator-style
7534 @cindex file type and executables, marking
7535 @cindex executables and file type, marking
7536 Append a character to each file name indicating the file type. Also,
7537 for regular files that are executable, append @samp{*}. The file type
7538 indicators are @samp{/} for directories, @samp{@@} for symbolic links,
7539 @samp{|} for FIFOs, @samp{=} for sockets, @samp{>} for doors,
7540 and nothing for regular files.
7541 @c The following sentence is the same as the one for -d.
7542 Do not follow symbolic links listed on the
7543 command line unless the @option{--dereference-command-line} (@option{-H}),
7544 @option{--dereference} (@option{-L}), or
7545 @option{--dereference-command-line-symlink-to-dir} options are specified.
7548 @itemx --indicator-style=file-type
7549 @opindex --file-type
7550 @opindex --indicator-style
7551 @cindex file type, marking
7552 Append a character to each file name indicating the file type. This is
7553 like @option{-F}, except that executables are not marked.
7555 @item --indicator-style=@var{word}
7556 @opindex --indicator-style
7557 Append a character indicator with style @var{word} to entry names,
7562 Do not append any character indicator; this is the default.
7564 Append @samp{/} for directories. This is the same as the @option{-p}
7567 Append @samp{/} for directories, @samp{@@} for symbolic links, @samp{|}
7568 for FIFOs, @samp{=} for sockets, and nothing for regular files. This is
7569 the same as the @option{--file-type} option.
7571 Append @samp{*} for executable regular files, otherwise behave as for
7572 @samp{file-type}. This is the same as the @option{-F} or
7573 @option{--classify} option.
7579 @opindex --kibibytes
7580 Set the default block size to its normal value of 1024 bytes,
7581 overriding any contrary specification in environment variables
7582 (@pxref{Block size}). This option is in turn overridden by the
7583 @option{--block-size}, @option{-h} or @option{--human-readable}, and
7584 @option{--si} options.
7586 The @option{-k} or @option{--kibibytes} option affects the
7587 per-directory block count written by the @option{-l} and similar
7588 options, and the size written by the @option{-s} or @option{--size}
7589 option. It does not affect the file size written by @option{-l}.
7592 @itemx --format=commas
7595 @opindex commas@r{, outputting between files}
7596 List files horizontally, with as many as will fit on each line,
7597 separated by @samp{, } (a comma and a space).
7600 @itemx --indicator-style=slash
7602 @opindex --indicator-style
7603 @cindex file type, marking
7604 Append a @samp{/} to directory names.
7607 @itemx --format=across
7608 @itemx --format=horizontal
7611 @opindex across@r{, listing files}
7612 @opindex horizontal@r{, listing files}
7613 List the files in columns, sorted horizontally.
7616 @itemx --tabsize=@var{cols}
7619 Assume that each tab stop is @var{cols} columns wide. The default is 8.
7620 @command{ls} uses tabs where possible in the output, for efficiency. If
7621 @var{cols} is zero, do not use tabs at all.
7623 @c FIXME: remove in 2009, if Apple Terminal has been fixed for long enough.
7624 Some terminal emulators (at least Apple Terminal 1.5 (133) from Mac OS X 10.4.8)
7625 do not properly align columns to the right of a TAB following a
7626 non-ASCII byte. If you use such a terminal emulator, use the
7627 @option{-T0} option or put @code{TABSIZE=0} in your environment to tell
7628 @command{ls} to align using spaces, not tabs.
7631 @itemx --width=@var{cols}
7635 Assume the screen is @var{cols} columns wide. The default is taken
7636 from the terminal settings if possible; otherwise the environment
7637 variable @env{COLUMNS} is used if it is set; otherwise the default
7643 @node Formatting file timestamps
7644 @subsection Formatting file timestamps
7646 By default, file timestamps are listed in abbreviated form, using
7647 a date like @samp{Mar 30@ @ 2002} for non-recent timestamps, and a
7648 date-without-year and time like @samp{Mar 30 23:45} for recent timestamps.
7649 This format can change depending on the current locale as detailed below.
7652 A timestamp is considered to be @dfn{recent} if it is less than six
7653 months old, and is not dated in the future. If a timestamp dated
7654 today is not listed in recent form, the timestamp is in the future,
7655 which means you probably have clock skew problems which may break
7656 programs like @command{make} that rely on file timestamps.
7659 Time stamps are listed according to the time zone rules specified by
7660 the @env{TZ} environment variable, or by the system default rules if
7661 @env{TZ} is not set. @xref{TZ Variable,, Specifying the Time Zone
7662 with @env{TZ}, libc, The GNU C Library Reference Manual}.
7664 The following option changes how file timestamps are printed.
7667 @item --time-style=@var{style}
7668 @opindex --time-style
7670 List timestamps in style @var{style}. The @var{style} should
7671 be one of the following:
7676 List timestamps using @var{format}, where @var{format} is interpreted
7677 like the format argument of @command{date} (@pxref{date invocation}).
7678 For example, @option{--time-style="+%Y-%m-%d %H:%M:%S"} causes
7679 @command{ls} to list timestamps like @samp{2002-03-30 23:45:56}. As
7680 with @command{date}, @var{format}'s interpretation is affected by the
7681 @env{LC_TIME} locale category.
7683 If @var{format} contains two format strings separated by a newline,
7684 the former is used for non-recent files and the latter for recent
7685 files; if you want output columns to line up, you may need to insert
7686 spaces in one of the two formats.
7689 List timestamps in full using ISO 8601 date, time, and time zone
7690 format with nanosecond precision, e.g., @samp{2002-03-30
7691 23:45:56.477817180 -0700}. This style is equivalent to
7692 @samp{+%Y-%m-%d %H:%M:%S.%N %z}.
7694 This is useful because the time output includes all the information that
7695 is available from the operating system. For example, this can help
7696 explain @command{make}'s behavior, since GNU @command{make}
7697 uses the full timestamp to determine whether a file is out of date.
7700 List ISO 8601 date and time in minutes, e.g.,
7701 @samp{2002-03-30 23:45}. These timestamps are shorter than
7702 @samp{full-iso} timestamps, and are usually good enough for everyday
7703 work. This style is equivalent to @samp{+%Y-%m-%d %H:%M}.
7706 List ISO 8601 dates for non-recent timestamps (e.g.,
7707 @samp{2002-03-30@ }), and ISO 8601 month, day, hour, and
7708 minute for recent timestamps (e.g., @samp{03-30 23:45}). These
7709 timestamps are uglier than @samp{long-iso} timestamps, but they carry
7710 nearly the same information in a smaller space and their brevity helps
7711 @command{ls} output fit within traditional 80-column output lines.
7712 The following two @command{ls} invocations are equivalent:
7717 ls -l --time-style="+%Y-%m-%d $newline%m-%d %H:%M"
7718 ls -l --time-style="iso"
7723 List timestamps in a locale-dependent form. For example, a Finnish
7724 locale might list non-recent timestamps like @samp{maalis 30@ @ 2002}
7725 and recent timestamps like @samp{maalis 30 23:45}. Locale-dependent
7726 timestamps typically consume more space than @samp{iso} timestamps and
7727 are harder for programs to parse because locale conventions vary so
7728 widely, but they are easier for many people to read.
7730 The @env{LC_TIME} locale category specifies the timestamp format. The
7731 default POSIX locale uses timestamps like @samp{Mar 30@
7732 @ 2002} and @samp{Mar 30 23:45}; in this locale, the following two
7733 @command{ls} invocations are equivalent:
7738 ls -l --time-style="+%b %e %Y$newline%b %e %H:%M"
7739 ls -l --time-style="locale"
7742 Other locales behave differently. For example, in a German locale,
7743 @option{--time-style="locale"} might be equivalent to
7744 @option{--time-style="+%e. %b %Y $newline%e. %b %H:%M"}
7745 and might generate timestamps like @samp{30. M@"ar 2002@ } and
7746 @samp{30. M@"ar 23:45}.
7748 @item posix-@var{style}
7750 List POSIX-locale timestamps if the @env{LC_TIME} locale
7751 category is POSIX, @var{style} timestamps otherwise. For
7752 example, the @samp{posix-long-iso} style lists
7753 timestamps like @samp{Mar 30@ @ 2002} and @samp{Mar 30 23:45} when in
7754 the POSIX locale, and like @samp{2002-03-30 23:45} otherwise.
7759 You can specify the default value of the @option{--time-style} option
7760 with the environment variable @env{TIME_STYLE}; if @env{TIME_STYLE} is not set
7761 the default style is @samp{locale}. GNU Emacs 21.3 and
7762 later use the @option{--dired} option and therefore can parse any date
7763 format, but if you are using Emacs 21.1 or 21.2 and specify a
7764 non-POSIX locale you may need to set
7765 @samp{TIME_STYLE="posix-long-iso"}.
7767 To avoid certain denial-of-service attacks, timestamps that would be
7768 longer than 1000 bytes may be treated as errors.
7771 @node Formatting the file names
7772 @subsection Formatting the file names
7774 These options change how file names themselves are printed.
7780 @itemx --quoting-style=escape
7783 @opindex --quoting-style
7784 @cindex backslash sequences for file names
7785 Quote nongraphic characters in file names using alphabetic and octal
7786 backslash sequences like those used in C.
7790 @itemx --quoting-style=literal
7793 @opindex --quoting-style
7794 Do not quote file names. However, with @command{ls} nongraphic
7795 characters are still printed as question marks if the output is a
7796 terminal and you do not specify the @option{--show-control-chars}
7800 @itemx --hide-control-chars
7802 @opindex --hide-control-chars
7803 Print question marks instead of nongraphic characters in file names.
7804 This is the default if the output is a terminal and the program is
7809 @itemx --quoting-style=c
7811 @opindex --quote-name
7812 @opindex --quoting-style
7813 Enclose file names in double quotes and quote nongraphic characters as
7816 @item --quoting-style=@var{word}
7817 @opindex --quoting-style
7818 @cindex quoting style
7819 Use style @var{word} to quote file names and other strings that may
7820 contain arbitrary characters. The @var{word} should
7821 be one of the following:
7825 Output strings as-is; this is the same as the @option{-N} or
7826 @option{--literal} option.
7828 Quote strings for the shell if they contain shell metacharacters or would
7829 cause ambiguous output.
7830 The quoting is suitable for POSIX-compatible shells like
7831 @command{bash}, but it does not always work for incompatible shells
7834 Quote strings for the shell, even if they would normally not require quoting.
7836 Quote strings as for C character string literals, including the
7837 surrounding double-quote characters; this is the same as the
7838 @option{-Q} or @option{--quote-name} option.
7840 Quote strings as for C character string literals, except omit the
7841 surrounding double-quote
7842 characters; this is the same as the @option{-b} or @option{--escape} option.
7844 Quote strings as for C character string literals, except use
7845 surrounding quotation marks appropriate for the
7848 @c Use @t instead of @samp to avoid duplicate quoting in some output styles.
7849 Quote strings as for C character string literals, except use
7850 surrounding quotation marks appropriate for the locale, and quote
7851 @t{'like this'} instead of @t{"like
7852 this"} in the default C locale. This looks nicer on many displays.
7855 You can specify the default value of the @option{--quoting-style} option
7856 with the environment variable @env{QUOTING_STYLE}@. If that environment
7857 variable is not set, the default value is @samp{literal}, but this
7858 default may change to @samp{shell} in a future version of this package.
7860 @item --show-control-chars
7861 @opindex --show-control-chars
7862 Print nongraphic characters as-is in file names.
7863 This is the default unless the output is a terminal and the program is
7869 @node dir invocation
7870 @section @command{dir}: Briefly list directory contents
7873 @cindex directory listing, brief
7875 @command{dir} is equivalent to @code{ls -C
7876 -b}; that is, by default files are listed in columns, sorted vertically,
7877 and special characters are represented by backslash escape sequences.
7879 @xref{ls invocation, @command{ls}}.
7882 @node vdir invocation
7883 @section @command{vdir}: Verbosely list directory contents
7886 @cindex directory listing, verbose
7888 @command{vdir} is equivalent to @code{ls -l
7889 -b}; that is, by default files are listed in long format and special
7890 characters are represented by backslash escape sequences.
7892 @node dircolors invocation
7893 @section @command{dircolors}: Color setup for @command{ls}
7897 @cindex setup for color
7899 @command{dircolors} outputs a sequence of shell commands to set up the
7900 terminal for color output from @command{ls} (and @command{dir}, etc.).
7904 eval "$(dircolors [@var{option}]@dots{} [@var{file}])"
7907 If @var{file} is specified, @command{dircolors} reads it to determine which
7908 colors to use for which file types and extensions. Otherwise, a
7909 precompiled database is used. For details on the format of these files,
7910 run @samp{dircolors --print-database}.
7912 To make @command{dircolors} read a @file{~/.dircolors} file if it
7913 exists, you can put the following lines in your @file{~/.bashrc} (or
7914 adapt them to your favorite shell):
7918 test -r $d && eval "$(dircolors $d)"
7922 @vindex SHELL @r{environment variable, and color}
7923 The output is a shell command to set the @env{LS_COLORS} environment
7924 variable. You can specify the shell syntax to use on the command line,
7925 or @command{dircolors} will guess it from the value of the @env{SHELL}
7926 environment variable.
7928 The program accepts the following options. Also see @ref{Common options}.
7933 @itemx --bourne-shell
7936 @opindex --bourne-shell
7937 @cindex Bourne shell syntax for color setup
7938 @cindex @command{sh} syntax for color setup
7939 Output Bourne shell commands. This is the default if the @env{SHELL}
7940 environment variable is set and does not end with @samp{csh} or
7949 @cindex C shell syntax for color setup
7950 @cindex @command{csh} syntax for color setup
7951 Output C shell commands. This is the default if @code{SHELL} ends with
7952 @command{csh} or @command{tcsh}.
7955 @itemx --print-database
7957 @opindex --print-database
7958 @cindex color database, printing
7959 @cindex database for color setup, printing
7960 @cindex printing color database
7961 Print the (compiled-in) default color configuration database. This
7962 output is itself a valid configuration file, and is fairly descriptive
7963 of the possibilities.
7970 @node Basic operations
7971 @chapter Basic operations
7973 @cindex manipulating files
7975 This chapter describes the commands for basic file manipulation:
7976 copying, moving (renaming), and deleting (removing).
7979 * cp invocation:: Copy files.
7980 * dd invocation:: Convert and copy a file.
7981 * install invocation:: Copy files and set attributes.
7982 * mv invocation:: Move (rename) files.
7983 * rm invocation:: Remove files or directories.
7984 * shred invocation:: Remove files more securely.
7989 @section @command{cp}: Copy files and directories
7992 @cindex copying files and directories
7993 @cindex files, copying
7994 @cindex directories, copying
7996 @command{cp} copies files (or, optionally, directories). The copy is
7997 completely independent of the original. You can either copy one file to
7998 another, or copy arbitrarily many files to a destination directory.
8002 cp [@var{option}]@dots{} [-T] @var{source} @var{dest}
8003 cp [@var{option}]@dots{} @var{source}@dots{} @var{directory}
8004 cp [@var{option}]@dots{} -t @var{directory} @var{source}@dots{}
8009 If two file names are given, @command{cp} copies the first file to the
8013 If the @option{--target-directory} (@option{-t}) option is given, or
8014 failing that if the last file is a directory and the
8015 @option{--no-target-directory} (@option{-T}) option is not given,
8016 @command{cp} copies each @var{source} file to the specified directory,
8017 using the @var{source}s' names.
8020 Generally, files are written just as they are read. For exceptions,
8021 see the @option{--sparse} option below.
8023 By default, @command{cp} does not copy directories. However, the
8024 @option{-R}, @option{-a}, and @option{-r} options cause @command{cp} to
8025 copy recursively by descending into source directories and copying files
8026 to corresponding destination directories.
8028 When copying from a symbolic link, @command{cp} normally follows the
8029 link only when not copying
8030 recursively. This default can be overridden with the
8031 @option{--archive} (@option{-a}), @option{-d}, @option{--dereference}
8032 (@option{-L}), @option{--no-dereference} (@option{-P}), and
8033 @option{-H} options. If more than one of these options is specified,
8034 the last one silently overrides the others.
8036 When copying to a symbolic link, @command{cp} follows the
8037 link only when it refers to an existing regular file.
8038 However, when copying to a dangling symbolic link, @command{cp}
8039 refuses by default, and fails with a diagnostic, since the operation
8040 is inherently dangerous. This behavior is contrary to historical
8041 practice and to POSIX@.
8042 Set @env{POSIXLY_CORRECT} to make @command{cp} attempt to create
8043 the target of a dangling destination symlink, in spite of the possible risk.
8044 Also, when an option like
8045 @option{--backup} or @option{--link} acts to rename or remove the
8046 destination before copying, @command{cp} renames or removes the
8047 symbolic link rather than the file it points to.
8049 By default, @command{cp} copies the contents of special files only
8050 when not copying recursively. This default can be overridden with the
8051 @option{--copy-contents} option.
8053 @cindex self-backups
8054 @cindex backups, making only
8055 @command{cp} generally refuses to copy a file onto itself, with the
8056 following exception: if @option{--force --backup} is specified with
8057 @var{source} and @var{dest} identical, and referring to a regular file,
8058 @command{cp} will make a backup file, either regular or numbered, as
8059 specified in the usual ways (@pxref{Backup options}). This is useful when
8060 you simply want to make a backup of an existing file before changing it.
8062 The program accepts the following options. Also see @ref{Common options}.
8069 Preserve as much as possible of the structure and attributes of the
8070 original files in the copy (but do not attempt to preserve internal
8071 directory structure; i.e., @samp{ls -U} may list the entries in a copied
8072 directory in a different order).
8073 Try to preserve SELinux security context and extended attributes (xattr),
8074 but ignore any failure to do that and print no corresponding diagnostic.
8075 Equivalent to @option{-dR --preserve=all} with the reduced diagnostics.
8077 @item --attributes-only
8078 @opindex --attributes-only
8079 Copy only the specified attributes of the source file to the destination.
8080 If the destination already exists, do not alter its contents.
8081 See the @option{--preserve} option for controlling which attributes to copy.
8084 @itemx @w{@kbd{--backup}[=@var{method}]}
8087 @vindex VERSION_CONTROL
8088 @cindex backups, making
8089 @xref{Backup options}.
8090 Make a backup of each file that would otherwise be overwritten or removed.
8091 As a special case, @command{cp} makes a backup of @var{source} when the force
8092 and backup options are given and @var{source} and @var{dest} are the same
8093 name for an existing, regular file. One useful application of this
8094 combination of options is this tiny Bourne shell script:
8098 # Usage: backup FILE...
8099 # Create a GNU-style backup of each listed FILE.
8102 cp --backup --force --preserve=all -- "$i" "$i" || fail=1
8107 @item --copy-contents
8108 @cindex directories, copying recursively
8109 @cindex copying directories recursively
8110 @cindex recursively copying directories
8111 @cindex non-directories, copying as special files
8112 If copying recursively, copy the contents of any special files (e.g.,
8113 FIFOs and device files) as if they were regular files. This means
8114 trying to read the data in each source file and writing it to the
8115 destination. It is usually a mistake to use this option, as it
8116 normally has undesirable effects on special files like FIFOs and the
8117 ones typically found in the @file{/dev} directory. In most cases,
8118 @code{cp -R --copy-contents} will hang indefinitely trying to read
8119 from FIFOs and special files like @file{/dev/console}, and it will
8120 fill up your destination disk if you use it to copy @file{/dev/zero}.
8121 This option has no effect unless copying recursively, and it does not
8122 affect the copying of symbolic links.
8126 @cindex symbolic links, copying
8127 @cindex hard links, preserving
8128 Copy symbolic links as symbolic links rather than copying the files that
8129 they point to, and preserve hard links between source files in the copies.
8130 Equivalent to @option{--no-dereference --preserve=links}.
8136 When copying without this option and an existing destination file cannot
8137 be opened for writing, the copy fails. However, with @option{--force},
8138 when a destination file cannot be opened, @command{cp} then removes it and
8139 tries to open it again. Contrast this behavior with that enabled by
8140 @option{--link} and @option{--symbolic-link}, whereby the destination file
8141 is never opened but rather is removed unconditionally. Also see the
8142 description of @option{--remove-destination}.
8144 This option is independent of the @option{--interactive} or
8145 @option{-i} option: neither cancels the effect of the other.
8147 This option is ignored when the @option{--no-clobber} or @option{-n} option
8152 If a command line argument specifies a symbolic link, then copy the
8153 file it points to rather than the symbolic link itself. However,
8154 copy (preserving its nature) any symbolic link that is encountered
8155 via recursive traversal.
8158 @itemx --interactive
8160 @opindex --interactive
8161 When copying a file other than a directory, prompt whether to
8162 overwrite an existing destination file. The @option{-i} option overrides
8163 a previous @option{-n} option.
8169 Make hard links instead of copies of non-directories.
8172 @itemx --dereference
8174 @opindex --dereference
8175 Follow symbolic links when copying from them.
8176 With this option, @command{cp} cannot create a symbolic link.
8177 For example, a symlink (to regular file) in the source tree will be copied to
8178 a regular file in the destination tree.
8183 @opindex --no-clobber
8184 Do not overwrite an existing file. The @option{-n} option overrides a previous
8185 @option{-i} option. This option is mutually exclusive with @option{-b} or
8186 @option{--backup} option.
8189 @itemx --no-dereference
8191 @opindex --no-dereference
8192 @cindex symbolic links, copying
8193 Copy symbolic links as symbolic links rather than copying the files that
8194 they point to. This option affects only symbolic links in the source;
8195 symbolic links in the destination are always followed if possible.
8198 @itemx @w{@kbd{--preserve}[=@var{attribute_list}]}
8201 @cindex file information, preserving, extended attributes, xattr
8202 Preserve the specified attributes of the original files.
8203 If specified, the @var{attribute_list} must be a comma-separated list
8204 of one or more of the following strings:
8208 Preserve the file mode bits and access control lists.
8210 Preserve the owner and group. On most modern systems,
8211 only users with appropriate privileges may change the owner of a file,
8213 may preserve the group ownership of a file only if they happen to be
8214 a member of the desired group.
8216 Preserve the times of last access and last modification, when possible.
8217 On older systems, it is not possible to preserve these attributes
8218 when the affected file is a symbolic link.
8219 However, many systems now provide the @code{utimensat} function,
8220 which makes it possible even for symbolic links.
8222 Preserve in the destination files
8223 any links between corresponding source files.
8224 Note that with @option{-L} or @option{-H}, this option can convert
8225 symbolic links to hard links. For example,
8227 $ mkdir c; : > a; ln -s a b; cp -aH a b c; ls -i1 c
8232 Note the inputs: @file{b} is a symlink to regular file @file{a},
8233 yet the files in destination directory, @file{c/}, are hard-linked.
8234 Since @option{-a} implies @option{--preserve=links}, and since @option{-H}
8235 tells @command{cp} to dereference command line arguments, it sees two files
8236 with the same inode number, and preserves the perceived hard link.
8238 Here is a similar example that exercises @command{cp}'s @option{-L} option:
8240 $ mkdir b c; (cd b; : > a; ln -s a b); cp -aL b c; ls -i1 c/b
8246 Preserve SELinux security context of the file, or fail with full diagnostics.
8248 Preserve extended attributes of the file, or fail with full diagnostics.
8249 If @command{cp} is built without xattr support, ignore this option.
8250 If SELinux context, ACLs or Capabilities are implemented using xattrs,
8251 they are preserved implicitly by this option as well, i.e., even without
8252 specifying @option{--preserve=mode} or @option{--preserve=context}.
8254 Preserve all file attributes.
8255 Equivalent to specifying all of the above, but with the difference
8256 that failure to preserve SELinux security context or extended attributes
8257 does not change @command{cp}'s exit status. In contrast to @option{-a},
8258 all but @samp{Operation not supported} warnings are output.
8261 Using @option{--preserve} with no @var{attribute_list} is equivalent
8262 to @option{--preserve=mode,ownership,timestamps}.
8264 In the absence of this option, the permissions of existing destination
8265 files are unchanged, while each new file is created with the
8266 mode bits of the corresponding source file, minus the bits set in the
8267 umask and minus the set-user-ID and set-group-ID bits.
8268 @xref{File permissions}.
8270 @item @w{@kbd{--no-preserve}=@var{attribute_list}}
8271 @cindex file information, preserving
8272 Do not preserve the specified attributes. The @var{attribute_list}
8273 has the same form as for @option{--preserve}.
8277 @cindex parent directories and @command{cp}
8278 Form the name of each destination file by appending to the target
8279 directory a slash and the specified name of the source file. The last
8280 argument given to @command{cp} must be the name of an existing directory.
8281 For example, the command:
8284 cp --parents a/b/c existing_dir
8288 copies the file @file{a/b/c} to @file{existing_dir/a/b/c}, creating
8289 any missing intermediate directories.
8296 @opindex --recursive
8297 @cindex directories, copying recursively
8298 @cindex copying directories recursively
8299 @cindex recursively copying directories
8300 @cindex non-directories, copying as special files
8301 Copy directories recursively. By default, do not follow symbolic
8302 links in the source; see the @option{--archive} (@option{-a}), @option{-d},
8303 @option{--dereference} (@option{-L}), @option{--no-dereference}
8304 (@option{-P}), and @option{-H} options. Special files are copied by
8305 creating a destination file of the same type as the source; see the
8306 @option{--copy-contents} option. It is not portable to use
8307 @option{-r} to copy symbolic links or special files. On some
8308 non-GNU systems, @option{-r} implies the equivalent of
8309 @option{-L} and @option{--copy-contents} for historical reasons.
8310 Also, it is not portable to use @option{-R} to copy symbolic links
8311 unless you also specify @option{-P}, as POSIX allows
8312 implementations that dereference symbolic links by default.
8314 @item --reflink[=@var{when}]
8315 @opindex --reflink[=@var{when}]
8318 @cindex copy on write
8319 Perform a lightweight, copy-on-write (COW) copy, if supported by the
8320 file system. Once it has succeeded, beware that the source and destination
8321 files share the same disk data blocks as long as they remain unmodified.
8322 Thus, if a disk I/O error affects data blocks of one of the files,
8323 the other suffers the same fate.
8325 The @var{when} value can be one of the following:
8329 The default behavior: if the copy-on-write operation is not supported
8330 then report the failure for each file and exit with a failure status.
8333 If the copy-on-write operation is not supported then fall back
8334 to the standard copy behaviour.
8337 This option is overridden by the @option{--link}, @option{--symbolic-link}
8338 and @option{--attributes-only} options, thus allowing it to be used
8339 to configure the default data copying behavior for @command{cp}.
8340 For example, with the following alias, @command{cp} will use the
8341 minimum amount of space supported by the file system.
8344 alias cp='cp --reflink=auto --sparse=always'
8347 @item --remove-destination
8348 @opindex --remove-destination
8349 Remove each existing destination file before attempting to open it
8350 (contrast with @option{-f} above).
8352 @item --sparse=@var{when}
8353 @opindex --sparse=@var{when}
8354 @cindex sparse files, copying
8355 @cindex holes, copying files with
8356 @findex read @r{system call, and holes}
8357 A @dfn{sparse file} contains @dfn{holes}---a sequence of zero bytes that
8358 does not occupy any physical disk blocks; the @samp{read} system call
8359 reads these as zeros. This can both save considerable disk space and
8360 increase speed, since many binary files contain lots of consecutive zero
8361 bytes. By default, @command{cp} detects holes in input source files via a crude
8362 heuristic and makes the corresponding output file sparse as well.
8363 Only regular files may be sparse.
8365 The @var{when} value can be one of the following:
8369 The default behavior: if the input file is sparse, attempt to make
8370 the output file sparse, too. However, if an output file exists but
8371 refers to a non-regular file, then do not attempt to make it sparse.
8374 For each sufficiently long sequence of zero bytes in the input file,
8375 attempt to create a corresponding hole in the output file, even if the
8376 input file does not appear to be sparse.
8377 This is useful when the input file resides on a file system
8378 that does not support sparse files
8379 (for example, @samp{efs} file systems in SGI IRIX 5.3 and earlier),
8380 but the output file is on a type of file system that does support them.
8381 Holes may be created only in regular files, so if the destination file
8382 is of some other type, @command{cp} does not even try to make it sparse.
8385 Never make the output file sparse.
8386 This is useful in creating a file for use with the @command{mkswap} command,
8387 since such a file must not have any holes.
8390 @optStripTrailingSlashes
8393 @itemx --symbolic-link
8395 @opindex --symbolic-link
8396 @cindex symbolic links, copying with
8397 Make symbolic links instead of copies of non-directories. All source
8398 file names must be absolute (starting with @samp{/}) unless the
8399 destination files are in the current directory. This option merely
8400 results in an error message on systems that do not support symbolic links.
8406 @optNoTargetDirectory
8412 @cindex newer files, copying only
8413 Do not copy a non-directory that has an existing destination with the
8414 same or newer modification time. If time stamps are being preserved,
8415 the comparison is to the source time stamp truncated to the
8416 resolutions of the destination file system and of the system calls
8417 used to update time stamps; this avoids duplicate work if several
8418 @samp{cp -pu} commands are executed with the same source and destination.
8419 If @option{--preserve=links} is also specified (like with @samp{cp -au}
8420 for example), that will take precedence. Consequently, depending on the
8421 order that files are processed from the source, newer files in the destination
8422 may be replaced, to mirror hard links in the source.
8428 Print the name of each file before copying it.
8431 @itemx --one-file-system
8433 @opindex --one-file-system
8434 @cindex file systems, omitting copying to different
8435 Skip subdirectories that are on different file systems from the one that
8436 the copy started on.
8437 However, mount point directories @emph{are} copied.
8445 @section @command{dd}: Convert and copy a file
8448 @cindex converting while copying a file
8450 @command{dd} copies a file (from standard input to standard output, by
8451 default) with a changeable I/O block size, while optionally performing
8452 conversions on it. Synopses:
8455 dd [@var{operand}]@dots{}
8459 The only options are @option{--help} and @option{--version}.
8460 @xref{Common options}. @command{dd} accepts the following operands,
8461 whose syntax was inspired by the DD (data definition) statement of
8468 Read from @var{file} instead of standard input.
8472 Write to @var{file} instead of standard output. Unless
8473 @samp{conv=notrunc} is given, @command{dd} truncates @var{file} to zero
8474 bytes (or the size specified with @samp{seek=}).
8476 @item ibs=@var{bytes}
8478 @cindex block size of input
8479 @cindex input block size
8480 Set the input block size to @var{bytes}.
8481 This makes @command{dd} read @var{bytes} per block.
8482 The default is 512 bytes.
8484 @item obs=@var{bytes}
8486 @cindex block size of output
8487 @cindex output block size
8488 Set the output block size to @var{bytes}.
8489 This makes @command{dd} write @var{bytes} per block.
8490 The default is 512 bytes.
8492 @item bs=@var{bytes}
8495 Set both input and output block sizes to @var{bytes}.
8496 This makes @command{dd} read and write @var{bytes} per block,
8497 overriding any @samp{ibs} and @samp{obs} settings.
8498 In addition, if no data-transforming @option{conv} option is specified,
8499 input is copied to the output as soon as it's read,
8500 even if it is smaller than the block size.
8502 @item cbs=@var{bytes}
8504 @cindex block size of conversion
8505 @cindex conversion block size
8506 @cindex fixed-length records, converting to variable-length
8507 @cindex variable-length records, converting to fixed-length
8508 Set the conversion block size to @var{bytes}.
8509 When converting variable-length records to fixed-length ones
8510 (@option{conv=block}) or the reverse (@option{conv=unblock}),
8511 use @var{bytes} as the fixed record length.
8515 Skip @var{n} @samp{ibs}-byte blocks in the input file before copying.
8516 If @samp{iflag=skip_bytes} is specified, @var{n} is interpreted
8517 as a byte count rather than a block count.
8521 Skip @var{n} @samp{obs}-byte blocks in the output file before copying.
8522 if @samp{oflag=seek_bytes} is specified, @var{n} is interpreted
8523 as a byte count rather than a block count.
8527 Copy @var{n} @samp{ibs}-byte blocks from the input file, instead
8528 of everything until the end of the file.
8529 if @samp{iflag=count_bytes} is specified, @var{n} is interpreted
8530 as a byte count rather than a block count.
8531 Note if the input may return short reads as could be the case
8532 when reading from a pipe for example, @samp{iflag=fullblock}
8533 will ensure that @samp{count=} corresponds to complete input blocks
8534 rather than the traditional POSIX specified behavior of counting
8535 input read operations.
8537 @item status=@var{which}
8539 Transfer information is normally output to stderr upon
8540 receipt of the @samp{INFO} signal or when @command{dd} exits.
8541 Specifying @var{which} will identify which information to suppress.
8546 @opindex noxfer @r{dd status=}
8547 Do not print the transfer rate and volume statistics
8548 that normally make up the last status line.
8551 @opindex none @r{dd status=}
8552 Do not print any informational or warning messages to stderr.
8553 Error messages are output as normal.
8557 @item conv=@var{conversion}[,@var{conversion}]@dots{}
8559 Convert the file as specified by the @var{conversion} argument(s).
8560 (No spaces around any comma(s).)
8567 @opindex ascii@r{, converting to}
8568 Convert EBCDIC to ASCII,
8569 using the conversion table specified by POSIX@.
8570 This provides a 1:1 translation for all 256 bytes.
8573 @opindex ebcdic@r{, converting to}
8574 Convert ASCII to EBCDIC@.
8575 This is the inverse of the @samp{ascii} conversion.
8578 @opindex alternate ebcdic@r{, converting to}
8579 Convert ASCII to alternate EBCDIC,
8580 using the alternate conversion table specified by POSIX@.
8581 This is not a 1:1 translation, but reflects common historical practice
8582 for @samp{~}, @samp{[}, and @samp{]}.
8584 The @samp{ascii}, @samp{ebcdic}, and @samp{ibm} conversions are
8588 @opindex block @r{(space-padding)}
8589 For each line in the input, output @samp{cbs} bytes, replacing the
8590 input newline with a space and padding with spaces as necessary.
8594 Remove any trailing spaces in each @samp{cbs}-sized input block,
8595 and append a newline.
8597 The @samp{block} and @samp{unblock} conversions are mutually exclusive.
8600 @opindex lcase@r{, converting to}
8601 Change uppercase letters to lowercase.
8604 @opindex ucase@r{, converting to}
8605 Change lowercase letters to uppercase.
8607 The @samp{lcase} and @samp{ucase} conversions are mutually exclusive.
8611 Try to seek rather than write NUL output blocks.
8612 On a file system that supports sparse files, this will create
8613 sparse output when extending the output file.
8614 Be careful when using this option in conjunction with
8615 @samp{conv=notrunc} or @samp{oflag=append}.
8616 With @samp{conv=notrunc}, existing data in the output file
8617 corresponding to NUL blocks from the input, will be untouched.
8618 With @samp{oflag=append} the seeks performed will be ineffective.
8619 Similarly, when the output is a device rather than a file,
8620 NUL input blocks are not copied, and therefore this option
8621 is most useful with virtual or pre zeroed devices.
8624 @opindex swab @r{(byte-swapping)}
8625 @cindex byte-swapping
8626 Swap every pair of input bytes. GNU @command{dd}, unlike others, works
8627 when an odd number of bytes are read---the last byte is simply copied
8628 (since there is nothing to swap it with).
8631 @opindex sync @r{(padding with ASCII NULs)}
8632 Pad every input block to size of @samp{ibs} with trailing zero bytes.
8633 When used with @samp{block} or @samp{unblock}, pad with spaces instead of
8638 The following ``conversions'' are really file flags
8639 and don't affect internal processing:
8644 @cindex creating output file, requiring
8645 Fail if the output file already exists; @command{dd} must create the
8650 @cindex creating output file, avoiding
8651 Do not create the output file; the output file must already exist.
8653 The @samp{excl} and @samp{nocreat} conversions are mutually exclusive.
8657 @cindex truncating output file, avoiding
8658 Do not truncate the output file.
8662 @cindex read errors, ignoring
8663 Continue after read errors.
8667 @cindex synchronized data writes, before finishing
8668 Synchronize output data just before finishing. This forces a physical
8669 write of output data.
8673 @cindex synchronized data and metadata writes, before finishing
8674 Synchronize output data and metadata just before finishing. This
8675 forces a physical write of output data and metadata.
8679 @item iflag=@var{flag}[,@var{flag}]@dots{}
8681 Access the input file using the flags specified by the @var{flag}
8682 argument(s). (No spaces around any comma(s).)
8684 @item oflag=@var{flag}[,@var{flag}]@dots{}
8686 Access the output file using the flags specified by the @var{flag}
8687 argument(s). (No spaces around any comma(s).)
8689 Here are the flags. Not every flag is supported on every operating
8696 @cindex appending to the output file
8697 Write in append mode, so that even if some other process is writing to
8698 this file, every @command{dd} write will append to the current
8699 contents of the file. This flag makes sense only for output.
8700 If you combine this flag with the @samp{of=@var{file}} operand,
8701 you should also specify @samp{conv=notrunc} unless you want the
8702 output file to be truncated before being appended to.
8706 @cindex concurrent I/O
8707 Use concurrent I/O mode for data. This mode performs direct I/O
8708 and drops the POSIX requirement to serialize all I/O to the same file.
8709 A file cannot be opened in CIO mode and with a standard open at the
8715 Use direct I/O for data, avoiding the buffer cache.
8716 Note that the kernel may impose restrictions on read or write buffer sizes.
8717 For example, with an ext4 destination file system and a linux-based kernel,
8718 using @samp{oflag=direct} will cause writes to fail with @code{EINVAL} if the
8719 output buffer size is not a multiple of 512.
8723 @cindex directory I/O
8725 Fail unless the file is a directory. Most operating systems do not
8726 allow I/O to a directory, so this flag has limited utility.
8730 @cindex synchronized data reads
8731 Use synchronized I/O for data. For the output file, this forces a
8732 physical write of output data on each write. For the input file,
8733 this flag can matter when reading from a remote file that has been
8734 written to synchronously by some other process. Metadata (e.g.,
8735 last-access and last-modified time) is not necessarily synchronized.
8739 @cindex synchronized data and metadata I/O
8740 Use synchronized I/O for both data and metadata.
8744 @cindex discarding file cache
8745 Discard the data cache for a file.
8746 When count=0 all cache is discarded,
8747 otherwise the cache is dropped for the processed
8748 portion of the file. Also when count=0
8749 failure to discard the cache is diagnosed
8750 and reflected in the exit status.
8751 Here as some usage examples:
8754 # Advise to drop cache for whole file
8755 dd if=ifile iflag=nocache count=0
8757 # Ensure drop cache for the whole file
8758 dd of=ofile oflag=nocache conv=notrunc,fdatasync count=0
8760 # Drop cache for part of file
8761 dd if=ifile iflag=nocache skip=10 count=10 of=/dev/null
8763 # Stream data using just the read-ahead cache
8764 dd if=ifile of=ofile iflag=nocache oflag=nocache
8769 @cindex nonblocking I/O
8770 Use non-blocking I/O.
8775 Do not update the file's access time.
8776 Some older file systems silently ignore this flag, so it is a good
8777 idea to test it on your files before relying on it.
8781 @cindex controlling terminal
8782 Do not assign the file to be a controlling terminal for @command{dd}.
8783 This has no effect when the file is not a terminal.
8784 On many hosts (e.g., GNU/Linux hosts), this option has no effect
8789 @cindex symbolic links, following
8790 Do not follow symbolic links.
8795 Fail if the file has multiple hard links.
8800 Use binary I/O@. This option has an effect only on nonstandard
8801 platforms that distinguish binary from text I/O.
8806 Use text I/O@. Like @samp{binary}, this option has no effect on
8811 Accumulate full blocks from input. The @code{read} system call
8812 may return early if a full block is not available.
8813 When that happens, continue calling @code{read} to fill the remainder
8815 This flag can be used only with @code{iflag}.
8816 This flag is useful with pipes for example
8817 as they may return short reads. In that case,
8818 this flag is needed to ensure that a @samp{count=} argument is
8819 interpreted as a block count rather than a count of read operations.
8822 @opindex count_bytes
8823 Interpret the @samp{count=} operand as a byte count,
8824 rather than a block count, which allows specifying
8825 a length that is not a multiple of the I/O block size.
8826 This flag can be used only with @code{iflag}.
8830 Interpret the @samp{skip=} operand as a byte count,
8831 rather than a block count, which allows specifying
8832 an offset that is not a multiple of the I/O block size.
8833 This flag can be used only with @code{iflag}.
8837 Interpret the @samp{seek=} operand as a byte count,
8838 rather than a block count, which allows specifying
8839 an offset that is not a multiple of the I/O block size.
8840 This flag can be used only with @code{oflag}.
8844 These flags are not supported on all systems, and @samp{dd} rejects
8845 attempts to use them when they are not supported. When reading from
8846 standard input or writing to standard output, the @samp{nofollow} and
8847 @samp{noctty} flags should not be specified, and the other flags
8848 (e.g., @samp{nonblock}) can affect how other processes behave with the
8849 affected file descriptors, even after @command{dd} exits.
8853 @cindex multipliers after numbers
8854 The numeric-valued strings above (@var{n} and @var{bytes})
8855 can be followed by a multiplier: @samp{b}=512, @samp{c}=1,
8856 @samp{w}=2, @samp{x@var{m}}=@var{m}, or any of the
8857 standard block size suffixes like @samp{k}=1024 (@pxref{Block size}).
8859 Any block size you specify via @samp{bs=}, @samp{ibs=}, @samp{obs=}, @samp{cbs=}
8860 should not be too large---values larger than a few megabytes
8861 are generally wasteful or (as in the gigabyte..exabyte case) downright
8862 counterproductive or error-inducing.
8864 To process data that is at an offset or size that is not a
8865 multiple of the I/O@ block size, you can use the @samp{skip_bytes},
8866 @samp{seek_bytes} and @samp{count_bytes} flags. Alternatively
8867 the traditional method of separate @command{dd} invocations can be used.
8868 For example, the following shell commands copy data
8869 in 512 KiB blocks between a disk and a tape, but do not save
8870 or restore a 4 KiB label at the start of the disk:
8873 disk=/dev/rdsk/c0t1d0s2
8876 # Copy all but the label from disk to tape.
8877 (dd bs=4k skip=1 count=0 && dd bs=512k) <$disk >$tape
8879 # Copy from tape back to disk, but leave the disk label alone.
8880 (dd bs=4k seek=1 count=0 && dd bs=512k) <$tape >$disk
8883 Sending an @samp{INFO} signal to a running @command{dd}
8884 process makes it print I/O statistics to standard error
8885 and then resume copying. In the example below,
8886 @command{dd} is run in the background to copy 10 million blocks.
8887 The @command{kill} command makes it output intermediate I/O statistics,
8888 and when @command{dd} completes normally or is killed by the
8889 @code{SIGINT} signal, it outputs the final statistics.
8892 $ dd if=/dev/zero of=/dev/null count=10MB & pid=$!
8893 $ kill -s INFO $pid; wait $pid
8894 3385223+0 records in
8895 3385223+0 records out
8896 1733234176 bytes (1.7 GB) copied, 6.42173 seconds, 270 MB/s
8897 10000000+0 records in
8898 10000000+0 records out
8899 5120000000 bytes (5.1 GB) copied, 18.913 seconds, 271 MB/s
8902 @vindex POSIXLY_CORRECT
8903 On systems lacking the @samp{INFO} signal @command{dd} responds to the
8904 @samp{USR1} signal instead, unless the @env{POSIXLY_CORRECT}
8905 environment variable is set.
8910 @node install invocation
8911 @section @command{install}: Copy files and set attributes
8914 @cindex copying files and setting attributes
8916 @command{install} copies files while setting their file mode bits and, if
8917 possible, their owner and group. Synopses:
8920 install [@var{option}]@dots{} [-T] @var{source} @var{dest}
8921 install [@var{option}]@dots{} @var{source}@dots{} @var{directory}
8922 install [@var{option}]@dots{} -t @var{directory} @var{source}@dots{}
8923 install [@var{option}]@dots{} -d @var{directory}@dots{}
8928 If two file names are given, @command{install} copies the first file to the
8932 If the @option{--target-directory} (@option{-t}) option is given, or
8933 failing that if the last file is a directory and the
8934 @option{--no-target-directory} (@option{-T}) option is not given,
8935 @command{install} copies each @var{source} file to the specified
8936 directory, using the @var{source}s' names.
8939 If the @option{--directory} (@option{-d}) option is given,
8940 @command{install} creates each @var{directory} and any missing parent
8941 directories. Parent directories are created with mode
8942 @samp{u=rwx,go=rx} (755), regardless of the @option{-m} option or the
8943 current umask. @xref{Directory Setuid and Setgid}, for how the
8944 set-user-ID and set-group-ID bits of parent directories are inherited.
8947 @cindex Makefiles, installing programs in
8948 @command{install} is similar to @command{cp}, but allows you to control the
8949 attributes of destination files. It is typically used in Makefiles to
8950 copy programs into their destination directories. It refuses to copy
8951 files onto themselves.
8953 @cindex extended attributes, xattr
8954 @command{install} never preserves extended attributes (xattr).
8956 The program accepts the following options. Also see @ref{Common options}.
8966 Compare each pair of source and destination files, and if the destination has
8967 identical content and any specified owner, group, permissions, and possibly
8968 SELinux context, then do not modify the destination at all.
8969 Note this option is best used in conjunction with @option{--user},
8970 @option{--group} and @option{--mode} options, lest @command{install}
8971 incorrectly determines the default attributes that installed files would have
8972 (as it doesn't consider setgid directories and POSIX default ACLs for example).
8973 This could result in redundant copies or attributes that are not reset to the
8978 Ignored; for compatibility with old Unix versions of @command{install}.
8982 Create any missing parent directories of @var{dest},
8983 then copy @var{source} to @var{dest}.
8984 This option is ignored if a destination directory is specified
8985 via @option{--target-directory=DIR}.
8990 @opindex --directory
8991 @cindex directories, creating with given attributes
8992 @cindex parent directories, creating missing
8993 @cindex leading directories, creating missing
8994 Create any missing parent directories, giving them the default
8995 attributes. Then create each given directory, setting their owner,
8996 group and mode as given on the command line or to the defaults.
8998 @item -g @var{group}
8999 @itemx --group=@var{group}
9002 @cindex group ownership of installed files, setting
9003 Set the group ownership of installed files or directories to
9004 @var{group}. The default is the process's current group. @var{group}
9005 may be either a group name or a numeric group ID.
9008 @itemx --mode=@var{mode}
9011 @cindex permissions of installed files, setting
9012 Set the file mode bits for the installed file or directory to @var{mode},
9013 which can be either an octal number, or a symbolic mode as in
9014 @command{chmod}, with @samp{a=} (no access allowed to anyone) as the
9015 point of departure (@pxref{File permissions}).
9016 The default mode is @samp{u=rwx,go=rx,a-s}---read, write, and
9017 execute for the owner, read and execute for group and other, and with
9018 set-user-ID and set-group-ID disabled.
9019 This default is not quite the same as @samp{755}, since it disables
9020 instead of preserving set-user-ID and set-group-ID on directories.
9021 @xref{Directory Setuid and Setgid}.
9023 @item -o @var{owner}
9024 @itemx --owner=@var{owner}
9027 @cindex ownership of installed files, setting
9028 @cindex appropriate privileges
9029 @vindex root @r{as default owner}
9030 If @command{install} has appropriate privileges (is run as root), set the
9031 ownership of installed files or directories to @var{owner}. The default
9032 is @code{root}. @var{owner} may be either a user name or a numeric user
9035 @item --preserve-context
9036 @opindex --preserve-context
9038 @cindex security context
9039 Preserve the SELinux security context of files and directories.
9040 Failure to preserve the context in all of the files or directories
9041 will result in an exit status of 1. If SELinux is disabled then
9042 print a warning and ignore the option.
9045 @itemx --preserve-timestamps
9047 @opindex --preserve-timestamps
9048 @cindex timestamps of installed files, preserving
9049 Set the time of last access and the time of last modification of each
9050 installed file to match those of each corresponding original file.
9051 When a file is installed without this option, its last access and
9052 last modification times are both set to the time of installation.
9053 This option is useful if you want to use the last modification times
9054 of installed files to keep track of when they were last built as opposed
9055 to when they were last installed.
9061 @cindex symbol table information, stripping
9062 @cindex stripping symbol table information
9063 Strip the symbol tables from installed binary executables.
9065 @item --strip-program=@var{program}
9066 @opindex --strip-program
9067 @cindex symbol table information, stripping, program
9068 Program used to strip binaries.
9074 @optNoTargetDirectory
9080 Print the name of each file before copying it.
9082 @item -Z @var{context}
9083 @itemx --context=@var{context}
9087 @cindex security context
9088 Set the default SELinux security context to be used for any
9089 created files and directories. If SELinux is disabled then
9090 print a warning and ignore the option.
9098 @section @command{mv}: Move (rename) files
9102 @command{mv} moves or renames files (or directories). Synopses:
9105 mv [@var{option}]@dots{} [-T] @var{source} @var{dest}
9106 mv [@var{option}]@dots{} @var{source}@dots{} @var{directory}
9107 mv [@var{option}]@dots{} -t @var{directory} @var{source}@dots{}
9112 If two file names are given, @command{mv} moves the first file to the
9116 If the @option{--target-directory} (@option{-t}) option is given, or
9117 failing that if the last file is a directory and the
9118 @option{--no-target-directory} (@option{-T}) option is not given,
9119 @command{mv} moves each @var{source} file to the specified
9120 directory, using the @var{source}s' names.
9123 @command{mv} can move any type of file from one file system to another.
9124 Prior to version @code{4.0} of the fileutils,
9125 @command{mv} could move only regular files between file systems.
9126 For example, now @command{mv} can move an entire directory hierarchy
9127 including special device files from one partition to another. It first
9128 uses some of the same code that's used by @code{cp -a} to copy the
9129 requested directories and files, then (assuming the copy succeeded)
9130 it removes the originals. If the copy fails, then the part that was
9131 copied to the destination partition is removed. If you were to copy
9132 three directories from one partition to another and the copy of the first
9133 directory succeeded, but the second didn't, the first would be left on
9134 the destination partition and the second and third would be left on the
9137 @cindex extended attributes, xattr
9138 @command{mv} always tries to copy extended attributes (xattr), which may
9139 include SELinux context, ACLs or Capabilities.
9140 Upon failure all but @samp{Operation not supported} warnings are output.
9142 @cindex prompting, and @command{mv}
9143 If a destination file exists but is normally unwritable, standard input
9144 is a terminal, and the @option{-f} or @option{--force} option is not given,
9145 @command{mv} prompts the user for whether to replace the file. (You might
9146 own the file, or have write permission on its directory.) If the
9147 response is not affirmative, the file is skipped.
9149 @emph{Warning}: Avoid specifying a source name with a trailing slash,
9150 when it might be a symlink to a directory.
9151 Otherwise, @command{mv} may do something very surprising, since
9152 its behavior depends on the underlying rename system call.
9153 On a system with a modern Linux-based kernel, it fails with
9154 @code{errno=ENOTDIR}@.
9155 However, on other systems (at least FreeBSD 6.1 and Solaris 10) it silently
9156 renames not the symlink but rather the directory referenced by the symlink.
9157 @xref{Trailing slashes}.
9159 The program accepts the following options. Also see @ref{Common options}.
9169 @cindex prompts, omitting
9170 Do not prompt the user before removing a destination file.
9172 If you specify more than one of the @option{-i}, @option{-f}, @option{-n}
9173 options, only the final one takes effect.
9178 @itemx --interactive
9180 @opindex --interactive
9181 @cindex prompts, forcing
9182 Prompt whether to overwrite each existing destination file, regardless
9184 If the response is not affirmative, the file is skipped.
9190 @opindex --no-clobber
9191 @cindex prompts, omitting
9192 Do not overwrite an existing file.
9194 This option is mutually exclusive with @option{-b} or @option{--backup} option.
9200 @cindex newer files, moving only
9201 Do not move a non-directory that has an existing destination with the
9202 same or newer modification time.
9203 If the move is across file system boundaries, the comparison is to the
9204 source time stamp truncated to the resolutions of the destination file
9205 system and of the system calls used to update time stamps; this avoids
9206 duplicate work if several @samp{mv -u} commands are executed with the
9207 same source and destination.
9213 Print the name of each file before moving it.
9215 @optStripTrailingSlashes
9221 @optNoTargetDirectory
9229 @section @command{rm}: Remove files or directories
9232 @cindex removing files or directories
9234 @command{rm} removes each given @var{file}. By default, it does not remove
9235 directories. Synopsis:
9238 rm [@var{option}]@dots{} [@var{file}]@dots{}
9241 @cindex prompting, and @command{rm}
9242 If the @option{-I} or @option{--interactive=once} option is given,
9243 and there are more than three files or the @option{-r}, @option{-R},
9244 or @option{--recursive} are given, then @command{rm} prompts the user
9245 for whether to proceed with the entire operation. If the response is
9246 not affirmative, the entire command is aborted.
9248 Otherwise, if a file is unwritable, standard input is a terminal, and
9249 the @option{-f} or @option{--force} option is not given, or the
9250 @option{-i} or @option{--interactive=always} option @emph{is} given,
9251 @command{rm} prompts the user for whether to remove the file.
9252 If the response is not affirmative, the file is skipped.
9254 Any attempt to remove a file whose last file name component is
9255 @file{.} or @file{..} is rejected without any prompting.
9257 @emph{Warning}: If you use @command{rm} to remove a file, it is usually
9258 possible to recover the contents of that file. If you want more assurance
9259 that the contents are truly unrecoverable, consider using @command{shred}.
9261 The program accepts the following options. Also see @ref{Common options}.
9269 @cindex directories, removing
9270 Remove the listed directories if they are empty.
9276 Ignore nonexistent files and missing operands, and never prompt the user.
9277 Ignore any previous @option{--interactive} (@option{-i}) option.
9281 Prompt whether to remove each file.
9282 If the response is not affirmative, the file is skipped.
9283 Ignore any previous @option{--force} (@option{-f}) option.
9284 Equivalent to @option{--interactive=always}.
9288 Prompt once whether to proceed with the command, if more than three
9289 files are named or if a recursive removal is requested. Ignore any
9290 previous @option{--force} (@option{-f}) option. Equivalent to
9291 @option{--interactive=once}.
9293 @item --interactive [=@var{when}]
9294 @opindex --interactive
9295 Specify when to issue an interactive prompt. @var{when} may be
9299 @vindex never @r{interactive option}
9300 - Do not prompt at all.
9302 @vindex once @r{interactive option}
9303 - Prompt once if more than three files are named or if a recursive
9304 removal is requested. Equivalent to @option{-I}.
9306 @vindex always @r{interactive option}
9307 - Prompt for every file being removed. Equivalent to @option{-i}.
9309 @option{--interactive} with no @var{when} is equivalent to
9310 @option{--interactive=always}.
9312 @item --one-file-system
9313 @opindex --one-file-system
9314 @cindex one file system, restricting @command{rm} to
9315 When removing a hierarchy recursively, skip any directory that is on a
9316 file system different from that of the corresponding command line argument.
9319 This option is useful when removing a build ``chroot'' hierarchy,
9320 which normally contains no valuable data. However, it is not uncommon
9321 to bind-mount @file{/home} into such a hierarchy, to make it easier to
9322 use one's start-up file. The catch is that it's easy to forget to
9323 unmount @file{/home}. Then, when you use @command{rm -rf} to remove
9324 your normally throw-away chroot, that command will remove everything
9325 under @file{/home}, too.
9326 Use the @option{--one-file-system} option, and it will
9327 warn about and skip directories on other file systems.
9328 Of course, this will not save your @file{/home} if it and your
9329 chroot happen to be on the same file system.
9331 @item --preserve-root
9332 @opindex --preserve-root
9333 @cindex root directory, disallow recursive destruction
9334 Fail upon any attempt to remove the root directory, @file{/},
9335 when used with the @option{--recursive} option.
9336 This is the default behavior.
9337 @xref{Treating / specially}.
9339 @item --no-preserve-root
9340 @opindex --no-preserve-root
9341 @cindex root directory, allow recursive destruction
9342 Do not treat @file{/} specially when removing recursively.
9343 This option is not recommended unless you really want to
9344 remove all the files on your computer.
9345 @xref{Treating / specially}.
9352 @opindex --recursive
9353 @cindex directories, removing (recursively)
9354 Remove the listed directories and their contents recursively.
9360 Print the name of each file before removing it.
9364 @cindex files beginning with @samp{-}, removing
9365 @cindex @samp{-}, removing files beginning with
9366 One common question is how to remove files whose names begin with a
9367 @samp{-}. GNU @command{rm}, like every program that uses the @code{getopt}
9368 function to parse its arguments, lets you use the @samp{--} option to
9369 indicate that all following arguments are non-options. To remove a file
9370 called @file{-f} in the current directory, you could type either:
9383 @opindex - @r{and Unix @command{rm}}
9384 The Unix @command{rm} program's use of a single @samp{-} for this purpose
9385 predates the development of the getopt standard syntax.
9390 @node shred invocation
9391 @section @command{shred}: Remove files more securely
9394 @cindex data, erasing
9395 @cindex erasing data
9397 @command{shred} overwrites devices or files, to help prevent even
9398 very expensive hardware from recovering the data.
9400 Ordinarily when you remove a file (@pxref{rm invocation}), the data is
9401 not actually destroyed. Only the index listing where the file is
9402 stored is destroyed, and the storage is made available for reuse.
9403 There are undelete utilities that will attempt to reconstruct the index
9404 and can bring the file back if the parts were not reused.
9406 On a busy system with a nearly-full drive, space can get reused in a few
9407 seconds. But there is no way to know for sure. If you have sensitive
9408 data, you may want to be sure that recovery is not possible by actually
9409 overwriting the file with non-sensitive data.
9411 However, even after doing that, it is possible to take the disk back
9412 to a laboratory and use a lot of sensitive (and expensive) equipment
9413 to look for the faint ``echoes'' of the original data underneath the
9414 overwritten data. If the data has only been overwritten once, it's not
9417 The best way to remove something irretrievably is to destroy the media
9418 it's on with acid, melt it down, or the like. For cheap removable media
9419 like floppy disks, this is the preferred method. However, hard drives
9420 are expensive and hard to melt, so the @command{shred} utility tries
9421 to achieve a similar effect non-destructively.
9423 This uses many overwrite passes, with the data patterns chosen to
9424 maximize the damage they do to the old data. While this will work on
9425 floppies, the patterns are designed for best effect on hard drives.
9426 For more details, see the source code and Peter Gutmann's paper
9427 @uref{http://www.cs.auckland.ac.nz/~pgut001/pubs/secure_del.html,
9428 @cite{Secure Deletion of Data from Magnetic and Solid-State Memory}},
9429 from the proceedings of the Sixth USENIX Security Symposium (San Jose,
9430 California, July 22--25, 1996).
9432 @strong{Please note} that @command{shred} relies on a very important assumption:
9433 that the file system overwrites data in place. This is the traditional
9434 way to do things, but many modern file system designs do not satisfy this
9435 assumption. Exceptions include:
9440 Log-structured or journaled file systems, such as those supplied with
9441 AIX and Solaris, and JFS, ReiserFS, XFS, Ext3 (in @code{data=journal} mode),
9442 BFS, NTFS, etc., when they are configured to journal @emph{data}.
9445 File systems that write redundant data and carry on even if some writes
9446 fail, such as RAID-based file systems.
9449 File systems that make snapshots, such as Network Appliance's NFS server.
9452 File systems that cache in temporary locations, such as NFS version 3
9456 Compressed file systems.
9459 In the particular case of ext3 file systems, the above disclaimer applies (and
9460 @command{shred} is thus of limited effectiveness) only in @code{data=journal}
9461 mode, which journals file data in addition to just metadata. In both
9462 the @code{data=ordered} (default) and @code{data=writeback} modes,
9463 @command{shred} works as usual. Ext3 journaling modes can be changed
9464 by adding the @code{data=something} option to the mount options for a
9465 particular file system in the @file{/etc/fstab} file, as documented in
9466 the mount man page (man mount).
9468 If you are not sure how your file system operates, then you should assume
9469 that it does not overwrite data in place, which means that shred cannot
9470 reliably operate on regular files in your file system.
9472 Generally speaking, it is more reliable to shred a device than a file,
9473 since this bypasses the problem of file system design mentioned above.
9474 However, even shredding devices is not always completely reliable. For
9475 example, most disks map out bad sectors invisibly to the application; if
9476 the bad sectors contain sensitive data, @command{shred} won't be able to
9479 @command{shred} makes no attempt to detect or report this problem, just as
9480 it makes no attempt to do anything about backups. However, since it is
9481 more reliable to shred devices than files, @command{shred} by default does
9482 not truncate or remove the output file. This default is more suitable
9483 for devices, which typically cannot be truncated and should not be
9486 Finally, consider the risk of backups and mirrors.
9487 File system backups and remote mirrors may contain copies of the
9488 file that cannot be removed, and that will allow a shredded file
9489 to be recovered later. So if you keep any data you may later want
9490 to destroy using @command{shred}, be sure that it is not backed up or mirrored.
9493 shred [@var{option}]@dots{} @var{file}[@dots{}]
9496 The program accepts the following options. Also see @ref{Common options}.
9504 @cindex force deletion
9505 Override file permissions if necessary to allow overwriting.
9508 @itemx -n @var{number}
9509 @itemx --iterations=@var{number}
9510 @opindex -n @var{number}
9511 @opindex --iterations=@var{number}
9512 @cindex iterations, selecting the number of
9513 By default, @command{shred} uses @value{SHRED_DEFAULT_PASSES} passes of
9514 overwrite. You can reduce this to save time, or increase it if you think it's
9515 appropriate. After 25 passes all of the internal overwrite patterns will have
9516 been used at least once.
9518 @item --random-source=@var{file}
9519 @opindex --random-source
9520 @cindex random source for shredding
9521 Use @var{file} as a source of random data used to overwrite and to
9522 choose pass ordering. @xref{Random sources}.
9524 @item -s @var{bytes}
9525 @itemx --size=@var{bytes}
9526 @opindex -s @var{bytes}
9527 @opindex --size=@var{bytes}
9528 @cindex size of file to shred
9529 Shred the first @var{bytes} bytes of the file. The default is to shred
9530 the whole file. @var{bytes} can be followed by a size specification like
9531 @samp{K}, @samp{M}, or @samp{G} to specify a multiple. @xref{Block size}.
9537 @cindex removing files after shredding
9538 After shredding a file, truncate it (if possible) and then remove it.
9539 If a file has multiple links, only the named links will be removed.
9545 Display to standard error all status updates as sterilization proceeds.
9551 By default, @command{shred} rounds the size of a regular file up to the next
9552 multiple of the file system block size to fully erase the last block
9554 Use @option{--exact} to suppress that behavior.
9555 Thus, by default if you shred a 10-byte regular file on a system with 512-byte
9556 blocks, the resulting file will be 512 bytes long. With this option,
9557 shred does not increase the apparent size of the file.
9563 Normally, the last pass that @command{shred} writes is made up of
9564 random data. If this would be conspicuous on your hard drive (for
9565 example, because it looks like encrypted data), or you just think
9566 it's tidier, the @option{--zero} option adds an additional overwrite pass with
9567 all zero bits. This is in addition to the number of passes specified
9568 by the @option{--iterations} option.
9572 You might use the following command to erase all trace of the
9573 file system you'd created on the floppy disk in your first drive.
9574 That command takes about 20 minutes to erase a ``1.44MB'' (actually
9578 shred --verbose /dev/fd0
9581 Similarly, to erase all data on a selected partition of
9582 your hard disk, you could give a command like this:
9585 shred --verbose /dev/sda5
9588 On modern disks, a single pass should be adequate,
9589 and it will take one third the time of the default three-pass approach.
9592 # 1 pass, write pseudo-random data; 3x faster than the default
9593 shred --verbose -n1 /dev/sda5
9596 To be on the safe side, use at least one pass that overwrites using
9597 pseudo-random data. I.e., don't be tempted to use @samp{-n0 --zero},
9598 in case some disk controller optimizes the process of writing blocks
9599 of all zeros, and thereby does not clear all bytes in a block.
9600 Some SSDs may do just that.
9602 A @var{file} of @samp{-} denotes standard output.
9603 The intended use of this is to shred a removed temporary file.
9610 echo "Hello, world" >&3
9615 However, the command @samp{shred - >file} does not shred the contents
9616 of @var{file}, since the shell truncates @var{file} before invoking
9617 @command{shred}. Use the command @samp{shred file} or (if using a
9618 Bourne-compatible shell) the command @samp{shred - 1<>file} instead.
9623 @node Special file types
9624 @chapter Special file types
9626 @cindex special file types
9627 @cindex file types, special
9629 This chapter describes commands which create special types of files (and
9630 @command{rmdir}, which removes directories, one special file type).
9632 @cindex special file types
9634 Although Unix-like operating systems have markedly fewer special file
9635 types than others, not @emph{everything} can be treated only as the
9636 undifferentiated byte stream of @dfn{normal files}. For example, when a
9637 file is created or removed, the system must record this information,
9638 which it does in a @dfn{directory}---a special type of file. Although
9639 you can read directories as normal files, if you're curious, in order
9640 for the system to do its job it must impose a structure, a certain
9641 order, on the bytes of the file. Thus it is a ``special'' type of file.
9643 Besides directories, other special file types include named pipes
9644 (FIFOs), symbolic links, sockets, and so-called @dfn{special files}.
9647 * link invocation:: Make a hard link via the link syscall
9648 * ln invocation:: Make links between files.
9649 * mkdir invocation:: Make directories.
9650 * mkfifo invocation:: Make FIFOs (named pipes).
9651 * mknod invocation:: Make block or character special files.
9652 * readlink invocation:: Print value of a symlink or canonical file name.
9653 * rmdir invocation:: Remove empty directories.
9654 * unlink invocation:: Remove files via the unlink syscall
9658 @node link invocation
9659 @section @command{link}: Make a hard link via the link syscall
9662 @cindex links, creating
9663 @cindex hard links, creating
9664 @cindex creating links (hard only)
9666 @command{link} creates a single hard link at a time.
9667 It is a minimalist interface to the system-provided
9668 @code{link} function. @xref{Hard Links, , , libc,
9669 The GNU C Library Reference Manual}.
9670 It avoids the bells and whistles of the more commonly-used
9671 @command{ln} command (@pxref{ln invocation}).
9675 link @var{filename} @var{linkname}
9678 @var{filename} must specify an existing file, and @var{linkname}
9679 must specify a nonexistent entry in an existing directory.
9680 @command{link} simply calls @code{link (@var{filename}, @var{linkname})}
9683 On a GNU system, this command acts like @samp{ln --directory
9684 --no-target-directory @var{filename} @var{linkname}}. However, the
9685 @option{--directory} and @option{--no-target-directory} options are
9686 not specified by POSIX, and the @command{link} command is
9687 more portable in practice.
9689 If @var{filename} is a symbolic link, it is unspecified whether
9690 @var{linkname} will be a hard link to the symbolic link or to the
9691 target of the symbolic link. Use @command{ln -P} or @command{ln -L}
9692 to specify which behavior is desired.
9698 @section @command{ln}: Make links between files
9701 @cindex links, creating
9702 @cindex hard links, creating
9703 @cindex symbolic (soft) links, creating
9704 @cindex creating links (hard or soft)
9706 @cindex file systems and hard links
9707 @command{ln} makes links between files. By default, it makes hard links;
9708 with the @option{-s} option, it makes symbolic (or @dfn{soft}) links.
9712 ln [@var{option}]@dots{} [-T] @var{target} @var{linkname}
9713 ln [@var{option}]@dots{} @var{target}
9714 ln [@var{option}]@dots{} @var{target}@dots{} @var{directory}
9715 ln [@var{option}]@dots{} -t @var{directory} @var{target}@dots{}
9721 If two file names are given, @command{ln} creates a link to the first
9722 file from the second.
9725 If one @var{target} is given, @command{ln} creates a link to that file
9726 in the current directory.
9729 If the @option{--target-directory} (@option{-t}) option is given, or
9730 failing that if the last file is a directory and the
9731 @option{--no-target-directory} (@option{-T}) option is not given,
9732 @command{ln} creates a link to each @var{target} file in the specified
9733 directory, using the @var{target}s' names.
9737 Normally @command{ln} does not remove existing files. Use the
9738 @option{--force} (@option{-f}) option to remove them unconditionally,
9739 the @option{--interactive} (@option{-i}) option to remove them
9740 conditionally, and the @option{--backup} (@option{-b}) option to
9743 @cindex hard link, defined
9744 @cindex inode, and hard links
9745 A @dfn{hard link} is another name for an existing file; the link and the
9746 original are indistinguishable. Technically speaking, they share the
9747 same inode, and the inode contains all the information about a
9748 file---indeed, it is not incorrect to say that the inode @emph{is} the
9749 file. Most systems prohibit making a hard link to
9750 a directory; on those where it is allowed, only the super-user can do
9751 so (and with caution, since creating a cycle will cause problems to many
9752 other utilities). Hard links cannot cross file system boundaries. (These
9753 restrictions are not mandated by POSIX, however.)
9755 @cindex dereferencing symbolic links
9756 @cindex symbolic link, defined
9757 @dfn{Symbolic links} (@dfn{symlinks} for short), on the other hand, are
9758 a special file type (which not all kernels support: System V release 3
9759 (and older) systems lack symlinks) in which the link file actually
9760 refers to a different file, by name. When most operations (opening,
9761 reading, writing, and so on) are passed the symbolic link file, the
9762 kernel automatically @dfn{dereferences} the link and operates on the
9763 target of the link. But some operations (e.g., removing) work on the
9764 link file itself, rather than on its target. The owner and group of a
9765 symlink are not significant to file access performed through
9766 the link, but do have implications on deleting a symbolic link from a
9767 directory with the restricted deletion bit set. On the GNU system,
9768 the mode of a symlink has no significance and cannot be changed, but
9769 on some BSD systems, the mode can be changed and will affect whether
9770 the symlink will be traversed in file name resolution. @xref{Symbolic Links,,,
9771 libc, The GNU C Library Reference Manual}.
9773 Symbolic links can contain arbitrary strings; a @dfn{dangling symlink}
9774 occurs when the string in the symlink does not resolve to a file.
9775 There are no restrictions against creating dangling symbolic links.
9776 There are trade-offs to using absolute or relative symlinks. An
9777 absolute symlink always points to the same file, even if the directory
9778 containing the link is moved. However, if the symlink is visible from
9779 more than one machine (such as on a networked file system), the file
9780 pointed to might not always be the same. A relative symbolic link is
9781 resolved in relation to the directory that contains the link, and is
9782 often useful in referring to files on the same device without regards
9783 to what name that device is mounted on when accessed via networked
9786 When creating a relative symlink in a different location than the
9787 current directory, the resolution of the symlink will be different
9788 than the resolution of the same string from the current directory.
9789 Therefore, many users prefer to first change directories to the
9790 location where the relative symlink will be created, so that
9791 tab-completion or other file resolution will find the same target as
9792 what will be placed in the symlink.
9794 The program accepts the following options. Also see @ref{Common options}.
9805 @opindex --directory
9806 @cindex hard links to directories
9807 Allow users with appropriate privileges to attempt to make hard links
9809 However, note that this will probably fail due to
9810 system restrictions, even for the super-user.
9816 Remove existing destination files.
9819 @itemx --interactive
9821 @opindex --interactive
9822 @cindex prompting, and @command{ln}
9823 Prompt whether to remove existing destination files.
9829 If @option{-s} is not in effect, and the source file is a symbolic
9830 link, create the hard link to the file referred to by the symbolic
9831 link, rather than the symbolic link itself.
9834 @itemx --no-dereference
9836 @opindex --no-dereference
9837 Do not treat the last operand specially when it is a symbolic link to
9838 a directory. Instead, treat it as if it were a normal file.
9840 When the destination is an actual directory (not a symlink to one),
9841 there is no ambiguity. The link is created in that directory.
9842 But when the specified destination is a symlink to a directory,
9843 there are two ways to treat the user's request. @command{ln} can
9844 treat the destination just as it would a normal directory and create
9845 the link in it. On the other hand, the destination can be viewed as a
9846 non-directory---as the symlink itself. In that case, @command{ln}
9847 must delete or backup that symlink before creating the new link.
9848 The default is to treat a destination that is a symlink to a directory
9849 just like a directory.
9851 This option is weaker than the @option{--no-target-directory}
9852 (@option{-T}) option, so it has no effect if both options are given.
9858 If @option{-s} is not in effect, and the source file is a symbolic
9859 link, create the hard link to the symbolic link itself. On platforms
9860 where this is not supported by the kernel, this option creates a
9861 symbolic link with identical contents; since symbolic link contents
9862 cannot be edited, any file name resolution performed through either
9863 link will be the same as if a hard link had been created.
9869 Make symbolic links relative to the link location.
9874 ln -srv /a/file /tmp
9875 '/tmp/file' -> '../a/file'
9878 Relative symbolic links are generated based on their canonicalized
9879 containing directory, and canonicalized targets. I.E. all symbolic
9880 links in these file names will be resolved.
9881 @xref{realpath invocation}, which gives greater control
9882 over relative file name generation, as demonstrated in the following example:
9887 test "$1" = --no-symlinks && { nosym=$1; shift; }
9889 test -d "$2" && link="$2/." || link="$2"
9890 rtarget="$(realpath $nosym -m "$target" \
9891 --relative-to "$(dirname "$link")")"
9892 ln -s -v "$rtarget" "$link"
9901 Make symbolic links instead of hard links. This option merely produces
9902 an error message on systems that do not support symbolic links.
9908 @optNoTargetDirectory
9914 Print the name of each file after linking it successfully.
9918 @cindex hard links to symbolic links
9919 @cindex symbolic links and @command{ln}
9920 If @option{-L} and @option{-P} are both given, the last one takes
9921 precedence. If @option{-s} is also given, @option{-L} and @option{-P}
9922 are silently ignored. If neither option is given, then this
9923 implementation defaults to @option{-P} if the system @code{link} supports
9924 hard links to symbolic links (such as the GNU system), and @option{-L}
9925 if @code{link} follows symbolic links (such as on BSD).
9934 # Create link ../a pointing to a in that directory.
9935 # Not really useful because it points to itself.
9940 # Change to the target before creating symlinks to avoid being confused.
9946 # Hard coded file names don't move well.
9947 ln -s $(pwd)/a /some/dir/
9951 # Relative file names survive directory moves and also
9952 # work across networked file systems.
9953 ln -s afile anotherfile
9954 ln -s ../adir/afile yetanotherfile
9958 @node mkdir invocation
9959 @section @command{mkdir}: Make directories
9962 @cindex directories, creating
9963 @cindex creating directories
9965 @command{mkdir} creates directories with the specified names. Synopsis:
9968 mkdir [@var{option}]@dots{} @var{name}@dots{}
9971 @command{mkdir} creates each directory @var{name} in the order given.
9972 It reports an error if @var{name} already exists, unless the
9973 @option{-p} option is given and @var{name} is a directory.
9975 The program accepts the following options. Also see @ref{Common options}.
9980 @itemx --mode=@var{mode}
9983 @cindex modes of created directories, setting
9984 Set the file permission bits of created directories to @var{mode},
9985 which uses the same syntax as
9986 in @command{chmod} and uses @samp{a=rwx} (read, write and execute allowed for
9987 everyone) for the point of the departure. @xref{File permissions}.
9989 Normally the directory has the desired file mode bits at the moment it
9990 is created. As a GNU extension, @var{mode} may also mention
9991 special mode bits, but in this case there may be a temporary window
9992 during which the directory exists but its special mode bits are
9993 incorrect. @xref{Directory Setuid and Setgid}, for how the
9994 set-user-ID and set-group-ID bits of directories are inherited unless
9995 overridden in this way.
10001 @cindex parent directories, creating
10002 Make any missing parent directories for each argument, setting their
10003 file permission bits to the umask modified by @samp{u+wx}. Ignore
10004 existing parent directories, and do not change their file permission
10007 To set the file permission bits of any newly-created parent
10008 directories to a value that includes @samp{u+wx}, you can set the
10009 umask before invoking @command{mkdir}. For example, if the shell
10010 command @samp{(umask u=rwx,go=rx; mkdir -p P/Q)} creates the parent
10011 @file{P} it sets the parent's permission bits to @samp{u=rwx,go=rx}.
10012 To set a parent's special mode bits as well, you can invoke
10013 @command{chmod} after @command{mkdir}. @xref{Directory Setuid and
10014 Setgid}, for how the set-user-ID and set-group-ID bits of
10015 newly-created parent directories are inherited.
10021 Print a message for each created directory. This is most useful with
10022 @option{--parents}.
10024 @item -Z @var{context}
10025 @itemx --context=@var{context}
10029 @cindex security context
10030 Set the default SELinux security context to be used for created directories.
10037 @node mkfifo invocation
10038 @section @command{mkfifo}: Make FIFOs (named pipes)
10041 @cindex FIFOs, creating
10042 @cindex named pipes, creating
10043 @cindex creating FIFOs (named pipes)
10045 @command{mkfifo} creates FIFOs (also called @dfn{named pipes}) with the
10046 specified names. Synopsis:
10049 mkfifo [@var{option}] @var{name}@dots{}
10052 A @dfn{FIFO} is a special file type that permits independent processes
10053 to communicate. One process opens the FIFO file for writing, and
10054 another for reading, after which data can flow as with the usual
10055 anonymous pipe in shells or elsewhere.
10057 The program accepts the following option. Also see @ref{Common options}.
10061 @item -m @var{mode}
10062 @itemx --mode=@var{mode}
10065 @cindex modes of created FIFOs, setting
10066 Set the mode of created FIFOs to @var{mode}, which is symbolic as in
10067 @command{chmod} and uses @samp{a=rw} (read and write allowed for everyone)
10068 for the point of departure. @var{mode} should specify only file
10069 permission bits. @xref{File permissions}.
10071 @item -Z @var{context}
10072 @itemx --context=@var{context}
10076 @cindex security context
10077 Set the default SELinux security context to be used for created FIFOs.
10084 @node mknod invocation
10085 @section @command{mknod}: Make block or character special files
10088 @cindex block special files, creating
10089 @cindex character special files, creating
10091 @command{mknod} creates a FIFO, character special file, or block special
10092 file with the specified name. Synopsis:
10095 mknod [@var{option}]@dots{} @var{name} @var{type} [@var{major} @var{minor}]
10098 @cindex special files
10099 @cindex block special files
10100 @cindex character special files
10101 Unlike the phrase ``special file type'' above, the term @dfn{special
10102 file} has a technical meaning on Unix: something that can generate or
10103 receive data. Usually this corresponds to a physical piece of hardware,
10104 e.g., a printer or a disk. (These files are typically created at
10105 system-configuration time.) The @command{mknod} command is what creates
10106 files of this type. Such devices can be read either a character at a
10107 time or a ``block'' (many characters) at a time, hence we say there are
10108 @dfn{block special} files and @dfn{character special} files.
10110 @c mknod is a shell built-in at least with OpenBSD's /bin/sh
10111 @mayConflictWithShellBuiltIn{mknod}
10113 The arguments after @var{name} specify the type of file to make:
10118 @opindex p @r{for FIFO file}
10122 @opindex b @r{for block special file}
10123 for a block special file
10126 @c Don't document the 'u' option -- it's just a synonym for 'c'.
10127 @c Do *any* versions of mknod still use it?
10129 @opindex c @r{for character special file}
10130 @c @opindex u @r{for character special file}
10131 for a character special file
10135 When making a block or character special file, the major and minor
10136 device numbers must be given after the file type.
10137 If a major or minor device number begins with @samp{0x} or @samp{0X},
10138 it is interpreted as hexadecimal; otherwise, if it begins with @samp{0},
10139 as octal; otherwise, as decimal.
10141 The program accepts the following option. Also see @ref{Common options}.
10145 @item -m @var{mode}
10146 @itemx --mode=@var{mode}
10149 Set the mode of created files to @var{mode}, which is symbolic as in
10150 @command{chmod} and uses @samp{a=rw} as the point of departure.
10151 @var{mode} should specify only file permission bits.
10152 @xref{File permissions}.
10154 @item -Z @var{context}
10155 @itemx --context=@var{context}
10159 @cindex security context
10160 Set the default SELinux security context to be used for created files.
10167 @node readlink invocation
10168 @section @command{readlink}: Print value of a symlink or canonical file name
10171 @cindex displaying value of a symbolic link
10172 @cindex canonical file name
10173 @cindex canonicalize a file name
10176 @command{readlink} may work in one of two supported modes:
10180 @item Readlink mode
10182 @command{readlink} outputs the value of the given symbolic links.
10183 If @command{readlink} is invoked with an argument other than the name
10184 of a symbolic link, it produces no output and exits with a nonzero exit code.
10186 @item Canonicalize mode
10188 @command{readlink} outputs the absolute name of the given files which contain
10189 no @file{.}, @file{..} components nor any repeated separators
10190 (@file{/}) or symbolic links.
10195 readlink [@var{option}]@dots{} @var{file}@dots{}
10198 By default, @command{readlink} operates in readlink mode.
10200 The program accepts the following options. Also see @ref{Common options}.
10205 @itemx --canonicalize
10207 @opindex --canonicalize
10208 Activate canonicalize mode.
10209 If any component of the file name except the last one is missing or unavailable,
10210 @command{readlink} produces no output and exits with a nonzero exit
10211 code. A trailing slash is ignored.
10214 @itemx --canonicalize-existing
10216 @opindex --canonicalize-existing
10217 Activate canonicalize mode.
10218 If any component is missing or unavailable, @command{readlink} produces
10219 no output and exits with a nonzero exit code. A trailing slash
10220 requires that the name resolve to a directory.
10223 @itemx --canonicalize-missing
10225 @opindex --canonicalize-missing
10226 Activate canonicalize mode.
10227 If any component is missing or unavailable, @command{readlink} treats it
10231 @itemx --no-newline
10233 @opindex --no-newline
10234 Do not print the output delimiter, when a single @var{file} is specified.
10235 Print a warning if specified along with multiple @var{file}s.
10245 Suppress most error messages.
10251 Report error messages.
10257 Separate output items with NUL characters.
10261 The @command{readlink} utility first appeared in OpenBSD 2.1.
10263 The @command{realpath} command without options, operates like
10264 @command{readlink} in canonicalize mode.
10269 @node rmdir invocation
10270 @section @command{rmdir}: Remove empty directories
10273 @cindex removing empty directories
10274 @cindex directories, removing empty
10276 @command{rmdir} removes empty directories. Synopsis:
10279 rmdir [@var{option}]@dots{} @var{directory}@dots{}
10282 If any @var{directory} argument does not refer to an existing empty
10283 directory, it is an error.
10285 The program accepts the following options. Also see @ref{Common options}.
10289 @item --ignore-fail-on-non-empty
10290 @opindex --ignore-fail-on-non-empty
10291 @cindex directory deletion, ignoring failures
10292 Ignore each failure to remove a directory that is solely because
10293 the directory is non-empty.
10299 @cindex parent directories, removing
10300 Remove @var{directory}, then try to remove each component of @var{directory}.
10301 So, for example, @samp{rmdir -p a/b/c} is similar to @samp{rmdir a/b/c a/b a}.
10302 As such, it fails if any of those directories turns out not to be empty.
10303 Use the @option{--ignore-fail-on-non-empty} option to make it so such
10304 a failure does not evoke a diagnostic and does not cause @command{rmdir} to
10305 exit unsuccessfully.
10311 @cindex directory deletion, reporting
10312 Give a diagnostic for each successful removal.
10313 @var{directory} is removed.
10317 @xref{rm invocation}, for how to remove non-empty directories (recursively).
10322 @node unlink invocation
10323 @section @command{unlink}: Remove files via the unlink syscall
10326 @cindex removing files or directories (via the unlink syscall)
10328 @command{unlink} deletes a single specified file name.
10329 It is a minimalist interface to the system-provided
10330 @code{unlink} function. @xref{Deleting Files, , , libc,
10331 The GNU C Library Reference Manual}. Synopsis:
10332 It avoids the bells and whistles of the more commonly-used
10333 @command{rm} command (@pxref{rm invocation}).
10336 unlink @var{filename}
10339 On some systems @code{unlink} can be used to delete the name of a
10340 directory. On others, it can be used that way only by a privileged user.
10341 In the GNU system @code{unlink} can never delete the name of a directory.
10343 The @command{unlink} command honors the @option{--help} and
10344 @option{--version} options. To remove a file whose name begins with
10345 @samp{-}, prefix the name with @samp{./}, e.g., @samp{unlink ./--help}.
10350 @node Changing file attributes
10351 @chapter Changing file attributes
10353 @cindex changing file attributes
10354 @cindex file attributes, changing
10355 @cindex attributes, file
10357 A file is not merely its contents, a name, and a file type
10358 (@pxref{Special file types}). A file also has an owner (a user ID), a
10359 group (a group ID), permissions (what the owner can do with the file,
10360 what people in the group can do, and what everyone else can do), various
10361 timestamps, and other information. Collectively, we call these a file's
10364 These commands change file attributes.
10367 * chown invocation:: Change file owners and groups.
10368 * chgrp invocation:: Change file groups.
10369 * chmod invocation:: Change access permissions.
10370 * touch invocation:: Change file timestamps.
10374 @node chown invocation
10375 @section @command{chown}: Change file owner and group
10378 @cindex file ownership, changing
10379 @cindex group ownership, changing
10380 @cindex changing file ownership
10381 @cindex changing group ownership
10383 @command{chown} changes the user and/or group ownership of each given @var{file}
10384 to @var{new-owner} or to the user and group of an existing reference file.
10388 chown [@var{option}]@dots{} @{@var{new-owner} | --reference=@var{ref_file}@}@c
10392 If used, @var{new-owner} specifies the new owner and/or group as follows
10393 (with no embedded white space):
10396 [@var{owner}] [ : [@var{group}] ]
10403 If only an @var{owner} (a user name or numeric user ID) is given, that
10404 user is made the owner of each given file, and the files' group is not
10407 @item owner@samp{:}group
10408 If the @var{owner} is followed by a colon and a @var{group} (a
10409 group name or numeric group ID), with no spaces between them, the group
10410 ownership of the files is changed as well (to @var{group}).
10412 @item owner@samp{:}
10413 If a colon but no group name follows @var{owner}, that user is
10414 made the owner of the files and the group of the files is changed to
10415 @var{owner}'s login group.
10417 @item @samp{:}group
10418 If the colon and following @var{group} are given, but the owner
10419 is omitted, only the group of the files is changed; in this case,
10420 @command{chown} performs the same function as @command{chgrp}.
10423 If only a colon is given, or if @var{new-owner} is empty, neither the
10424 owner nor the group is changed.
10428 If @var{owner} or @var{group} is intended to represent a numeric user
10429 or group ID, then you may specify it with a leading @samp{+}.
10430 @xref{Disambiguating names and IDs}.
10432 Some older scripts may still use @samp{.} in place of the @samp{:} separator.
10433 POSIX 1003.1-2001 (@pxref{Standards conformance}) does not
10434 require support for that, but for backward compatibility GNU
10435 @command{chown} supports @samp{.} so long as no ambiguity results.
10436 New scripts should avoid the use of @samp{.} because it is not
10437 portable, and because it has undesirable results if the entire
10438 @var{owner@samp{.}group} happens to identify a user whose name
10441 The @command{chown} command sometimes clears the set-user-ID or
10442 set-group-ID permission bits. This behavior depends on the policy and
10443 functionality of the underlying @code{chown} system call, which may
10444 make system-dependent file mode modifications outside the control of
10445 the @command{chown} command. For example, the @command{chown} command
10446 might not affect those bits when invoked by a user with appropriate
10447 privileges, or when the
10448 bits signify some function other than executable permission (e.g.,
10449 mandatory locking).
10450 When in doubt, check the underlying system behavior.
10452 The program accepts the following options. Also see @ref{Common options}.
10460 @cindex changed owners, verbosely describing
10461 Verbosely describe the action for each @var{file} whose ownership
10470 @cindex error messages, omitting
10471 Do not print error messages about files whose ownership cannot be
10474 @item @w{@kbd{--from}=@var{old-owner}}
10476 @cindex symbolic links, changing owner
10477 Change a @var{file}'s ownership only if it has current attributes specified
10478 by @var{old-owner}. @var{old-owner} has the same form as @var{new-owner}
10480 This option is useful primarily from a security standpoint in that
10481 it narrows considerably the window of potential abuse.
10482 For example, to reflect a user ID numbering change for one user's files
10483 without an option like this, @code{root} might run
10486 find / -owner OLDUSER -print0 | xargs -0 chown -h NEWUSER
10489 But that is dangerous because the interval between when the @command{find}
10490 tests the existing file's owner and when the @command{chown} is actually run
10491 may be quite large.
10492 One way to narrow the gap would be to invoke chown for each file
10496 find / -owner OLDUSER -exec chown -h NEWUSER @{@} \;
10499 But that is very slow if there are many affected files.
10500 With this option, it is safer (the gap is narrower still)
10501 though still not perfect:
10504 chown -h -R --from=OLDUSER NEWUSER /
10507 @item --dereference
10508 @opindex --dereference
10509 @cindex symbolic links, changing owner
10511 Do not act on symbolic links themselves but rather on what they point to.
10512 This is the default.
10515 @itemx --no-dereference
10517 @opindex --no-dereference
10518 @cindex symbolic links, changing owner
10520 Act on symbolic links themselves instead of what they point to.
10521 This mode relies on the @code{lchown} system call.
10522 On systems that do not provide the @code{lchown} system call,
10523 @command{chown} fails when a file specified on the command line
10524 is a symbolic link.
10525 By default, no diagnostic is issued for symbolic links encountered
10526 during a recursive traversal, but see @option{--verbose}.
10528 @item --preserve-root
10529 @opindex --preserve-root
10530 @cindex root directory, disallow recursive modification
10531 Fail upon any attempt to recursively change the root directory, @file{/}.
10532 Without @option{--recursive}, this option has no effect.
10533 @xref{Treating / specially}.
10535 @item --no-preserve-root
10536 @opindex --no-preserve-root
10537 @cindex root directory, allow recursive modification
10538 Cancel the effect of any preceding @option{--preserve-root} option.
10539 @xref{Treating / specially}.
10541 @item --reference=@var{ref_file}
10542 @opindex --reference
10543 Change the user and group of each @var{file} to be the same as those of
10544 @var{ref_file}. If @var{ref_file} is a symbolic link, do not use the
10545 user and group of the symbolic link, but rather those of the file it
10552 Output a diagnostic for every file processed.
10553 If a symbolic link is encountered during a recursive traversal
10554 on a system without the @code{lchown} system call, and @option{--no-dereference}
10555 is in effect, then issue a diagnostic saying neither the symbolic link nor
10556 its referent is being changed.
10561 @opindex --recursive
10562 @cindex recursively changing file ownership
10563 Recursively change ownership of directories and their contents.
10566 @xref{Traversing symlinks}.
10569 @xref{Traversing symlinks}.
10572 @xref{Traversing symlinks}.
10581 # Change the owner of /u to "root".
10584 # Likewise, but also change its group to "staff".
10585 chown root:staff /u
10587 # Change the owner of /u and subfiles to "root".
10592 @node chgrp invocation
10593 @section @command{chgrp}: Change group ownership
10596 @cindex group ownership, changing
10597 @cindex changing group ownership
10599 @command{chgrp} changes the group ownership of each given @var{file}
10600 to @var{group} (which can be either a group name or a numeric group ID)
10601 or to the group of an existing reference file. Synopsis:
10604 chgrp [@var{option}]@dots{} @{@var{group} | --reference=@var{ref_file}@}@c
10608 If @var{group} is intended to represent a
10609 numeric group ID, then you may specify it with a leading @samp{+}.
10610 @xref{Disambiguating names and IDs}.
10612 The program accepts the following options. Also see @ref{Common options}.
10620 @cindex changed files, verbosely describing
10621 Verbosely describe the action for each @var{file} whose group actually
10630 @cindex error messages, omitting
10631 Do not print error messages about files whose group cannot be
10634 @item --dereference
10635 @opindex --dereference
10636 @cindex symbolic links, changing owner
10638 Do not act on symbolic links themselves but rather on what they point to.
10639 This is the default.
10642 @itemx --no-dereference
10644 @opindex --no-dereference
10645 @cindex symbolic links, changing group
10647 Act on symbolic links themselves instead of what they point to.
10648 This mode relies on the @code{lchown} system call.
10649 On systems that do not provide the @code{lchown} system call,
10650 @command{chgrp} fails when a file specified on the command line
10651 is a symbolic link.
10652 By default, no diagnostic is issued for symbolic links encountered
10653 during a recursive traversal, but see @option{--verbose}.
10655 @item --preserve-root
10656 @opindex --preserve-root
10657 @cindex root directory, disallow recursive modification
10658 Fail upon any attempt to recursively change the root directory, @file{/}.
10659 Without @option{--recursive}, this option has no effect.
10660 @xref{Treating / specially}.
10662 @item --no-preserve-root
10663 @opindex --no-preserve-root
10664 @cindex root directory, allow recursive modification
10665 Cancel the effect of any preceding @option{--preserve-root} option.
10666 @xref{Treating / specially}.
10668 @item --reference=@var{ref_file}
10669 @opindex --reference
10670 Change the group of each @var{file} to be the same as that of
10671 @var{ref_file}. If @var{ref_file} is a symbolic link, do not use the
10672 group of the symbolic link, but rather that of the file it refers to.
10678 Output a diagnostic for every file processed.
10679 If a symbolic link is encountered during a recursive traversal
10680 on a system without the @code{lchown} system call, and @option{--no-dereference}
10681 is in effect, then issue a diagnostic saying neither the symbolic link nor
10682 its referent is being changed.
10687 @opindex --recursive
10688 @cindex recursively changing group ownership
10689 Recursively change the group ownership of directories and their contents.
10692 @xref{Traversing symlinks}.
10695 @xref{Traversing symlinks}.
10698 @xref{Traversing symlinks}.
10707 # Change the group of /u to "staff".
10710 # Change the group of /u and subfiles to "staff".
10715 @node chmod invocation
10716 @section @command{chmod}: Change access permissions
10719 @cindex changing access permissions
10720 @cindex access permissions, changing
10721 @cindex permissions, changing access
10723 @command{chmod} changes the access permissions of the named files. Synopsis:
10726 chmod [@var{option}]@dots{} @{@var{mode} | --reference=@var{ref_file}@}@c
10730 @cindex symbolic links, permissions of
10731 @command{chmod} never changes the permissions of symbolic links, since
10732 the @command{chmod} system call cannot change their permissions.
10733 This is not a problem since the permissions of symbolic links are
10734 never used. However, for each symbolic link listed on the command
10735 line, @command{chmod} changes the permissions of the pointed-to file.
10736 In contrast, @command{chmod} ignores symbolic links encountered during
10737 recursive directory traversals.
10739 A successful use of @command{chmod} clears the set-group-ID bit of a
10740 regular file if the file's group ID does not match the user's
10741 effective group ID or one of the user's supplementary group IDs,
10742 unless the user has appropriate privileges. Additional restrictions
10743 may cause the set-user-ID and set-group-ID bits of @var{mode} or
10744 @var{ref_file} to be ignored. This behavior depends on the policy and
10745 functionality of the underlying @code{chmod} system call. When in
10746 doubt, check the underlying system behavior.
10748 If used, @var{mode} specifies the new file mode bits.
10749 For details, see the section on @ref{File permissions}.
10750 If you really want @var{mode} to have a leading @samp{-}, you should
10751 use @option{--} first, e.g., @samp{chmod -- -w file}. Typically,
10752 though, @samp{chmod a-w file} is preferable, and @command{chmod -w
10753 file} (without the @option{--}) complains if it behaves differently
10754 from what @samp{chmod a-w file} would do.
10756 The program accepts the following options. Also see @ref{Common options}.
10764 Verbosely describe the action for each @var{file} whose permissions
10773 @cindex error messages, omitting
10774 Do not print error messages about files whose permissions cannot be
10777 @item --preserve-root
10778 @opindex --preserve-root
10779 @cindex root directory, disallow recursive modification
10780 Fail upon any attempt to recursively change the root directory, @file{/}.
10781 Without @option{--recursive}, this option has no effect.
10782 @xref{Treating / specially}.
10784 @item --no-preserve-root
10785 @opindex --no-preserve-root
10786 @cindex root directory, allow recursive modification
10787 Cancel the effect of any preceding @option{--preserve-root} option.
10788 @xref{Treating / specially}.
10794 Verbosely describe the action or non-action taken for every @var{file}.
10796 @item --reference=@var{ref_file}
10797 @opindex --reference
10798 Change the mode of each @var{file} to be the same as that of @var{ref_file}.
10799 @xref{File permissions}.
10800 If @var{ref_file} is a symbolic link, do not use the mode
10801 of the symbolic link, but rather that of the file it refers to.
10806 @opindex --recursive
10807 @cindex recursively changing access permissions
10808 Recursively change permissions of directories and their contents.
10815 @node touch invocation
10816 @section @command{touch}: Change file timestamps
10819 @cindex changing file timestamps
10820 @cindex file timestamps, changing
10821 @cindex timestamps, changing file
10823 @command{touch} changes the access and/or modification times of the
10824 specified files. Synopsis:
10827 touch [@var{option}]@dots{} @var{file}@dots{}
10830 @cindex empty files, creating
10831 Any @var{file} argument that does not exist is created empty, unless
10832 option @option{--no-create} (@option{-c}) or @option{--no-dereference}
10833 (@option{-h}) was in effect.
10835 A @var{file} argument string of @samp{-} is handled specially and
10836 causes @command{touch} to change the times of the file associated with
10840 By default, @command{touch} sets file timestamps to the current time.
10841 Because @command{touch} acts on its operands left to right, the
10842 resulting timestamps of earlier and later operands may disagree.
10843 Also, the determination of what time is ``current'' depends on the
10844 platform. Platforms with network file systems often use different
10845 clocks for the operating system and for file systems; because
10846 @command{touch} typically uses file systems' clocks by default, clock
10847 skew can cause the resulting file timestamps to appear to be in a
10848 program's ``future'' or ``past''.
10850 @cindex file timestamp resolution
10851 The @command{touch} command sets the file's timestamp to the greatest
10852 representable value that is not greater than the requested time. This
10853 can differ from the requested time for several reasons. First, the
10854 requested time may have a higher resolution than supported. Second, a
10855 file system may use different resolutions for different types of
10856 times. Third, file timestamps may use a different resolution than
10857 operating system timestamps. Fourth, the operating system primitives
10858 used to update timestamps may employ yet a different resolution. For
10859 example, in theory a file system might use 10-microsecond resolution
10860 for access time and 100-nanosecond resolution for modification time,
10861 and the operating system might use nanosecond resolution for the
10862 current time and microsecond resolution for the primitive that
10863 @command{touch} uses to set a file's timestamp to an arbitrary value.
10865 @cindex permissions, for changing file timestamps
10866 When setting file timestamps to the current time, @command{touch} can
10867 change the timestamps for files that the user does not own but has
10868 write permission for. Otherwise, the user must own the files. Some
10869 older systems have a further restriction: the user must own the files
10870 unless both the access and modification times are being set to the
10873 Although @command{touch} provides options for changing two of the times---the
10874 times of last access and modification---of a file, there is actually
10875 a standard third one as well: the inode change time. This is often
10876 referred to as a file's @code{ctime}.
10877 The inode change time represents the time when the file's meta-information
10878 last changed. One common example of this is when the permissions of a
10879 file change. Changing the permissions doesn't access the file, so
10880 the atime doesn't change, nor does it modify the file, so the mtime
10881 doesn't change. Yet, something about the file itself has changed,
10882 and this must be noted somewhere. This is the job of the ctime field.
10883 This is necessary, so that, for example, a backup program can make a
10884 fresh copy of the file, including the new permissions value.
10885 Another operation that modifies a file's ctime without affecting
10886 the others is renaming. In any case, it is not possible, in normal
10887 operations, for a user to change the ctime field to a user-specified value.
10888 Some operating systems and file systems support a fourth time: the
10889 birth time, when the file was first created; by definition, this
10890 timestamp never changes.
10893 Time stamps assume the time zone rules specified by the @env{TZ}
10894 environment variable, or by the system default rules if @env{TZ} is
10895 not set. @xref{TZ Variable,, Specifying the Time Zone with @env{TZ},
10896 libc, The GNU C Library Reference Manual}.
10897 You can avoid ambiguities during
10898 daylight saving transitions by using UTC time stamps.
10900 The program accepts the following options. Also see @ref{Common options}.
10905 @itemx --time=atime
10906 @itemx --time=access
10910 @opindex atime@r{, changing}
10911 @opindex access @r{time, changing}
10912 @opindex use @r{time, changing}
10913 Change the access time only.
10918 @opindex --no-create
10919 Do not warn about or create files that do not exist.
10922 @itemx --date=@var{time}
10926 Use @var{time} instead of the current time. It can contain month names,
10927 time zones, @samp{am} and @samp{pm}, @samp{yesterday}, etc. For
10928 example, @option{--date="2004-02-27 14:19:13.489392193 +0530"}
10929 specifies the instant of time that is 489,392,193 nanoseconds after
10930 February 27, 2004 at 2:19:13 PM in a time zone that is 5 hours and 30
10931 minutes east of UTC@. @xref{Date input formats}.
10932 File systems that do not support high-resolution time stamps
10933 silently ignore any excess precision here.
10937 @cindex BSD @command{touch} compatibility
10938 Ignored; for compatibility with BSD versions of @command{touch}.
10941 @itemx --no-dereference
10943 @opindex --no-dereference
10944 @cindex symbolic links, changing time
10946 Attempt to change the timestamps of a symbolic link, rather than what
10947 the link refers to. When using this option, empty files are not
10948 created, but option @option{-c} must also be used to avoid warning
10949 about files that do not exist. Not all systems support changing the
10950 timestamps of symlinks, since underlying system support for this
10951 action was not required until POSIX 2008. Also, on some
10952 systems, the mere act of examining a symbolic link changes the access
10953 time, such that only changes to the modification time will persist
10954 long enough to be observable. When coupled with option @option{-r}, a
10955 reference timestamp is taken from a symbolic link rather than the file
10959 @itemx --time=mtime
10960 @itemx --time=modify
10963 @opindex mtime@r{, changing}
10964 @opindex modify @r{time, changing}
10965 Change the modification time only.
10967 @item -r @var{file}
10968 @itemx --reference=@var{file}
10970 @opindex --reference
10971 Use the times of the reference @var{file} instead of the current time.
10972 If this option is combined with the @option{--date=@var{time}}
10973 (@option{-d @var{time}}) option, the reference @var{file}'s time is
10974 the origin for any relative @var{time}s given, but is otherwise ignored.
10975 For example, @samp{-r foo -d '-5 seconds'} specifies a time stamp
10976 equal to five seconds before the corresponding time stamp for @file{foo}.
10977 If @var{file} is a symbolic link, the reference timestamp is taken
10978 from the target of the symlink, unless @option{-h} was also in effect.
10980 @item -t [[@var{cc}]@var{yy}]@var{mmddhhmm}[.@var{ss}]
10981 @cindex leap seconds
10982 Use the argument (optional four-digit or two-digit years, months,
10983 days, hours, minutes, optional seconds) instead of the current time.
10984 If the year is specified with only two digits, then @var{cc}
10985 is 20 for years in the range 0 @dots{} 68, and 19 for years in
10986 69 @dots{} 99. If no digits of the year are specified,
10987 the argument is interpreted as a date in the current year.
10988 On the atypical systems that support leap seconds, @var{ss} may be
10993 @vindex _POSIX2_VERSION
10994 On older systems, @command{touch} supports an obsolete syntax, as follows.
10995 If no timestamp is given with any of the @option{-d}, @option{-r}, or
10996 @option{-t} options, and if there are two or more @var{file}s and the
10997 first @var{file} is of the form @samp{@var{mmddhhmm}[@var{yy}]} and this
10998 would be a valid argument to the @option{-t} option (if the @var{yy}, if
10999 any, were moved to the front), and if the represented year
11000 is in the range 1969--1999, that argument is interpreted as the time
11001 for the other files instead of as a file name.
11002 This obsolete behavior can be enabled or disabled with the
11003 @env{_POSIX2_VERSION} environment variable (@pxref{Standards
11004 conformance}), but portable scripts should avoid commands whose
11005 behavior depends on this variable.
11006 For example, use @samp{touch ./12312359 main.c} or @samp{touch -t
11007 12312359 main.c} rather than the ambiguous @samp{touch 12312359 main.c}.
11013 @chapter Disk usage
11017 No disk can hold an infinite amount of data. These commands report
11018 how much disk storage is in use or available, report other file and
11019 file status information, and write buffers to disk.
11022 * df invocation:: Report file system disk space usage.
11023 * du invocation:: Estimate file space usage.
11024 * stat invocation:: Report file or file system status.
11025 * sync invocation:: Synchronize memory and disk.
11026 * truncate invocation:: Shrink or extend the size of a file.
11030 @node df invocation
11031 @section @command{df}: Report file system disk space usage
11034 @cindex file system disk usage
11035 @cindex disk usage by file system
11037 @command{df} reports the amount of disk space used and available on
11038 file systems. Synopsis:
11041 df [@var{option}]@dots{} [@var{file}]@dots{}
11044 With no arguments, @command{df} reports the space used and available on all
11045 currently mounted file systems (of all types). Otherwise, @command{df}
11046 reports on the file system containing each argument @var{file}.
11048 Normally the disk space is printed in units of
11049 1024 bytes, but this can be overridden (@pxref{Block size}).
11050 Non-integer quantities are rounded up to the next higher unit.
11052 For bind mounts and without arguments, @command{df} only outputs the statistics
11053 for that device with the shortest mount point name in the list of file systems
11054 (@var{mtab}), i.e., it hides duplicate entries, unless the @option{-a} option is
11057 With the same logic, @command{df} elides a mount entry of a dummy pseudo device
11058 if there is another mount entry of a real block device for that mount point with
11059 the same device number, e.g. the early-boot pseudo file system @samp{rootfs} is
11060 not shown per default when already the real root device has been mounted.
11062 @cindex disk device file
11063 @cindex device file, disk
11064 If an argument @var{file} is an absolute name of a disk device node containing
11065 a mounted file system, @command{df} shows the space available on that
11066 file system rather than on the file system containing the device node.
11067 GNU @command{df} does not attempt to determine the disk usage
11068 on unmounted file systems, because on most kinds of systems doing so
11069 requires extremely nonportable intimate knowledge of file system structures.
11071 The program accepts the following options. Also see @ref{Common options}.
11079 @cindex automounter file systems
11080 @cindex ignore file systems
11081 Include in the listing dummy file systems, which
11082 are omitted by default. Such file systems are typically special-purpose
11083 pseudo-file-systems, such as automounter entries.
11085 @item -B @var{size}
11086 @itemx --block-size=@var{size}
11088 @opindex --block-size
11089 @cindex file system sizes
11090 Scale sizes by @var{size} before printing them (@pxref{Block size}).
11091 For example, @option{-BG} prints sizes in units of 1,073,741,824 bytes.
11095 @cindex grand total of disk size, usage and available space
11096 Print a grand total of all arguments after all arguments have
11097 been processed. This can be used to find out the total disk size, usage
11098 and available space of all listed devices.
11100 For the grand total line, @command{df} prints @samp{"total"} into the
11101 @var{source} column, and @samp{"-"} into the @var{target} column.
11102 If there is no @var{source} column (see @option{--output}), then
11103 @command{df} prints @samp{"total"} into the @var{target} column,
11110 Equivalent to @option{--si}.
11116 @cindex inode usage
11117 List inode usage information instead of block usage. An inode (short
11118 for index node) contains information about a file such as its owner,
11119 permissions, timestamps, and location on the disk.
11123 @cindex kibibytes for file system sizes
11124 Print sizes in 1024-byte blocks, overriding the default block size
11125 (@pxref{Block size}).
11126 This option is equivalent to @option{--block-size=1K}.
11132 @cindex file system types, limiting output to certain
11133 Limit the listing to local file systems. By default, remote file systems
11138 @cindex file system space, retrieving old data more quickly
11139 Do not invoke the @code{sync} system call before getting any usage data.
11140 This may make @command{df} run significantly faster on systems with many
11141 disks, but on some systems (notably SunOS) the results may be slightly
11142 out of date. This is the default.
11145 @itemx @w{@kbd{--output}[=@var{field_list}]}
11147 Use the output format defined by @var{field_list}, or print all fields if
11148 @var{field_list} is omitted. In the latter case, the order of the columns
11149 conforms to the order of the field descriptions below.
11151 The use of the @option{--output} together with each of the options @option{-i},
11152 @option{-P}, and @option{-T} is mutually exclusive.
11154 FIELD_LIST is a comma-separated list of columns to be included in @command{df}'s
11155 output and therefore effectively controls the order of output columns.
11156 Each field can thus be used at the place of choice, but yet must only be
11159 Valid field names in the @var{field_list} are:
11162 The source of the mount point, usually a device.
11167 Total number of inodes.
11169 Number of used inodes.
11171 Number of available inodes.
11173 Percentage of @var{iused} divided by @var{itotal}.
11176 Total number of blocks.
11178 Number of used blocks.
11180 Number of available blocks.
11182 Percentage of @var{used} divided by @var{size}.
11188 The fields for block and inodes statistics are affected by the scaling
11189 options like @option{-h} as usual.
11191 The definition of the @var{field_list} can even be splitted among several
11192 @option{--output} uses.
11196 # Print the TARGET (i.e., the mount point) along with their percentage
11197 # statistic regarding the blocks and the inodes.
11198 df --out=target --output=pcent,ipcent
11200 # Print all available fields.
11206 @itemx --portability
11208 @opindex --portability
11209 @cindex one-line output format
11210 @cindex POSIX output format
11211 @cindex portable output format
11212 @cindex output format, portable
11213 Use the POSIX output format. This is like the default format except
11218 The information about each file system is always printed on exactly
11219 one line; a mount device is never put on a line by itself. This means
11220 that if the mount device name is more than 20 characters long (e.g., for
11221 some network mounts), the columns are misaligned.
11224 The labels in the header output line are changed to conform to POSIX.
11227 The default block size and output format are unaffected by the
11228 @env{DF_BLOCK_SIZE}, @env{BLOCK_SIZE} and @env{BLOCKSIZE} environment
11229 variables. However, the default block size is still affected by
11230 @env{POSIXLY_CORRECT}: it is 512 if @env{POSIXLY_CORRECT} is set, 1024
11231 otherwise. @xref{Block size}.
11238 @cindex file system space, retrieving current data more slowly
11239 Invoke the @code{sync} system call before getting any usage data. On
11240 some systems (notably SunOS), doing this yields more up to date results,
11241 but in general this option makes @command{df} much slower, especially when
11242 there are many or very busy file systems.
11244 @item -t @var{fstype}
11245 @itemx --type=@var{fstype}
11248 @cindex file system types, limiting output to certain
11249 Limit the listing to file systems of type @var{fstype}. Multiple
11250 file system types can be specified by giving multiple @option{-t} options.
11251 By default, nothing is omitted.
11254 @itemx --print-type
11256 @opindex --print-type
11257 @cindex file system types, printing
11258 Print each file system's type. The types printed here are the same ones
11259 you can include or exclude with @option{-t} and @option{-x}. The particular
11260 types printed are whatever is supported by the system. Here are some of
11261 the common names (this list is certainly not exhaustive):
11266 @cindex NFS file system type
11267 An NFS file system, i.e., one mounted over a network from another
11268 machine. This is the one type name which seems to be used uniformly by
11271 @item ext2@r{, }ext3@r{, }ext4@r{, }xfs@r{, }btrfs@dots{}
11272 @cindex Linux file system types
11273 @cindex local file system types
11274 @opindex ext2 @r{file system type}
11275 @opindex ext3 @r{file system type}
11276 @opindex ext4 @r{file system type}
11277 @opindex xfs @r{file system type}
11278 @opindex btrfs @r{file system type}
11279 A file system on a locally-mounted hard disk. (The system might even
11280 support more than one type here; Linux does.)
11282 @item iso9660@r{, }cdfs
11283 @cindex CD-ROM file system type
11284 @cindex DVD file system type
11285 @cindex ISO9660 file system type
11286 @opindex iso9660 @r{file system type}
11287 @opindex cdfs @r{file system type}
11288 A file system on a CD or DVD drive. HP-UX uses @samp{cdfs}, most other
11289 systems use @samp{iso9660}.
11292 @cindex NTFS file system
11293 @cindex DOS file system
11294 @cindex MS-DOS file system
11295 @cindex MS-Windows file system
11296 @opindex ntfs @r{file system file}
11297 @opindex fat @r{file system file}
11298 File systems used by MS-Windows / MS-DOS.
11302 @item -x @var{fstype}
11303 @itemx --exclude-type=@var{fstype}
11305 @opindex --exclude-type
11306 Limit the listing to file systems not of type @var{fstype}.
11307 Multiple file system types can be eliminated by giving multiple
11308 @option{-x} options. By default, no file system types are omitted.
11311 Ignored; for compatibility with System V versions of @command{df}.
11316 Failure includes the case where no output is generated, so you can
11317 inspect the exit status of a command like @samp{df -t ext3 -t reiserfs
11318 @var{dir}} to test whether @var{dir} is on a file system of type
11319 @samp{ext3} or @samp{reiserfs}.
11321 Since the list of file systems (@var{mtab}) is needed to determine the
11322 file system type, failure includes the cases when that list cannot
11323 be read and one or more of the options @option{-a}, @option{-l}, @option{-t}
11324 or @option{-x} is used together with a file name argument.
11327 @node du invocation
11328 @section @command{du}: Estimate file space usage
11331 @cindex file space usage
11332 @cindex disk usage for files
11334 @command{du} reports the amount of disk space used by the specified files
11335 and for each subdirectory (of directory arguments). Synopsis:
11338 du [@var{option}]@dots{} [@var{file}]@dots{}
11341 With no arguments, @command{du} reports the disk space for the current
11342 directory. Normally the disk space is printed in units of
11343 1024 bytes, but this can be overridden (@pxref{Block size}).
11344 Non-integer quantities are rounded up to the next higher unit.
11346 If two or more hard links point to the same file, only one of the hard
11347 links is counted. The @var{file} argument order affects which links
11348 are counted, and changing the argument order may change the numbers
11349 that @command{du} outputs.
11351 The program accepts the following options. Also see @ref{Common options}.
11361 Show counts for all files, not just directories.
11363 @item --apparent-size
11364 @opindex --apparent-size
11365 Print apparent sizes, rather than disk usage. The apparent size of a
11366 file is the number of bytes reported by @code{wc -c} on regular files,
11367 or more generally, @code{ls -l --block-size=1} or @code{stat --format=%s}.
11368 For example, a file containing the word @samp{zoo} with no newline would,
11369 of course, have an apparent size of 3. Such a small file may require
11370 anywhere from 0 to 16 KiB or more of disk space, depending on
11371 the type and configuration of the file system on which the file resides.
11372 However, a sparse file created with this command:
11375 dd bs=1 seek=2GiB if=/dev/null of=big
11379 has an apparent size of 2 GiB, yet on most modern
11380 systems, it actually uses almost no disk space.
11382 @item -B @var{size}
11383 @itemx --block-size=@var{size}
11385 @opindex --block-size
11387 Scale sizes by @var{size} before printing them (@pxref{Block size}).
11388 For example, @option{-BG} prints sizes in units of 1,073,741,824 bytes.
11394 Equivalent to @code{--apparent-size --block-size=1}.
11400 @cindex grand total of disk space
11401 Print a grand total of all arguments after all arguments have
11402 been processed. This can be used to find out the total disk usage of
11403 a given set of files or directories.
11406 @itemx --dereference-args
11408 @opindex --dereference-args
11409 Dereference symbolic links that are command line arguments.
11410 Does not affect other symbolic links. This is helpful for finding
11411 out the disk usage of directories, such as @file{/usr/tmp}, which
11412 are often symbolic links.
11414 @item -d @var{depth}
11415 @itemx --max-depth=@var{depth}
11416 @opindex -d @var{depth}
11417 @opindex --max-depth=@var{depth}
11418 @cindex limiting output of @command{du}
11419 Show the total for each directory (and file if --all) that is at
11420 most MAX_DEPTH levels down from the root of the hierarchy. The root
11421 is at level 0, so @code{du --max-depth=0} is equivalent to @code{du -s}.
11423 @c --files0-from=FILE
11424 @filesZeroFromOption{du,, with the @option{--total} (@option{-c}) option}
11428 Equivalent to @option{--dereference-args} (@option{-D}).
11434 @cindex inode usage, dereferencing in @command{du}
11435 List inode usage information instead of block usage.
11436 This option is useful for finding directories which contain many files, and
11437 therefore eat up most of the inodes space of a file system (see @command{df},
11438 option @option{--inodes}).
11439 It can well be combined with the options @option{-a}, @option{-c},
11440 @option{-h}, @option{-l}, @option{-s}, @option{-S}, @option{-t} and
11441 @option{-x}; however, passing other options regarding the block size, for
11442 example @option{-b}, @option{-m} and @option{--apparent-size}, is ignored.
11446 @cindex kibibytes for file sizes
11447 Print sizes in 1024-byte blocks, overriding the default block size
11448 (@pxref{Block size}).
11449 This option is equivalent to @option{--block-size=1K}.
11452 @itemx --dereference
11454 @opindex --dereference
11455 @cindex symbolic links, dereferencing in @command{du}
11456 Dereference symbolic links (show the disk space used by the file
11457 or directory that the link points to instead of the space used by
11461 @itemx --count-links
11463 @opindex --count-links
11464 @cindex hard links, counting in @command{du}
11465 Count the size of all files, even if they have appeared already (as a
11470 @cindex mebibytes for file sizes
11471 Print sizes in 1,048,576-byte blocks, overriding the default block size
11472 (@pxref{Block size}).
11473 This option is equivalent to @option{--block-size=1M}.
11476 @itemx --no-dereference
11478 @opindex --no-dereference
11479 @cindex symbolic links, dereferencing in @command{du}
11480 For each symbolic links encountered by @command{du},
11481 consider the disk space used by the symbolic link.
11484 @itemx --separate-dirs
11486 @opindex --separate-dirs
11487 Normally, in the output of @command{du} (when not using @option{--summarize}),
11488 the size listed next to a directory name, @var{d}, represents the sum
11489 of sizes of all entries beneath @var{d} as well as the size of @var{d} itself.
11490 With @option{--separate-dirs}, the size reported for a directory name,
11491 @var{d}, will exclude the size of any subdirectories.
11498 @opindex --summarize
11499 Display only a total for each argument.
11501 @item -t @var{size}
11502 @itemx --threshold=@var{size}
11504 @opindex --threshold
11505 Exclude entries based on a given @var{size}. The @var{size} refers to used
11506 blocks in normal mode (@pxref{Block size}), or inodes count in conjunction
11507 with the @option{--inodes} option.
11509 If @var{size} is positive, then @command{du} will only print entries with a size
11510 greater than or equal to that.
11512 If @var{size} is negative, then @command{du} will only print entries with a size
11513 smaller than or equal to that.
11515 Although GNU @command{find} can be used to find files of a certain size,
11516 @command{du}'s @option{--threshold} option can be used to also filter
11517 directories based on a given size.
11519 Please note that the @option{--threshold} option can be combined with the
11520 @option{--apparent-size} option, and in this case would elide entries based on
11523 Please note that the @option{--threshold} option can be combined with the
11524 @option{--inodes} option, and in this case would elide entries based on
11527 Here's how you would use @option{--threshold} to find directories with a size
11528 greater than or equal to 200 megabytes:
11531 du --threshold=200MB
11534 Here's how you would use @option{--threshold} to find directories and files -
11535 note the @option{-a} - with an apparent size smaller than or equal to 500 bytes:
11538 du -a -t -500 --apparent-size
11541 Here's how you would use @option{--threshold} to find directories on the root
11542 file system with more than 20000 inodes used in the directory tree below:
11545 du --inodes -x --threshold=20000 /
11551 @cindex last modified dates, displaying in @command{du}
11552 Show time of the most recent modification of any file in the directory,
11553 or any of its subdirectories.
11556 @itemx --time=status
11559 @opindex ctime@r{, show the most recent}
11560 @opindex status time@r{, show the most recent}
11561 @opindex use time@r{, show the most recent}
11562 Show the most recent status change time (the @samp{ctime} in the inode) of
11563 any file in the directory, instead of the modification time.
11566 @itemx --time=access
11568 @opindex atime@r{, show the most recent}
11569 @opindex access time@r{, show the most recent}
11570 Show the most recent access time (the @samp{atime} in the inode) of
11571 any file in the directory, instead of the modification time.
11573 @item --time-style=@var{style}
11574 @opindex --time-style
11576 List timestamps in style @var{style}. This option has an effect only if
11577 the @option{--time} option is also specified. The @var{style} should
11578 be one of the following:
11581 @item +@var{format}
11583 List timestamps using @var{format}, where @var{format} is interpreted
11584 like the format argument of @command{date} (@pxref{date invocation}).
11585 For example, @option{--time-style="+%Y-%m-%d %H:%M:%S"} causes
11586 @command{du} to list timestamps like @samp{2002-03-30 23:45:56}. As
11587 with @command{date}, @var{format}'s interpretation is affected by the
11588 @env{LC_TIME} locale category.
11591 List timestamps in full using ISO 8601 date, time, and time zone
11592 format with nanosecond precision, e.g., @samp{2002-03-30
11593 23:45:56.477817180 -0700}. This style is equivalent to
11594 @samp{+%Y-%m-%d %H:%M:%S.%N %z}.
11597 List ISO 8601 date and time in minutes, e.g.,
11598 @samp{2002-03-30 23:45}. These timestamps are shorter than
11599 @samp{full-iso} timestamps, and are usually good enough for everyday
11600 work. This style is equivalent to @samp{+%Y-%m-%d %H:%M}.
11603 List ISO 8601 dates for timestamps, e.g., @samp{2002-03-30}.
11604 This style is equivalent to @samp{+%Y-%m-%d}.
11608 You can specify the default value of the @option{--time-style} option
11609 with the environment variable @env{TIME_STYLE}; if @env{TIME_STYLE} is not set
11610 the default style is @samp{long-iso}. For compatibility with @command{ls},
11611 if @env{TIME_STYLE} begins with @samp{+} and contains a newline,
11612 the newline and any later characters are ignored; if @env{TIME_STYLE}
11613 begins with @samp{posix-} the @samp{posix-} is ignored; and if
11614 @env{TIME_STYLE} is @samp{locale} it is ignored.
11616 @item -X @var{file}
11617 @itemx --exclude-from=@var{file}
11618 @opindex -X @var{file}
11619 @opindex --exclude-from=@var{file}
11620 @cindex excluding files from @command{du}
11621 Like @option{--exclude}, except take the patterns to exclude from @var{file},
11622 one per line. If @var{file} is @samp{-}, take the patterns from standard
11625 @item --exclude=@var{pattern}
11626 @opindex --exclude=@var{pattern}
11627 @cindex excluding files from @command{du}
11628 When recursing, skip subdirectories or files matching @var{pattern}.
11629 For example, @code{du --exclude='*.o'} excludes files whose names
11633 @itemx --one-file-system
11635 @opindex --one-file-system
11636 @cindex one file system, restricting @command{du} to
11637 Skip directories that are on different file systems from the one that
11638 the argument being processed is on.
11642 @cindex NFS mounts from BSD to HP-UX
11643 On BSD systems, @command{du} reports sizes that are half the correct
11644 values for files that are NFS-mounted from HP-UX systems. On HP-UX
11645 systems, it reports sizes that are twice the correct values for
11646 files that are NFS-mounted from BSD systems. This is due to a flaw
11647 in HP-UX; it also affects the HP-UX @command{du} program.
11652 @node stat invocation
11653 @section @command{stat}: Report file or file system status
11656 @cindex file status
11657 @cindex file system status
11659 @command{stat} displays information about the specified file(s). Synopsis:
11662 stat [@var{option}]@dots{} [@var{file}]@dots{}
11665 With no option, @command{stat} reports all information about the given files.
11666 But it also can be used to report the information of the file systems the
11667 given files are located on. If the files are links, @command{stat} can
11668 also give information about the files the links point to.
11670 @mayConflictWithShellBuiltIn{stat}
11675 @itemx --dereference
11677 @opindex --dereference
11678 @cindex symbolic links, dereferencing in @command{stat}
11679 Change how @command{stat} treats symbolic links.
11680 With this option, @command{stat} acts on the file referenced
11681 by each symbolic link argument.
11682 Without it, @command{stat} acts on any symbolic link argument directly.
11685 @itemx --file-system
11687 @opindex --file-system
11688 @cindex file systems
11689 Report information about the file systems where the given files are located
11690 instead of information about the files themselves.
11691 This option implies the @option{-L} option.
11694 @itemx --format=@var{format}
11696 @opindex --format=@var{format}
11697 @cindex output format
11698 Use @var{format} rather than the default format.
11699 @var{format} is automatically newline-terminated, so
11700 running a command like the following with two or more @var{file}
11701 operands produces a line of output for each operand:
11703 $ stat --format=%d:%i / /usr
11708 @item --printf=@var{format}
11709 @opindex --printf=@var{format}
11710 @cindex output format
11711 Use @var{format} rather than the default format.
11712 Like @option{--format}, but interpret backslash escapes,
11713 and do not output a mandatory trailing newline.
11714 If you want a newline, include @samp{\n} in the @var{format}.
11715 Here's how you would use @option{--printf} to print the device
11716 and inode numbers of @file{/} and @file{/usr}:
11718 $ stat --printf='%d:%i\n' / /usr
11727 @cindex terse output
11728 Print the information in terse form, suitable for parsing by other programs.
11732 The valid @var{format} directives for files with @option{--format} and
11733 @option{--printf} are:
11736 @item %a - Access rights in octal
11737 @item %A - Access rights in human readable form
11738 @item %b - Number of blocks allocated (see @samp{%B})
11739 @item %B - The size in bytes of each block reported by @samp{%b}
11740 @item %C - The SELinux security context of a file, if available
11741 @item %d - Device number in decimal
11742 @item %D - Device number in hex
11743 @item %f - Raw mode in hex
11744 @item %F - File type
11745 @item %g - Group ID of owner
11746 @item %G - Group name of owner
11747 @item %h - Number of hard links
11748 @item %i - Inode number
11749 @item %m - Mount point (See note below)
11750 @item %n - File name
11751 @item %N - Quoted file name with dereference if symbolic link
11752 @item %o - Optimal I/O transfer size hint
11753 @item %s - Total size, in bytes
11754 @item %t - Major device type in hex (see below)
11755 @item %T - Minor device type in hex (see below)
11756 @item %u - User ID of owner
11757 @item %U - User name of owner
11758 @item %w - Time of file birth, or @samp{-} if unknown
11759 @item %W - Time of file birth as seconds since Epoch, or @samp{0}
11760 @item %x - Time of last access
11761 @item %X - Time of last access as seconds since Epoch
11762 @item %y - Time of last modification
11763 @item %Y - Time of last modification as seconds since Epoch
11764 @item %z - Time of last change
11765 @item %Z - Time of last change as seconds since Epoch
11768 The @samp{%t} and @samp{%T} formats operate on the st_rdev member of
11769 the stat(2) structure, and are only defined for character and block
11770 special files. On some systems or file types, st_rdev may be used to
11771 represent other quantities.
11773 The @samp{%W}, @samp{%X}, @samp{%Y}, and @samp{%Z} formats accept a
11774 precision preceded by a period to specify the number of digits to
11775 print after the decimal point. For example, @samp{%.3X} outputs the
11776 last access time to millisecond precision. If a period is given but no
11777 precision, @command{stat} uses 9 digits, so @samp{%.X} is equivalent to
11778 @samp{%.9X}@. When discarding excess precision, time stamps are truncated
11779 toward minus infinity.
11783 $ stat -c '[%015Y]' /usr
11786 $ stat -c '[%15Y]' /usr
11788 $ stat -c '[%-15Y]' /usr
11791 $ stat -c '[%.3Y]' /usr
11793 $ stat -c '[%.Y]' /usr
11794 [1288929712.114951834]
11797 The mount point printed by @samp{%m} is similar to that output
11798 by @command{df}, except that:
11801 stat does not dereference symlinks by default
11802 (unless @option{-L} is specified)
11804 stat does not search for specified device nodes in the
11805 file system list, instead operating on them directly
11808 stat outputs the alias for a bind mounted file, rather than
11809 the initial mount point of its backing device.
11810 One can recursively call stat until there is no change in output,
11811 to get the current base mount point
11814 When listing file system information (@option{--file-system} (@option{-f})),
11815 you must use a different set of @var{format} directives:
11818 @item %a - Free blocks available to non-super-user
11819 @item %b - Total data blocks in file system
11820 @item %c - Total file nodes in file system
11821 @item %d - Free file nodes in file system
11822 @item %f - Free blocks in file system
11823 @item %i - File System ID in hex
11824 @item %l - Maximum length of file names
11825 @item %n - File name
11826 @item %s - Block size (for faster transfers)
11827 @item %S - Fundamental block size (for block counts)
11828 @item %t - Type in hex
11829 @item %T - Type in human readable form
11833 Time stamps are listed according to the time zone rules specified by
11834 the @env{TZ} environment variable, or by the system default rules if
11835 @env{TZ} is not set. @xref{TZ Variable,, Specifying the Time Zone
11836 with @env{TZ}, libc, The GNU C Library Reference Manual}.
11841 @node sync invocation
11842 @section @command{sync}: Synchronize data on disk with memory
11845 @cindex synchronize disk and memory
11847 @cindex superblock, writing
11848 @cindex inodes, written buffered
11849 @command{sync} writes any data buffered in memory out to disk. This can
11850 include (but is not limited to) modified superblocks, modified inodes,
11851 and delayed reads and writes. This must be implemented by the kernel;
11852 The @command{sync} program does nothing but exercise the @code{sync} system
11855 @cindex crashes and corruption
11856 The kernel keeps data in memory to avoid doing (relatively slow) disk
11857 reads and writes. This improves performance, but if the computer
11858 crashes, data may be lost or the file system corrupted as a
11859 result. The @command{sync} command ensures everything in memory
11860 is written to disk.
11862 Any arguments are ignored, except for a lone @option{--help} or
11863 @option{--version} (@pxref{Common options}).
11868 @node truncate invocation
11869 @section @command{truncate}: Shrink or extend the size of a file
11872 @cindex truncating, file sizes
11874 @command{truncate} shrinks or extends the size of each @var{file} to the
11875 specified size. Synopsis:
11878 truncate @var{option}@dots{} @var{file}@dots{}
11881 @cindex files, creating
11882 Any @var{file} that does not exist is created.
11884 @cindex sparse files, creating
11885 @cindex holes, creating files with
11886 If a @var{file} is larger than the specified size, the extra data is lost.
11887 If a @var{file} is shorter, it is extended and the extended part (or hole)
11888 reads as zero bytes.
11890 The program accepts the following options. Also see @ref{Common options}.
11897 @opindex --no-create
11898 Do not create files that do not exist.
11903 @opindex --io-blocks
11904 Treat @var{size} as number of I/O blocks of the @var{file} rather than bytes.
11906 @item -r @var{rfile}
11907 @itemx --reference=@var{rfile}
11909 @opindex --reference
11910 Base the size of each @var{file} on the size of @var{rfile}.
11912 @item -s @var{size}
11913 @itemx --size=@var{size}
11916 Set or adjust the size of each @var{file} according to @var{size}.
11917 @var{size} is in bytes unless @option{--io-blocks} is specified.
11918 @multiplierSuffixesNoBlocks{size}
11920 @var{size} may also be prefixed by one of the following to adjust
11921 the size of each @var{file} based on their current size:
11923 @samp{+} => extend by
11924 @samp{-} => reduce by
11925 @samp{<} => at most
11926 @samp{>} => at least
11927 @samp{/} => round down to multiple of
11928 @samp{%} => round up to multiple of
11936 @node Printing text
11937 @chapter Printing text
11939 @cindex printing text, commands for
11940 @cindex commands for printing text
11942 This section describes commands that display text strings.
11945 * echo invocation:: Print a line of text.
11946 * printf invocation:: Format and print data.
11947 * yes invocation:: Print a string until interrupted.
11951 @node echo invocation
11952 @section @command{echo}: Print a line of text
11955 @cindex displaying text
11956 @cindex printing text
11957 @cindex text, displaying
11958 @cindex arbitrary text, displaying
11960 @command{echo} writes each given @var{string} to standard output, with a
11961 space between each and a newline after the last one. Synopsis:
11964 echo [@var{option}]@dots{} [@var{string}]@dots{}
11967 @mayConflictWithShellBuiltIn{echo}
11969 The program accepts the following options. Also see @ref{Common options}.
11970 Options must precede operands, and the normally-special argument
11971 @samp{--} has no special meaning and is treated like any other
11977 Do not output the trailing newline.
11981 @cindex backslash escapes
11982 Enable interpretation of the following backslash-escaped characters in
11991 produce no further output
12007 the eight-bit value that is the octal number @var{nnn}
12008 (zero to three octal digits), if @var{nnn} is
12009 a nine-bit value, the ninth bit is ignored
12011 the eight-bit value that is the octal number @var{nnn}
12012 (one to three octal digits), if @var{nnn} is
12013 a nine-bit value, the ninth bit is ignored
12015 the eight-bit value that is the hexadecimal number @var{hh}
12016 (one or two hexadecimal digits)
12021 @cindex backslash escapes
12022 Disable interpretation of backslash escapes in each @var{string}.
12023 This is the default. If @option{-e} and @option{-E} are both
12024 specified, the last one given takes effect.
12028 @vindex POSIXLY_CORRECT
12029 If the @env{POSIXLY_CORRECT} environment variable is set, then when
12030 @command{echo}'s first argument is not @option{-n} it outputs
12031 option-like arguments instead of treating them as options. For
12032 example, @code{echo -ne hello} outputs @samp{-ne hello} instead of
12033 plain @samp{hello}.
12035 POSIX does not require support for any options, and says
12036 that the behavior of @command{echo} is implementation-defined if any
12037 @var{string} contains a backslash or if the first argument is
12038 @option{-n}. Portable programs can use the @command{printf} command
12039 if they need to omit trailing newlines or output control characters or
12040 backslashes. @xref{printf invocation}.
12045 @node printf invocation
12046 @section @command{printf}: Format and print data
12049 @command{printf} does formatted printing of text. Synopsis:
12052 printf @var{format} [@var{argument}]@dots{}
12055 @command{printf} prints the @var{format} string, interpreting @samp{%}
12056 directives and @samp{\} escapes to format numeric and string arguments
12057 in a way that is mostly similar to the C @samp{printf} function.
12058 @xref{Output Conversion Syntax,, @command{printf} format directives,
12059 libc, The GNU C Library Reference Manual}, for details.
12060 The differences are listed below.
12062 @mayConflictWithShellBuiltIn{printf}
12067 The @var{format} argument is reused as necessary to convert all the
12068 given @var{argument}s. For example, the command @samp{printf %s a b}
12072 Missing @var{argument}s are treated as null strings or as zeros,
12073 depending on whether the context expects a string or a number. For
12074 example, the command @samp{printf %sx%d} prints @samp{x0}.
12078 An additional escape, @samp{\c}, causes @command{printf} to produce no
12079 further output. For example, the command @samp{printf 'A%sC\cD%sF' B
12080 E} prints @samp{ABC}.
12083 The hexadecimal escape sequence @samp{\x@var{hh}} has at most two
12084 digits, as opposed to C where it can have an unlimited number of
12085 digits. For example, the command @samp{printf '\x07e'} prints two
12086 bytes, whereas the C statement @samp{printf ("\x07e")} prints just
12091 @command{printf} has an additional directive, @samp{%b}, which prints its
12092 argument string with @samp{\} escapes interpreted in the same way as in
12093 the @var{format} string, except that octal escapes are of the form
12094 @samp{\0@var{ooo}} where @var{ooo} is 0 to 3 octal digits. If
12095 @samp{\@var{ooo}} is nine-bit value, ignore the ninth bit.
12096 If a precision is also given, it limits the number of bytes printed
12097 from the converted string.
12100 Numeric arguments must be single C constants, possibly with leading
12101 @samp{+} or @samp{-}. For example, @samp{printf %.4d -3} outputs
12105 @vindex POSIXLY_CORRECT
12106 If the leading character of a numeric argument is @samp{"} or @samp{'}
12107 then its value is the numeric value of the immediately following
12108 character. Any remaining characters are silently ignored if the
12109 @env{POSIXLY_CORRECT} environment variable is set; otherwise, a
12110 warning is printed. For example, @samp{printf "%d" "'a"} outputs
12111 @samp{97} on hosts that use the ASCII character set, since
12112 @samp{a} has the numeric value 97 in ASCII.
12117 A floating-point argument must use a period before any fractional
12118 digits, but is printed according to the @env{LC_NUMERIC} category of the
12119 current locale. For example, in a locale whose radix character is a
12120 comma, the command @samp{printf %g 3.14} outputs @samp{3,14} whereas
12121 the command @samp{printf %g 3,14} is an error.
12122 @xref{Floating point}.
12126 @command{printf} interprets @samp{\@var{ooo}} in @var{format} as an octal number
12127 (if @var{ooo} is 1 to 3 octal digits) specifying a byte to print,
12128 and @samp{\x@var{hh}} as a hexadecimal number (if @var{hh} is 1 to 2 hex
12129 digits) specifying a character to print.
12130 Note however that when @samp{\@var{ooo}} specifies a number larger than 255,
12131 @command{printf} ignores the ninth bit.
12132 For example, @samp{printf '\400'} is equivalent to @samp{printf '\0'}.
12137 @cindex ISO/IEC 10646
12139 @command{printf} interprets two character syntaxes introduced in
12141 @samp{\u} for 16-bit Unicode (ISO/IEC 10646)
12142 characters, specified as
12143 four hexadecimal digits @var{hhhh}, and @samp{\U} for 32-bit Unicode
12144 characters, specified as eight hexadecimal digits @var{hhhhhhhh}.
12145 @command{printf} outputs the Unicode characters
12146 according to the @env{LC_CTYPE} locale. Unicode characters in the ranges
12147 U+0000@dots{}U+009F, U+D800@dots{}U+DFFF cannot be specified by this syntax,
12148 except for U+0024 ($), U+0040 (@@), and U+0060 (@`).
12150 The processing of @samp{\u} and @samp{\U} requires a full-featured
12151 @code{iconv} facility. It is activated on systems with glibc 2.2 (or newer),
12152 or when @code{libiconv} is installed prior to this package. Otherwise
12153 @samp{\u} and @samp{\U} will print as-is.
12155 The only options are a lone @option{--help} or
12156 @option{--version}. @xref{Common options}.
12157 Options must precede operands.
12159 The Unicode character syntaxes are useful for writing strings in a locale
12160 independent way. For example, a string containing the Euro currency symbol
12163 $ env printf '\u20AC 14.95'
12167 will be output correctly in all locales supporting the Euro symbol
12168 (ISO-8859-15, UTF-8, and others). Similarly, a Chinese string
12171 $ env printf '\u4e2d\u6587'
12175 will be output correctly in all Chinese locales (GB2312, BIG5, UTF-8, etc).
12177 Note that in these examples, the @command{printf} command has been
12178 invoked via @command{env} to ensure that we run the program found via
12179 your shell's search path, and not a shell alias or a built-in function.
12181 For larger strings, you don't need to look up the hexadecimal code
12182 values of each character one by one. ASCII characters mixed with \u
12183 escape sequences is also known as the JAVA source file encoding. You can
12184 use GNU recode 3.5c (or newer) to convert strings to this encoding. Here
12185 is how to convert a piece of text into a shell script which will output
12186 this text in a locale-independent way:
12189 $ LC_CTYPE=zh_CN.big5 /usr/local/bin/printf \
12190 '\u4e2d\u6587\n' > sample.txt
12191 $ recode BIG5..JAVA < sample.txt \
12192 | sed -e "s|^|/usr/local/bin/printf '|" -e "s|$|\\\\n'|" \
12199 @node yes invocation
12200 @section @command{yes}: Print a string until interrupted
12203 @cindex repeated output of a string
12205 @command{yes} prints the command line arguments, separated by spaces and
12206 followed by a newline, forever until it is killed. If no arguments are
12207 given, it prints @samp{y} followed by a newline forever until killed.
12209 Upon a write error, @command{yes} exits with status @samp{1}.
12211 The only options are a lone @option{--help} or @option{--version}.
12212 To output an argument that begins with
12213 @samp{-}, precede it with @option{--}, e.g., @samp{yes -- --help}.
12214 @xref{Common options}.
12218 @chapter Conditions
12221 @cindex commands for exit status
12222 @cindex exit status commands
12224 This section describes commands that are primarily useful for their exit
12225 status, rather than their output. Thus, they are often used as the
12226 condition of shell @code{if} statements, or as the last command in a
12230 * false invocation:: Do nothing, unsuccessfully.
12231 * true invocation:: Do nothing, successfully.
12232 * test invocation:: Check file types and compare values.
12233 * expr invocation:: Evaluate expressions.
12237 @node false invocation
12238 @section @command{false}: Do nothing, unsuccessfully
12241 @cindex do nothing, unsuccessfully
12242 @cindex failure exit status
12243 @cindex exit status of @command{false}
12245 @command{false} does nothing except return an exit status of 1, meaning
12246 @dfn{failure}. It can be used as a place holder in shell scripts
12247 where an unsuccessful command is needed.
12248 In most modern shells, @command{false} is a built-in command, so when
12249 you use @samp{false} in a script, you're probably using the built-in
12250 command, not the one documented here.
12252 @command{false} honors the @option{--help} and @option{--version} options.
12254 This version of @command{false} is implemented as a C program, and is thus
12255 more secure and faster than a shell script implementation, and may safely
12256 be used as a dummy shell for the purpose of disabling accounts.
12258 Note that @command{false} (unlike all other programs documented herein)
12259 exits unsuccessfully, even when invoked with
12260 @option{--help} or @option{--version}.
12262 Portable programs should not assume that the exit status of
12263 @command{false} is 1, as it is greater than 1 on some
12267 @node true invocation
12268 @section @command{true}: Do nothing, successfully
12271 @cindex do nothing, successfully
12273 @cindex successful exit
12274 @cindex exit status of @command{true}
12276 @command{true} does nothing except return an exit status of 0, meaning
12277 @dfn{success}. It can be used as a place holder in shell scripts
12278 where a successful command is needed, although the shell built-in
12279 command @code{:} (colon) may do the same thing faster.
12280 In most modern shells, @command{true} is a built-in command, so when
12281 you use @samp{true} in a script, you're probably using the built-in
12282 command, not the one documented here.
12284 @command{true} honors the @option{--help} and @option{--version} options.
12286 Note, however, that it is possible to cause @command{true}
12287 to exit with nonzero status: with the @option{--help} or @option{--version}
12288 option, and with standard
12289 output already closed or redirected to a file that evokes an I/O error.
12290 For example, using a Bourne-compatible shell:
12293 $ ./true --version >&-
12294 ./true: write error: Bad file number
12295 $ ./true --version > /dev/full
12296 ./true: write error: No space left on device
12299 This version of @command{true} is implemented as a C program, and is thus
12300 more secure and faster than a shell script implementation, and may safely
12301 be used as a dummy shell for the purpose of disabling accounts.
12303 @node test invocation
12304 @section @command{test}: Check file types and compare values
12307 @cindex check file types
12308 @cindex compare values
12309 @cindex expression evaluation
12311 @command{test} returns a status of 0 (true) or 1 (false) depending on the
12312 evaluation of the conditional expression @var{expr}. Each part of the
12313 expression must be a separate argument.
12315 @command{test} has file status checks, string operators, and numeric
12316 comparison operators.
12318 @command{test} has an alternate form that uses opening and closing
12319 square brackets instead a leading @samp{test}. For example, instead
12320 of @samp{test -d /}, you can write @samp{[ -d / ]}. The square
12321 brackets must be separate arguments; for example, @samp{[-d /]} does
12322 not have the desired effect. Since @samp{test @var{expr}} and @samp{[
12323 @var{expr} ]} have the same meaning, only the former form is discussed
12329 test @var{expression}
12331 [ @var{expression} ]
12336 @mayConflictWithShellBuiltIn{test}
12338 If @var{expression} is omitted, @command{test} returns false.
12339 If @var{expression} is a single argument,
12340 @command{test} returns false if the argument is null and true
12341 otherwise. The argument
12342 can be any string, including strings like @samp{-d}, @samp{-1},
12343 @samp{--}, @samp{--help}, and @samp{--version} that most other
12344 programs would treat as options. To get help and version information,
12345 invoke the commands @samp{[ --help} and @samp{[ --version}, without
12346 the usual closing brackets. @xref{Common options}.
12348 @cindex exit status of @command{test}
12352 0 if the expression is true,
12353 1 if the expression is false,
12354 2 if an error occurred.
12358 * File type tests:: -[bcdfhLpSt]
12359 * Access permission tests:: -[gkruwxOG]
12360 * File characteristic tests:: -e -s -nt -ot -ef
12361 * String tests:: -z -n = == !=
12362 * Numeric tests:: -eq -ne -lt -le -gt -ge
12363 * Connectives for test:: ! -a -o
12367 @node File type tests
12368 @subsection File type tests
12370 @cindex file type tests
12372 These options test for particular types of files. (Everything's a file,
12373 but not all files are the same!)
12377 @item -b @var{file}
12379 @cindex block special check
12380 True if @var{file} exists and is a block special device.
12382 @item -c @var{file}
12384 @cindex character special check
12385 True if @var{file} exists and is a character special device.
12387 @item -d @var{file}
12389 @cindex directory check
12390 True if @var{file} exists and is a directory.
12392 @item -f @var{file}
12394 @cindex regular file check
12395 True if @var{file} exists and is a regular file.
12397 @item -h @var{file}
12398 @itemx -L @var{file}
12401 @cindex symbolic link check
12402 True if @var{file} exists and is a symbolic link.
12403 Unlike all other file-related tests, this test does not dereference
12404 @var{file} if it is a symbolic link.
12406 @item -p @var{file}
12408 @cindex named pipe check
12409 True if @var{file} exists and is a named pipe.
12411 @item -S @var{file}
12413 @cindex socket check
12414 True if @var{file} exists and is a socket.
12418 @cindex terminal check
12419 True if @var{fd} is a file descriptor that is associated with a
12425 @node Access permission tests
12426 @subsection Access permission tests
12428 @cindex access permission tests
12429 @cindex permission tests
12431 These options test for particular access permissions.
12435 @item -g @var{file}
12437 @cindex set-group-ID check
12438 True if @var{file} exists and has its set-group-ID bit set.
12440 @item -k @var{file}
12442 @cindex sticky bit check
12443 True if @var{file} exists and has its @dfn{sticky} bit set.
12445 @item -r @var{file}
12447 @cindex readable file check
12448 True if @var{file} exists and read permission is granted.
12450 @item -u @var{file}
12452 @cindex set-user-ID check
12453 True if @var{file} exists and has its set-user-ID bit set.
12455 @item -w @var{file}
12457 @cindex writable file check
12458 True if @var{file} exists and write permission is granted.
12460 @item -x @var{file}
12462 @cindex executable file check
12463 True if @var{file} exists and execute permission is granted
12464 (or search permission, if it is a directory).
12466 @item -O @var{file}
12468 @cindex owned by effective user ID check
12469 True if @var{file} exists and is owned by the current effective user ID.
12471 @item -G @var{file}
12473 @cindex owned by effective group ID check
12474 True if @var{file} exists and is owned by the current effective group ID.
12478 @node File characteristic tests
12479 @subsection File characteristic tests
12481 @cindex file characteristic tests
12483 These options test other file characteristics.
12487 @item -e @var{file}
12489 @cindex existence-of-file check
12490 True if @var{file} exists.
12492 @item -s @var{file}
12494 @cindex nonempty file check
12495 True if @var{file} exists and has a size greater than zero.
12497 @item @var{file1} -nt @var{file2}
12499 @cindex newer-than file check
12500 True if @var{file1} is newer (according to modification date) than
12501 @var{file2}, or if @var{file1} exists and @var{file2} does not.
12503 @item @var{file1} -ot @var{file2}
12505 @cindex older-than file check
12506 True if @var{file1} is older (according to modification date) than
12507 @var{file2}, or if @var{file2} exists and @var{file1} does not.
12509 @item @var{file1} -ef @var{file2}
12511 @cindex same file check
12512 @cindex hard link check
12513 True if @var{file1} and @var{file2} have the same device and inode
12514 numbers, i.e., if they are hard links to each other.
12520 @subsection String tests
12522 @cindex string tests
12524 These options test string characteristics. You may need to quote
12525 @var{string} arguments for the shell. For example:
12531 The quotes here prevent the wrong arguments from being passed to
12532 @command{test} if @samp{$V} is empty or contains special characters.
12536 @item -z @var{string}
12538 @cindex zero-length string check
12539 True if the length of @var{string} is zero.
12541 @item -n @var{string}
12542 @itemx @var{string}
12544 @cindex nonzero-length string check
12545 True if the length of @var{string} is nonzero.
12547 @item @var{string1} = @var{string2}
12549 @cindex equal string check
12550 True if the strings are equal.
12552 @item @var{string1} == @var{string2}
12554 @cindex equal string check
12555 True if the strings are equal (synonym for =).
12557 @item @var{string1} != @var{string2}
12559 @cindex not-equal string check
12560 True if the strings are not equal.
12565 @node Numeric tests
12566 @subsection Numeric tests
12568 @cindex numeric tests
12569 @cindex arithmetic tests
12571 Numeric relational operators. The arguments must be entirely numeric
12572 (possibly negative), or the special expression @w{@code{-l @var{string}}},
12573 which evaluates to the length of @var{string}.
12577 @item @var{arg1} -eq @var{arg2}
12578 @itemx @var{arg1} -ne @var{arg2}
12579 @itemx @var{arg1} -lt @var{arg2}
12580 @itemx @var{arg1} -le @var{arg2}
12581 @itemx @var{arg1} -gt @var{arg2}
12582 @itemx @var{arg1} -ge @var{arg2}
12589 These arithmetic binary operators return true if @var{arg1} is equal,
12590 not-equal, less-than, less-than-or-equal, greater-than, or
12591 greater-than-or-equal than @var{arg2}, respectively.
12598 test -1 -gt -2 && echo yes
12600 test -l abc -gt 1 && echo yes
12603 @error{} test: integer expression expected before -eq
12607 @node Connectives for test
12608 @subsection Connectives for @command{test}
12610 @cindex logical connectives
12611 @cindex connectives, logical
12613 The usual logical connectives.
12619 True if @var{expr} is false.
12621 @item @var{expr1} -a @var{expr2}
12623 @cindex logical and operator
12624 @cindex and operator
12625 True if both @var{expr1} and @var{expr2} are true.
12627 @item @var{expr1} -o @var{expr2}
12629 @cindex logical or operator
12630 @cindex or operator
12631 True if either @var{expr1} or @var{expr2} is true.
12636 @node expr invocation
12637 @section @command{expr}: Evaluate expressions
12640 @cindex expression evaluation
12641 @cindex evaluation of expressions
12643 @command{expr} evaluates an expression and writes the result on standard
12644 output. Each token of the expression must be a separate argument.
12646 Operands are either integers or strings. Integers consist of one or
12647 more decimal digits, with an optional leading @samp{-}.
12648 @command{expr} converts
12649 anything appearing in an operand position to an integer or a string
12650 depending on the operation being applied to it.
12652 Strings are not quoted for @command{expr} itself, though you may need to
12653 quote them to protect characters with special meaning to the shell,
12654 e.g., spaces. However, regardless of whether it is quoted, a string
12655 operand should not be a parenthesis or any of @command{expr}'s
12656 operators like @code{+}, so you cannot safely pass an arbitrary string
12657 @code{$str} to expr merely by quoting it to the shell. One way to
12658 work around this is to use the GNU extension @code{+},
12659 (e.g., @code{+ "$str" = foo}); a more portable way is to use
12660 @code{@w{" $str"}} and to adjust the rest of the expression to take
12661 the leading space into account (e.g., @code{@w{" $str" = " foo"}}).
12663 You should not pass a negative integer or a string with leading
12664 @samp{-} as @command{expr}'s first argument, as it might be
12665 misinterpreted as an option; this can be avoided by parenthesization.
12666 Also, portable scripts should not use a string operand that happens to
12667 take the form of an integer; this can be worked around by inserting
12668 leading spaces as mentioned above.
12670 @cindex parentheses for grouping
12671 Operators may be given as infix symbols or prefix keywords. Parentheses
12672 may be used for grouping in the usual manner. You must quote
12673 parentheses and many operators to avoid the shell evaluating them,
12676 When built with support for the GNU MP library, @command{expr} uses
12677 arbitrary-precision arithmetic; otherwise, it uses native arithmetic
12678 types and may fail due to arithmetic overflow.
12680 The only options are @option{--help} and @option{--version}. @xref{Common
12681 options}. Options must precede operands.
12683 @cindex exit status of @command{expr}
12687 0 if the expression is neither null nor 0,
12688 1 if the expression is null or 0,
12689 2 if the expression is invalid,
12690 3 if an internal error occurred (e.g., arithmetic overflow).
12694 * String expressions:: + : match substr index length
12695 * Numeric expressions:: + - * / %
12696 * Relations for expr:: | & < <= = == != >= >
12697 * Examples of expr:: Examples.
12701 @node String expressions
12702 @subsection String expressions
12704 @cindex string expressions
12705 @cindex expressions, string
12707 @command{expr} supports pattern matching and other string operators. These
12708 have higher precedence than both the numeric and relational operators (in
12709 the next sections).
12713 @item @var{string} : @var{regex}
12714 @cindex pattern matching
12715 @cindex regular expression matching
12716 @cindex matching patterns
12717 Perform pattern matching. The arguments are converted to strings and the
12718 second is considered to be a (basic, a la GNU @code{grep}) regular
12719 expression, with a @code{^} implicitly prepended. The first argument is
12720 then matched against this regular expression.
12722 If the match succeeds and @var{regex} uses @samp{\(} and @samp{\)}, the
12723 @code{:} expression returns the part of @var{string} that matched the
12724 subexpression; otherwise, it returns the number of characters matched.
12726 If the match fails, the @code{:} operator returns the null string if
12727 @samp{\(} and @samp{\)} are used in @var{regex}, otherwise 0.
12729 @kindex \( @r{regexp operator}
12730 Only the first @samp{\( @dots{} \)} pair is relevant to the return
12731 value; additional pairs are meaningful only for grouping the regular
12732 expression operators.
12734 @kindex \+ @r{regexp operator}
12735 @kindex \? @r{regexp operator}
12736 @kindex \| @r{regexp operator}
12737 In the regular expression, @code{\+}, @code{\?}, and @code{\|} are
12738 operators which respectively match one or more, zero or one, or separate
12739 alternatives. SunOS and other @command{expr}'s treat these as regular
12740 characters. (POSIX allows either behavior.)
12741 @xref{Top, , Regular Expression Library, regex, Regex}, for details of
12742 regular expression syntax. Some examples are in @ref{Examples of expr}.
12744 @item match @var{string} @var{regex}
12746 An alternative way to do pattern matching. This is the same as
12747 @w{@samp{@var{string} : @var{regex}}}.
12749 @item substr @var{string} @var{position} @var{length}
12751 Returns the substring of @var{string} beginning at @var{position}
12752 with length at most @var{length}. If either @var{position} or
12753 @var{length} is negative, zero, or non-numeric, returns the null string.
12755 @item index @var{string} @var{charset}
12757 Returns the first position in @var{string} where the first character in
12758 @var{charset} was found. If no character in @var{charset} is found in
12759 @var{string}, return 0.
12761 @item length @var{string}
12763 Returns the length of @var{string}.
12765 @item + @var{token}
12767 Interpret @var{token} as a string, even if it is a keyword like @var{match}
12768 or an operator like @code{/}.
12769 This makes it possible to test @code{expr length + "$x"} or
12770 @code{expr + "$x" : '.*/\(.\)'} and have it do the right thing even if
12771 the value of @var{$x} happens to be (for example) @code{/} or @code{index}.
12772 This operator is a GNU extension. Portable shell scripts should use
12773 @code{@w{" $token"} : @w{' \(.*\)'}} instead of @code{+ "$token"}.
12777 To make @command{expr} interpret keywords as strings, you must use the
12778 @code{quote} operator.
12781 @node Numeric expressions
12782 @subsection Numeric expressions
12784 @cindex numeric expressions
12785 @cindex expressions, numeric
12787 @command{expr} supports the usual numeric operators, in order of increasing
12788 precedence. These numeric operators have lower precedence than the
12789 string operators described in the previous section, and higher precedence
12790 than the connectives (next section).
12798 @cindex subtraction
12799 Addition and subtraction. Both arguments are converted to integers;
12800 an error occurs if this cannot be done.
12806 @cindex multiplication
12809 Multiplication, division, remainder. Both arguments are converted to
12810 integers; an error occurs if this cannot be done.
12815 @node Relations for expr
12816 @subsection Relations for @command{expr}
12818 @cindex connectives, logical
12819 @cindex logical connectives
12820 @cindex relations, numeric or string
12822 @command{expr} supports the usual logical connectives and relations. These
12823 have lower precedence than the string and numeric operators
12824 (previous sections). Here is the list, lowest-precedence operator first.
12830 @cindex logical or operator
12831 @cindex or operator
12832 Returns its first argument if that is neither null nor zero, otherwise
12833 its second argument if it is neither null nor zero, otherwise 0. It
12834 does not evaluate its second argument if its first argument is neither
12839 @cindex logical and operator
12840 @cindex and operator
12841 Return its first argument if neither argument is null or zero, otherwise
12842 0. It does not evaluate its second argument if its first argument is
12845 @item < <= = == != >= >
12852 @cindex comparison operators
12854 Compare the arguments and return 1 if the relation is true, 0 otherwise.
12855 @code{==} is a synonym for @code{=}. @command{expr} first tries to convert
12856 both arguments to integers and do a numeric comparison; if either
12857 conversion fails, it does a lexicographic comparison using the character
12858 collating sequence specified by the @env{LC_COLLATE} locale.
12863 @node Examples of expr
12864 @subsection Examples of using @command{expr}
12866 @cindex examples of @command{expr}
12867 Here are a few examples, including quoting for shell metacharacters.
12869 To add 1 to the shell variable @code{foo}, in Bourne-compatible shells:
12872 foo=$(expr $foo + 1)
12875 To print the non-directory part of the file name stored in
12876 @code{$fname}, which need not contain a @code{/}:
12879 expr $fname : '.*/\(.*\)' '|' $fname
12882 An example showing that @code{\+} is an operator:
12890 expr abc : 'a\(.\)c'
12892 expr index abcdef cz
12895 @error{} expr: syntax error
12896 expr index + index a
12902 @chapter Redirection
12904 @cindex redirection
12905 @cindex commands for redirection
12907 Unix shells commonly provide several forms of @dfn{redirection}---ways
12908 to change the input source or output destination of a command. But one
12909 useful redirection is performed by a separate command, not by the shell;
12910 it's described here.
12913 * tee invocation:: Redirect output to multiple files or processes.
12917 @node tee invocation
12918 @section @command{tee}: Redirect output to multiple files or processes
12921 @cindex pipe fitting
12922 @cindex destinations, multiple output
12923 @cindex read from stdin and write to stdout and files
12925 The @command{tee} command copies standard input to standard output and also
12926 to any files given as arguments. This is useful when you want not only
12927 to send some data down a pipe, but also to save a copy. Synopsis:
12930 tee [@var{option}]@dots{} [@var{file}]@dots{}
12933 If a file being written to does not already exist, it is created. If a
12934 file being written to already exists, the data it previously contained
12935 is overwritten unless the @option{-a} option is used.
12937 A @var{file} of @samp{-} causes @command{tee} to send another copy of
12938 input to standard output, but this is typically not that useful as the
12939 copies are interleaved.
12941 The program accepts the following options. Also see @ref{Common options}.
12948 Append standard input to the given files rather than overwriting
12952 @itemx --ignore-interrupts
12954 @opindex --ignore-interrupts
12955 Ignore interrupt signals.
12959 The @command{tee} command is useful when you happen to be transferring a large
12960 amount of data and also want to summarize that data without reading
12961 it a second time. For example, when you are downloading a DVD image,
12962 you often want to verify its signature or checksum right away.
12963 The inefficient way to do it is simply:
12966 wget http://example.com/some.iso && sha1sum some.iso
12969 One problem with the above is that it makes you wait for the
12970 download to complete before starting the time-consuming SHA1 computation.
12971 Perhaps even more importantly, the above requires reading
12972 the DVD image a second time (the first was from the network).
12974 The efficient way to do it is to interleave the download
12975 and SHA1 computation. Then, you'll get the checksum for
12976 free, because the entire process parallelizes so well:
12979 # slightly contrived, to demonstrate process substitution
12980 wget -O - http://example.com/dvd.iso \
12981 | tee >(sha1sum > dvd.sha1) > dvd.iso
12984 That makes @command{tee} write not just to the expected output file,
12985 but also to a pipe running @command{sha1sum} and saving the final
12986 checksum in a file named @file{dvd.sha1}.
12988 Note, however, that this example relies on a feature of modern shells
12989 called @dfn{process substitution}
12990 (the @samp{>(command)} syntax, above;
12991 @xref{Process Substitution,,Process Substitution, bash,
12992 The Bash Reference Manual}.),
12993 so it works with @command{zsh}, @command{bash}, and @command{ksh},
12994 but not with @command{/bin/sh}. So if you write code like this
12995 in a shell script, be sure to start the script with @samp{#!/bin/bash}.
12997 Since the above example writes to one file and one process,
12998 a more conventional and portable use of @command{tee} is even better:
13001 wget -O - http://example.com/dvd.iso \
13002 | tee dvd.iso | sha1sum > dvd.sha1
13005 You can extend this example to make @command{tee} write to two processes,
13006 computing MD5 and SHA1 checksums in parallel. In this case,
13007 process substitution is required:
13010 wget -O - http://example.com/dvd.iso \
13011 | tee >(sha1sum > dvd.sha1) \
13012 >(md5sum > dvd.md5) \
13016 This technique is also useful when you want to make a @emph{compressed}
13017 copy of the contents of a pipe.
13018 Consider a tool to graphically summarize disk usage data from @samp{du -ak}.
13019 For a large hierarchy, @samp{du -ak} can run for a long time,
13020 and can easily produce terabytes of data, so you won't want to
13021 rerun the command unnecessarily. Nor will you want to save
13022 the uncompressed output.
13024 Doing it the inefficient way, you can't even start the GUI
13025 until after you've compressed all of the @command{du} output:
13028 du -ak | gzip -9 > /tmp/du.gz
13029 gzip -d /tmp/du.gz | xdiskusage -a
13032 With @command{tee} and process substitution, you start the GUI
13033 right away and eliminate the decompression completely:
13036 du -ak | tee >(gzip -9 > /tmp/du.gz) | xdiskusage -a
13039 Finally, if you regularly create more than one type of
13040 compressed tarball at once, for example when @code{make dist} creates
13041 both @command{gzip}-compressed and @command{bzip2}-compressed tarballs,
13042 there may be a better way.
13043 Typical @command{automake}-generated @file{Makefile} rules create
13044 the two compressed tar archives with commands in sequence, like this
13045 (slightly simplified):
13048 tardir=your-pkg-M.N
13049 tar chof - "$tardir" | gzip -9 -c > your-pkg-M.N.tar.gz
13050 tar chof - "$tardir" | bzip2 -9 -c > your-pkg-M.N.tar.bz2
13053 However, if the hierarchy you are archiving and compressing is larger
13054 than a couple megabytes, and especially if you are using a multi-processor
13055 system with plenty of memory, then you can do much better by reading the
13056 directory contents only once and running the compression programs in parallel:
13059 tardir=your-pkg-M.N
13060 tar chof - "$tardir" \
13061 | tee >(gzip -9 -c > your-pkg-M.N.tar.gz) \
13062 | bzip2 -9 -c > your-pkg-M.N.tar.bz2
13068 @node File name manipulation
13069 @chapter File name manipulation
13071 @cindex file name manipulation
13072 @cindex manipulation of file names
13073 @cindex commands for file name manipulation
13075 This section describes commands that manipulate file names.
13078 * basename invocation:: Strip directory and suffix from a file name.
13079 * dirname invocation:: Strip last file name component.
13080 * pathchk invocation:: Check file name validity and portability.
13081 * mktemp invocation:: Create temporary file or directory.
13082 * realpath invocation:: Print resolved file names.
13086 @node basename invocation
13087 @section @command{basename}: Strip directory and suffix from a file name
13090 @cindex strip directory and suffix from file names
13091 @cindex directory, stripping from file names
13092 @cindex suffix, stripping from file names
13093 @cindex file names, stripping directory and suffix
13094 @cindex leading directory components, stripping
13096 @command{basename} removes any leading directory components from
13097 @var{name}. Synopsis:
13100 basename @var{name} [@var{suffix}]
13101 basename @var{option}@dots{} @var{name}@dots{}
13104 If @var{suffix} is specified and is identical to the end of @var{name},
13105 it is removed from @var{name} as well. Note that since trailing slashes
13106 are removed prior to suffix matching, @var{suffix} will do nothing if it
13107 contains slashes. @command{basename} prints the result on standard
13110 @c This test is used both here and in the section on dirname.
13111 @macro basenameAndDirname
13112 Together, @command{basename} and @command{dirname} are designed such
13113 that if @samp{ls "$name"} succeeds, then the command sequence @samp{cd
13114 "$(dirname "$name")"; ls "$(basename "$name")"} will, too. This works
13115 for everything except file names containing a trailing newline.
13117 @basenameAndDirname
13119 POSIX allows the implementation to define the results if
13120 @var{name} is empty or @samp{//}. In the former case, GNU
13121 @command{basename} returns the empty string. In the latter case, the
13122 result is @samp{//} on platforms where @var{//} is distinct from
13123 @var{/}, and @samp{/} on platforms where there is no difference.
13125 The program accepts the following options. Also see @ref{Common options}.
13126 Options must precede operands.
13133 @opindex --multiple
13134 Support more than one argument. Treat every argument as a @var{name}.
13135 With this, an optional @var{suffix} must be specified using the
13136 @option{-s} option.
13138 @item -s @var{suffix}
13139 @itemx --suffix=@var{suffix}
13142 Remove a trailing @var{suffix}.
13143 This option implies the @option{-a} option.
13149 Separate output items with NUL characters.
13159 basename /usr/bin/sort
13162 basename include/stdio.h .h
13165 basename -s .h include/stdio.h
13167 # Output "stdio" followed by "stdlib"
13168 basename -a -s .h include/stdio.h include/stdlib.h
13172 @node dirname invocation
13173 @section @command{dirname}: Strip last file name component
13176 @cindex directory components, printing
13177 @cindex stripping non-directory suffix
13178 @cindex non-directory suffix, stripping
13180 @command{dirname} prints all but the final slash-delimited component
13181 of each @var{name}. Slashes on either side of the final component are
13182 also removed. If the string contains no slash, @command{dirname}
13183 prints @samp{.} (meaning the current directory). Synopsis:
13186 dirname [@var{option}] @var{name}@dots{}
13189 @var{name} need not be a file name, but if it is, this operation
13190 effectively lists the directory that contains the final component,
13191 including the case when the final component is itself a directory.
13193 @basenameAndDirname
13195 POSIX allows the implementation to define the results if
13196 @var{name} is @samp{//}. With GNU @command{dirname}, the
13197 result is @samp{//} on platforms where @var{//} is distinct from
13198 @var{/}, and @samp{/} on platforms where there is no difference.
13200 The program accepts the following option. Also see @ref{Common options}.
13208 Separate output items with NUL characters.
13217 # Output "/usr/bin".
13218 dirname /usr/bin/sort
13219 dirname /usr/bin//.//
13221 # Output "dir1" followed by "dir2"
13222 dirname dir1/str dir2/str
13229 @node pathchk invocation
13230 @section @command{pathchk}: Check file name validity and portability
13233 @cindex file names, checking validity and portability
13234 @cindex valid file names, checking for
13235 @cindex portable file names, checking for
13237 @command{pathchk} checks validity and portability of file names. Synopsis:
13240 pathchk [@var{option}]@dots{} @var{name}@dots{}
13243 For each @var{name}, @command{pathchk} prints an error message if any of
13244 these conditions is true:
13248 One of the existing directories in @var{name} does not have search
13249 (execute) permission,
13251 The length of @var{name} is larger than the maximum supported by the
13254 The length of one component of @var{name} is longer than
13255 its file system's maximum.
13258 A nonexistent @var{name} is not an error, so long a file with that
13259 name could be created under the above conditions.
13261 The program accepts the following options. Also see @ref{Common options}.
13262 Options must precede operands.
13268 Instead of performing checks based on the underlying file system,
13269 print an error message if any of these conditions is true:
13273 A file name is empty.
13276 A file name contains a character outside the POSIX portable file
13277 name character set, namely, the ASCII letters and digits, @samp{.},
13278 @samp{_}, @samp{-}, and @samp{/}.
13281 The length of a file name or one of its components exceeds the
13282 POSIX minimum limits for portability.
13287 Print an error message if a file name is empty, or if it contains a component
13288 that begins with @samp{-}.
13290 @item --portability
13291 @opindex --portability
13292 Print an error message if a file name is not portable to all POSIX
13293 hosts. This option is equivalent to @samp{-p -P}.
13297 @cindex exit status of @command{pathchk}
13301 0 if all specified file names passed all checks,
13305 @node mktemp invocation
13306 @section @command{mktemp}: Create temporary file or directory
13309 @cindex file names, creating temporary
13310 @cindex directory, creating temporary
13311 @cindex temporary files and directories
13313 @command{mktemp} manages the creation of temporary files and
13314 directories. Synopsis:
13317 mktemp [@var{option}]@dots{} [@var{template}]
13320 Safely create a temporary file or directory based on @var{template},
13321 and print its name. If given, @var{template} must include at least
13322 three consecutive @samp{X}s in the last component. If omitted, the template
13323 @samp{tmp.XXXXXXXXXX} is used, and option @option{--tmpdir} is
13324 implied. The final run of @samp{X}s in the @var{template} will be replaced
13325 by alpha-numeric characters; thus, on a case-sensitive file system,
13326 and with a @var{template} including a run of @var{n} instances of @samp{X},
13327 there are @samp{62**@var{n}} potential file names.
13329 Older scripts used to create temporary files by simply joining the
13330 name of the program with the process id (@samp{$$}) as a suffix.
13331 However, that naming scheme is easily predictable, and suffers from a
13332 race condition where the attacker can create an appropriately named
13333 symbolic link, such that when the script then opens a handle to what
13334 it thought was an unused file, it is instead modifying an existing
13335 file. Using the same scheme to create a directory is slightly safer,
13336 since the @command{mkdir} will fail if the target already exists, but
13337 it is still inferior because it allows for denial of service attacks.
13338 Therefore, modern scripts should use the @command{mktemp} command to
13339 guarantee that the generated name will be unpredictable, and that
13340 knowledge of the temporary file name implies that the file was created
13341 by the current script and cannot be modified by other users.
13343 When creating a file, the resulting file has read and write
13344 permissions for the current user, but no permissions for the group or
13345 others; these permissions are reduced if the current umask is more
13348 Here are some examples (although note that if you repeat them, you
13349 will most likely get different file names):
13354 Create a temporary file in the current directory.
13361 Create a temporary file with a known suffix.
13363 $ mktemp --suffix=.txt file-XXXX
13365 $ mktemp file-XXXX-XXXX.txt
13370 Create a secure fifo relative to the user's choice of @env{TMPDIR},
13371 but falling back to the current directory rather than @file{/tmp}.
13372 Note that @command{mktemp} does not create fifos, but can create a
13373 secure directory in which the fifo can live. Exit the shell if the
13374 directory or fifo could not be created.
13376 $ dir=$(mktemp -p "$@{TMPDIR:-.@}" -d dir-XXXX) || exit 1
13378 $ mkfifo "$fifo" || @{ rmdir "$dir"; exit 1; @}
13382 Create and use a temporary file if possible, but ignore failure. The
13383 file will reside in the directory named by @env{TMPDIR}, if specified,
13384 or else in @file{/tmp}.
13386 $ file=$(mktemp -q) && @{
13387 > # Safe to use $file only within this block. Use quotes,
13388 > # since $TMPDIR, and thus $file, may contain whitespace.
13389 > echo ... > "$file"
13395 Act as a semi-random character generator (it is not fully random,
13396 since it is impacted by the contents of the current directory). To
13397 avoid security holes, do not use the resulting names to create a file.
13407 The program accepts the following options. Also see @ref{Common options}.
13414 @opindex --directory
13415 Create a directory rather than a file. The directory will have read,
13416 write, and search permissions for the current user, but no permissions
13417 for the group or others; these permissions are reduced if the current
13418 umask is more restrictive.
13424 Suppress diagnostics about failure to create a file or directory. The
13425 exit status will still reflect whether a file was created.
13431 Generate a temporary name that does not name an existing file, without
13432 changing the file system contents. Using the output of this command
13433 to create a new file is inherently unsafe, as there is a window of
13434 time between generating the name and using it where another process
13435 can create an object by the same name.
13438 @itemx --tmpdir[=@var{dir}]
13441 Treat @var{template} relative to the directory @var{dir}. If
13442 @var{dir} is not specified (only possible with the long option
13443 @option{--tmpdir}) or is the empty string, use the value of
13444 @env{TMPDIR} if available, otherwise use @samp{/tmp}. If this is
13445 specified, @var{template} must not be absolute. However,
13446 @var{template} can still contain slashes, although intermediate
13447 directories must already exist.
13449 @item --suffix=@var{suffix}
13451 Append @var{suffix} to the @var{template}. @var{suffix} must not
13452 contain slash. If @option{--suffix} is specified, @var{template} must
13453 end in @samp{X}; if it is not specified, then an appropriate
13454 @option{--suffix} is inferred by finding the last @samp{X} in
13455 @var{template}. This option exists for use with the default
13456 @var{template} and for the creation of a @var{suffix} that starts with
13461 Treat @var{template} as a single file relative to the value of
13462 @env{TMPDIR} if available, or to the directory specified by
13463 @option{-p}, otherwise to @samp{/tmp}. @var{template} must not
13464 contain slashes. This option is deprecated; the use of @option{-p}
13465 without @option{-t} offers better defaults (by favoring the command
13466 line over @env{TMPDIR}) and more flexibility (by allowing intermediate
13471 @cindex exit status of @command{mktemp}
13475 0 if the file was created,
13480 @node realpath invocation
13481 @section @command{realpath}: Print the resolved file name.
13484 @cindex file names, canonicalization
13485 @cindex symlinks, resolution
13486 @cindex canonical file name
13487 @cindex canonicalize a file name
13491 @command{realpath} expands all symbolic links and resolves references to
13492 @samp{/./}, @samp{/../} and extra @samp{/} characters. By default,
13493 all but the last component of the specified files must exist. Synopsis:
13496 realpath [@var{option}]@dots{} @var{file}@dots{}
13499 The program accepts the following options. Also see @ref{Common options}.
13504 @itemx --canonicalize-existing
13506 @opindex --canonicalize-existing
13507 Ensure that all components of the specified file names exist.
13508 If any component is missing or unavailable, @command{realpath} will output
13509 a diagnostic unless the @option{-q} option is specified, and exit with a
13510 nonzero exit code. A trailing slash requires that the name resolve to a
13514 @itemx --canonicalize-missing
13516 @opindex --canonicalize-missing
13517 If any component of a specified file name is missing or unavailable,
13518 treat it as a directory.
13524 Symbolic links are resolved in the specified file names,
13525 but they are resolved after any subsequent @samp{..} components are processed.
13530 @opindex --physical
13531 Symbolic links are resolved in the specified file names,
13532 and they are resolved before any subsequent @samp{..} components are processed.
13533 This is the default mode of operation.
13539 Suppress diagnostic messages for specified file names.
13543 @itemx --no-symlinks
13546 @opindex --no-symlinks
13547 Do not resolve symbolic links. Only resolve references to
13548 @samp{/./}, @samp{/../} and remove extra @samp{/} characters.
13549 When combined with the @option{-m} option, realpath operates
13550 only on the file name, and does not touch any actual file.
13556 Separate output items with NUL characters.
13558 @item --relative-to=@var{file}
13559 @opindex --relative-to
13561 Print the resolved file names relative to the specified file.
13562 Note this option honors the @option{-m} and @option{-e} options
13563 pertaining to file existence.
13565 @item --relative-base=@var{base}
13566 @opindex --relative-base
13567 This option is valid when used with @option{--relative-to}, and will restrict
13568 the output of @option{--relative-to} so that relative names are output,
13569 only when @var{file}s are descendants of @var{base}. Otherwise output the
13570 absolute file name. If @option{--relative-to} was not specified, then
13571 the descendants of @var{base} are printed relative to @var{base}. If
13572 @option{--relative-to} is specified, then that directory must be a
13573 descendant of @var{base} for this option to have an effect.
13574 Note: this option honors the @option{-m} and @option{-e}
13575 options pertaining to file existence. For example:
13578 realpath --relative-to=/usr /tmp /usr/bin
13581 realpath --relative-base=/usr /tmp /usr/bin
13588 @cindex exit status of @command{realpath}
13592 0 if all file names were printed without issue.
13597 @node Working context
13598 @chapter Working context
13600 @cindex working context
13601 @cindex commands for printing the working context
13603 This section describes commands that display or alter the context in
13604 which you are working: the current directory, the terminal settings, and
13605 so forth. See also the user-related commands in the next section.
13608 * pwd invocation:: Print working directory.
13609 * stty invocation:: Print or change terminal characteristics.
13610 * printenv invocation:: Print environment variables.
13611 * tty invocation:: Print file name of terminal on standard input.
13615 @node pwd invocation
13616 @section @command{pwd}: Print working directory
13619 @cindex print name of current directory
13620 @cindex current working directory, printing
13621 @cindex working directory, printing
13624 @command{pwd} prints the name of the current directory. Synopsis:
13627 pwd [@var{option}]@dots{}
13630 The program accepts the following options. Also see @ref{Common options}.
13637 If the contents of the environment variable @env{PWD} provide an
13638 absolute name of the current directory with no @samp{.} or @samp{..}
13639 components, but possibly with symbolic links, then output those
13640 contents. Otherwise, fall back to default @option{-P} handling.
13645 @opindex --physical
13646 Print a fully resolved name for the current directory. That is, all
13647 components of the printed name will be actual directory names---none
13648 will be symbolic links.
13651 @cindex symbolic links and @command{pwd}
13652 If @option{-L} and @option{-P} are both given, the last one takes
13653 precedence. If neither option is given, then this implementation uses
13654 @option{-P} as the default unless the @env{POSIXLY_CORRECT}
13655 environment variable is set.
13657 @mayConflictWithShellBuiltIn{pwd}
13662 @node stty invocation
13663 @section @command{stty}: Print or change terminal characteristics
13666 @cindex change or print terminal settings
13667 @cindex terminal settings
13668 @cindex line settings of terminal
13670 @command{stty} prints or changes terminal characteristics, such as baud rate.
13674 stty [@var{option}] [@var{setting}]@dots{}
13675 stty [@var{option}]
13678 If given no line settings, @command{stty} prints the baud rate, line
13679 discipline number (on systems that support it), and line settings
13680 that have been changed from the values set by @samp{stty sane}.
13681 By default, mode reading and setting are performed on the tty line
13682 connected to standard input, although this can be modified by the
13683 @option{--file} option.
13685 @command{stty} accepts many non-option arguments that change aspects of
13686 the terminal line operation, as described below.
13688 The program accepts the following options. Also see @ref{Common options}.
13695 Print all current settings in human-readable form. This option may not
13696 be used in combination with any line settings.
13698 @item -F @var{device}
13699 @itemx --file=@var{device}
13702 Set the line opened by the file name specified in @var{device} instead of
13703 the tty line connected to standard input. This option is necessary
13704 because opening a POSIX tty requires use of the
13705 @code{O_NONDELAY} flag to prevent a POSIX tty from blocking
13706 until the carrier detect line is high if
13707 the @code{clocal} flag is not set. Hence, it is not always possible
13708 to allow the shell to open the device in the traditional manner.
13714 @cindex machine-readable @command{stty} output
13715 Print all current settings in a form that can be used as an argument to
13716 another @command{stty} command to restore the current settings. This option
13717 may not be used in combination with any line settings.
13721 Many settings can be turned off by preceding them with a @samp{-}.
13722 Such arguments are marked below with ``May be negated'' in their
13723 description. The descriptions themselves refer to the positive
13724 case, that is, when @emph{not} negated (unless stated otherwise,
13727 Some settings are not available on all POSIX systems, since they use
13728 extensions. Such arguments are marked below with
13729 ``Non-POSIX'' in their description. On non-POSIX
13730 systems, those or other settings also may not
13731 be available, but it's not feasible to document all the variations: just
13737 * Control:: Control settings
13738 * Input:: Input settings
13739 * Output:: Output settings
13740 * Local:: Local settings
13741 * Combination:: Combination settings
13742 * Characters:: Special characters
13743 * Special:: Special settings
13748 @subsection Control settings
13750 @cindex control settings
13756 @cindex two-way parity
13757 Generate parity bit in output and expect parity bit in input.
13763 @cindex even parity
13764 Set odd parity (even if negated). May be negated.
13771 @cindex character size
13772 @cindex eight-bit characters
13773 Set character size to 5, 6, 7, or 8 bits.
13778 Send a hangup signal when the last process closes the tty. May be
13784 Use two stop bits per character (one if negated). May be negated.
13788 Allow input to be received. May be negated.
13792 @cindex modem control
13793 Disable modem control signals. May be negated.
13797 @cindex hardware flow control
13798 @cindex flow control, hardware
13799 @cindex RTS/CTS flow control
13800 Enable RTS/CTS flow control. Non-POSIX@. May be negated.
13804 @cindex hardware flow control
13805 @cindex flow control, hardware
13806 @cindex DTR/DSR flow control
13807 Enable DTR/DSR flow control. Non-POSIX@. May be negated.
13812 @subsection Input settings
13814 @cindex input settings
13815 These settings control operations on data received from the terminal.
13820 @cindex breaks, ignoring
13821 Ignore break characters. May be negated.
13825 @cindex breaks, cause interrupts
13826 Make breaks cause an interrupt signal. May be negated.
13830 @cindex parity, ignoring
13831 Ignore characters with parity errors. May be negated.
13835 @cindex parity errors, marking
13836 Mark parity errors (with a 255-0-character sequence). May be negated.
13840 Enable input parity checking. May be negated.
13844 @cindex eight-bit input
13845 Clear high (8th) bit of input characters. May be negated.
13849 @cindex newline, translating to return
13850 Translate newline to carriage return. May be negated.
13854 @cindex return, ignoring
13855 Ignore carriage return. May be negated.
13859 @cindex return, translating to newline
13860 Translate carriage return to newline. May be negated.
13864 @cindex input encoding, UTF-8
13865 Assume input characters are UTF-8 encoded. May be negated.
13869 @kindex C-s/C-q flow control
13870 @cindex XON/XOFF flow control
13871 Enable XON/XOFF flow control (that is, @kbd{CTRL-S}/@kbd{CTRL-Q}). May
13878 @cindex software flow control
13879 @cindex flow control, software
13880 Enable sending of @code{stop} character when the system input buffer
13881 is almost full, and @code{start} character when it becomes almost
13882 empty again. May be negated.
13886 @cindex uppercase, translating to lowercase
13887 Translate uppercase characters to lowercase. Non-POSIX@. May be
13888 negated. Note ilcuc is not implemented, as one would not be able to issue
13889 almost any (lowercase) Unix command, after invoking it.
13893 Allow any character to restart output (only the start character
13894 if negated). Non-POSIX@. May be negated.
13898 @cindex beeping at input buffer full
13899 Enable beeping and not flushing input buffer if a character arrives
13900 when the input buffer is full. Non-POSIX@. May be negated.
13905 @subsection Output settings
13907 @cindex output settings
13908 These settings control operations on data sent to the terminal.
13913 Postprocess output. May be negated.
13917 @cindex lowercase, translating to output
13918 Translate lowercase characters to uppercase. Non-POSIX@. May be
13919 negated. (Note ouclc is not currently implemented.)
13923 @cindex return, translating to newline
13924 Translate carriage return to newline. Non-POSIX@. May be negated.
13928 @cindex newline, translating to crlf
13929 Translate newline to carriage return-newline. Non-POSIX@. May be
13934 Do not print carriage returns in the first column. Non-POSIX@.
13939 Newline performs a carriage return. Non-POSIX@. May be negated.
13943 @cindex pad instead of timing for delaying
13944 Use fill (padding) characters instead of timing for delays.
13950 @cindex pad character
13951 Use ASCII DEL characters for fill instead of
13952 ASCII NUL characters. Non-POSIX@.
13958 Newline delay style. Non-POSIX.
13965 Carriage return delay style. Non-POSIX.
13971 @opindex tab@var{n}
13972 Horizontal tab delay style. Non-POSIX.
13977 Backspace delay style. Non-POSIX.
13982 Vertical tab delay style. Non-POSIX.
13987 Form feed delay style. Non-POSIX.
13992 @subsection Local settings
13994 @cindex local settings
13999 Enable @code{interrupt}, @code{quit}, and @code{suspend} special
14000 characters. May be negated.
14004 Enable @code{erase}, @code{kill}, @code{werase}, and @code{rprnt}
14005 special characters. May be negated.
14009 Enable non-POSIX special characters. May be negated.
14013 Echo input characters. May be negated.
14019 Echo @code{erase} characters as backspace-space-backspace. May be
14024 @cindex newline echoing after @code{kill}
14025 Echo a newline after a @code{kill} character. May be negated.
14029 @cindex newline, echoing
14030 Echo newline even if not echoing other characters. May be negated.
14034 @cindex flushing, disabling
14035 Disable flushing after @code{interrupt} and @code{quit} special
14036 characters. May be negated.
14040 @cindex case translation
14041 Enable input and output of uppercase characters by preceding their
14042 lowercase equivalents with @samp{\}, when @code{icanon} is set.
14043 Non-POSIX@. May be negated.
14047 @cindex background jobs, stopping at terminal write
14048 Stop background jobs that try to write to the terminal. Non-POSIX@.
14055 Echo erased characters backward, between @samp{\} and @samp{/}.
14056 Non-POSIX@. May be negated.
14062 @cindex control characters, using @samp{^@var{c}}
14063 @cindex hat notation for control characters
14064 Echo control characters in hat notation (@samp{^@var{c}}) instead
14065 of literally. Non-POSIX@. May be negated.
14071 Echo the @code{kill} special character by erasing each character on
14072 the line as indicated by the @code{echoprt} and @code{echoe} settings,
14073 instead of by the @code{echoctl} and @code{echok} settings.
14080 @subsection Combination settings
14082 @cindex combination settings
14083 Combination settings:
14090 Same as @code{parenb -parodd cs7}. May be negated. If negated, same
14091 as @code{-parenb cs8}.
14095 Same as @code{parenb parodd cs7}. May be negated. If negated, same
14096 as @code{-parenb cs8}.
14100 Same as @code{-icrnl -onlcr}. May be negated. If negated, same as
14101 @code{icrnl -inlcr -igncr onlcr -ocrnl -onlret}.
14105 Reset the @code{erase} and @code{kill} special characters to their default
14112 @c This is too long to write inline.
14114 cread -ignbrk brkint -inlcr -igncr icrnl -ixoff
14115 -iuclc -ixany imaxbel opost -olcuc -ocrnl onlcr
14116 -onocr -onlret -ofill -ofdel nl0 cr0 tab0 bs0 vt0
14117 ff0 isig icanon iexten echo echoe echok -echonl
14118 -noflsh -xcase -tostop -echoprt echoctl echoke
14122 and also sets all special characters to their default values.
14126 Same as @code{brkint ignpar istrip icrnl ixon opost isig icanon}, plus
14127 sets the @code{eof} and @code{eol} characters to their default values
14128 if they are the same as the @code{min} and @code{time} characters.
14129 May be negated. If negated, same as @code{raw}.
14136 -ignbrk -brkint -ignpar -parmrk -inpck -istrip
14137 -inlcr -igncr -icrnl -ixon -ixoff -iuclc -ixany
14138 -imaxbel -opost -isig -icanon -xcase min 1 time 0
14142 May be negated. If negated, same as @code{cooked}.
14146 Same as @option{-icanon}. May be negated. If negated, same as
14151 @cindex eight-bit characters
14152 Same as @code{-parenb -istrip cs8}. May be negated. If negated,
14153 same as @code{parenb istrip cs7}.
14157 Same as @option{-parenb -istrip -opost cs8}. May be negated.
14158 If negated, same as @code{parenb istrip opost cs7}.
14162 Same as @option{-ixany}. Non-POSIX@. May be negated.
14166 Same as @code{tab0}. Non-POSIX@. May be negated. If negated, same
14173 Same as @code{xcase iuclc olcuc}. Non-POSIX@. May be negated.
14174 (Used for terminals with uppercase characters only.)
14178 Same as @code{echoe echoctl echoke}.
14182 Same as @code{echoe echoctl echoke -ixany intr ^C erase ^? kill C-u}.
14187 @subsection Special characters
14189 @cindex special characters
14190 @cindex characters, special
14192 The special characters' default values vary from system to system.
14193 They are set with the syntax @samp{name value}, where the names are
14194 listed below and the value can be given either literally, in hat
14195 notation (@samp{^@var{c}}), or as an integer which may start with
14196 @samp{0x} to indicate hexadecimal, @samp{0} to indicate octal, or
14197 any other digit to indicate decimal.
14199 @cindex disabling special characters
14200 @kindex u@r{, and disabling special characters}
14201 For GNU stty, giving a value of @code{^-} or @code{undef} disables that
14202 special character. (This is incompatible with Ultrix @command{stty},
14203 which uses a value of @samp{u} to disable a special character. GNU
14204 @command{stty} treats a value @samp{u} like any other, namely to set that
14205 special character to @key{U}.)
14211 Send an interrupt signal.
14215 Send a quit signal.
14219 Erase the last character typed.
14223 Erase the current line.
14227 Send an end of file (terminate the input).
14235 Alternate character to end the line. Non-POSIX.
14239 Switch to a different shell layer. Non-POSIX.
14243 Restart the output after stopping it.
14251 Send a terminal stop signal.
14255 Send a terminal stop signal after flushing the input. Non-POSIX.
14259 Redraw the current line. Non-POSIX.
14263 Erase the last word typed. Non-POSIX.
14267 Enter the next character typed literally, even if it is a special
14268 character. Non-POSIX.
14273 @subsection Special settings
14275 @cindex special settings
14280 Set the minimum number of characters that will satisfy a read until
14281 the time value has expired, when @option{-icanon} is set.
14285 Set the number of tenths of a second before reads time out if the minimum
14286 number of characters have not been read, when @option{-icanon} is set.
14288 @item ispeed @var{n}
14290 Set the input speed to @var{n}.
14292 @item ospeed @var{n}
14294 Set the output speed to @var{n}.
14298 Tell the tty kernel driver that the terminal has @var{n} rows.
14302 @itemx columns @var{n}
14305 Tell the kernel that the terminal has @var{n} columns. Non-POSIX.
14311 Print the number of rows and columns that the kernel thinks the
14312 terminal has. (Systems that don't support rows and columns in the kernel
14313 typically use the environment variables @env{LINES} and @env{COLUMNS}
14314 instead; however, GNU @command{stty} does not know anything about them.)
14319 Use line discipline @var{n}. Non-POSIX.
14323 Print the terminal speed.
14326 @cindex baud rate, setting
14327 Set the input and output speeds to @var{n}. @var{n} can be one of: 0
14328 50 75 110 134 134.5 150 200 300 600 1200 1800 2400 4800 9600 19200
14329 38400 @code{exta} @code{extb}. @code{exta} is the same as 19200;
14330 @code{extb} is the same as 38400. Many systems, including GNU/Linux,
14331 support higher speeds. The @command{stty} command includes support
14348 4000000 where the system supports these.
14349 0 hangs up the line if @option{-clocal} is set.
14353 @node printenv invocation
14354 @section @command{printenv}: Print all or some environment variables
14357 @cindex printing all or some environment variables
14358 @cindex environment variables, printing
14360 @command{printenv} prints environment variable values. Synopsis:
14363 printenv [@var{option}] [@var{variable}]@dots{}
14366 If no @var{variable}s are specified, @command{printenv} prints the value of
14367 every environment variable. Otherwise, it prints the value of each
14368 @var{variable} that is set, and nothing for those that are not set.
14370 The program accepts the following option. Also see @ref{Common options}.
14378 @cindex exit status of @command{printenv}
14382 0 if all variables specified were found
14383 1 if at least one specified variable was not found
14384 2 if a write error occurred
14388 @node tty invocation
14389 @section @command{tty}: Print file name of terminal on standard input
14392 @cindex print terminal file name
14393 @cindex terminal file name, printing
14395 @command{tty} prints the file name of the terminal connected to its standard
14396 input. It prints @samp{not a tty} if standard input is not a terminal.
14400 tty [@var{option}]@dots{}
14403 The program accepts the following option. Also see @ref{Common options}.
14413 Print nothing; only return an exit status.
14417 @cindex exit status of @command{tty}
14421 0 if standard input is a terminal
14422 1 if standard input is not a terminal
14423 2 if given incorrect arguments
14424 3 if a write error occurs
14428 @node User information
14429 @chapter User information
14431 @cindex user information, commands for
14432 @cindex commands for printing user information
14434 This section describes commands that print user-related information:
14435 logins, groups, and so forth.
14438 * id invocation:: Print user identity.
14439 * logname invocation:: Print current login name.
14440 * whoami invocation:: Print effective user ID.
14441 * groups invocation:: Print group names a user is in.
14442 * users invocation:: Print login names of users currently logged in.
14443 * who invocation:: Print who is currently logged in.
14447 @node id invocation
14448 @section @command{id}: Print user identity
14451 @cindex real user and group IDs, printing
14452 @cindex effective user and group IDs, printing
14453 @cindex printing real and effective user and group IDs
14455 @command{id} prints information about the given user, or the process
14456 running it if no user is specified. Synopsis:
14459 id [@var{option}]@dots{} [@var{username}]
14462 @vindex POSIXLY_CORRECT
14463 By default, it prints the real user ID, real group ID, effective user ID
14464 if different from the real user ID, effective group ID if different from
14465 the real group ID, and supplemental group IDs.
14466 In addition, if SELinux
14467 is enabled and the @env{POSIXLY_CORRECT} environment variable is not set,
14468 then print @samp{context=@var{c}}, where @var{c} is the security context.
14470 Each of these numeric values is preceded by an identifying string and
14471 followed by the corresponding user or group name in parentheses.
14473 The options cause @command{id} to print only part of the above information.
14474 Also see @ref{Common options}.
14481 Print only the group ID.
14487 Print only the group ID and the supplementary groups.
14493 Print the user or group name instead of the ID number. Requires
14494 @option{-u}, @option{-g}, or @option{-G}.
14500 Print the real, instead of effective, user or group ID@. Requires
14501 @option{-u}, @option{-g}, or @option{-G}.
14507 Print only the user ID.
14514 @cindex security context
14515 Print only the security context of the current user.
14516 If SELinux is disabled then print a warning and
14517 set the exit status to 1.
14523 @macro primaryAndSupplementaryGroups{cmd,arg}
14524 Primary and supplementary groups for a process are normally inherited
14525 from its parent and are usually unchanged since login. This means
14526 that if you change the group database after logging in, @command{\cmd\}
14527 will not reflect your changes within your existing login session.
14528 Running @command{\cmd\} with a \arg\ causes the user and group
14529 database to be consulted afresh, and so will give a different result.
14531 @primaryAndSupplementaryGroups{id,user argument}
14533 @node logname invocation
14534 @section @command{logname}: Print current login name
14537 @cindex printing user's login name
14538 @cindex login name, printing
14539 @cindex user name, printing
14542 @command{logname} prints the calling user's name, as found in a
14543 system-maintained file (often @file{/var/run/utmp} or
14544 @file{/etc/utmp}), and exits with a status of 0. If there is no entry
14545 for the calling process, @command{logname} prints
14546 an error message and exits with a status of 1.
14548 The only options are @option{--help} and @option{--version}. @xref{Common
14554 @node whoami invocation
14555 @section @command{whoami}: Print effective user ID
14558 @cindex effective user ID, printing
14559 @cindex printing the effective user ID
14561 @command{whoami} prints the user name associated with the current
14562 effective user ID@. It is equivalent to the command @samp{id -un}.
14564 The only options are @option{--help} and @option{--version}. @xref{Common
14570 @node groups invocation
14571 @section @command{groups}: Print group names a user is in
14574 @cindex printing groups a user is in
14575 @cindex supplementary groups, printing
14577 @command{groups} prints the names of the primary and any supplementary
14578 groups for each given @var{username}, or the current process if no names
14579 are given. If more than one name is given, the name of each user is
14581 the list of that user's groups and the user name is separated from the
14582 group list by a colon. Synopsis:
14585 groups [@var{username}]@dots{}
14588 The group lists are equivalent to the output of the command @samp{id -Gn}.
14590 @primaryAndSupplementaryGroups{groups,list of users}
14592 The only options are @option{--help} and @option{--version}. @xref{Common
14598 @node users invocation
14599 @section @command{users}: Print login names of users currently logged in
14602 @cindex printing current usernames
14603 @cindex usernames, printing current
14605 @cindex login sessions, printing users with
14606 @command{users} prints on a single line a blank-separated list of user
14607 names of users currently logged in to the current host. Each user name
14608 corresponds to a login session, so if a user has more than one login
14609 session, that user's name will appear the same number of times in the
14618 With no @var{file} argument, @command{users} extracts its information from
14619 a system-maintained file (often @file{/var/run/utmp} or
14620 @file{/etc/utmp}). If a file argument is given, @command{users} uses
14621 that file instead. A common choice is @file{/var/log/wtmp}.
14623 The only options are @option{--help} and @option{--version}. @xref{Common
14629 @node who invocation
14630 @section @command{who}: Print who is currently logged in
14633 @cindex printing current user information
14634 @cindex information, about current users
14636 @command{who} prints information about users who are currently logged on.
14640 @command{who} [@var{option}] [@var{file}] [am i]
14643 @cindex terminal lines, currently used
14645 @cindex remote hostname
14646 If given no non-option arguments, @command{who} prints the following
14647 information for each user currently logged on: login name, terminal
14648 line, login time, and remote hostname or X display.
14652 If given one non-option argument, @command{who} uses that instead of
14653 a default system-maintained file (often @file{/var/run/utmp} or
14654 @file{/etc/utmp}) as the name of the file containing the record of
14655 users logged on. @file{/var/log/wtmp} is commonly given as an argument
14656 to @command{who} to look at who has previously logged on.
14660 If given two non-option arguments, @command{who} prints only the entry
14661 for the user running it (determined from its standard input), preceded
14662 by the hostname. Traditionally, the two arguments given are @samp{am
14663 i}, as in @samp{who am i}.
14666 Time stamps are listed according to the time zone rules specified by
14667 the @env{TZ} environment variable, or by the system default rules if
14668 @env{TZ} is not set. @xref{TZ Variable,, Specifying the Time Zone
14669 with @env{TZ}, libc, The GNU C Library Reference Manual}.
14671 The program accepts the following options. Also see @ref{Common options}.
14679 Same as @samp{-b -d --login -p -r -t -T -u}.
14685 Print the date and time of last system boot.
14691 Print information corresponding to dead processes.
14697 Print a line of column headings.
14703 List only the entries that correspond to processes via which the
14704 system is waiting for a user to login. The user name is always @samp{LOGIN}.
14708 Attempt to canonicalize hostnames found in utmp through a DNS lookup. This
14709 is not the default because it can cause significant delays on systems with
14710 automatic dial-up internet access.
14714 Same as @samp{who am i}.
14720 List active processes spawned by init.
14726 Print only the login names and the number of users logged on.
14727 Overrides all other options.
14732 @opindex --runlevel
14733 Print the current (and maybe previous) run-level of the init process.
14737 Ignored; for compatibility with other versions of @command{who}.
14743 Print last system clock change.
14748 After the login time, print the number of hours and minutes that the
14749 user has been idle. @samp{.} means the user was active in the last minute.
14750 @samp{old} means the user has been idle for more than 24 hours.
14761 @opindex --writable
14762 @cindex message status
14763 @pindex write@r{, allowed}
14764 After each login name print a character indicating the user's message status:
14767 @samp{+} allowing @code{write} messages
14768 @samp{-} disallowing @code{write} messages
14769 @samp{?} cannot find terminal device
14777 @node System context
14778 @chapter System context
14780 @cindex system context
14781 @cindex context, system
14782 @cindex commands for system context
14784 This section describes commands that print or change system-wide
14788 * date invocation:: Print or set system date and time.
14789 * arch invocation:: Print machine hardware name.
14790 * nproc invocation:: Print the number of processors.
14791 * uname invocation:: Print system information.
14792 * hostname invocation:: Print or set system name.
14793 * hostid invocation:: Print numeric host identifier.
14794 * uptime invocation:: Print system uptime and load.
14797 @node date invocation
14798 @section @command{date}: Print or set system date and time
14801 @cindex time, printing or setting
14802 @cindex printing the current time
14807 date [@var{option}]@dots{} [+@var{format}]
14808 date [-u|--utc|--universal] @c this avoids a newline in the output
14809 [ MMDDhhmm[[CC]YY][.ss] ]
14813 Invoking @command{date} with no @var{format} argument is equivalent to invoking
14814 it with a default format that depends on the @env{LC_TIME} locale category.
14815 In the default C locale, this format is @samp{'+%a %b %e %H:%M:%S %Z %Y'},
14816 so the output looks like @samp{Thu Mar @ 3 13:47:51 PST 2005}.
14819 Normally, @command{date} uses the time zone rules indicated by the
14820 @env{TZ} environment variable, or the system default rules if @env{TZ}
14821 is not set. @xref{TZ Variable,, Specifying the Time Zone with
14822 @env{TZ}, libc, The GNU C Library Reference Manual}.
14824 @findex strftime @r{and @command{date}}
14825 @cindex time formats
14826 @cindex formatting times
14827 If given an argument that starts with a @samp{+}, @command{date} prints the
14828 current date and time (or the date and time specified by the
14829 @option{--date} option, see below) in the format defined by that argument,
14830 which is similar to that of the @code{strftime} function. Except for
14831 conversion specifiers, which start with @samp{%}, characters in the
14832 format string are printed unchanged. The conversion specifiers are
14838 * Time conversion specifiers:: %[HIklMNpPrRsSTXzZ]
14839 * Date conversion specifiers:: %[aAbBcCdDeFgGhjmuUVwWxyY]
14840 * Literal conversion specifiers:: %[%nt]
14841 * Padding and other flags:: Pad with zeros, spaces, etc.
14842 * Setting the time:: Changing the system clock.
14843 * Options for date:: Instead of the current time.
14845 * Date input formats:: Specifying date strings.
14847 * Examples of date:: Examples.
14850 @node Time conversion specifiers
14851 @subsection Time conversion specifiers
14853 @cindex time conversion specifiers
14854 @cindex conversion specifiers, time
14856 @command{date} conversion specifiers related to times.
14860 hour (@samp{00}@dots{}@samp{23})
14862 hour (@samp{01}@dots{}@samp{12})
14864 hour, space padded (@samp{ 0}@dots{}@samp{23}); equivalent to @samp{%_H}@.
14865 This is a GNU extension.
14867 hour, space padded (@samp{ 1}@dots{}@samp{12}); equivalent to @samp{%_I}@.
14868 This is a GNU extension.
14870 minute (@samp{00}@dots{}@samp{59})
14872 nanoseconds (@samp{000000000}@dots{}@samp{999999999}).
14873 This is a GNU extension.
14875 locale's equivalent of either @samp{AM} or @samp{PM};
14876 blank in many locales.
14877 Noon is treated as @samp{PM} and midnight as @samp{AM}.
14879 like @samp{%p}, except lower case.
14880 This is a GNU extension.
14882 locale's 12-hour clock time (e.g., @samp{11:11:04 PM})
14884 24-hour hour and minute. Same as @samp{%H:%M}.
14886 @cindex epoch, seconds since
14887 @cindex seconds since the epoch
14888 @cindex beginning of time
14889 @cindex leap seconds
14890 seconds since the epoch, i.e., since 1970-01-01 00:00:00 UTC@.
14891 Leap seconds are not counted unless leap second support is available.
14892 @xref{%s-examples}, for examples.
14893 This is a GNU extension.
14895 @cindex leap seconds
14896 second (@samp{00}@dots{}@samp{60}).
14897 This may be @samp{60} if leap seconds are supported.
14899 24-hour hour, minute, and second. Same as @samp{%H:%M:%S}.
14901 locale's time representation (e.g., @samp{23:13:48})
14903 @w{RFC 2822/ISO 8601} style numeric time zone
14904 (e.g., @samp{-0600} or @samp{+0530}), or nothing if no
14905 time zone is determinable. This value reflects the numeric time zone
14906 appropriate for the current time, using the time zone rules specified
14907 by the @env{TZ} environment variable.
14908 The time (and optionally, the time zone rules) can be overridden
14909 by the @option{--date} option.
14911 @w{RFC 3339/ISO 8601} style numeric time zone with
14912 @samp{:} (e.g., @samp{-06:00} or @samp{+05:30}), or nothing if no time
14913 zone is determinable.
14914 This is a GNU extension.
14916 Numeric time zone to the nearest second with @samp{:} (e.g.,
14917 @samp{-06:00:00} or @samp{+05:30:00}), or nothing if no time zone is
14919 This is a GNU extension.
14921 Numeric time zone with @samp{:} using the minimum necessary precision
14922 (e.g., @samp{-06}, @samp{+05:30}, or @samp{-04:56:02}), or nothing if
14923 no time zone is determinable.
14924 This is a GNU extension.
14926 alphabetic time zone abbreviation (e.g., @samp{EDT}), or nothing if no
14927 time zone is determinable. See @samp{%z} for how it is determined.
14931 @node Date conversion specifiers
14932 @subsection Date conversion specifiers
14934 @cindex date conversion specifiers
14935 @cindex conversion specifiers, date
14937 @command{date} conversion specifiers related to dates.
14941 locale's abbreviated weekday name (e.g., @samp{Sun})
14943 locale's full weekday name, variable length (e.g., @samp{Sunday})
14945 locale's abbreviated month name (e.g., @samp{Jan})
14947 locale's full month name, variable length (e.g., @samp{January})
14949 locale's date and time (e.g., @samp{Thu Mar @ 3 23:05:25 2005})
14951 century. This is like @samp{%Y}, except the last two digits are omitted.
14952 For example, it is @samp{20} if @samp{%Y} is @samp{2000},
14953 and is @samp{-0} if @samp{%Y} is @samp{-001}.
14954 It is normally at least two characters, but it may be more.
14956 day of month (e.g., @samp{01})
14958 date; same as @samp{%m/%d/%y}
14960 day of month, space padded; same as @samp{%_d}
14962 full date in ISO 8601 format; same as @samp{%Y-%m-%d}.
14963 This is a good choice for a date format, as it is standard and
14964 is easy to sort in the usual case where years are in the range
14967 year corresponding to the ISO week number, but without the century
14968 (range @samp{00} through @samp{99}). This has the same format and value
14969 as @samp{%y}, except that if the ISO week number (see
14971 to the previous or next year, that year is used instead.
14973 year corresponding to the ISO week number. This has the
14974 same format and value as @samp{%Y}, except that if the ISO
14976 @samp{%V}) belongs to the previous or next year, that year is used
14978 It is normally useful only if @samp{%V} is also used;
14979 for example, the format @samp{%G-%m-%d} is probably a mistake,
14980 since it combines the ISO week number year with the conventional month and day.
14984 day of year (@samp{001}@dots{}@samp{366})
14986 month (@samp{01}@dots{}@samp{12})
14988 day of week (@samp{1}@dots{}@samp{7}) with @samp{1} corresponding to Monday
14990 week number of year, with Sunday as the first day of the week
14991 (@samp{00}@dots{}@samp{53}).
14992 Days in a new year preceding the first Sunday are in week zero.
14994 ISO week number, that is, the
14995 week number of year, with Monday as the first day of the week
14996 (@samp{01}@dots{}@samp{53}).
14997 If the week containing January 1 has four or more days in
14998 the new year, then it is considered week 1; otherwise, it is week 53 of
14999 the previous year, and the next week is week 1. (See the ISO 8601
15002 day of week (@samp{0}@dots{}@samp{6}) with 0 corresponding to Sunday
15004 week number of year, with Monday as first day of week
15005 (@samp{00}@dots{}@samp{53}).
15006 Days in a new year preceding the first Monday are in week zero.
15008 locale's date representation (e.g., @samp{12/31/99})
15010 last two digits of year (@samp{00}@dots{}@samp{99})
15012 year. This is normally at least four characters, but it may be more.
15013 Year @samp{0000} precedes year @samp{0001}, and year @samp{-001}
15014 precedes year @samp{0000}.
15018 @node Literal conversion specifiers
15019 @subsection Literal conversion specifiers
15021 @cindex literal conversion specifiers
15022 @cindex conversion specifiers, literal
15024 @command{date} conversion specifiers that produce literal strings.
15036 @node Padding and other flags
15037 @subsection Padding and other flags
15039 @cindex numeric field padding
15040 @cindex padding of numeric fields
15041 @cindex fields, padding numeric
15043 Unless otherwise specified, @command{date} normally pads numeric fields
15044 with zeros, so that, for
15045 example, numeric months are always output as two digits.
15046 Seconds since the epoch are not padded, though,
15047 since there is no natural width for them.
15049 As a GNU extension, @command{date} recognizes any of the
15050 following optional flags after the @samp{%}:
15054 (hyphen) Do not pad the field; useful if the output is intended for
15057 (underscore) Pad with spaces; useful if you need a fixed
15058 number of characters in the output, but zeros are too distracting.
15060 (zero) Pad with zeros even if the conversion specifier
15061 would normally pad with spaces.
15063 Use upper case characters if possible.
15065 Use opposite case characters if possible.
15066 A field that is normally upper case becomes lower case, and vice versa.
15070 Here are some examples of padding:
15073 date +%d/%m -d "Feb 1"
15075 date +%-d/%-m -d "Feb 1"
15077 date +%_d/%_m -d "Feb 1"
15081 As a GNU extension, you can specify the field width
15082 (after any flag, if present) as a decimal number. If the natural size of the
15083 output of the field has less than the specified number of characters,
15084 the result is written right adjusted and padded to the given
15085 size. For example, @samp{%9B} prints the right adjusted month name in
15086 a field of width 9.
15088 An optional modifier can follow the optional flag and width
15089 specification. The modifiers are:
15093 Use the locale's alternate representation for date and time. This
15094 modifier applies to the @samp{%c}, @samp{%C}, @samp{%x}, @samp{%X},
15095 @samp{%y} and @samp{%Y} conversion specifiers. In a Japanese locale, for
15096 example, @samp{%Ex} might yield a date format based on the Japanese
15100 Use the locale's alternate numeric symbols for numbers. This modifier
15101 applies only to numeric conversion specifiers.
15104 If the format supports the modifier but no alternate representation
15105 is available, it is ignored.
15108 @node Setting the time
15109 @subsection Setting the time
15111 @cindex setting the time
15112 @cindex time setting
15113 @cindex appropriate privileges
15115 If given an argument that does not start with @samp{+}, @command{date} sets
15116 the system clock to the date and time specified by that argument (as
15117 described below). You must have appropriate privileges to set the
15118 system clock. Note for changes to persist across a reboot, the
15119 hardware clock may need to be updated from the system clock, which
15120 might not happen automatically on your system.
15122 The argument must consist entirely of digits, which have the following
15135 first two digits of year (optional)
15137 last two digits of year (optional)
15142 Note, the @option{--date} and @option{--set} options may not be used with an
15143 argument in the above format. The @option{--universal} option may be used
15144 with such an argument to indicate that the specified date and time are
15145 relative to Coordinated Universal Time rather than to the local time zone.
15148 @node Options for date
15149 @subsection Options for @command{date}
15151 @cindex @command{date} options
15152 @cindex options for @command{date}
15154 The program accepts the following options. Also see @ref{Common options}.
15158 @item -d @var{datestr}
15159 @itemx --date=@var{datestr}
15162 @cindex parsing date strings
15163 @cindex date strings, parsing
15164 @cindex arbitrary date strings, parsing
15167 @opindex next @var{day}
15168 @opindex last @var{day}
15169 Display the date and time specified in @var{datestr} instead of the
15170 current date and time. @var{datestr} can be in almost any common
15171 format. It can contain month names, time zones, @samp{am} and @samp{pm},
15172 @samp{yesterday}, etc. For example, @option{--date="2004-02-27
15173 14:19:13.489392193 +0530"} specifies the instant of time that is
15174 489,392,193 nanoseconds after February 27, 2004 at 2:19:13 PM in a
15175 time zone that is 5 hours and 30 minutes east of UTC.@*
15176 Note: input currently must be in locale independent format. E.g., the
15177 LC_TIME=C below is needed to print back the correct date in many locales:
15179 date -d "$(LC_TIME=C date)"
15181 @xref{Date input formats}.
15183 @item -f @var{datefile}
15184 @itemx --file=@var{datefile}
15187 Parse each line in @var{datefile} as with @option{-d} and display the
15188 resulting date and time. If @var{datefile} is @samp{-}, use standard
15189 input. This is useful when you have many dates to process, because the
15190 system overhead of starting up the @command{date} executable many times can
15193 @item -I[@var{timespec}]
15194 @itemx --iso-8601[=@var{timespec}]
15195 @opindex -I[@var{timespec}]
15196 @opindex --iso-8601[=@var{timespec}]
15197 Display the date using the ISO 8601 format, @samp{%Y-%m-%d}.
15199 The argument @var{timespec} specifies the number of additional
15200 terms of the time to include. It can be one of the following:
15203 Print just the date. This is the default if @var{timespec} is omitted.
15206 Append the hour of the day to the date.
15209 Append the hours and minutes.
15212 Append the hours, minutes and seconds.
15215 Append the hours, minutes, seconds and nanoseconds.
15218 If showing any time terms, then include the time zone using the format
15221 @item -r @var{file}
15222 @itemx --reference=@var{file}
15224 @opindex --reference
15225 Display the date and time of the last modification of @var{file},
15226 instead of the current date and time.
15233 @opindex --rfc-2822
15234 Display the date and time using the format @samp{%a, %d %b %Y %H:%M:%S
15235 %z}, evaluated in the C locale so abbreviations are always in English.
15239 Fri, 09 Sep 2005 13:51:39 -0700
15242 This format conforms to
15243 @uref{ftp://ftp.rfc-editor.org/in-notes/rfc2822.txt, Internet
15245 @uref{ftp://ftp.rfc-editor.org/in-notes/rfc822.txt, 822}, the
15246 current and previous standards for Internet email.
15248 @item --rfc-3339=@var{timespec}
15249 @opindex --rfc-3339=@var{timespec}
15250 Display the date using a format specified by
15251 @uref{ftp://ftp.rfc-editor.org/in-notes/rfc3339.txt, Internet
15252 RFC 3339}. This is a subset of the ISO 8601
15253 format, except that it also permits applications to use a space rather
15254 than a @samp{T} to separate dates from times. Unlike the other
15255 standard formats, RFC 3339 format is always suitable as
15256 input for the @option{--date} (@option{-d}) and @option{--file}
15257 (@option{-f}) options, regardless of the current locale.
15259 The argument @var{timespec} specifies how much of the time to include.
15260 It can be one of the following:
15264 Print just the full-date, e.g., @samp{2005-09-14}.
15265 This is equivalent to the format @samp{%Y-%m-%d}.
15268 Print the full-date and full-time separated by a space, e.g.,
15269 @samp{2005-09-14 00:56:06+05:30}. The output ends with a numeric
15270 time-offset; here the @samp{+05:30} means that local time is five
15271 hours and thirty minutes east of UTC@. This is equivalent to
15272 the format @samp{%Y-%m-%d %H:%M:%S%:z}.
15275 Like @samp{seconds}, but also print nanoseconds, e.g.,
15276 @samp{2005-09-14 00:56:06.998458565+05:30}.
15277 This is equivalent to the format @samp{%Y-%m-%d %H:%M:%S.%N%:z}.
15281 @item -s @var{datestr}
15282 @itemx --set=@var{datestr}
15285 Set the date and time to @var{datestr}. See @option{-d} above.
15286 See also @ref{Setting the time}.
15293 @opindex --universal
15294 @cindex Coordinated Universal Time
15296 @cindex Greenwich Mean Time
15298 @cindex leap seconds
15300 Use Coordinated Universal Time (UTC) by operating as if the
15301 @env{TZ} environment variable were set to the string @samp{UTC0}.
15303 Universal Time is often called ``Greenwich Mean Time'' (GMT) for
15304 historical reasons.
15305 Typically, systems ignore leap seconds and thus implement an
15306 approximation to UTC rather than true UTC.
15310 @node Examples of date
15311 @subsection Examples of @command{date}
15313 @cindex examples of @command{date}
15315 Here are a few examples. Also see the documentation for the @option{-d}
15316 option in the previous section.
15321 To print the date of the day before yesterday:
15324 date --date='2 days ago'
15328 To print the date of the day three months and one day hence:
15331 date --date='3 months 1 day'
15335 To print the day of year of Christmas in the current year:
15338 date --date='25 Dec' +%j
15342 To print the current full month name and the day of the month:
15348 But this may not be what you want because for the first nine days of
15349 the month, the @samp{%d} expands to a zero-padded two-digit field,
15350 for example @samp{date -d 1may '+%B %d'} will print @samp{May 01}.
15353 To print a date without the leading zero for one-digit days
15354 of the month, you can use the (GNU extension)
15355 @samp{-} flag to suppress
15356 the padding altogether:
15359 date -d 1may '+%B %-d
15363 To print the current date and time in the format required by many
15364 non-GNU versions of @command{date} when setting the system clock:
15367 date +%m%d%H%M%Y.%S
15371 To set the system clock forward by two minutes:
15374 date --set='+2 minutes'
15378 To print the date in RFC 2822 format,
15379 use @samp{date --rfc-2822}. Here is some example output:
15382 Fri, 09 Sep 2005 13:51:39 -0700
15385 @anchor{%s-examples}
15387 To convert a date string to the number of seconds since the epoch
15388 (which is 1970-01-01 00:00:00 UTC), use the @option{--date} option with
15389 the @samp{%s} format. That can be useful in sorting and/or graphing
15390 and/or comparing data by date. The following command outputs the
15391 number of the seconds since the epoch for the time two minutes after the
15395 date --date='1970-01-01 00:02:00 +0000' +%s
15399 If you do not specify time zone information in the date string,
15400 @command{date} uses your computer's idea of the time zone when
15401 interpreting the string. For example, if your computer's time zone is
15402 that of Cambridge, Massachusetts, which was then 5 hours (i.e., 18,000
15403 seconds) behind UTC:
15406 # local time zone used
15407 date --date='1970-01-01 00:02:00' +%s
15412 If you're sorting or graphing dated data, your raw date values may be
15413 represented as seconds since the epoch. But few people can look at
15414 the date @samp{946684800} and casually note ``Oh, that's the first second
15415 of the year 2000 in Greenwich, England.''
15418 date --date='2000-01-01 UTC' +%s
15422 An alternative is to use the @option{--utc} (@option{-u}) option.
15423 Then you may omit @samp{UTC} from the date string. Although this
15424 produces the same result for @samp{%s} and many other format sequences,
15425 with a time zone offset different from zero, it would give a different
15426 result for zone-dependent formats like @samp{%z}.
15429 date -u --date=2000-01-01 +%s
15433 To convert such an unwieldy number of seconds back to
15434 a more readable form, use a command like this:
15437 # local time zone used
15438 date -d '1970-01-01 UTC 946684800 seconds' +"%Y-%m-%d %T %z"
15439 1999-12-31 19:00:00 -0500
15442 Or if you do not mind depending on the @samp{@@} feature present since
15443 coreutils 5.3.0, you could shorten this to:
15446 date -d @@946684800 +"%F %T %z"
15447 1999-12-31 19:00:00 -0500
15450 Often it is better to output UTC-relative date and time:
15453 date -u -d '1970-01-01 946684800 seconds' +"%Y-%m-%d %T %z"
15454 2000-01-01 00:00:00 +0000
15458 @cindex leap seconds
15459 Typically the seconds count omits leap seconds, but some systems are
15460 exceptions. Because leap seconds are not predictable, the mapping
15461 between the seconds count and a future timestamp is not reliable on
15462 the atypical systems that include leap seconds in their counts.
15464 Here is how the two kinds of systems handle the leap second at
15465 2012-06-30 23:59:60 UTC:
15468 # Typical systems ignore leap seconds:
15469 date --date='2012-06-30 23:59:59 +0000' +%s
15471 date --date='2012-06-30 23:59:60 +0000' +%s
15472 date: invalid date '2012-06-30 23:59:60 +0000'
15473 date --date='2012-07-01 00:00:00 +0000' +%s
15478 # Atypical systems count leap seconds:
15479 date --date='2012-06-30 23:59:59 +0000' +%s
15481 date --date='2012-06-30 23:59:60 +0000' +%s
15483 date --date='2012-07-01 00:00:00 +0000' +%s
15490 @node arch invocation
15491 @section @command{arch}: Print machine hardware name
15494 @cindex print machine hardware name
15495 @cindex system information, printing
15497 @command{arch} prints the machine hardware name,
15498 and is equivalent to @samp{uname -m}.
15502 arch [@var{option}]
15505 The program accepts the @ref{Common options} only.
15510 @node nproc invocation
15511 @section @command{nproc}: Print the number of available processors
15514 @cindex Print the number of processors
15515 @cindex system information, printing
15517 Print the number of processing units available to the current process,
15518 which may be less than the number of online processors.
15519 If this information is not accessible, then print the number of
15520 processors installed. If the @env{OMP_NUM_THREADS} environment variable is
15521 set, then it will determine the returned value. The result is guaranteed to be
15522 greater than zero. Synopsis:
15525 nproc [@var{option}]
15528 The program accepts the following options. Also see @ref{Common options}.
15534 Print the number of installed processors on the system, which may
15535 be greater than the number online or available to the current process.
15536 The @env{OMP_NUM_THREADS} environment variable is not honored in this case.
15538 @item --ignore=@var{number}
15540 If possible, exclude this @var{number} of processing units.
15547 @node uname invocation
15548 @section @command{uname}: Print system information
15551 @cindex print system information
15552 @cindex system information, printing
15554 @command{uname} prints information about the machine and operating system
15555 it is run on. If no options are given, @command{uname} acts as if the
15556 @option{-s} option were given. Synopsis:
15559 uname [@var{option}]@dots{}
15562 If multiple options or @option{-a} are given, the selected information is
15563 printed in this order:
15566 @var{kernel-name} @var{nodename} @var{kernel-release} @var{kernel-version}
15567 @var{machine} @var{processor} @var{hardware-platform} @var{operating-system}
15570 The information may contain internal spaces, so such output cannot be
15571 parsed reliably. In the following example, @var{release} is
15572 @samp{2.2.18ss.e820-bda652a #4 SMP Tue Jun 5 11:24:08 PDT 2001}:
15576 @result{} Linux dumdum 2.2.18 #4 SMP Tue Jun 5 11:24:08 PDT 2001 i686@c
15577 unknown unknown GNU/Linux
15581 The program accepts the following options. Also see @ref{Common options}.
15589 Print all of the below information, except omit the processor type
15590 and the hardware platform name if they are unknown.
15593 @itemx --hardware-platform
15595 @opindex --hardware-platform
15596 @cindex implementation, hardware
15597 @cindex hardware platform
15598 @cindex platform, hardware
15599 Print the hardware platform name
15600 (sometimes called the hardware implementation).
15601 Print @samp{unknown} if the kernel does not make this information
15602 easily available, as is the case with Linux kernels.
15608 @cindex machine type
15609 @cindex hardware class
15610 @cindex hardware type
15611 Print the machine hardware name (sometimes called the hardware class
15617 @opindex --nodename
15620 @cindex network node name
15621 Print the network node hostname.
15626 @opindex --processor
15627 @cindex host processor type
15628 Print the processor type (sometimes called the instruction set
15629 architecture or ISA).
15630 Print @samp{unknown} if the kernel does not make this information
15631 easily available, as is the case with Linux kernels.
15634 @itemx --operating-system
15636 @opindex --operating-system
15637 @cindex operating system name
15638 Print the name of the operating system.
15641 @itemx --kernel-release
15643 @opindex --kernel-release
15644 @cindex kernel release
15645 @cindex release of kernel
15646 Print the kernel release.
15649 @itemx --kernel-name
15651 @opindex --kernel-name
15652 @cindex kernel name
15653 @cindex name of kernel
15654 Print the kernel name.
15655 POSIX 1003.1-2001 (@pxref{Standards conformance}) calls this
15656 ``the implementation of the operating system'', because the
15657 POSIX specification itself has no notion of ``kernel''.
15658 The kernel name might be the same as the operating system name printed
15659 by the @option{-o} or @option{--operating-system} option, but it might
15660 differ. Some operating systems (e.g., FreeBSD, HP-UX) have the same
15661 name as their underlying kernels; others (e.g., GNU/Linux, Solaris)
15665 @itemx --kernel-version
15667 @opindex --kernel-version
15668 @cindex kernel version
15669 @cindex version of kernel
15670 Print the kernel version.
15677 @node hostname invocation
15678 @section @command{hostname}: Print or set system name
15681 @cindex setting the hostname
15682 @cindex printing the hostname
15683 @cindex system name, printing
15684 @cindex appropriate privileges
15686 With no arguments, @command{hostname} prints the name of the current host
15687 system. With one argument, it sets the current host name to the
15688 specified string. You must have appropriate privileges to set the host
15692 hostname [@var{name}]
15695 The only options are @option{--help} and @option{--version}. @xref{Common
15701 @node hostid invocation
15702 @section @command{hostid}: Print numeric host identifier
15705 @cindex printing the host identifier
15707 @command{hostid} prints the numeric identifier of the current host
15708 in hexadecimal. This command accepts no arguments.
15709 The only options are @option{--help} and @option{--version}.
15710 @xref{Common options}.
15712 For example, here's what it prints on one system I use:
15719 On that system, the 32-bit quantity happens to be closely
15720 related to the system's Internet address, but that isn't always
15725 @node uptime invocation
15726 @section @command{uptime}: Print system uptime and load
15729 @cindex printing the system uptime and load
15731 @command{uptime} prints the current time, the system's uptime, the
15732 number of logged-in users and the current load average.
15734 If an argument is specified, it is used as the file to be read
15735 to discover how many users are logged in. If no argument is
15736 specified, a system default is used (@command{uptime --help} indicates
15737 the default setting).
15739 The only options are @option{--help} and @option{--version}.
15740 @xref{Common options}.
15742 For example, here's what it prints right now on one system I use:
15746 14:07 up 3:35, 3 users, load average: 1.39, 1.15, 1.04
15749 The precise method of calculation of load average varies somewhat
15750 between systems. Some systems calculate it as the average number of
15751 runnable processes over the last 1, 5 and 15 minutes, but some systems
15752 also include processes in the uninterruptible sleep state (that is,
15753 those processes which are waiting for disk I/O). The Linux kernel
15754 includes uninterruptible processes.
15756 @node SELinux context
15757 @chapter SELinux context
15759 @cindex SELinux context
15760 @cindex SELinux, context
15761 @cindex commands for SELinux context
15763 This section describes commands for operations with SELinux
15767 * chcon invocation:: Change SELinux context of file
15768 * runcon invocation:: Run a command in specified SELinux context
15771 @node chcon invocation
15772 @section @command{chcon}: Change SELinux context of file
15775 @cindex changing security context
15776 @cindex change SELinux context
15778 @command{chcon} changes the SELinux security context of the selected files.
15782 chcon [@var{option}]@dots{} @var{context} @var{file}@dots{}
15783 chcon [@var{option}]@dots{} [-u @var{user}] [-r @var{role}] [-l @var{range}]@c
15784 [-t @var{type}] @var{file}@dots{}
15785 chcon [@var{option}]@dots{} --reference=@var{rfile} @var{file}@dots{}
15788 Change the SELinux security context of each @var{file} to @var{context}.
15789 With @option{--reference}, change the security context of each @var{file}
15790 to that of @var{rfile}.
15792 The program accepts the following options. Also see @ref{Common options}.
15796 @item --dereference
15797 @opindex --dereference
15798 Do not affect symbolic links but what they refer to; this is the default.
15801 @itemx --no-dereference
15803 @opindex --no-dereference
15804 @cindex no dereference
15805 Affect the symbolic links themselves instead of any referenced file.
15807 @item --reference=@var{rfile}
15808 @opindex --reference
15809 @cindex reference file
15810 Use @var{rfile}'s security context rather than specifying a @var{context} value.
15815 @opindex --recursive
15816 Operate on files and directories recursively.
15818 @item --preserve-root
15819 @opindex --preserve-root
15820 Refuse to operate recursively on the root directory, @file{/},
15821 when used together with the @option{--recursive} option.
15822 @xref{Treating / specially}.
15824 @item --no-preserve-root
15825 @opindex --no-preserve-root
15826 Do not treat the root directory, @file{/}, specially when operating
15827 recursively; this is the default.
15828 @xref{Treating / specially}.
15831 @xref{Traversing symlinks}.
15834 @xref{Traversing symlinks}.
15837 @xref{Traversing symlinks}.
15844 Output a diagnostic for every file processed.
15846 @item -u @var{user}
15847 @itemx --user=@var{user}
15850 Set user @var{user} in the target security context.
15852 @item -r @var{role}
15853 @itemx --role=@var{role}
15856 Set role @var{role} in the target security context.
15858 @item -t @var{type}
15859 @itemx --type=@var{type}
15862 Set type @var{type} in the target security context.
15864 @item -l @var{range}
15865 @itemx --range=@var{range}
15868 Set range @var{range} in the target security context.
15874 @node runcon invocation
15875 @section @command{runcon}: Run a command in specified SELinux context
15878 @cindex run with security context
15881 @command{runcon} runs file in specified SELinux security context.
15885 runcon @var{context} @var{command} [@var{args}]
15886 runcon [ -c ] [-u @var{user}] [-r @var{role}] [-t @var{type}]@c
15887 [-l @var{range}] @var{command} [@var{args}]
15890 Run @var{command} with completely-specified @var{context}, or with
15891 current or transitioned security context modified by one or more of @var{level},
15892 @var{role}, @var{type} and @var{user}.
15894 If none of @option{-c}, @option{-t}, @option{-u}, @option{-r}, or @option{-l}
15895 is specified, the first argument is used as the complete context.
15896 Any additional arguments after @var{command}
15897 are interpreted as arguments to the command.
15899 With neither @var{context} nor @var{command}, print the current
15902 The program accepts the following options. Also see @ref{Common options}.
15910 Compute process transition context before modifying.
15912 @item -u @var{user}
15913 @itemx --user=@var{user}
15916 Set user @var{user} in the target security context.
15918 @item -r @var{role}
15919 @itemx --role=@var{role}
15922 Set role @var{role} in the target security context.
15924 @item -t @var{type}
15925 @itemx --type=@var{type}
15928 Set type @var{type} in the target security context.
15930 @item -l @var{range}
15931 @itemx --range=@var{range}
15934 Set range @var{range} in the target security context.
15938 @cindex exit status of @command{runcon}
15942 126 if @var{command} is found but cannot be invoked
15943 127 if @command{runcon} itself fails or if @var{command} cannot be found
15944 the exit status of @var{command} otherwise
15947 @node Modified command invocation
15948 @chapter Modified command invocation
15950 @cindex modified command invocation
15951 @cindex invocation of commands, modified
15952 @cindex commands for invoking other commands
15954 This section describes commands that run other commands in some context
15955 different than the current one: a modified environment, as a different
15959 * chroot invocation:: Modify the root directory.
15960 * env invocation:: Modify environment variables.
15961 * nice invocation:: Modify niceness.
15962 * nohup invocation:: Immunize to hangups.
15963 * stdbuf invocation:: Modify buffering of standard streams.
15964 * timeout invocation:: Run with time limit.
15968 @node chroot invocation
15969 @section @command{chroot}: Run a command with a different root directory
15972 @cindex running a program in a specified root directory
15973 @cindex root directory, running a program in a specified
15975 @command{chroot} runs a command with a specified root directory.
15976 On many systems, only the super-user can do this.@footnote{However,
15977 some systems (e.g., FreeBSD) can be configured to allow certain regular
15978 users to use the @code{chroot} system call, and hence to run this program.
15979 Also, on Cygwin, anyone can run the @command{chroot} command, because the
15980 underlying function is non-privileged due to lack of support in MS-Windows.}
15984 chroot @var{option} @var{newroot} [@var{command} [@var{args}]@dots{}]
15985 chroot @var{option}
15988 Ordinarily, file names are looked up starting at the root of the
15989 directory structure, i.e., @file{/}. @command{chroot} changes the root to
15990 the directory @var{newroot} (which must exist) and then runs
15991 @var{command} with optional @var{args}. If @var{command} is not
15992 specified, the default is the value of the @env{SHELL} environment
15993 variable or @command{/bin/sh} if not set, invoked with the @option{-i} option.
15994 @var{command} must not be a special built-in utility
15995 (@pxref{Special built-in utilities}).
15997 The program accepts the following options. Also see @ref{Common options}.
15998 Options must precede operands.
16002 @item --userspec=@var{user}[:@var{group}]
16003 @opindex --userspec
16004 By default, @var{command} is run with the same credentials
16005 as the invoking process.
16006 Use this option to run it as a different @var{user} and/or with a
16007 different primary @var{group}.
16009 @item --groups=@var{groups}
16011 Use this option to specify the supplementary @var{groups} to be
16012 used by the new process.
16013 The items in the list (names or numeric IDs) must be separated by commas.
16017 Here are a few tips to help avoid common problems in using chroot.
16018 To start with a simple example, make @var{command} refer to a statically
16019 linked binary. If you were to use a dynamically linked executable, then
16020 you'd have to arrange to have the shared libraries in the right place under
16021 your new root directory.
16023 For example, if you create a statically linked @command{ls} executable,
16024 and put it in @file{/tmp/empty}, you can run this command as root:
16027 $ chroot /tmp/empty /ls -Rl /
16030 Then you'll see output like this:
16035 -rwxr-xr-x 1 0 0 1041745 Aug 16 11:17 ls
16038 If you want to use a dynamically linked executable, say @command{bash},
16039 then first run @samp{ldd bash} to see what shared objects it needs.
16040 Then, in addition to copying the actual binary, also copy the listed
16041 files to the required positions under your intended new root directory.
16042 Finally, if the executable requires any other files (e.g., data, state,
16043 device files), copy them into place, too.
16045 @cindex exit status of @command{chroot}
16049 125 if @command{chroot} itself fails
16050 126 if @var{command} is found but cannot be invoked
16051 127 if @var{command} cannot be found
16052 the exit status of @var{command} otherwise
16056 @node env invocation
16057 @section @command{env}: Run a command in a modified environment
16060 @cindex environment, running a program in a modified
16061 @cindex modified environment, running a program in a
16062 @cindex running a program in a modified environment
16064 @command{env} runs a command with a modified environment. Synopses:
16067 env [@var{option}]@dots{} [@var{name}=@var{value}]@dots{} @c
16068 [@var{command} [@var{args}]@dots{}]
16072 Operands of the form @samp{@var{variable}=@var{value}} set
16073 the environment variable @var{variable} to value @var{value}.
16074 @var{value} may be empty (@samp{@var{variable}=}). Setting a variable
16075 to an empty value is different from unsetting it.
16076 These operands are evaluated left-to-right, so if two operands
16077 mention the same variable the earlier is ignored.
16079 Environment variable names can be empty, and can contain any
16080 characters other than @samp{=} and ASCII NUL.
16081 However, it is wise to limit yourself to names that
16082 consist solely of underscores, digits, and ASCII letters,
16083 and that begin with a non-digit, as applications like the shell do not
16084 work well with other names.
16087 The first operand that does not contain the character @samp{=}
16088 specifies the program to invoke; it is
16089 searched for according to the @env{PATH} environment variable. Any
16090 remaining arguments are passed as arguments to that program.
16091 The program should not be a special built-in utility
16092 (@pxref{Special built-in utilities}).
16094 Modifications to @env{PATH} take effect prior to searching for
16095 @var{command}. Use caution when reducing @env{PATH}; behavior is
16096 not portable when @env{PATH} is undefined or omits key directories
16097 such as @file{/bin}.
16099 In the rare case that a utility contains a @samp{=} in the name, the
16100 only way to disambiguate it from a variable assignment is to use an
16101 intermediate command for @var{command}, and pass the problematic
16102 program name via @var{args}. For example, if @file{./prog=} is an
16103 executable in the current @env{PATH}:
16106 env prog= true # runs 'true', with prog= in environment
16107 env ./prog= true # runs 'true', with ./prog= in environment
16108 env -- prog= true # runs 'true', with prog= in environment
16109 env sh -c '\prog= true' # runs 'prog=' with argument 'true'
16110 env sh -c 'exec "$@@"' sh prog= true # also runs 'prog='
16113 @cindex environment, printing
16115 If no command name is specified following the environment
16116 specifications, the resulting environment is printed. This is like
16117 specifying the @command{printenv} program.
16119 For some examples, suppose the environment passed to @command{env}
16120 contains @samp{LOGNAME=rms}, @samp{EDITOR=emacs}, and
16121 @samp{PATH=.:/gnubin:/hacks}:
16126 Output the current environment.
16128 $ env | LC_ALL=C sort
16131 PATH=.:/gnubin:/hacks
16135 Run @command{foo} with a reduced environment, preserving only the
16136 original @env{PATH} to avoid problems in locating @command{foo}.
16138 env - PATH="$PATH" foo
16142 Run @command{foo} with the environment containing @samp{LOGNAME=rms},
16143 @samp{EDITOR=emacs}, and @samp{PATH=.:/gnubin:/hacks}, and guarantees
16144 that @command{foo} was found in the file system rather than as a shell
16151 Run @command{nemacs} with the environment containing @samp{LOGNAME=foo},
16152 @samp{EDITOR=emacs}, @samp{PATH=.:/gnubin:/hacks}, and
16153 @samp{DISPLAY=gnu:0}.
16155 env DISPLAY=gnu:0 LOGNAME=foo nemacs
16159 Attempt to run the program @command{/energy/--} (as that is the only
16160 possible path search result); if the command exists, the environment
16161 will contain @samp{LOGNAME=rms} and @samp{PATH=/energy}, and the
16162 arguments will be @samp{e=mc2}, @samp{bar}, and @samp{baz}.
16164 env -u EDITOR PATH=/energy -- e=mc2 bar baz
16170 The program accepts the following options. Also see @ref{Common options}.
16171 Options must precede operands.
16177 @item -u @var{name}
16178 @itemx --unset=@var{name}
16181 Remove variable @var{name} from the environment, if it was in the
16186 @itemx --ignore-environment
16189 @opindex --ignore-environment
16190 Start with an empty environment, ignoring the inherited environment.
16194 @cindex exit status of @command{env}
16198 0 if no @var{command} is specified and the environment is output
16199 125 if @command{env} itself fails
16200 126 if @var{command} is found but cannot be invoked
16201 127 if @var{command} cannot be found
16202 the exit status of @var{command} otherwise
16206 @node nice invocation
16207 @section @command{nice}: Run a command with modified niceness
16211 @cindex scheduling, affecting
16212 @cindex appropriate privileges
16214 @command{nice} prints a process's @dfn{niceness}, or runs
16215 a command with modified niceness. @dfn{niceness} affects how
16216 favorably the process is scheduled in the system.
16220 nice [@var{option}]@dots{} [@var{command} [@var{arg}]@dots{}]
16223 If no arguments are given, @command{nice} prints the current niceness.
16224 Otherwise, @command{nice} runs the given @var{command} with its
16225 niceness adjusted. By default, its niceness is incremented by 10.
16227 Niceness values range at least from @minus{}20 (process has high priority
16228 and gets more resources, thus slowing down other processes) through 19
16229 (process has lower priority and runs slowly itself, but has less impact
16230 on the speed of other running processes). Some systems
16231 may have a wider range of niceness values; conversely, other systems may
16232 enforce more restrictive limits. An attempt to set the niceness
16233 outside the supported range is treated as an attempt to use the
16234 minimum or maximum supported value.
16236 A niceness should not be confused with a scheduling priority, which
16237 lets applications determine the order in which threads are scheduled
16238 to run. Unlike a priority, a niceness is merely advice to the
16239 scheduler, which the scheduler is free to ignore. Also, as a point of
16240 terminology, POSIX defines the behavior of @command{nice} in
16241 terms of a @dfn{nice value}, which is the nonnegative difference
16242 between a niceness and the minimum niceness. Though @command{nice}
16243 conforms to POSIX, its documentation and diagnostics use the
16244 term ``niceness'' for compatibility with historical practice.
16246 @var{command} must not be a special built-in utility (@pxref{Special
16247 built-in utilities}).
16249 @mayConflictWithShellBuiltIn{nice}
16251 Note to change the @dfn{niceness} of an existing process,
16252 one needs to use the @command{renice} command.
16254 The program accepts the following option. Also see @ref{Common options}.
16255 Options must precede operands.
16258 @item -n @var{adjustment}
16259 @itemx --adjustment=@var{adjustment}
16261 @opindex --adjustment
16262 Add @var{adjustment} instead of 10 to the command's niceness. If
16263 @var{adjustment} is negative and you lack appropriate privileges,
16264 @command{nice} issues a warning but otherwise acts as if you specified
16267 For compatibility @command{nice} also supports an obsolete
16268 option syntax @option{-@var{adjustment}}. New scripts should use
16269 @option{-n @var{adjustment}} instead.
16273 @cindex exit status of @command{nice}
16277 0 if no @var{command} is specified and the niceness is output
16278 125 if @command{nice} itself fails
16279 126 if @var{command} is found but cannot be invoked
16280 127 if @var{command} cannot be found
16281 the exit status of @var{command} otherwise
16284 It is sometimes useful to run a non-interactive program with reduced niceness.
16287 $ nice factor 4611686018427387903
16290 Since @command{nice} prints the current niceness,
16291 you can invoke it through itself to demonstrate how it works.
16293 The default behavior is to increase the niceness by @samp{10}:
16304 The @var{adjustment} is relative to the current niceness. In the
16305 next example, the first @command{nice} invocation runs the second one
16306 with niceness 10, and it in turn runs the final one with a niceness
16310 $ nice nice -n 3 nice
16314 Specifying a niceness larger than the supported range
16315 is the same as specifying the maximum supported value:
16318 $ nice -n 10000000000 nice
16322 Only a privileged user may run a process with lower niceness:
16326 nice: cannot set niceness: Permission denied
16328 $ sudo nice -n -1 nice
16333 @node nohup invocation
16334 @section @command{nohup}: Run a command immune to hangups
16337 @cindex hangups, immunity to
16338 @cindex immunity to hangups
16339 @cindex logging out and continuing to run
16342 @command{nohup} runs the given @var{command} with hangup signals ignored,
16343 so that the command can continue running in the background after you log
16347 nohup @var{command} [@var{arg}]@dots{}
16350 If standard input is a terminal, it is redirected from
16351 @file{/dev/null} so that terminal sessions do not mistakenly consider
16352 the terminal to be used by the command. This is a GNU
16353 extension; programs intended to be portable to non-GNU hosts
16354 should use @samp{nohup @var{command} [@var{arg}]@dots{} </dev/null}
16358 If standard output is a terminal, the command's standard output is appended
16359 to the file @file{nohup.out}; if that cannot be written to, it is appended
16360 to the file @file{$HOME/nohup.out}; and if that cannot be written to, the
16361 command is not run.
16362 Any @file{nohup.out} or @file{$HOME/nohup.out} file created by
16363 @command{nohup} is made readable and writable only to the user,
16364 regardless of the current umask settings.
16366 If standard error is a terminal, it is normally redirected to the same file
16367 descriptor as the (possibly-redirected) standard output.
16368 However, if standard output is closed, standard error terminal output
16369 is instead appended to the file @file{nohup.out} or
16370 @file{$HOME/nohup.out} as above.
16372 To capture the command's output to a file other than @file{nohup.out}
16373 you can redirect it. For example, to capture the output of
16377 nohup make > make.log
16380 @command{nohup} does not automatically put the command it runs in the
16381 background; you must do that explicitly, by ending the command line
16382 with an @samp{&}. Also, @command{nohup} does not alter the
16383 niceness of @var{command}; use @command{nice} for that,
16384 e.g., @samp{nohup nice @var{command}}.
16386 @var{command} must not be a special built-in utility (@pxref{Special
16387 built-in utilities}).
16389 The only options are @option{--help} and @option{--version}. @xref{Common
16390 options}. Options must precede operands.
16392 @cindex exit status of @command{nohup}
16396 125 if @command{nohup} itself fails, and @env{POSIXLY_CORRECT} is not set
16397 126 if @var{command} is found but cannot be invoked
16398 127 if @var{command} cannot be found
16399 the exit status of @var{command} otherwise
16402 If @env{POSIXLY_CORRECT} is set, internal failures give status 127
16406 @node stdbuf invocation
16407 @section @command{stdbuf}: Run a command with modified I/O stream buffering
16410 @cindex standard streams, buffering
16411 @cindex line buffered
16413 @command{stdbuf} allows one to modify the buffering operations of the
16414 three standard I/O streams associated with a program. Synopsis:
16417 stdbuf @var{option}@dots{} @var{command}
16420 @var{command} must start with the name of a program that
16423 uses the ISO C @code{FILE} streams for input/output (note the
16424 programs @command{dd} and @command{cat} don't do that),
16427 does not adjust the buffering of its standard streams (note the
16428 program @command{tee} is not in this category).
16431 Any additional @var{arg}s are passed as additional arguments to the
16434 The program accepts the following options. Also see @ref{Common options}.
16438 @item -i @var{mode}
16439 @itemx --input=@var{mode}
16442 Adjust the standard input stream buffering.
16444 @item -o @var{mode}
16445 @itemx --output=@var{mode}
16448 Adjust the standard output stream buffering.
16450 @item -e @var{mode}
16451 @itemx --error=@var{mode}
16454 Adjust the standard error stream buffering.
16458 The @var{mode} can be specified as follows:
16463 Set the stream to line buffered mode.
16464 In this mode data is coalesced until a newline is output or
16465 input is read from any stream attached to a terminal device.
16466 This option is invalid with standard input.
16469 Disable buffering of the selected stream.
16470 In this mode, data is output immediately and only the
16471 amount of data requested is read from input.
16472 Note the difference in function for input and output.
16473 Disabling buffering for input will not influence the responsiveness
16474 or blocking behavior of the stream input functions.
16475 For example @code{fread} will still block until @code{EOF} or error,
16476 even if the underlying @code{read} returns less data than requested.
16479 Specify the size of the buffer to use in fully buffered mode.
16480 @multiplierSuffixesNoBlocks{size}
16484 @cindex exit status of @command{stdbuf}
16488 125 if @command{stdbuf} itself fails
16489 126 if @var{command} is found but cannot be invoked
16490 127 if @var{command} cannot be found
16491 the exit status of @var{command} otherwise
16495 @node timeout invocation
16496 @section @command{timeout}: Run a command with a time limit
16500 @cindex run commands with bounded time
16502 @command{timeout} runs the given @var{command} and kills it if it is
16503 still running after the specified time interval. Synopsis:
16506 timeout [@var{option}] @var{duration} @var{command} [@var{arg}]@dots{}
16509 @var{command} must not be a special built-in utility (@pxref{Special
16510 built-in utilities}).
16512 The program accepts the following options. Also see @ref{Common options}.
16513 Options must precede operands.
16516 @item --preserve-status
16517 @opindex --preserve-status
16518 Return the exit status of the managed @var{command} on timeout, rather than
16519 a specific exit status indicating a timeout. This is useful if the
16520 managed @var{command} supports running for an indeterminite amount of time.
16523 @opindex --foreground
16524 Don't create a separate background program group, so that
16525 the managed @var{command} can use the foreground TTY normally.
16526 This is needed to support timing out commands not started
16527 directly from an interactive shell, in two situations.
16530 @var{command} is interactive and needs to read from the terminal for example
16532 the user wants to support sending signals directly to @var{command}
16533 from the terminal (like Ctrl-C for example)
16536 Note in this mode of operation, any children of @var{command}
16537 will not be timed out.
16539 @item -k @var{duration}
16540 @itemx --kill-after=@var{duration}
16542 @opindex --kill-after
16543 Ensure the monitored @var{command} is killed by also sending a @samp{KILL}
16544 signal, after the specified @var{duration}. Without this option, if the
16545 selected signal proves not to be fatal, @command{timeout} does not kill
16548 @item -s @var{signal}
16549 @itemx --signal=@var{signal}
16552 Send this @var{signal} to @var{command} on timeout, rather than the
16553 default @samp{TERM} signal. @var{signal} may be a name like @samp{HUP}
16554 or a number. @xref{Signal specifications}.
16558 @var{duration} is a floating point number followed by an optional unit:
16560 @samp{s} for seconds (the default)
16561 @samp{m} for minutes
16565 A duration of 0 disables the associated timeout.
16566 Note that the actual timeout duration is dependent on system conditions,
16567 which should be especially considered when specifying sub-second timeouts.
16569 @cindex exit status of @command{timeout}
16573 124 if @var{command} times out
16574 125 if @command{timeout} itself fails
16575 126 if @var{command} is found but cannot be invoked
16576 127 if @var{command} cannot be found
16577 137 if @var{command} is sent the KILL(9) signal (128+9)
16578 the exit status of @var{command} otherwise
16582 @node Process control
16583 @chapter Process control
16585 @cindex processes, commands for controlling
16586 @cindex commands for controlling processes
16589 * kill invocation:: Sending a signal to processes.
16593 @node kill invocation
16594 @section @command{kill}: Send a signal to processes
16597 @cindex send a signal to processes
16599 The @command{kill} command sends a signal to processes, causing them
16600 to terminate or otherwise act upon receiving the signal in some way.
16601 Alternatively, it lists information about signals. Synopses:
16604 kill [-s @var{signal} | --signal @var{signal} | -@var{signal}] @var{pid}@dots{}
16605 kill [-l | --list | -t | --table] [@var{signal}]@dots{}
16608 @mayConflictWithShellBuiltIn{kill}
16610 The first form of the @command{kill} command sends a signal to all
16611 @var{pid} arguments. The default signal to send if none is specified
16612 is @samp{TERM}@. The special signal number @samp{0} does not denote a
16613 valid signal, but can be used to test whether the @var{pid} arguments
16614 specify processes to which a signal could be sent.
16616 If @var{pid} is positive, the signal is sent to the process with the
16617 process ID @var{pid}. If @var{pid} is zero, the signal is sent to all
16618 processes in the process group of the current process. If @var{pid}
16619 is @minus{}1, the signal is sent to all processes for which the user has
16620 permission to send a signal. If @var{pid} is less than @minus{}1, the signal
16621 is sent to all processes in the process group that equals the absolute
16622 value of @var{pid}.
16624 If @var{pid} is not positive, a system-dependent set of system
16625 processes is excluded from the list of processes to which the signal
16628 If a negative @var{pid} argument is desired as the first one, it
16629 should be preceded by @option{--}. However, as a common extension to
16630 POSIX, @option{--} is not required with @samp{kill
16631 -@var{signal} -@var{pid}}. The following commands are equivalent:
16640 The first form of the @command{kill} command succeeds if every @var{pid}
16641 argument specifies at least one process that the signal was sent to.
16643 The second form of the @command{kill} command lists signal information.
16644 Either the @option{-l} or @option{--list} option, or the @option{-t}
16645 or @option{--table} option must be specified. Without any
16646 @var{signal} argument, all supported signals are listed. The output
16647 of @option{-l} or @option{--list} is a list of the signal names, one
16648 per line; if @var{signal} is already a name, the signal number is
16649 printed instead. The output of @option{-t} or @option{--table} is a
16650 table of signal numbers, names, and descriptions. This form of the
16651 @command{kill} command succeeds if all @var{signal} arguments are valid
16652 and if there is no output error.
16654 The @command{kill} command also supports the @option{--help} and
16655 @option{--version} options. @xref{Common options}.
16657 A @var{signal} may be a signal name like @samp{HUP}, or a signal
16658 number like @samp{1}, or an exit status of a process terminated by the
16659 signal. A signal name can be given in canonical form or prefixed by
16660 @samp{SIG}@. The case of the letters is ignored, except for the
16661 @option{-@var{signal}} option which must use upper case to avoid
16662 ambiguity with lower case option letters.
16663 @xref{Signal specifications}, for a list of supported
16664 signal names and numbers.
16669 @cindex delaying commands
16670 @cindex commands for delaying
16672 @c Perhaps @command{wait} or other commands should be described here also?
16675 * sleep invocation:: Delay for a specified time.
16679 @node sleep invocation
16680 @section @command{sleep}: Delay for a specified time
16683 @cindex delay for a specified time
16685 @command{sleep} pauses for an amount of time specified by the sum of
16686 the values of the command line arguments.
16690 sleep @var{number}[smhd]@dots{}
16694 Each argument is a number followed by an optional unit; the default
16695 is seconds. The units are:
16708 Historical implementations of @command{sleep} have required that
16709 @var{number} be an integer, and only accepted a single argument
16710 without a suffix. However, GNU @command{sleep} accepts
16711 arbitrary floating point numbers. @xref{Floating point}.
16713 The only options are @option{--help} and @option{--version}. @xref{Common
16716 @c sleep is a shell built-in at least with Solaris 11's /bin/sh
16717 @mayConflictWithShellBuiltIn{sleep}
16722 @node Numeric operations
16723 @chapter Numeric operations
16725 @cindex numeric operations
16726 These programs do numerically-related operations.
16729 * factor invocation:: Show factors of numbers.
16730 * seq invocation:: Print sequences of numbers.
16734 @node factor invocation
16735 @section @command{factor}: Print prime factors
16738 @cindex prime factors
16740 @command{factor} prints prime factors. Synopses:
16743 factor [@var{number}]@dots{}
16744 factor @var{option}
16747 If no @var{number} is specified on the command line, @command{factor} reads
16748 numbers from standard input, delimited by newlines, tabs, or spaces.
16750 The @command{factor} command supports only a small number of options:
16754 Print a short help on standard output, then exit without further
16758 Print the program version on standard output, then exit without further
16762 Factoring the product of the eighth and ninth Mersenne primes
16763 takes about 30 milliseconds of CPU time on a 2.2 GHz Athlon.
16766 M8=$(echo 2^31-1|bc)
16767 M9=$(echo 2^61-1|bc)
16768 n=$(echo "$M8 * $M9" | bc)
16769 /usr/bin/time -f %U factor $n
16770 4951760154835678088235319297: 2147483647 2305843009213693951
16774 Similarly, factoring the eighth Fermat number @math{2^{256}+1} takes
16775 about 20 seconds on the same machine.
16777 Factoring large numbers is, in general, hard. The Pollard Rho
16778 algorithm used by @command{factor} is particularly effective for
16779 numbers with relatively small factors. If you wish to factor large
16780 numbers which do not have small factors (for example, numbers which
16781 are the product of two large primes), other methods are far better.
16783 If @command{factor} is built without using GNU MP, only
16784 single-precision arithmetic is available, and so large numbers
16785 (typically @math{2^{64}} and above) will not be supported. The single-precision
16786 code uses an algorithm which is designed for factoring smaller
16792 @node seq invocation
16793 @section @command{seq}: Print numeric sequences
16796 @cindex numeric sequences
16797 @cindex sequence of numbers
16799 @command{seq} prints a sequence of numbers to standard output. Synopses:
16802 seq [@var{option}]@dots{} @var{last}
16803 seq [@var{option}]@dots{} @var{first} @var{last}
16804 seq [@var{option}]@dots{} @var{first} @var{increment} @var{last}
16807 @command{seq} prints the numbers from @var{first} to @var{last} by
16808 @var{increment}. By default, each number is printed on a separate line.
16809 When @var{increment} is not specified, it defaults to @samp{1},
16810 even when @var{first} is larger than @var{last}.
16811 @var{first} also defaults to @samp{1}. So @code{seq 1} prints
16812 @samp{1}, but @code{seq 0} and @code{seq 10 5} produce no output.
16813 The sequence of numbers ends when the sum of the current number and
16814 @var{increment} would become greater than @var{last},
16815 so @code{seq 1 10 10} only produces @samp{1}.
16816 Floating-point numbers may be specified. @xref{Floating point}.
16818 The program accepts the following options. Also see @ref{Common options}.
16819 Options must precede operands.
16822 @item -f @var{format}
16823 @itemx --format=@var{format}
16824 @opindex -f @var{format}
16825 @opindex --format=@var{format}
16826 @cindex formatting of numbers in @command{seq}
16827 Print all numbers using @var{format}.
16828 @var{format} must contain exactly one of the @samp{printf}-style
16829 floating point conversion specifications @samp{%a}, @samp{%e},
16830 @samp{%f}, @samp{%g}, @samp{%A}, @samp{%E}, @samp{%F}, @samp{%G}@.
16831 The @samp{%} may be followed by zero or more flags taken from the set
16832 @samp{-+#0 '}, then an optional width containing one or more digits,
16833 then an optional precision consisting of a @samp{.} followed by zero
16834 or more digits. @var{format} may also contain any number of @samp{%%}
16835 conversion specifications. All conversion specifications have the
16836 same meaning as with @samp{printf}.
16838 The default format is derived from @var{first}, @var{step}, and
16839 @var{last}. If these all use a fixed point decimal representation,
16840 the default format is @samp{%.@var{p}f}, where @var{p} is the minimum
16841 precision that can represent the output numbers exactly. Otherwise,
16842 the default format is @samp{%g}.
16844 @item -s @var{string}
16845 @itemx --separator=@var{string}
16846 @cindex separator for numbers in @command{seq}
16847 Separate numbers with @var{string}; default is a newline.
16848 The output always terminates with a newline.
16851 @itemx --equal-width
16852 Print all numbers with the same width, by padding with leading zeros.
16853 @var{first}, @var{step}, and @var{last} should all use a fixed point
16854 decimal representation.
16855 (To have other kinds of padding, use @option{--format}).
16859 You can get finer-grained control over output with @option{-f}:
16862 $ seq -f '(%9.2E)' -9e5 1.1e6 1.3e6
16868 If you want hexadecimal integer output, you can use @command{printf}
16869 to perform the conversion:
16872 $ printf '%x\n' $(seq 1048575 1024 1050623)
16878 For very long lists of numbers, use xargs to avoid
16879 system limitations on the length of an argument list:
16882 $ seq 1000000 | xargs printf '%x\n' | tail -n 3
16888 To generate octal output, use the printf @code{%o} format instead
16891 On most systems, seq can produce whole-number output for values up to
16892 at least @math{2^{53}}. Larger integers are approximated. The details
16893 differ depending on your floating-point implementation.
16894 @xref{Floating point}. A common
16895 case is that @command{seq} works with integers through @math{2^{64}},
16896 and larger integers may not be numerically correct:
16899 $ seq 50000000000000000000 2 50000000000000000004
16900 50000000000000000000
16901 50000000000000000000
16902 50000000000000000004
16905 However, note that when limited to non-negative whole numbers,
16906 an increment of 1 and no format-specifying option, seq can print
16907 arbitrarily large numbers.
16909 Be careful when using @command{seq} with outlandish values: otherwise
16910 you may see surprising results, as @command{seq} uses floating point
16911 internally. For example, on the x86 platform, where the internal
16912 representation uses a 64-bit fraction, the command:
16915 seq 1 0.0000000000000000001 1.0000000000000000009
16918 outputs 1.0000000000000000007 twice and skips 1.0000000000000000008.
16923 @node File permissions
16924 @chapter File permissions
16927 @include parse-datetime.texi
16931 @node Opening the software toolbox
16932 @chapter Opening the Software Toolbox
16934 An earlier version of this chapter appeared in
16935 @uref{http://www.linuxjournal.com/article.php?sid=2762, the
16936 @cite{What's GNU@?} column of the June 1994 @cite{Linux Journal}}.
16937 It was written by Arnold Robbins.
16940 * Toolbox introduction:: Toolbox introduction
16941 * I/O redirection:: I/O redirection
16942 * The who command:: The @command{who} command
16943 * The cut command:: The @command{cut} command
16944 * The sort command:: The @command{sort} command
16945 * The uniq command:: The @command{uniq} command
16946 * Putting the tools together:: Putting the tools together
16950 @node Toolbox introduction
16951 @unnumberedsec Toolbox Introduction
16953 This month's column is only peripherally related to the GNU Project, in
16954 that it describes a number of the GNU tools on your GNU/Linux system
16956 might be used. What it's really about is the ``Software Tools'' philosophy
16957 of program development and usage.
16959 The software tools philosophy was an important and integral concept
16960 in the initial design and development of Unix (of which Linux and GNU are
16961 essentially clones). Unfortunately, in the modern day press of
16962 Internetworking and flashy GUIs, it seems to have fallen by the
16963 wayside. This is a shame, since it provides a powerful mental model
16964 for solving many kinds of problems.
16966 Many people carry a Swiss Army knife around in their pants pockets (or
16967 purse). A Swiss Army knife is a handy tool to have: it has several knife
16968 blades, a screwdriver, tweezers, toothpick, nail file, corkscrew, and perhaps
16969 a number of other things on it. For the everyday, small miscellaneous jobs
16970 where you need a simple, general purpose tool, it's just the thing.
16972 On the other hand, an experienced carpenter doesn't build a house using
16973 a Swiss Army knife. Instead, he has a toolbox chock full of specialized
16974 tools---a saw, a hammer, a screwdriver, a plane, and so on. And he knows
16975 exactly when and where to use each tool; you won't catch him hammering nails
16976 with the handle of his screwdriver.
16978 The Unix developers at Bell Labs were all professional programmers and trained
16979 computer scientists. They had found that while a one-size-fits-all program
16980 might appeal to a user because there's only one program to use, in practice
16985 difficult to write,
16988 difficult to maintain and
16992 difficult to extend to meet new situations.
16995 Instead, they felt that programs should be specialized tools. In short, each
16996 program ``should do one thing well.'' No more and no less. Such programs are
16997 simpler to design, write, and get right---they only do one thing.
16999 Furthermore, they found that with the right machinery for hooking programs
17000 together, that the whole was greater than the sum of the parts. By combining
17001 several special purpose programs, you could accomplish a specific task
17002 that none of the programs was designed for, and accomplish it much more
17003 quickly and easily than if you had to write a special purpose program.
17004 We will see some (classic) examples of this further on in the column.
17005 (An important additional point was that, if necessary, take a detour
17006 and build any software tools you may need first, if you don't already
17007 have something appropriate in the toolbox.)
17009 @node I/O redirection
17010 @unnumberedsec I/O Redirection
17012 Hopefully, you are familiar with the basics of I/O redirection in the
17013 shell, in particular the concepts of ``standard input,'' ``standard output,''
17014 and ``standard error''. Briefly, ``standard input'' is a data source, where
17015 data comes from. A program should not need to either know or care if the
17016 data source is a disk file, a keyboard, a magnetic tape, or even a punched
17017 card reader. Similarly, ``standard output'' is a data sink, where data goes
17018 to. The program should neither know nor care where this might be.
17019 Programs that only read their standard input, do something to the data,
17020 and then send it on, are called @dfn{filters}, by analogy to filters in a
17023 With the Unix shell, it's very easy to set up data pipelines:
17026 program_to_create_data | filter1 | ... | filterN > final.pretty.data
17029 We start out by creating the raw data; each filter applies some successive
17030 transformation to the data, until by the time it comes out of the pipeline,
17031 it is in the desired form.
17033 This is fine and good for standard input and standard output. Where does the
17034 standard error come in to play? Well, think about @command{filter1} in
17035 the pipeline above. What happens if it encounters an error in the data it
17036 sees? If it writes an error message to standard output, it will just
17037 disappear down the pipeline into @command{filter2}'s input, and the
17038 user will probably never see it. So programs need a place where they can send
17039 error messages so that the user will notice them. This is standard error,
17040 and it is usually connected to your console or window, even if you have
17041 redirected standard output of your program away from your screen.
17043 For filter programs to work together, the format of the data has to be
17044 agreed upon. The most straightforward and easiest format to use is simply
17045 lines of text. Unix data files are generally just streams of bytes, with
17046 lines delimited by the ASCII LF (Line Feed) character,
17047 conventionally called a ``newline'' in the Unix literature. (This is
17048 @code{'\n'} if you're a C programmer.) This is the format used by all
17049 the traditional filtering programs. (Many earlier operating systems
17050 had elaborate facilities and special purpose programs for managing
17051 binary data. Unix has always shied away from such things, under the
17052 philosophy that it's easiest to simply be able to view and edit your
17053 data with a text editor.)
17055 OK, enough introduction. Let's take a look at some of the tools, and then
17056 we'll see how to hook them together in interesting ways. In the following
17057 discussion, we will only present those command line options that interest
17058 us. As you should always do, double check your system documentation
17059 for the full story.
17061 @node The who command
17062 @unnumberedsec The @command{who} Command
17064 The first program is the @command{who} command. By itself, it generates a
17065 list of the users who are currently logged in. Although I'm writing
17066 this on a single-user system, we'll pretend that several people are
17071 @print{} arnold console Jan 22 19:57
17072 @print{} miriam ttyp0 Jan 23 14:19(:0.0)
17073 @print{} bill ttyp1 Jan 21 09:32(:0.0)
17074 @print{} arnold ttyp2 Jan 23 20:48(:0.0)
17077 Here, the @samp{$} is the usual shell prompt, at which I typed @samp{who}.
17078 There are three people logged in, and I am logged in twice. On traditional
17079 Unix systems, user names are never more than eight characters long. This
17080 little bit of trivia will be useful later. The output of @command{who} is nice,
17081 but the data is not all that exciting.
17083 @node The cut command
17084 @unnumberedsec The @command{cut} Command
17086 The next program we'll look at is the @command{cut} command. This program
17087 cuts out columns or fields of input data. For example, we can tell it
17088 to print just the login name and full name from the @file{/etc/passwd}
17089 file. The @file{/etc/passwd} file has seven fields, separated by
17093 arnold:xyzzy:2076:10:Arnold D. Robbins:/home/arnold:/bin/bash
17096 To get the first and fifth fields, we would use @command{cut} like this:
17099 $ cut -d: -f1,5 /etc/passwd
17100 @print{} root:Operator
17102 @print{} arnold:Arnold D. Robbins
17103 @print{} miriam:Miriam A. Robbins
17107 With the @option{-c} option, @command{cut} will cut out specific characters
17108 (i.e., columns) in the input lines. This is useful for input data
17109 that has fixed width fields, and does not have a field separator. For
17110 example, list the Monday dates for the current month:
17112 @c Is using cal ok? Looked at gcal, but I don't like it.
17123 @node The sort command
17124 @unnumberedsec The @command{sort} Command
17126 Next we'll look at the @command{sort} command. This is one of the most
17127 powerful commands on a Unix-style system; one that you will often find
17128 yourself using when setting up fancy data plumbing.
17131 command reads and sorts each file named on the command line. It then
17132 merges the sorted data and writes it to standard output. It will read
17133 standard input if no files are given on the command line (thus
17134 making it into a filter). The sort is based on the character collating
17135 sequence or based on user-supplied ordering criteria.
17138 @node The uniq command
17139 @unnumberedsec The @command{uniq} Command
17141 Finally (at least for now), we'll look at the @command{uniq} program. When
17142 sorting data, you will often end up with duplicate lines, lines that
17143 are identical. Usually, all you need is one instance of each line.
17144 This is where @command{uniq} comes in. The @command{uniq} program reads its
17145 standard input. It prints only one
17146 copy of each repeated line. It does have several options. Later on,
17147 we'll use the @option{-c} option, which prints each unique line, preceded
17148 by a count of the number of times that line occurred in the input.
17151 @node Putting the tools together
17152 @unnumberedsec Putting the Tools Together
17154 Now, let's suppose this is a large ISP server system with dozens of users
17155 logged in. The management wants the system administrator to write a
17157 generate a sorted list of logged in users. Furthermore, even if a user
17158 is logged in multiple times, his or her name should only show up in the
17161 The administrator could sit down with the system documentation and write a C
17162 program that did this. It would take perhaps a couple of hundred lines
17163 of code and about two hours to write it, test it, and debug it.
17164 However, knowing the software toolbox, the administrator can instead start out
17165 by generating just a list of logged on users:
17175 Next, sort the list:
17178 $ who | cut -c1-8 | sort
17185 Finally, run the sorted list through @command{uniq}, to weed out duplicates:
17188 $ who | cut -c1-8 | sort | uniq
17194 The @command{sort} command actually has a @option{-u} option that does what
17195 @command{uniq} does. However, @command{uniq} has other uses for which one
17196 cannot substitute @samp{sort -u}.
17198 The administrator puts this pipeline into a shell script, and makes it
17200 all the users on the system (@samp{#} is the system administrator,
17201 or @code{root}, prompt):
17204 # cat > /usr/local/bin/listusers
17205 who | cut -c1-8 | sort | uniq
17207 # chmod +x /usr/local/bin/listusers
17210 There are four major points to note here. First, with just four
17211 programs, on one command line, the administrator was able to save about two
17212 hours worth of work. Furthermore, the shell pipeline is just about as
17213 efficient as the C program would be, and it is much more efficient in
17214 terms of programmer time. People time is much more expensive than
17215 computer time, and in our modern ``there's never enough time to do
17216 everything'' society, saving two hours of programmer time is no mean
17219 Second, it is also important to emphasize that with the
17220 @emph{combination} of the tools, it is possible to do a special
17221 purpose job never imagined by the authors of the individual programs.
17223 Third, it is also valuable to build up your pipeline in stages, as we did here.
17224 This allows you to view the data at each stage in the pipeline, which helps
17225 you acquire the confidence that you are indeed using these tools correctly.
17227 Finally, by bundling the pipeline in a shell script, other users can use
17228 your command, without having to remember the fancy plumbing you set up for
17229 them. In terms of how you run them, shell scripts and compiled programs are
17232 After the previous warm-up exercise, we'll look at two additional, more
17233 complicated pipelines. For them, we need to introduce two more tools.
17235 The first is the @command{tr} command, which stands for ``transliterate.''
17236 The @command{tr} command works on a character-by-character basis, changing
17237 characters. Normally it is used for things like mapping upper case to
17241 $ echo ThIs ExAmPlE HaS MIXED case! | tr '[:upper:]' '[:lower:]'
17242 @print{} this example has mixed case!
17245 There are several options of interest:
17249 work on the complement of the listed characters, i.e.,
17250 operations apply to characters not in the given set
17253 delete characters in the first set from the output
17256 squeeze repeated characters in the output into just one character.
17259 We will be using all three options in a moment.
17261 The other command we'll look at is @command{comm}. The @command{comm}
17262 command takes two sorted input files as input data, and prints out the
17263 files' lines in three columns. The output columns are the data lines
17264 unique to the first file, the data lines unique to the second file, and
17265 the data lines that are common to both. The @option{-1}, @option{-2}, and
17266 @option{-3} command line options @emph{omit} the respective columns. (This is
17267 non-intuitive and takes a little getting used to.) For example:
17289 The file name @file{-} tells @command{comm} to read standard input
17290 instead of a regular file.
17292 Now we're ready to build a fancy pipeline. The first application is a word
17293 frequency counter. This helps an author determine if he or she is over-using
17296 The first step is to change the case of all the letters in our input file
17297 to one case. ``The'' and ``the'' are the same word when doing counting.
17300 $ tr '[:upper:]' '[:lower:]' < whats.gnu | ...
17303 The next step is to get rid of punctuation. Quoted words and unquoted words
17304 should be treated identically; it's easiest to just get the punctuation out of
17308 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' | ...
17311 The second @command{tr} command operates on the complement of the listed
17312 characters, which are all the letters, the digits, the underscore, and
17313 the blank. The @samp{\n} represents the newline character; it has to
17314 be left alone. (The ASCII tab character should also be included for
17315 good measure in a production script.)
17317 At this point, we have data consisting of words separated by blank space.
17318 The words only contain alphanumeric characters (and the underscore). The
17319 next step is break the data apart so that we have one word per line. This
17320 makes the counting operation much easier, as we will see shortly.
17323 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' |
17324 > tr -s ' ' '\n' | ...
17327 This command turns blanks into newlines. The @option{-s} option squeezes
17328 multiple newline characters in the output into just one. This helps us
17329 avoid blank lines. (The @samp{>} is the shell's ``secondary prompt.''
17330 This is what the shell prints when it notices you haven't finished
17331 typing in all of a command.)
17333 We now have data consisting of one word per line, no punctuation, all one
17334 case. We're ready to count each word:
17337 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' |
17338 > tr -s ' ' '\n' | sort | uniq -c | ...
17341 At this point, the data might look something like this:
17354 The output is sorted by word, not by count! What we want is the most
17355 frequently used words first. Fortunately, this is easy to accomplish,
17356 with the help of two more @command{sort} options:
17360 do a numeric sort, not a textual one
17363 reverse the order of the sort
17366 The final pipeline looks like this:
17369 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' |
17370 > tr -s ' ' '\n' | sort | uniq -c | sort -n -r
17379 Whew! That's a lot to digest. Yet, the same principles apply. With six
17380 commands, on two lines (really one long one split for convenience), we've
17381 created a program that does something interesting and useful, in much
17382 less time than we could have written a C program to do the same thing.
17384 A minor modification to the above pipeline can give us a simple spelling
17385 checker! To determine if you've spelled a word correctly, all you have to
17386 do is look it up in a dictionary. If it is not there, then chances are
17387 that your spelling is incorrect. So, we need a dictionary.
17388 The conventional location for a dictionary is @file{/usr/dict/words}.
17389 On my GNU/Linux system,@footnote{Redhat Linux 6.1, for the November 2000
17390 revision of this article.}
17391 this is a sorted, 45,402 word dictionary.
17393 Now, how to compare our file with the dictionary? As before, we generate
17394 a sorted list of words, one per line:
17397 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' |
17398 > tr -s ' ' '\n' | sort -u | ...
17401 Now, all we need is a list of words that are @emph{not} in the
17402 dictionary. Here is where the @command{comm} command comes in.
17405 $ tr '[:upper:]' '[:lower:]' < whats.gnu | tr -cd '[:alnum:]_ \n' |
17406 > tr -s ' ' '\n' | sort -u |
17407 > comm -23 - /usr/dict/words
17410 The @option{-2} and @option{-3} options eliminate lines that are only in the
17411 dictionary (the second file), and lines that are in both files. Lines
17412 only in the first file (standard input, our stream of words), are
17413 words that are not in the dictionary. These are likely candidates for
17414 spelling errors. This pipeline was the first cut at a production
17415 spelling checker on Unix.
17417 There are some other tools that deserve brief mention.
17421 search files for text that matches a regular expression
17424 count lines, words, characters
17427 a T-fitting for data pipes, copies data to files and to standard output
17430 the stream editor, an advanced tool
17433 a data manipulation language, another advanced tool
17436 The software tools philosophy also espoused the following bit of
17437 advice: ``Let someone else do the hard part.'' This means, take
17438 something that gives you most of what you need, and then massage it the
17439 rest of the way until it's in the form that you want.
17445 Each program should do one thing well. No more, no less.
17448 Combining programs with appropriate plumbing leads to results where
17449 the whole is greater than the sum of the parts. It also leads to novel
17450 uses of programs that the authors might never have imagined.
17453 Programs should never print extraneous header or trailer data, since these
17454 could get sent on down a pipeline. (A point we didn't mention earlier.)
17457 Let someone else do the hard part.
17460 Know your toolbox! Use each program appropriately. If you don't have an
17461 appropriate tool, build one.
17464 As of this writing, all the programs discussed are available from
17465 @uref{http://ftp.gnu.org/old-gnu/textutils/textutils-1.22.tar.gz},
17466 with more recent versions available from
17467 @uref{http://ftp.gnu.org/gnu/coreutils}.
17469 None of what I have presented in this column is new. The Software Tools
17470 philosophy was first introduced in the book @cite{Software Tools}, by
17471 Brian Kernighan and P.J. Plauger (Addison-Wesley, ISBN 0-201-03669-X).
17472 This book showed how to write and use software tools. It was written in
17473 1976, using a preprocessor for FORTRAN named @command{ratfor} (RATional
17474 FORtran). At the time, C was not as ubiquitous as it is now; FORTRAN
17475 was. The last chapter presented a @command{ratfor} to FORTRAN
17476 processor, written in @command{ratfor}. @command{ratfor} looks an awful
17477 lot like C; if you know C, you won't have any problem following the
17480 In 1981, the book was updated and made available as @cite{Software Tools
17481 in Pascal} (Addison-Wesley, ISBN 0-201-10342-7). Both books are
17482 still in print and are well worth
17483 reading if you're a programmer. They certainly made a major change in
17484 how I view programming.
17486 The programs in both books are available from
17487 @uref{http://cm.bell-labs.com/who/bwk, Brian Kernighan's home page}.
17488 For a number of years, there was an active
17489 Software Tools Users Group, whose members had ported the original
17490 @command{ratfor} programs to essentially every computer system with a
17491 FORTRAN compiler. The popularity of the group waned in the middle 1980s
17492 as Unix began to spread beyond universities.
17494 With the current proliferation of GNU code and other clones of Unix programs,
17495 these programs now receive little attention; modern C versions are
17496 much more efficient and do more than these programs do. Nevertheless, as
17497 exposition of good programming style, and evangelism for a still-valuable
17498 philosophy, these books are unparalleled, and I recommend them highly.
17500 Acknowledgment: I would like to express my gratitude to Brian Kernighan
17501 of Bell Labs, the original Software Toolsmith, for reviewing this column.
17503 @node GNU Free Documentation License
17504 @appendix GNU Free Documentation License
17508 @node Concept index
17515 @c Local variables:
17516 @c texinfo-column-for-description: 32