src/parallel_alternatives.pod

   1 #!/usr/bin/perl -w
   2
   3 # SPDX-FileCopyrightText: 2021-2024 Ole Tange, http://ole.tange.dk and Free Software and Foundation, Inc.
   4 # SPDX-License-Identifier: GFDL-1.3-or-later
   5 # SPDX-License-Identifier: CC-BY-SA-4.0
   6
   7 =encoding utf8
   8
   9 =head1 NAME
  10
  11 parallel_alternatives - Alternatives to GNU B<parallel>
  12
  13
  14 =head1 DIFFERENCES BETWEEN GNU Parallel AND ALTERNATIVES
  15
  16 There are a lot programs that share functionality with GNU
  17 B<parallel>. Some of these are specialized tools, and while GNU
  18 B<parallel> can emulate many of them, a specialized tool can be better
  19 at a given task. GNU B<parallel> strives to include the best of the
  20 general functionality without sacrificing ease of use.
  21
  22 B<parallel> has existed since 2002-01-06 and as GNU B<parallel> since
  23 2010. A lot of the alternatives have not had the vitality to survive
  24 that long, but have come and gone during that time.
  25
  26 GNU B<parallel> is actively maintained with a new release every month
  27 since 2010. Most other alternatives are fleeting interests of the
  28 developers with irregular releases and only maintained for a few
  29 years.
  30
  31
  32 =head2 SUMMARY LEGEND
  33
  34 The following features are in some of the comparable tools:
  35
  36 =head3 Inputs
  37
  38 =over
  39
  40 =item I1. Arguments can be read from stdin
  41
  42 =item I2. Arguments can be read from a file
  43
  44 =item I3. Arguments can be read from multiple files
  45
  46 =item I4. Arguments can be read from command line
  47
  48 =item I5. Arguments can be read from a table
  49
  50 =item I6. Arguments can be read from the same file using #! (shebang)
  51
  52 =item I7. Line oriented input as default (Quoting of special chars not needed)
  53
  54 =back
  55
  56
  57 =head3 Manipulation of input
  58
  59 =over
  60
  61 =item M1. Composed command
  62
  63 =item M2. Multiple arguments can fill up an execution line
  64
  65 =item M3. Arguments can be put anywhere in the execution line
  66
  67 =item M4. Multiple arguments can be put anywhere in the execution line
  68
  69 =item M5. Arguments can be replaced with context
  70
  71 =item M6. Input can be treated as the complete command line
  72
  73 =back
  74
  75
  76 =head3 Outputs
  77
  78 =over
  79
  80 =item O1. Grouping output so output from different jobs do not mix
  81
  82 =item O2. Send stderr (standard error) to stderr (standard error)
  83
  84 =item O3. Send stdout (standard output) to stdout (standard output)
  85
  86 =item O4. Order of output can be same as order of input
  87
  88 =item O5. Stdout only contains stdout (standard output) from the command
  89
  90 =item O6. Stderr only contains stderr (standard error) from the command
  91
  92 =item O7. Buffering on disk
  93
  94 =item O8. No temporary files left if killed
  95
  96 =item O9. Test if disk runs full during run
  97
  98 =item O10. Output of a line bigger than 4 GB
  99
 100 =back
 101
 102
 103 =head3 Execution
 104
 105 =over
 106
 107 =item E1. Run jobs in parallel
 108
 109 =item E2. List running jobs
 110
 111 =item E3. Finish running jobs, but do not start new jobs
 112
 113 =item E4. Number of running jobs can depend on number of cpus
 114
 115 =item E5. Finish running jobs, but do not start new jobs after first failure
 116
 117 =item E6. Number of running jobs can be adjusted while running
 118
 119 =item E7. Only spawn new jobs if load is less than a limit
 120
 121 =back
 122
 123
 124 =head3 Remote execution
 125
 126 =over
 127
 128 =item R1. Jobs can be run on remote computers
 129
 130 =item R2. Basefiles can be transferred
 131
 132 =item R3. Argument files can be transferred
 133
 134 =item R4. Result files can be transferred
 135
 136 =item R5. Cleanup of transferred files
 137
 138 =item R6. No config files needed
 139
 140 =item R7. Do not run more than SSHD's MaxStartups can handle
 141
 142 =item R8. Configurable SSH command
 143
 144 =item R9. Retry if connection breaks occasionally
 145
 146 =back
 147
 148
 149 =head3 Semaphore
 150
 151 =over
 152
 153 =item S1. Possibility to work as a mutex
 154
 155 =item S2. Possibility to work as a counting semaphore
 156
 157 =back
 158
 159
 160 =head3 Legend
 161
 162 =over
 163
 164 =item - = no
 165
 166 =item x = not applicable
 167
 168 =item ID = yes
 169
 170 =back
 171
 172 As every new version of the programs are not tested the table may be
 173 outdated. Please file a bug report if you find errors (See REPORTING
 174 BUGS).
 175
 176 parallel:
 177
 178 =over
 179
 180 =item I1 I2 I3 I4 I5 I6 I7
 181
 182 =item M1 M2 M3 M4 M5 M6
 183
 184 =item O1 O2 O3 O4 O5 O6 O7 O8 O9 O10
 185
 186 =item E1 E2 E3 E4 E5 E6 E7
 187
 188 =item R1 R2 R3 R4 R5 R6 R7 R8 R9
 189
 190 =item S1 S2
 191
 192 =back
 193
 194
 195 =head2 DIFFERENCES BETWEEN xargs AND GNU Parallel
 196
 197 Summary (see legend above):
 198
 199 =over
 200
 201 =item I1 I2 - - - - -
 202
 203 =item - M2 M3 - - -
 204
 205 =item - O2 O3 - O5 O6
 206
 207 =item E1 - - - - - -
 208
 209 =item - - - - - x - - -
 210
 211 =item - -
 212
 213 =back
 214
 215 B<xargs> offers some of the same possibilities as GNU B<parallel>.
 216
 217 B<xargs> deals badly with special characters (such as space, \, ' and
 218 "). To see the problem try this:
 219
 220   touch important_file
 221   touch 'not important_file'
 222   ls not* | xargs rm
 223   mkdir -p "My brother's 12\" records"
 224   ls | xargs rmdir
 225   touch 'c:\windows\system32\clfs.sys'
 226   echo 'c:\windows\system32\clfs.sys' | xargs ls -l
 227
 228 You can specify B<-0>, but many input generators are not optimized for
 229 using B<NUL> as separator but are optimized for B<newline> as
 230 separator. E.g. B<awk>, B<ls>, B<echo>, B<tar -v>, B<head> (requires
 231 using B<-z>), B<tail> (requires using B<-z>), B<sed> (requires using
 232 B<-z>), B<perl> (B<-0> and \0 instead of \n), B<locate> (requires
 233 using B<-0>), B<find> (requires using B<-print0>), B<grep> (requires
 234 using B<-z> or B<-Z>), B<sort> (requires using B<-z>).
 235
 236 GNU B<parallel>'s newline separation can be emulated with:
 237
 238   cat | xargs -d "\n" -n1 command
 239
 240 B<xargs> can run a given number of jobs in parallel, but has no
 241 support for running number-of-cpu-cores jobs in parallel.
 242
 243 B<xargs> has no support for grouping the output, therefore output may
 244 run together, e.g. the first half of a line is from one process and
 245 the last half of the line is from another process. The example
 246 B<Parallel grep> cannot be done reliably with B<xargs> because of
 247 this. To see this in action try:
 248
 249   parallel perl -e "'"'$a="1"."{}"x10000000;print $a,"\n"'"'" \
 250     '>' {} ::: a b c d e f g h
 251   # Serial = no mixing = the wanted result
 252   # 'tr -s a-z' squeezes repeating letters into a single letter
 253   echo a b c d e f g h | xargs -P1 -n1 grep 1 | tr -s a-z
 254   # Compare to 8 jobs in parallel
 255   parallel -kP8 -n1 grep 1 ::: a b c d e f g h | tr -s a-z
 256   echo a b c d e f g h | xargs -P8 -n1 grep 1 | tr -s a-z
 257   echo a b c d e f g h | xargs -P8 -n1 grep --line-buffered 1 | \
 258     tr -s a-z
 259
 260 Or try this:
 261
 262   slow_seq() {
 263     echo Count to "$@"
 264     seq "$@" |
 265       perl -ne '$|=1; for(split//){ print; select($a,$a,$a,0.100);}'
 266   }
 267   export -f slow_seq
 268   # Serial = no mixing = the wanted result
 269   seq 8 | xargs -n1 -P1 -I {} bash -c 'slow_seq {}'
 270   # Compare to 8 jobs in parallel
 271   seq 8 | parallel -P8 slow_seq {}
 272   seq 8 | xargs -n1 -P8 -I {} bash -c 'slow_seq {}'
 273
 274 B<xargs> has no support for keeping the order of the output, therefore
 275 if running jobs in parallel using B<xargs> the output of the second
 276 job cannot be postponed till the first job is done.
 277
 278 B<xargs> has no support for running jobs on remote computers.
 279
 280 B<xargs> has no support for context replace, so you will have to create the
 281 arguments.
 282
 283 If you use a replace string in B<xargs> (B<-I>) you can not force
 284 B<xargs> to use more than one argument.
 285
 286 Quoting in B<xargs> works like B<-q> in GNU B<parallel>. This means
 287 composed commands and redirection require using B<bash -c>.
 288
 289   ls | parallel "wc {} >{}.wc"
 290   ls | parallel "echo {}; ls {}|wc"
 291
 292 becomes (assuming you have 8 cores and that none of the filenames
 293 contain space, " or ').
 294
 295   ls | xargs -d "\n" -P8 -I {} bash -c "wc {} >{}.wc"
 296   ls | xargs -d "\n" -P8 -I {} bash -c "echo {}; ls {}|wc"
 297
 298 A more extreme example can be found on:
 299 https://unix.stackexchange.com/q/405552/
 300
 301 https://www.gnu.org/software/findutils/
 302
 303
 304 =head2 DIFFERENCES BETWEEN find -exec AND GNU Parallel
 305
 306 Summary (see legend above):
 307
 308 =over
 309
 310 =item -  -  -  x  -  x  -
 311
 312 =item -  M2 M3 -  -  -  -
 313
 314 =item -  O2 O3 O4 O5 O6
 315
 316 =item -  -  -  -  -  -  -
 317
 318 =item -  -  -  -  -  -  -  -  -
 319
 320 =item x  x
 321
 322 =back
 323
 324 B<find -exec> offers some of the same possibilities as GNU B<parallel>.
 325
 326 B<find -exec> only works on files. Processing other input (such as
 327 hosts or URLs) will require creating these inputs as files. B<find
 328 -exec> has no support for running commands in parallel.
 329
 330 https://www.gnu.org/software/findutils/
 331 (Last checked: 2019-01)
 332
 333
 334 =head2 DIFFERENCES BETWEEN make -j AND GNU Parallel
 335
 336 Summary (see legend above):
 337
 338 =over
 339
 340 =item -  -  -  -  -  -  -
 341
 342 =item -  -  -  -  -  -
 343
 344 =item O1 O2 O3 -  x  O6
 345
 346 =item E1 -  -  -  E5 -
 347
 348 =item -  -  -  -  -  -  -  -  -
 349
 350 =item -  -
 351
 352 =back
 353
 354 B<make -j> can run jobs in parallel, but requires a crafted Makefile
 355 to do this. That results in extra quoting to get filenames containing
 356 newlines to work correctly.
 357
 358 B<make -j> computes a dependency graph before running jobs. Jobs run
 359 by GNU B<parallel> does not depend on each other.
 360
 361 (Very early versions of GNU B<parallel> were coincidentally implemented
 362 using B<make -j>).
 363
 364 https://www.gnu.org/software/make/
 365 (Last checked: 2019-01)
 366
 367
 368 =head2 DIFFERENCES BETWEEN ppss AND GNU Parallel
 369
 370 Summary (see legend above):
 371
 372 =over
 373
 374 =item I1 I2 - - - - I7
 375
 376 =item M1 - M3 - - M6
 377
 378 =item O1 - - x - -
 379
 380 =item E1 E2 ?E3 E4 - - -
 381
 382 =item R1 R2 R3 R4 - - ?R7 ? ?
 383
 384 =item - -
 385
 386 =back
 387
 388 B<ppss> is also a tool for running jobs in parallel.
 389
 390 The output of B<ppss> is status information and thus not useful for
 391 using as input for another command. The output from the jobs are put
 392 into files.
 393
 394 The argument replace string ($ITEM) cannot be changed. Arguments must
 395 be quoted - thus arguments containing special characters (space '"&!*)
 396 may cause problems. More than one argument is not supported. Filenames
 397 containing newlines are not processed correctly. When reading input
 398 from a file null cannot be used as a terminator. B<ppss> needs to read
 399 the whole input file before starting any jobs.
 400
 401 Output and status information is stored in ppss_dir and thus requires
 402 cleanup when completed. If the dir is not removed before running
 403 B<ppss> again it may cause nothing to happen as B<ppss> thinks the
 404 task is already done. GNU B<parallel> will normally not need cleaning
 405 up if running locally and will only need cleaning up if stopped
 406 abnormally and running remote (B<--cleanup> may not complete if
 407 stopped abnormally). The example B<Parallel grep> would require extra
 408 postprocessing if written using B<ppss>.
 409
 410 For remote systems PPSS requires 3 steps: config, deploy, and
 411 start. GNU B<parallel> only requires one step.
 412
 413 =head3 EXAMPLES FROM ppss MANUAL
 414
 415 Here are the examples from B<ppss>'s manual page with the equivalent
 416 using GNU B<parallel>:
 417
 418   1$ ./ppss.sh standalone -d /path/to/files -c 'gzip '
 419
 420   1$ find /path/to/files -type f | parallel gzip
 421
 422   2$ ./ppss.sh standalone -d /path/to/files \
 423        -c 'cp "$ITEM" /destination/dir '
 424
 425   2$ find /path/to/files -type f | parallel cp {} /destination/dir
 426
 427   3$ ./ppss.sh standalone -f list-of-urls.txt -c 'wget -q '
 428
 429   3$ parallel -a list-of-urls.txt wget -q
 430
 431   4$ ./ppss.sh standalone -f list-of-urls.txt -c 'wget -q "$ITEM"'
 432
 433   4$ parallel -a list-of-urls.txt wget -q {}
 434
 435   5$ ./ppss config -C config.cfg -c 'encode.sh ' -d /source/dir \
 436        -m 192.168.1.100 -u ppss -k ppss-key.key -S ./encode.sh \
 437        -n nodes.txt -o /some/output/dir --upload --download;
 438      ./ppss deploy -C config.cfg
 439      ./ppss start -C config
 440
 441   5$ # parallel does not use configs. If you want
 442      # a different username put it in nodes.txt: user@hostname
 443      find source/dir -type f |
 444        parallel --sshloginfile nodes.txt --trc {.}.mp3 \
 445          lame -a {} -o {.}.mp3 --preset standard --quiet
 446
 447   6$ ./ppss stop -C config.cfg
 448
 449   6$ killall -TERM parallel
 450
 451   7$ ./ppss pause -C config.cfg
 452
 453   7$ Press: CTRL-Z or killall -SIGTSTP parallel
 454
 455   8$ ./ppss continue -C config.cfg
 456
 457   8$ Enter: fg or killall -SIGCONT parallel
 458
 459   9$ ./ppss.sh status -C config.cfg
 460
 461   9$ killall -SIGUSR2 parallel
 462
 463 https://github.com/louwrentius/PPSS
 464 (Last checked: 2010-12)
 465
 466
 467 =head2 DIFFERENCES BETWEEN pexec AND GNU Parallel
 468
 469 Summary (see legend above):
 470
 471 =over
 472
 473 =item I1 I2 - I4 I5 - -
 474
 475 =item M1 - M3 - - M6
 476
 477 =item O1 O2 O3 - O5 O6
 478
 479 =item E1 - - E4 - E6 -
 480
 481 =item R1 - - - - R6 - - -
 482
 483 =item S1 -
 484
 485 =back
 486
 487 B<pexec> is also a tool for running jobs in parallel.
 488
 489 =head3 EXAMPLES FROM pexec MANUAL
 490
 491 Here are the examples from B<pexec>'s info page with the equivalent
 492 using GNU B<parallel>:
 493
 494   1$ pexec -o sqrt-%s.dat -p "$(seq 10)" -e NUM -n 4 -c -- \
 495        'echo "scale=10000;sqrt($NUM)" | bc'
 496
 497   1$ seq 10 | parallel -j4 'echo "scale=10000;sqrt({})" | \
 498        bc > sqrt-{}.dat'
 499
 500   2$ pexec -p "$(ls myfiles*.ext)" -i %s -o %s.sort -- sort
 501
 502   2$ ls myfiles*.ext | parallel sort {} ">{}.sort"
 503
 504   3$ pexec -f image.list -n auto -e B -u star.log -c -- \
 505        'fistar $B.fits -f 100 -F id,x,y,flux -o $B.star'
 506
 507   3$ parallel -a image.list \
 508        'fistar {}.fits -f 100 -F id,x,y,flux -o {}.star' 2>star.log
 509
 510   4$ pexec -r *.png -e IMG -c -o - -- \
 511        'convert $IMG ${IMG%.png}.jpeg ; "echo $IMG: done"'
 512
 513   4$ ls *.png | parallel 'convert {} {.}.jpeg; echo {}: done'
 514
 515   5$ pexec -r *.png -i %s -o %s.jpg -c 'pngtopnm | pnmtojpeg'
 516
 517   5$ ls *.png | parallel 'pngtopnm < {} | pnmtojpeg > {}.jpg'
 518
 519   6$ for p in *.png ; do echo ${p%.png} ; done | \
 520        pexec -f - -i %s.png -o %s.jpg -c 'pngtopnm | pnmtojpeg'
 521
 522   6$ ls *.png | parallel 'pngtopnm < {} | pnmtojpeg > {.}.jpg'
 523
 524   7$ LIST=$(for p in *.png ; do echo ${p%.png} ; done)
 525      pexec -r $LIST -i %s.png -o %s.jpg -c 'pngtopnm | pnmtojpeg'
 526
 527   7$ ls *.png | parallel 'pngtopnm < {} | pnmtojpeg > {.}.jpg'
 528
 529   8$ pexec -n 8 -r *.jpg -y unix -e IMG -c \
 530        'pexec -j -m blockread -d $IMG | \
 531         jpegtopnm | pnmscale 0.5 | pnmtojpeg | \
 532         pexec -j -m blockwrite -s th_$IMG'
 533
 534   8$ # Combining GNU B<parallel> and GNU B<sem>.
 535      ls *jpg | parallel -j8 'sem --id blockread cat {} | jpegtopnm |' \
 536        'pnmscale 0.5 | pnmtojpeg | sem --id blockwrite cat > th_{}'
 537
 538      # If reading and writing is done to the same disk, this may be
 539      # faster as only one process will be either reading or writing:
 540      ls *jpg | parallel -j8 'sem --id diskio cat {} | jpegtopnm |' \
 541        'pnmscale 0.5 | pnmtojpeg | sem --id diskio cat > th_{}'
 542
 543 https://www.gnu.org/software/pexec/
 544 (Last checked: 2010-12)
 545
 546
 547 =head2 DIFFERENCES BETWEEN xjobs AND GNU Parallel
 548
 549 B<xjobs> is also a tool for running jobs in parallel. It only supports
 550 running jobs on your local computer.
 551
 552 B<xjobs> deals badly with special characters just like B<xargs>. See
 553 the section B<DIFFERENCES BETWEEN xargs AND GNU Parallel>.
 554
 555 =head3 EXAMPLES FROM xjobs MANUAL
 556
 557 Here are the examples from B<xjobs>'s man page with the equivalent
 558 using GNU B<parallel>:
 559
 560   1$ ls -1 *.zip | xjobs unzip
 561
 562   1$ ls *.zip | parallel unzip
 563
 564   2$ ls -1 *.zip | xjobs -n unzip
 565
 566   2$ ls *.zip | parallel unzip >/dev/null
 567
 568   3$ find . -name '*.bak' | xjobs gzip
 569
 570   3$ find . -name '*.bak' | parallel gzip
 571
 572   4$ ls -1 *.jar | sed 's/\(.*\)/\1 > \1.idx/' | xjobs jar tf
 573
 574   4$ ls *.jar | parallel jar tf {} '>' {}.idx
 575
 576   5$ xjobs -s script
 577
 578   5$ cat script | parallel
 579
 580   6$ mkfifo /var/run/my_named_pipe;
 581      xjobs -s /var/run/my_named_pipe &
 582      echo unzip 1.zip >> /var/run/my_named_pipe;
 583      echo tar cf /backup/myhome.tar /home/me >> /var/run/my_named_pipe
 584
 585   6$ mkfifo /var/run/my_named_pipe;
 586      cat /var/run/my_named_pipe | parallel &
 587      echo unzip 1.zip >> /var/run/my_named_pipe;
 588      echo tar cf /backup/myhome.tar /home/me >> /var/run/my_named_pipe
 589
 590 https://www.maier-komor.de/xjobs.html
 591 (Last checked: 2019-01)
 592
 593
 594 =head2 DIFFERENCES BETWEEN prll AND GNU Parallel
 595
 596 B<prll> is also a tool for running jobs in parallel. It does not
 597 support running jobs on remote computers.
 598
 599 B<prll> encourages using BASH aliases and BASH functions instead of
 600 scripts. GNU B<parallel> supports scripts directly, functions if they
 601 are exported using B<export -f>, and aliases if using B<env_parallel>.
 602
 603 B<prll> generates a lot of status information on stderr (standard
 604 error) which makes it harder to use the stderr (standard error) output
 605 of the job directly as input for another program.
 606
 607 =head3 EXAMPLES FROM prll's MANUAL
 608
 609 Here is the example from B<prll>'s man page with the equivalent
 610 using GNU B<parallel>:
 611
 612   1$ prll -s 'mogrify -flip $1' *.jpg
 613
 614   1$ parallel mogrify -flip ::: *.jpg
 615
 616 https://github.com/exzombie/prll
 617 (Last checked: 2019-01)
 618
 619
 620 =head2 DIFFERENCES BETWEEN dxargs AND GNU Parallel
 621
 622 B<dxargs> is also a tool for running jobs in parallel.
 623
 624 B<dxargs> does not deal well with more simultaneous jobs than SSHD's
 625 MaxStartups. B<dxargs> is only built for remote run jobs, but does not
 626 support transferring of files.
 627
 628 https://web.archive.org/web/20120518070250/http://www.
 629 semicomplete.com/blog/geekery/distributed-xargs.html
 630 (Last checked: 2019-01)
 631
 632
 633 =head2 DIFFERENCES BETWEEN mdm/middleman AND GNU Parallel
 634
 635 middleman(mdm) is also a tool for running jobs in parallel.
 636
 637 =head3 EXAMPLES FROM middleman's WEBSITE
 638
 639 Here are the shellscripts of
 640 https://web.archive.org/web/20110728064735/http://mdm.
 641 berlios.de/usage.html ported to GNU B<parallel>:
 642
 643   1$ seq 19 | parallel buffon -o - | sort -n > result
 644      cat files | parallel cmd
 645      find dir -execdir sem cmd {} \;
 646
 647 https://github.com/cklin/mdm
 648 (Last checked: 2019-01)
 649
 650
 651 =head2 DIFFERENCES BETWEEN xapply AND GNU Parallel
 652
 653 B<xapply> can run jobs in parallel on the local computer.
 654
 655 =head3 EXAMPLES FROM xapply's MANUAL
 656
 657 Here are the examples from B<xapply>'s man page with the equivalent
 658 using GNU B<parallel>:
 659
 660   1$ xapply '(cd %1 && make all)' */
 661
 662   1$ parallel 'cd {} && make all' ::: */
 663
 664   2$ xapply -f 'diff %1 ../version5/%1' manifest | more
 665
 666   2$ parallel diff {} ../version5/{} < manifest | more
 667
 668   3$ xapply -p/dev/null -f 'diff %1 %2' manifest1 checklist1
 669
 670   3$ parallel --link diff {1} {2} :::: manifest1 checklist1
 671
 672   4$ xapply 'indent' *.c
 673
 674   4$ parallel indent ::: *.c
 675
 676   5$ find ~ksb/bin -type f ! -perm -111 -print | \
 677        xapply -f -v 'chmod a+x' -
 678
 679   5$ find ~ksb/bin -type f ! -perm -111 -print | \
 680        parallel -v chmod a+x
 681
 682   6$ find */ -... | fmt 960 1024 | xapply -f -i /dev/tty 'vi' -
 683
 684   6$ sh <(find */ -... | parallel -s 1024 echo vi)
 685
 686   6$ find */ -... | parallel -s 1024 -Xuj1 vi
 687
 688   7$ find ... | xapply -f -5 -i /dev/tty 'vi' - - - - -
 689
 690   7$ sh <(find ... | parallel -n5 echo vi)
 691
 692   7$ find ... | parallel -n5 -uj1 vi
 693
 694   8$ xapply -fn "" /etc/passwd
 695
 696   8$ parallel -k echo < /etc/passwd
 697
 698   9$ tr ':' '\012' < /etc/passwd | \
 699        xapply -7 -nf 'chown %1 %6' - - - - - - -
 700
 701   9$ tr ':' '\012' < /etc/passwd | parallel -N7 chown {1} {6}
 702
 703   10$ xapply '[ -d %1/RCS ] || echo %1' */
 704
 705   10$ parallel '[ -d {}/RCS ] || echo {}' ::: */
 706
 707   11$ xapply -f '[ -f %1 ] && echo %1' List | ...
 708
 709   11$ parallel '[ -f {} ] && echo {}' < List | ...
 710
 711 https://www.databits.net/~ksb/msrc/local/bin/xapply/xapply.html (Last
 712 checked: 2010-12)
 713
 714
 715 =head2 DIFFERENCES BETWEEN AIX apply AND GNU Parallel
 716
 717 B<apply> can build command lines based on a template and arguments -
 718 very much like GNU B<parallel>. B<apply> does not run jobs in
 719 parallel. B<apply> does not use an argument separator (like B<:::>);
 720 instead the template must be the first argument.
 721
 722 =head3 EXAMPLES FROM IBM's KNOWLEDGE CENTER
 723
 724 Here are the examples from IBM's Knowledge Center and the
 725 corresponding command using GNU B<parallel>:
 726
 727 =head4 To obtain results similar to those of the B<ls> command, enter:
 728
 729   1$ apply echo *
 730   1$ parallel echo ::: *
 731
 732 =head4 To compare the file named a1 to the file named b1, and
 733 the file named a2 to the file named b2, enter:
 734
 735   2$ apply -2 cmp a1 b1 a2 b2
 736   2$ parallel -N2 cmp ::: a1 b1 a2 b2
 737
 738 =head4 To run the B<who> command five times, enter:
 739
 740   3$ apply -0 who 1 2 3 4 5
 741   3$ parallel -N0 who ::: 1 2 3 4 5
 742
 743 =head4 To link all files in the current directory to the directory
 744 /usr/joe, enter:
 745
 746   4$ apply 'ln %1 /usr/joe' *
 747   4$ parallel ln {} /usr/joe ::: *
 748
 749 https://www-01.ibm.com/support/knowledgecenter/
 750 ssw_aix_71/com.ibm.aix.cmds1/apply.htm
 751 (Last checked: 2019-01)
 752
 753
 754 =head2 DIFFERENCES BETWEEN paexec AND GNU Parallel
 755
 756 B<paexec> can run jobs in parallel on both the local and remote computers.
 757
 758 B<paexec> requires commands to print a blank line as the last
 759 output. This means you will have to write a wrapper for most programs.
 760
 761 B<paexec> has a job dependency facility so a job can depend on another
 762 job to be executed successfully. Sort of a poor-man's B<make>.
 763
 764 =head3 EXAMPLES FROM paexec's EXAMPLE CATALOG
 765
 766 Here are the examples from B<paexec>'s example catalog with the equivalent
 767 using GNU B<parallel>:
 768
 769 =head4 1_div_X_run
 770
 771   1$ ../../paexec -s -l -c "`pwd`/1_div_X_cmd" -n +1 <<EOF [...]
 772
 773   1$ parallel echo {} '|' `pwd`/1_div_X_cmd <<EOF [...]
 774
 775 =head4 all_substr_run
 776
 777   2$ ../../paexec -lp -c "`pwd`/all_substr_cmd" -n +3 <<EOF [...]
 778
 779   2$ parallel echo {} '|' `pwd`/all_substr_cmd <<EOF [...]
 780
 781 =head4 cc_wrapper_run
 782
 783   3$ ../../paexec -c "env CC=gcc CFLAGS=-O2 `pwd`/cc_wrapper_cmd" \
 784              -n 'host1 host2' \
 785              -t '/usr/bin/ssh -x' <<EOF [...]
 786
 787   3$ parallel echo {} '|' "env CC=gcc CFLAGS=-O2 `pwd`/cc_wrapper_cmd" \
 788              -S host1,host2 <<EOF [...]
 789
 790      # This is not exactly the same, but avoids the wrapper
 791      parallel gcc -O2 -c -o {.}.o {} \
 792              -S host1,host2 <<EOF [...]
 793
 794 =head4 toupper_run
 795
 796   4$ ../../paexec -lp -c "`pwd`/toupper_cmd" -n +10 <<EOF [...]
 797
 798   4$ parallel echo {} '|' ./toupper_cmd <<EOF [...]
 799
 800      # Without the wrapper:
 801      parallel echo {} '| awk {print\ toupper\(\$0\)}' <<EOF [...]
 802
 803 https://github.com/cheusov/paexec
 804 (Last checked: 2010-12)
 805
 806
 807 =head2 DIFFERENCES BETWEEN map(sitaramc) AND GNU Parallel
 808
 809 Summary (see legend above):
 810
 811 =over
 812
 813 =item I1 - - I4 - - (I7)
 814
 815 =item M1 (M2) M3 (M4) M5 M6
 816
 817 =item - O2 O3 - O5 - - x x O10
 818
 819 =item E1 - - - - - -
 820
 821 =item - - - - - - - - -
 822
 823 =item - -
 824
 825 =back
 826
 827 (I7): Only under special circumstances. See below.
 828
 829 (M2+M4): Only if there is a single replacement string.
 830
 831 B<map> rejects input with special characters:
 832
 833   echo "The Cure" > My\ brother\'s\ 12\"\ records
 834
 835   ls | map 'echo %; wc %'
 836
 837 It works with GNU B<parallel>:
 838
 839   ls | parallel 'echo {}; wc {}'
 840
 841 Under some circumstances it also works with B<map>:
 842
 843   ls | map 'echo % works %'
 844
 845 But tiny changes make it reject the input with special characters:
 846
 847   ls | map 'echo % does not work "%"'
 848
 849 This means that many UTF-8 characters will be rejected. This is by
 850 design. From the web page: "As such, programs that I<quietly handle
 851 them, with no warnings at all,> are doing their users a disservice."
 852
 853 B<map> delays each job by 0.01 s. This can be emulated by using
 854 B<parallel --delay 0.01>.
 855
 856 B<map> prints '+' on stderr when a job starts, and '-' when a job
 857 finishes. This cannot be disabled. B<parallel> has B<--bar> if you
 858 need to see progress.
 859
 860 B<map>'s replacement strings (% %D %B %E) can be simulated in GNU
 861 B<parallel> by putting this in B<~/.parallel/config>:
 862
 863   --rpl '%'
 864   --rpl '%D $_=Q(::dirname($_));'
 865   --rpl '%B s:.*/::;s:\.[^/.]+$::;'
 866   --rpl '%E s:.*\.::'
 867
 868 B<map> does not have an argument separator on the command line, but
 869 uses the first argument as command. This makes quoting harder which again
 870 may affect readability. Compare:
 871
 872   map -p 2 'perl -ne '"'"'/^\S+\s+\S+$/ and print $ARGV,"\n"'"'" *
 873
 874   parallel -q perl -ne '/^\S+\s+\S+$/ and print $ARGV,"\n"' ::: *
 875
 876 B<map> can do multiple arguments with context replace, but not without
 877 context replace:
 878
 879   parallel --xargs echo 'BEGIN{'{}'}END' ::: 1 2 3
 880
 881   map "echo 'BEGIN{'%'}END'" 1 2 3
 882
 883 B<map> has no support for grouping. So this gives the wrong results:
 884
 885   parallel perl -e '\$a=\"1{}\"x10000000\;print\ \$a,\"\\n\"' '>' {} \
 886     ::: a b c d e f
 887   ls -l a b c d e f
 888   parallel -kP4 -n1 grep 1 ::: a b c d e f > out.par
 889   map -n1 -p 4 'grep 1' a b c d e f > out.map-unbuf
 890   map -n1 -p 4 'grep --line-buffered 1' a b c d e f > out.map-linebuf
 891   map -n1 -p 1 'grep --line-buffered 1' a b c d e f > out.map-serial
 892   ls -l out*
 893   md5sum out*
 894
 895 =head3 EXAMPLES FROM map's WEBSITE
 896
 897 Here are the examples from B<map>'s web page with the equivalent using
 898 GNU B<parallel>:
 899
 900   1$ ls *.gif | map convert % %B.png         # default max-args: 1
 901
 902   1$ ls *.gif | parallel convert {} {.}.png
 903
 904   2$ map "mkdir %B; tar -C %B -xf %" *.tgz   # default max-args: 1
 905
 906   2$ parallel 'mkdir {.}; tar -C {.} -xf {}' :::  *.tgz
 907
 908   3$ ls *.gif | map cp % /tmp                # default max-args: 100
 909
 910   3$ ls *.gif | parallel -X cp {} /tmp
 911
 912   4$ ls *.tar | map -n 1 tar -xf %
 913
 914   4$ ls *.tar | parallel tar -xf
 915
 916   5$ map "cp % /tmp" *.tgz
 917
 918   5$ parallel cp {} /tmp ::: *.tgz
 919
 920   6$ map "du -sm /home/%/mail" alice bob carol
 921
 922   6$ parallel "du -sm /home/{}/mail" ::: alice bob carol
 923   or if you prefer running a single job with multiple args:
 924   6$ parallel -Xj1 "du -sm /home/{}/mail" ::: alice bob carol
 925
 926   7$ cat /etc/passwd | map -d: 'echo user %1 has shell %7'
 927
 928   7$ cat /etc/passwd | parallel --colsep : 'echo user {1} has shell {7}'
 929
 930   8$ export MAP_MAX_PROCS=$(( `nproc` / 2 ))
 931
 932   8$ export PARALLEL=-j50%
 933
 934 https://github.com/sitaramc/map
 935 (Last checked: 2020-05)
 936
 937
 938 =head2 DIFFERENCES BETWEEN ladon AND GNU Parallel
 939
 940 B<ladon> can run multiple jobs on files in parallel.
 941
 942 B<ladon> only works on files and the only way to specify files is
 943 using a quoted glob string (such as \*.jpg). It is not possible to
 944 list the files manually.
 945
 946 As replacement strings it uses FULLPATH DIRNAME BASENAME EXT RELDIR
 947 RELPATH
 948
 949 These can be simulated using GNU B<parallel> by putting this in
 950 B<~/.parallel/config>:
 951
 952   --rpl 'FULLPATH $_=Q($_);chomp($_=qx{readlink -f $_});'
 953   --rpl 'DIRNAME $_=Q(::dirname($_));chomp($_=qx{readlink -f $_});'
 954   --rpl 'BASENAME s:.*/::;s:\.[^/.]+$::;'
 955   --rpl 'EXT s:.*\.::'
 956   --rpl 'RELDIR $_=Q($_);chomp(($_,$c)=qx{readlink -f $_;pwd});
 957          s:\Q$c/\E::;$_=::dirname($_);'
 958   --rpl 'RELPATH $_=Q($_);chomp(($_,$c)=qx{readlink -f $_;pwd});
 959          s:\Q$c/\E::;'
 960
 961 B<ladon> deals badly with filenames containing " and newline, and it
 962 fails for output larger than 200k:
 963
 964   ladon '*' -- seq 36000 | wc
 965
 966 =head3 EXAMPLES FROM ladon MANUAL
 967
 968 It is assumed that the '--rpl's above are put in B<~/.parallel/config>
 969 and that it is run under a shell that supports '**' globbing (such as B<zsh>):
 970
 971   1$ ladon "**/*.txt" -- echo RELPATH
 972
 973   1$ parallel echo RELPATH ::: **/*.txt
 974
 975   2$ ladon "~/Documents/**/*.pdf" -- shasum FULLPATH >hashes.txt
 976
 977   2$ parallel shasum FULLPATH ::: ~/Documents/**/*.pdf >hashes.txt
 978
 979   3$ ladon -m thumbs/RELDIR "**/*.jpg" -- convert FULLPATH \
 980        -thumbnail 100x100^ -gravity center -extent 100x100 \
 981        thumbs/RELPATH
 982
 983   3$ parallel mkdir -p thumbs/RELDIR\; convert FULLPATH
 984        -thumbnail 100x100^ -gravity center -extent 100x100 \
 985        thumbs/RELPATH ::: **/*.jpg
 986
 987   4$ ladon "~/Music/*.wav" -- lame -V 2 FULLPATH DIRNAME/BASENAME.mp3
 988
 989   4$ parallel lame -V 2 FULLPATH DIRNAME/BASENAME.mp3 ::: ~/Music/*.wav
 990
 991 https://github.com/danielgtaylor/ladon
 992 (Last checked: 2019-01)
 993
 994
 995 =head2 DIFFERENCES BETWEEN jobflow AND GNU Parallel
 996
 997 Summary (see legend above):
 998
 999 =over
1000
1001 =item I1 - - - - - I7
1002
1003 =item - - M3 - - (M6)
1004
1005 =item O1 O2 O3 - O5 O6 (O7) - - O10
1006
1007 =item E1 - - - - E6 -
1008
1009 =item - - - - - - - - -
1010
1011 =item - -
1012
1013 =back
1014
1015
1016 B<jobflow> can run multiple jobs in parallel.
1017
1018 Just like B<xargs> output from B<jobflow> jobs running in parallel mix
1019 together by default. B<jobflow> can buffer into files with
1020 B<-buffered> (placed in /run/shm), but these are not cleaned up if
1021 B<jobflow> dies unexpectedly (e.g. by Ctrl-C). If the total output is
1022 big (in the order of RAM+swap) it can cause the system to slow to a
1023 crawl and eventually run out of memory.
1024
1025 Just like B<xargs> redirection and composed commands require wrapping
1026 with B<bash -c>.
1027
1028 Input lines can at most be 4096 bytes.
1029
1030 B<jobflow> is faster than GNU B<parallel> but around 6 times slower
1031 than B<parallel-bash>.
1032
1033 B<jobflow> has no equivalent for B<--pipe>, or B<--sshlogin>.
1034
1035 B<jobflow> makes it possible to set resource limits on the running
1036 jobs. This can be emulated by GNU B<parallel> using B<bash>'s B<ulimit>:
1037
1038   jobflow -limits=mem=100M,cpu=3,fsize=20M,nofiles=300 myjob
1039
1040   parallel 'ulimit -v 102400 -t 3 -f 204800 -n 300 myjob'
1041
1042
1043 =head3 EXAMPLES FROM jobflow README
1044
1045   1$ cat things.list | jobflow -threads=8 -exec ./mytask {}
1046
1047   1$ cat things.list | parallel -j8 ./mytask {}
1048
1049   2$ seq 100 | jobflow -threads=100 -exec echo {}
1050
1051   2$ seq 100 | parallel -j100 echo {}
1052
1053   3$ cat urls.txt | jobflow -threads=32 -exec wget {}
1054
1055   3$ cat urls.txt | parallel -j32 wget {}
1056
1057   4$ find . -name '*.bmp' | \
1058        jobflow -threads=8 -exec bmp2jpeg {.}.bmp {.}.jpg
1059
1060   4$ find . -name '*.bmp' | \
1061        parallel -j8 bmp2jpeg {.}.bmp {.}.jpg
1062
1063   5$ seq 100 | jobflow -skip 10 -count 10
1064
1065   5$ seq 100 | parallel --filter '{1} > 10 and {1} <= 20' echo
1066
1067   5$ seq 100 | parallel echo '{= $_>10 and $_<=20 or skip() =}'
1068
1069 https://github.com/rofl0r/jobflow
1070 (Last checked: 2022-05)
1071
1072
1073 =head2 DIFFERENCES BETWEEN gargs AND GNU Parallel
1074
1075 B<gargs> can run multiple jobs in parallel.
1076
1077 Older versions cache output in memory. This causes it to be extremely
1078 slow when the output is larger than the physical RAM, and can cause
1079 the system to run out of memory.
1080
1081 See more details on this in B<man parallel_design>.
1082
1083 Newer versions cache output in files, but leave files in $TMPDIR if it
1084 is killed.
1085
1086 Output to stderr (standard error) is changed if the command fails.
1087
1088 =head3 EXAMPLES FROM gargs WEBSITE
1089
1090   1$ seq 12 -1 1 | gargs -p 4 -n 3 "sleep {0}; echo {1} {2}"
1091
1092   1$ seq 12 -1 1 | parallel -P 4 -n 3 "sleep {1}; echo {2} {3}"
1093
1094   2$ cat t.txt | gargs --sep "\s+" \
1095        -p 2 "echo '{0}:{1}-{2}' full-line: \'{}\'"
1096
1097   2$ cat t.txt | parallel --colsep "\\s+" \
1098        -P 2 "echo '{1}:{2}-{3}' full-line: \'{}\'"
1099
1100 https://github.com/brentp/gargs
1101 (Last checked: 2016-08)
1102
1103
1104 =head2 DIFFERENCES BETWEEN orgalorg AND GNU Parallel
1105
1106 B<orgalorg> can run the same job on multiple machines. This is related
1107 to B<--onall> and B<--nonall>.
1108
1109 B<orgalorg> supports entering the SSH password - provided it is the
1110 same for all servers. GNU B<parallel> advocates using B<ssh-agent>
1111 instead, but it is possible to emulate B<orgalorg>'s behavior by
1112 setting SSHPASS and by using B<--ssh "sshpass ssh">.
1113
1114 To make the emulation easier, make a simple alias:
1115
1116   alias par_emul="parallel -j0 --ssh 'sshpass ssh' --nonall --tag --lb"
1117
1118 If you want to supply a password run:
1119
1120   SSHPASS=`ssh-askpass`
1121
1122 or set the password directly:
1123
1124   SSHPASS=P4$$w0rd!
1125
1126 If the above is set up you can then do:
1127
1128   orgalorg -o frontend1 -o frontend2 -p -C uptime
1129   par_emul -S frontend1 -S frontend2 uptime
1130
1131   orgalorg -o frontend1 -o frontend2 -p -C top -bid 1
1132   par_emul -S frontend1 -S frontend2 top -bid 1
1133
1134   orgalorg -o frontend1 -o frontend2 -p -er /tmp -n \
1135     'md5sum /tmp/bigfile' -S bigfile
1136   par_emul -S frontend1 -S frontend2 --basefile bigfile \
1137     --workdir /tmp md5sum /tmp/bigfile
1138
1139 B<orgalorg> has a progress indicator for the transferring of a
1140 file. GNU B<parallel> does not.
1141
1142 https://github.com/reconquest/orgalorg
1143 (Last checked: 2016-08)
1144
1145
1146 =head2 DIFFERENCES BETWEEN Rust parallel(mmstick) AND GNU Parallel
1147
1148 Rust parallel focuses on speed. It is almost as fast as B<xargs>, but
1149 not as fast as B<parallel-bash>. It implements a few features from GNU
1150 B<parallel>, but lacks many functions. All these fail:
1151
1152   # Read arguments from file
1153   parallel -a file echo
1154   # Changing the delimiter
1155   parallel -d _ echo ::: a_b_c_
1156
1157 These do something different from GNU B<parallel>
1158
1159   # -q to protect quoted $ and space
1160   parallel -q perl -e '$a=shift; print "$a"x10000000' ::: a b c
1161   # Generation of combination of inputs
1162   parallel echo {1} {2} ::: red green blue ::: S M L XL XXL
1163   # {= perl expression =} replacement string
1164   parallel echo '{= s/new/old/ =}' ::: my.new your.new
1165   # --pipe
1166   seq 100000 | parallel --pipe wc
1167   # linked arguments
1168   parallel echo ::: S M L :::+ sml med lrg ::: R G B :::+ red grn blu
1169   # Run different shell dialects
1170   zsh -c 'parallel echo \={} ::: zsh && true'
1171   csh -c 'parallel echo \$\{\} ::: shell && true'
1172   bash -c 'parallel echo \$\({}\) ::: pwd && true'
1173   # Rust parallel does not start before the last argument is read
1174   (seq 10; sleep 5; echo 2) | time parallel -j2 'sleep 2; echo'
1175   tail -f /var/log/syslog | parallel echo
1176
1177 Most of the examples from the book GNU Parallel 2018 do not work, thus
1178 Rust parallel is not close to being a compatible replacement.
1179
1180 Rust parallel has no remote facilities.
1181
1182 It uses /tmp/parallel for tmp files and does not clean up if
1183 terminated abruptly. If another user on the system uses Rust parallel,
1184 then /tmp/parallel will have the wrong permissions and Rust parallel
1185 will fail. A malicious user can setup the right permissions and
1186 symlink the output file to one of the user's files and next time the
1187 user uses Rust parallel it will overwrite this file.
1188
1189   attacker$ mkdir /tmp/parallel
1190   attacker$ chmod a+rwX /tmp/parallel
1191   # Symlink to the file the attacker wants to zero out
1192   attacker$ ln -s ~victim/.important-file /tmp/parallel/stderr_1
1193   victim$ seq 1000 | parallel echo
1194   # This file is now overwritten with stderr from 'echo'
1195   victim$ cat ~victim/.important-file
1196
1197 If /tmp/parallel runs full during the run, Rust parallel does not
1198 report this, but finishes with success - thereby risking data loss.
1199
1200 https://github.com/mmstick/parallel
1201 (Last checked: 2016-08)
1202
1203
1204 =head2 DIFFERENCES BETWEEN Rush AND GNU Parallel
1205
1206 B<rush> (https://github.com/shenwei356/rush) is written in Go and
1207 based on B<gargs>.
1208
1209 Just like GNU B<parallel> B<rush> buffers in temporary files. But
1210 opposite GNU B<parallel> B<rush> does not clean up, if the process
1211 dies abnormally.
1212
1213 B<rush> has some string manipulations that can be emulated by putting
1214 this into ~/.parallel/config (/ is used instead of %, and % is used
1215 instead of ^ as that is closer to bash's ${var%postfix}):
1216
1217   --rpl '{:} s:(\.[^/]+)*$::'
1218   --rpl '{:%([^}]+?)} s:$$1(\.[^/]+)*$::'
1219   --rpl '{/:%([^}]*?)} s:.*/(.*)$$1(\.[^/]+)*$:$1:'
1220   --rpl '{/:} s:(.*/)?([^/.]+)(\.[^/]+)*$:$2:'
1221   --rpl '{@(.*?)} /$$1/ and $_=$1;'
1222
1223 =head3 EXAMPLES FROM rush's WEBSITE
1224
1225 Here are the examples from B<rush>'s website with the equivalent
1226 command in GNU B<parallel>.
1227
1228 B<1. Simple run, quoting is not necessary>
1229
1230   1$ seq 1 3 | rush echo {}
1231
1232   1$ seq 1 3 | parallel echo {}
1233
1234 B<2. Read data from file (`-i`)>
1235
1236   2$ rush echo {} -i data1.txt -i data2.txt
1237
1238   2$ cat data1.txt data2.txt | parallel echo {}
1239
1240 B<3. Keep output order (`-k`)>
1241
1242   3$ seq 1 3 | rush 'echo {}' -k
1243
1244   3$ seq 1 3 | parallel -k echo {}
1245
1246
1247 B<4. Timeout (`-t`)>
1248
1249   4$ time seq 1 | rush 'sleep 2; echo {}' -t 1
1250
1251   4$ time seq 1 | parallel --timeout 1 'sleep 2; echo {}'
1252
1253 B<5. Retry (`-r`)>
1254
1255   5$ seq 1 | rush 'python unexisted_script.py' -r 1
1256
1257   5$ seq 1 | parallel --retries 2 'python unexisted_script.py'
1258
1259 Use B<-u> to see it is really run twice:
1260
1261   5$ seq 1 | parallel -u --retries 2 'python unexisted_script.py'
1262
1263 B<6. Dirname (`{/}`) and basename (`{%}`) and remove custom
1264 suffix (`{^suffix}`)>
1265
1266   6$ echo dir/file_1.txt.gz | rush 'echo {/} {%} {^_1.txt.gz}'
1267
1268   6$ echo dir/file_1.txt.gz |
1269        parallel --plus echo {//} {/} {%_1.txt.gz}
1270
1271 B<7. Get basename, and remove last (`{.}`) or any (`{:}`) extension>
1272
1273   7$ echo dir.d/file.txt.gz | rush 'echo {.} {:} {%.} {%:}'
1274
1275   7$ echo dir.d/file.txt.gz | parallel 'echo {.} {:} {/.} {/:}'
1276
1277 B<8. Job ID, combine fields index and other replacement strings>
1278
1279   8$ echo 12 file.txt dir/s_1.fq.gz |
1280        rush 'echo job {#}: {2} {2.} {3%:^_1}'
1281
1282   8$ echo 12 file.txt dir/s_1.fq.gz |
1283        parallel --colsep ' ' 'echo job {#}: {2} {2.} {3/:%_1}'
1284
1285 B<9. Capture submatch using regular expression (`{@regexp}`)>
1286
1287   9$ echo read_1.fq.gz | rush 'echo {@(.+)_\d}'
1288
1289   9$ echo read_1.fq.gz | parallel 'echo {@(.+)_\d}'
1290
1291 B<10. Custom field delimiter (`-d`)>
1292
1293   10$ echo a=b=c | rush 'echo {1} {2} {3}' -d =
1294
1295   10$ echo a=b=c | parallel -d = echo {1} {2} {3}
1296
1297 B<11. Send multi-lines to every command (`-n`)>
1298
1299   11$ seq 5 | rush -n 2 -k 'echo "{}"; echo'
1300
1301   11$ seq 5 |
1302         parallel -n 2 -k \
1303           'echo {=-1 $_=join"\n",@arg[1..$#arg] =}; echo'
1304
1305   11$ seq 5 | rush -n 2 -k 'echo "{}"; echo' -J ' '
1306
1307   11$ seq 5 | parallel -n 2 -k 'echo {}; echo'
1308
1309
1310 B<12. Custom record delimiter (`-D`), note that empty records are not used.>
1311
1312   12$ echo a b c d | rush -D " " -k 'echo {}'
1313
1314   12$ echo a b c d | parallel -d " " -k 'echo {}'
1315
1316   12$ echo abcd | rush -D "" -k 'echo {}'
1317
1318   Cannot be done by GNU Parallel
1319
1320   12$ cat fasta.fa
1321   >seq1
1322   tag
1323   >seq2
1324   cat
1325   gat
1326   >seq3
1327   attac
1328   a
1329   cat
1330
1331   12$ cat fasta.fa | rush -D ">" \
1332         'echo FASTA record {#}: name: {1} sequence: {2}' -k -d "\n"
1333       # rush fails to join the multiline sequences
1334
1335   12$ cat fasta.fa | (read -n1 ignore_first_char;
1336         parallel -d '>' --colsep '\n' echo FASTA record {#}: \
1337           name: {1} sequence: '{=2 $_=join"",@arg[2..$#arg]=}'
1338       )
1339
1340 B<13. Assign value to variable, like `awk -v` (`-v`)>
1341
1342   13$ seq 1 |
1343         rush 'echo Hello, {fname} {lname}!' -v fname=Wei -v lname=Shen
1344
1345   13$ seq 1 |
1346         parallel -N0 \
1347           'fname=Wei; lname=Shen; echo Hello, ${fname} ${lname}!'
1348
1349   13$ for var in a b; do \
1350   13$   seq 1 3 | rush -k -v var=$var 'echo var: {var}, data: {}'; \
1351   13$ done
1352
1353 In GNU B<parallel> you would typically do:
1354
1355   13$ seq 1 3 | parallel -k echo var: {1}, data: {2} ::: a b :::: -
1356
1357 If you I<really> want the var:
1358
1359   13$ seq 1 3 |
1360         parallel -k var={1} ';echo var: $var, data: {}' ::: a b :::: -
1361
1362 If you I<really> want the B<for>-loop:
1363
1364   13$ for var in a b; do
1365         export var;
1366         seq 1 3 | parallel -k 'echo var: $var, data: {}';
1367       done
1368
1369 Contrary to B<rush> this also works if the value is complex like:
1370
1371   My brother's 12" records
1372
1373
1374 B<14. Preset variable (`-v`), avoid repeatedly writing verbose replacement strings>
1375
1376   14$ # naive way
1377       echo read_1.fq.gz | rush 'echo {:^_1} {:^_1}_2.fq.gz'
1378
1379   14$ echo read_1.fq.gz | parallel 'echo {:%_1} {:%_1}_2.fq.gz'
1380
1381   14$ # macro + removing suffix
1382       echo read_1.fq.gz |
1383         rush -v p='{:^_1}' 'echo {p} {p}_2.fq.gz'
1384
1385   14$ echo read_1.fq.gz |
1386         parallel 'p={:%_1}; echo $p ${p}_2.fq.gz'
1387
1388   14$ # macro + regular expression
1389       echo read_1.fq.gz | rush -v p='{@(.+?)_\d}' 'echo {p} {p}_2.fq.gz'
1390
1391   14$ echo read_1.fq.gz | parallel 'p={@(.+?)_\d}; echo $p ${p}_2.fq.gz'
1392
1393 Contrary to B<rush> GNU B<parallel> works with complex values:
1394
1395   14$ echo "My brother's 12\"read_1.fq.gz" |
1396         parallel 'p={@(.+?)_\d}; echo $p ${p}_2.fq.gz'
1397
1398 B<15. Interrupt jobs by `Ctrl-C`, rush will stop unfinished commands and exit.>
1399
1400   15$ seq 1 20 | rush 'sleep 1; echo {}'
1401       ^C
1402
1403   15$ seq 1 20 | parallel 'sleep 1; echo {}'
1404       ^C
1405
1406 B<16. Continue/resume jobs (`-c`). When some jobs failed (by
1407 execution failure, timeout, or canceling by user with `Ctrl + C`),
1408 please switch flag `-c/--continue` on and run again, so that `rush`
1409 can save successful commands and ignore them in I<NEXT> run.>
1410
1411   16$ seq 1 3 | rush 'sleep {}; echo {}' -t 3 -c
1412       cat successful_cmds.rush
1413       seq 1 3 | rush 'sleep {}; echo {}' -t 3 -c
1414
1415   16$ seq 1 3 | parallel --joblog mylog --timeout 2 \
1416         'sleep {}; echo {}'
1417       cat mylog
1418       seq 1 3 | parallel --joblog mylog --retry-failed \
1419         'sleep {}; echo {}'
1420
1421 Multi-line jobs:
1422
1423   16$ seq 1 3 | rush 'sleep {}; echo {}; \
1424         echo finish {}' -t 3 -c -C finished.rush
1425       cat finished.rush
1426       seq 1 3 | rush 'sleep {}; echo {}; \
1427         echo finish {}' -t 3 -c -C finished.rush
1428
1429   16$ seq 1 3 |
1430         parallel --joblog mylog --timeout 2 'sleep {}; echo {}; \
1431           echo finish {}'
1432       cat mylog
1433       seq 1 3 |
1434         parallel --joblog mylog --retry-failed 'sleep {}; echo {}; \
1435           echo finish {}'
1436
1437 B<17. A comprehensive example: downloading 1K+ pages given by
1438 three URL list files using `phantomjs save_page.js` (some page
1439 contents are dynamically generated by Javascript, so `wget` does not
1440 work). Here I set max jobs number (`-j`) as `20`, each job has a max
1441 running time (`-t`) of `60` seconds and `3` retry changes
1442 (`-r`). Continue flag `-c` is also switched on, so we can continue
1443 unfinished jobs. Luckily, it's accomplished in one run :)>
1444
1445   17$ for f in $(seq 2014 2016); do \
1446         /bin/rm -rf $f; mkdir -p $f; \
1447         cat $f.html.txt | rush -v d=$f -d = \
1448           'phantomjs save_page.js "{}" > {d}/{3}.html' \
1449           -j 20 -t 60 -r 3 -c; \
1450       done
1451
1452 GNU B<parallel> can append to an existing joblog with '+':
1453
1454   17$ rm mylog
1455       for f in $(seq 2014 2016); do
1456         /bin/rm -rf $f; mkdir -p $f;
1457         cat $f.html.txt |
1458           parallel -j20 --timeout 60 --retries 4 --joblog +mylog \
1459             --colsep = \
1460             phantomjs save_page.js {1}={2}={3} '>' $f/{3}.html
1461       done
1462
1463 B<18. A bioinformatics example: mapping with `bwa`, and
1464 processing result with `samtools`:>
1465
1466   18$ ref=ref/xxx.fa
1467       threads=25
1468       ls -d raw.cluster.clean.mapping/* \
1469         | rush -v ref=$ref -v j=$threads -v p='{}/{%}' \
1470         'bwa mem -t {j} -M -a {ref} {p}_1.fq.gz {p}_2.fq.gz >{p}.sam;\
1471         samtools view -bS {p}.sam > {p}.bam; \
1472         samtools sort -T {p}.tmp -@ {j} {p}.bam -o {p}.sorted.bam; \
1473         samtools index {p}.sorted.bam; \
1474         samtools flagstat {p}.sorted.bam > {p}.sorted.bam.flagstat; \
1475         /bin/rm {p}.bam {p}.sam;' \
1476         -j 2 --verbose -c -C mapping.rush
1477
1478 GNU B<parallel> would use a function:
1479
1480   18$ ref=ref/xxx.fa
1481       export ref
1482       thr=25
1483       export thr
1484       bwa_sam() {
1485         p="$1"
1486         bam="$p".bam
1487         sam="$p".sam
1488         sortbam="$p".sorted.bam
1489         bwa mem -t $thr -M -a $ref ${p}_1.fq.gz ${p}_2.fq.gz > "$sam"
1490         samtools view -bS "$sam" > "$bam"
1491         samtools sort -T ${p}.tmp -@ $thr "$bam" -o "$sortbam"
1492         samtools index "$sortbam"
1493         samtools flagstat "$sortbam" > "$sortbam".flagstat
1494         /bin/rm "$bam" "$sam"
1495       }
1496       export -f bwa_sam
1497       ls -d raw.cluster.clean.mapping/* |
1498         parallel -j 2 --verbose --joblog mylog bwa_sam
1499
1500 =head3 Other B<rush> features
1501
1502 B<rush> has:
1503
1504 =over 4
1505
1506 =item * B<awk -v> like custom defined variables (B<-v>)
1507
1508 With GNU B<parallel> you would simply set a shell variable:
1509
1510    parallel 'v={}; echo "$v"' ::: foo
1511    echo foo | rush -v v={} 'echo {v}'
1512
1513 Also B<rush> does not like special chars. So these B<do not work>:
1514
1515    echo does not work | rush -v v=\" 'echo {v}'
1516    echo "My  brother's  12\"  records" | rush -v v={} 'echo {v}'
1517
1518 Whereas the corresponding GNU B<parallel> version works:
1519
1520    parallel 'v=\"; echo "$v"' ::: works
1521    parallel 'v={}; echo "$v"' ::: "My  brother's  12\"  records"
1522
1523 =item * Exit on first error(s) (-e)
1524
1525 This is called B<--halt now,fail=1> (or shorter: B<--halt 2>) when
1526 used with GNU B<parallel>.
1527
1528 =item * Settable records sending to every command (B<-n>, default 1)
1529
1530 This is also called B<-n> in GNU B<parallel>.
1531
1532 =item * Practical replacement strings
1533
1534 =over 4
1535
1536 =item {:} remove any extension
1537
1538 With GNU B<parallel> this can be emulated by:
1539
1540   parallel --plus echo '{/\..*/}' ::: foo.ext.bar.gz
1541
1542 =item {^suffix}, remove suffix
1543
1544 With GNU B<parallel> this can be emulated by:
1545
1546   parallel --plus echo '{%.bar.gz}' ::: foo.ext.bar.gz
1547
1548 =item {@regexp}, capture submatch using regular expression
1549
1550 With GNU B<parallel> this can be emulated by:
1551
1552   parallel --rpl '{@(.*?)} /$$1/ and $_=$1;' \
1553     echo '{@\d_(.*).gz}' ::: 1_foo.gz
1554
1555 =item {%.}, {%:}, basename without extension
1556
1557 With GNU B<parallel> this can be emulated by:
1558
1559   parallel echo '{= s:.*/::;s/\..*// =}' ::: dir/foo.bar.gz
1560
1561 And if you need it often, you define a B<--rpl> in
1562 B<$HOME/.parallel/config>:
1563
1564   --rpl '{%.} s:.*/::;s/\..*//'
1565   --rpl '{%:} s:.*/::;s/\..*//'
1566
1567 Then you can use them as:
1568
1569   parallel echo {%.} {%:} ::: dir/foo.bar.gz
1570
1571 =back
1572
1573 =item * Preset variable (macro)
1574
1575 E.g.
1576
1577   echo foosuffix | rush -v p={^suffix} 'echo {p}_new_suffix'
1578
1579 With GNU B<parallel> this can be emulated by:
1580
1581   echo foosuffix |
1582     parallel --plus 'p={%suffix}; echo ${p}_new_suffix'
1583
1584 Opposite B<rush> GNU B<parallel> works fine if the input contains
1585 double space, ' and ":
1586
1587   echo "1'6\"  foosuffix" |
1588     parallel --plus 'p={%suffix}; echo "${p}"_new_suffix'
1589
1590
1591 =item * Commands of multi-lines
1592
1593 While you I<can> use multi-lined commands in GNU B<parallel>, to
1594 improve readability GNU B<parallel> discourages the use of multi-line
1595 commands. In most cases it can be written as a function:
1596
1597   seq 1 3 |
1598     parallel --timeout 2 --joblog my.log 'sleep {}; echo {}; \
1599       echo finish {}'
1600
1601 Could be written as:
1602
1603   doit() {
1604     sleep "$1"
1605     echo "$1"
1606     echo finish "$1"
1607   }
1608   export -f doit
1609   seq 1 3 | parallel --timeout 2 --joblog my.log doit
1610
1611 The failed commands can be resumed with:
1612
1613   seq 1 3 |
1614     parallel --resume-failed --joblog my.log 'sleep {}; echo {};\
1615       echo finish {}'
1616
1617 =back
1618
1619 https://github.com/shenwei356/rush
1620 (Last checked: 2017-05)
1621
1622
1623 =head2 DIFFERENCES BETWEEN ClusterSSH AND GNU Parallel
1624
1625 ClusterSSH solves a different problem than GNU B<parallel>.
1626
1627 ClusterSSH opens a terminal window for each computer and using a
1628 master window you can run the same command on all the computers. This
1629 is typically used for administrating several computers that are almost
1630 identical.
1631
1632 GNU B<parallel> runs the same (or different) commands with different
1633 arguments in parallel possibly using remote computers to help
1634 computing. If more than one computer is listed in B<-S> GNU B<parallel> may
1635 only use one of these (e.g. if there are 8 jobs to be run and one
1636 computer has 8 cores).
1637
1638 GNU B<parallel> can be used as a poor-man's version of ClusterSSH:
1639
1640 B<parallel --nonall -S server-a,server-b do_stuff foo bar>
1641
1642 https://github.com/duncs/clusterssh
1643 (Last checked: 2010-12)
1644
1645
1646 =head2 DIFFERENCES BETWEEN coshell AND GNU Parallel
1647
1648 B<coshell> only accepts full commands on standard input. Any quoting
1649 needs to be done by the user.
1650
1651 Commands are run in B<sh> so any B<bash>/B<tcsh>/B<zsh> specific
1652 syntax will not work.
1653
1654 Output can be buffered by using B<-d>. Output is buffered in memory,
1655 so big output can cause swapping and therefore be terrible slow or
1656 even cause out of memory.
1657
1658 https://github.com/gdm85/coshell
1659 (Last checked: 2019-01)
1660
1661
1662 =head2 DIFFERENCES BETWEEN spread AND GNU Parallel
1663
1664 =over
1665
1666 =item - - - I4 - - I7
1667
1668 =item M1 - - - - -
1669
1670 =item O1 O2 O3 O4 O5 O6 - O8 - O10
1671
1672 =item - - - - - - -
1673
1674 =item - - - - - - - - -
1675
1676 =item - -
1677
1678 =back
1679
1680 B<spread> runs commands on all directories. It does not run jobs in parallel.
1681
1682 It can be emulated with GNU B<parallel> using this Bash function:
1683
1684   spread() {
1685     _cmds() {
1686       perl -e '$"=" && ";print "@ARGV"' "cd {}" "$@"
1687     }
1688     parallel $(_cmds "$@")'|| echo exit status $?' ::: */
1689   }
1690
1691 https://github.com/tfogo/spread
1692 (Last checked: 2024-04)
1693
1694
1695 =head2 DIFFERENCES BETWEEN pyargs AND GNU Parallel
1696
1697 B<pyargs> deals badly with input containing spaces. It buffers stdout,
1698 but not stderr. It buffers in RAM. {} does not work as replacement
1699 string. It does not support running functions.
1700
1701 B<pyargs> does not support composed commands if run with B<--lines>,
1702 and fails on B<pyargs traceroute gnu.org fsf.org>.
1703
1704 =head3 Examples
1705
1706   seq 5 | pyargs -P50 -L seq
1707   seq 5 | parallel -P50 --lb seq
1708
1709   seq 5 | pyargs -P50 --mark -L seq
1710   seq 5 | parallel -P50 --lb \
1711     --tagstring OUTPUT'[{= $_=$job->replaced() =}]' seq
1712   # Similar, but not precisely the same
1713   seq 5 | parallel -P50 --lb --tag seq
1714
1715   seq 5 | pyargs -P50  --mark command
1716   # Somewhat longer with GNU Parallel due to the special
1717   #   --mark formatting
1718   cmd="$(echo "command" | parallel --shellquote)"
1719   wrap_cmd() {
1720      echo "MARK $cmd $@================================" >&3
1721      echo "OUTPUT START[$cmd $@]:"
1722      eval $cmd "$@"
1723      echo "OUTPUT END[$cmd $@]"
1724   }
1725   (seq 5 | env_parallel -P2 wrap_cmd) 3>&1
1726   # Similar, but not exactly the same
1727   seq 5 | parallel -t --tag command
1728
1729   (echo '1  2  3';echo 4 5 6) | pyargs  --stream seq
1730   (echo '1  2  3';echo 4 5 6) | perl -pe 's/\n/ /' |
1731     parallel -r -d' ' seq
1732   # Similar, but not exactly the same
1733   parallel seq ::: 1 2 3 4 5 6
1734
1735 https://github.com/robertblackwell/pyargs
1736 (Last checked: 2019-01)
1737
1738
1739 =head2 DIFFERENCES BETWEEN concurrently AND GNU Parallel
1740
1741 B<concurrently> runs jobs in parallel.
1742
1743 The output is prepended with the job number, and may be incomplete:
1744
1745   $ concurrently 'seq 100000' | (sleep 3;wc -l)
1746   7165
1747
1748 When pretty printing it caches output in memory. Output mixes by using
1749 test MIX below whether or not output is cached.
1750
1751 There seems to be no way of making a template command and have
1752 B<concurrently> fill that with different args. The full commands must
1753 be given on the command line.
1754
1755 There is also no way of controlling how many jobs should be run in
1756 parallel at a time - i.e. "number of jobslots". Instead all jobs are
1757 simply started in parallel.
1758
1759 https://github.com/kimmobrunfeldt/concurrently
1760 (Last checked: 2019-01)
1761
1762
1763 =head2 DIFFERENCES BETWEEN map(soveran) AND GNU Parallel
1764
1765 B<map> does not run jobs in parallel by default. The README suggests using:
1766
1767   ... | map t 'sleep $t && say done &'
1768
1769 But this fails if more jobs are run in parallel than the number of
1770 available processes. Since there is no support for parallelization in
1771 B<map> itself, the output also mixes:
1772
1773   seq 10 | map i 'echo start-$i && sleep 0.$i && echo end-$i &'
1774
1775 The major difference is that GNU B<parallel> is built for parallelization
1776 and B<map> is not. So GNU B<parallel> has lots of ways of dealing with the
1777 issues that parallelization raises:
1778
1779 =over 4
1780
1781 =item *
1782
1783 Keep the number of processes manageable
1784
1785 =item *
1786
1787 Make sure output does not mix
1788
1789 =item *
1790
1791 Make Ctrl-C kill all running processes
1792
1793 =back
1794
1795 =head3 EXAMPLES FROM maps WEBSITE
1796
1797 Here are the 5 examples converted to GNU Parallel:
1798
1799   1$ ls *.c | map f 'foo $f'
1800   1$ ls *.c | parallel foo
1801
1802   2$ ls *.c | map f 'foo $f; bar $f'
1803   2$ ls *.c | parallel 'foo {}; bar {}'
1804
1805   3$ cat urls | map u 'curl -O $u'
1806   3$ cat urls | parallel curl -O
1807
1808   4$ printf "1\n1\n1\n" | map t 'sleep $t && say done'
1809   4$ printf "1\n1\n1\n" | parallel 'sleep {} && say done'
1810   4$ parallel 'sleep {} && say done' ::: 1 1 1
1811
1812   5$ printf "1\n1\n1\n" | map t 'sleep $t && say done &'
1813   5$ printf "1\n1\n1\n" | parallel -j0 'sleep {} && say done'
1814   5$ parallel -j0 'sleep {} && say done' ::: 1 1 1
1815
1816 https://github.com/soveran/map
1817 (Last checked: 2019-01)
1818
1819
1820 =head2 DIFFERENCES BETWEEN loop AND GNU Parallel
1821
1822 B<loop> mixes stdout and stderr:
1823
1824     loop 'ls /no-such-file' >/dev/null
1825
1826 B<loop>'s replacement string B<$ITEM> does not quote strings:
1827
1828     echo 'two  spaces' | loop 'echo $ITEM'
1829
1830 B<loop> cannot run functions:
1831
1832     myfunc() { echo joe; }
1833     export -f myfunc
1834     loop 'myfunc this fails'
1835
1836 =head3 EXAMPLES FROM loop's WEBSITE
1837
1838 Some of the examples from https://github.com/Miserlou/Loop/ can be
1839 emulated with GNU B<parallel>:
1840
1841     # A couple of functions will make the code easier to read
1842     $ loopy() {
1843         yes | parallel -uN0 -j1 "$@"
1844       }
1845     $ export -f loopy
1846     $ time_out() {
1847         parallel -uN0 -q --timeout "$@" ::: 1
1848       }
1849     $ match() {
1850         perl -0777 -ne 'grep /'"$1"'/,$_ and print or exit 1'
1851       }
1852     $ export -f match
1853
1854     $ loop 'ls' --every 10s
1855     $ loopy --delay 10s ls
1856
1857     $ loop 'touch $COUNT.txt' --count-by 5
1858     $ loopy touch '{= $_=seq()*5 =}'.txt
1859
1860     $ loop --until-contains 200 -- \
1861         ./get_response_code.sh --site mysite.biz`
1862     $ loopy --halt now,success=1 \
1863         './get_response_code.sh --site mysite.biz | match 200'
1864
1865     $ loop './poke_server' --for-duration 8h
1866     $ time_out 8h loopy ./poke_server
1867
1868     $ loop './poke_server' --until-success
1869     $ loopy --halt now,success=1 ./poke_server
1870
1871     $ cat files_to_create.txt | loop 'touch $ITEM'
1872     $ cat files_to_create.txt | parallel touch {}
1873
1874     $ loop 'ls' --for-duration 10min --summary
1875     # --joblog is somewhat more verbose than --summary
1876     $ time_out 10m loopy --joblog my.log ./poke_server; cat my.log
1877
1878     $ loop 'echo hello'
1879     $ loopy echo hello
1880
1881     $ loop 'echo $COUNT'
1882     # GNU Parallel counts from 1
1883     $ loopy echo {#}
1884     # Counting from 0 can be forced
1885     $ loopy echo '{= $_=seq()-1 =}'
1886
1887     $ loop 'echo $COUNT' --count-by 2
1888     $ loopy echo '{= $_=2*(seq()-1) =}'
1889
1890     $ loop 'echo $COUNT' --count-by 2 --offset 10
1891     $ loopy echo '{= $_=10+2*(seq()-1) =}'
1892
1893     $ loop 'echo $COUNT' --count-by 1.1
1894     # GNU Parallel rounds 3.3000000000000003 to 3.3
1895     $ loopy echo '{= $_=1.1*(seq()-1) =}'
1896
1897     $ loop 'echo $COUNT $ACTUALCOUNT' --count-by 2
1898     $ loopy echo '{= $_=2*(seq()-1) =} {#}'
1899
1900     $ loop 'echo $COUNT' --num 3 --summary
1901     # --joblog is somewhat more verbose than --summary
1902     $ seq 3 | parallel --joblog my.log echo; cat my.log
1903
1904     $ loop 'ls -foobarbatz' --num 3 --summary
1905     # --joblog is somewhat more verbose than --summary
1906     $ seq 3 | parallel --joblog my.log -N0 ls -foobarbatz; cat my.log
1907
1908     $ loop 'echo $COUNT' --count-by 2 --num 50 --only-last
1909     # Can be emulated by running 2 jobs
1910     $ seq 49 | parallel echo '{= $_=2*(seq()-1) =}' >/dev/null
1911     $ echo 50| parallel echo '{= $_=2*(seq()-1) =}'
1912
1913     $ loop 'date' --every 5s
1914     $ loopy --delay 5s date
1915
1916     $ loop 'date' --for-duration 8s --every 2s
1917     $ time_out 8s loopy --delay 2s date
1918
1919     $ loop 'date -u' --until-time '2018-05-25 20:50:00' --every 5s
1920     $ seconds=$((`date -d 2019-05-25T20:50:00 +%s` - `date  +%s`))s
1921     $ time_out $seconds loopy --delay 5s date -u
1922
1923     $ loop 'echo $RANDOM' --until-contains "666"
1924     $ loopy --halt now,success=1 'echo $RANDOM | match 666'
1925
1926     $ loop 'if (( RANDOM % 2 )); then
1927               (echo "TRUE"; true);
1928             else
1929               (echo "FALSE"; false);
1930             fi' --until-success
1931     $ loopy --halt now,success=1 'if (( $RANDOM % 2 )); then
1932                                     (echo "TRUE"; true);
1933                                   else
1934                                     (echo "FALSE"; false);
1935                                   fi'
1936
1937     $ loop 'if (( RANDOM % 2 )); then
1938         (echo "TRUE"; true);
1939       else
1940         (echo "FALSE"; false);
1941       fi' --until-error
1942     $ loopy --halt now,fail=1 'if (( $RANDOM % 2 )); then
1943                                  (echo "TRUE"; true);
1944                                else
1945                                  (echo "FALSE"; false);
1946                                fi'
1947
1948     $ loop 'date' --until-match "(\d{4})"
1949     $ loopy --halt now,success=1 'date | match [0-9][0-9][0-9][0-9]'
1950
1951     $ loop 'echo $ITEM' --for red,green,blue
1952     $ parallel echo ::: red green blue
1953
1954     $ cat /tmp/my-list-of-files-to-create.txt | loop 'touch $ITEM'
1955     $ cat /tmp/my-list-of-files-to-create.txt | parallel touch
1956
1957     $ ls | loop 'cp $ITEM $ITEM.bak'; ls
1958     $ ls | parallel cp {} {}.bak; ls
1959
1960     $ loop 'echo $ITEM | tr a-z A-Z' -i
1961     $ parallel 'echo {} | tr a-z A-Z'
1962     # Or more efficiently:
1963     $ parallel --pipe tr a-z A-Z
1964
1965     $ loop 'echo $ITEM' --for "`ls`"
1966     $ parallel echo {} ::: "`ls`"
1967
1968     $ ls | loop './my_program $ITEM' --until-success;
1969     $ ls | parallel --halt now,success=1 ./my_program {}
1970
1971     $ ls | loop './my_program $ITEM' --until-fail;
1972     $ ls | parallel --halt now,fail=1 ./my_program {}
1973
1974     $ ./deploy.sh;
1975       loop 'curl -sw "%{http_code}" http://coolwebsite.biz' \
1976         --every 5s --until-contains 200;
1977       ./announce_to_slack.sh
1978     $ ./deploy.sh;
1979       loopy --delay 5s --halt now,success=1 \
1980       'curl -sw "%{http_code}" http://coolwebsite.biz | match 200';
1981       ./announce_to_slack.sh
1982
1983     $ loop "ping -c 1 mysite.com" --until-success; ./do_next_thing
1984     $ loopy --halt now,success=1 ping -c 1 mysite.com; ./do_next_thing
1985
1986     $ ./create_big_file -o my_big_file.bin;
1987       loop 'ls' --until-contains 'my_big_file.bin';
1988       ./upload_big_file my_big_file.bin
1989     # inotifywait is a better tool to detect file system changes.
1990     # It can even make sure the file is complete
1991     # so you are not uploading an incomplete file
1992     $ inotifywait -qmre MOVED_TO -e CLOSE_WRITE --format %w%f . |
1993         grep my_big_file.bin
1994
1995     $ ls | loop 'cp $ITEM $ITEM.bak'
1996     $ ls | parallel cp {} {}.bak
1997
1998     $ loop './do_thing.sh' --every 15s --until-success --num 5
1999     $ parallel --retries 5 --delay 15s ::: ./do_thing.sh
2000
2001 https://github.com/Miserlou/Loop/
2002 (Last checked: 2018-10)
2003
2004
2005 =head2 DIFFERENCES BETWEEN lorikeet AND GNU Parallel
2006
2007 B<lorikeet> can run jobs in parallel. It does this based on a
2008 dependency graph described in a file, so this is similar to B<make>.
2009
2010 https://github.com/cetra3/lorikeet
2011 (Last checked: 2018-10)
2012
2013
2014 =head2 DIFFERENCES BETWEEN spp AND GNU Parallel
2015
2016 B<spp> can run jobs in parallel. B<spp> does not use a command
2017 template to generate the jobs, but requires jobs to be in a
2018 file. Output from the jobs mix.
2019
2020 https://github.com/john01dav/spp
2021 (Last checked: 2019-01)
2022
2023
2024 =head2 DIFFERENCES BETWEEN paral AND GNU Parallel
2025
2026 B<paral> prints a lot of status information and stores the output from
2027 the commands run into files. This means it cannot be used the middle
2028 of a pipe like this
2029
2030   paral "echo this" "echo does not" "echo work" | wc
2031
2032 Instead it puts the output into files named like
2033 B<out_#_I<command>.out.log>. To get a very similar behaviour with GNU
2034 B<parallel> use B<--results
2035 'out_{#}_{=s/[^\sa-z_0-9]//g;s/\s+/_/g=}.log' --eta>
2036
2037 B<paral> only takes arguments on the command line and each argument
2038 should be a full command. Thus it does not use command templates.
2039
2040 This limits how many jobs it can run in total, because they all need
2041 to fit on a single command line.
2042
2043 B<paral> has no support for running jobs remotely.
2044
2045 =head3 EXAMPLES FROM README.markdown
2046
2047 The examples from B<README.markdown> and the corresponding command run
2048 with GNU B<parallel> (B<--results
2049 'out_{#}_{=s/[^\sa-z_0-9]//g;s/\s+/_/g=}.log' --eta> is omitted from
2050 the GNU B<parallel> command):
2051
2052   1$ paral "command 1" "command 2 --flag" "command arg1 arg2"
2053   1$ parallel ::: "command 1" "command 2 --flag" "command arg1 arg2"
2054
2055   2$ paral "sleep 1 && echo c1" "sleep 2 && echo c2" \
2056        "sleep 3 && echo c3" "sleep 4 && echo c4"  "sleep 5 && echo c5"
2057   2$ parallel ::: "sleep 1 && echo c1" "sleep 2 && echo c2" \
2058        "sleep 3 && echo c3" "sleep 4 && echo c4"  "sleep 5 && echo c5"
2059      # Or shorter:
2060      parallel "sleep {} && echo c{}" ::: {1..5}
2061
2062   3$ paral -n=0 "sleep 5 && echo c5" "sleep 4 && echo c4" \
2063        "sleep 3 && echo c3" "sleep 2 && echo c2" "sleep 1 && echo c1"
2064   3$ parallel ::: "sleep 5 && echo c5" "sleep 4 && echo c4" \
2065        "sleep 3 && echo c3" "sleep 2 && echo c2" "sleep 1 && echo c1"
2066      # Or shorter:
2067      parallel -j0 "sleep {} && echo c{}" ::: 5 4 3 2 1
2068
2069   4$ paral -n=1 "sleep 5 && echo c5" "sleep 4 && echo c4" \
2070        "sleep 3 && echo c3" "sleep 2 && echo c2" "sleep 1 && echo c1"
2071   4$ parallel -j1 "sleep {} && echo c{}" ::: 5 4 3 2 1
2072
2073   5$ paral -n=2 "sleep 5 && echo c5" "sleep 4 && echo c4" \
2074        "sleep 3 && echo c3" "sleep 2 && echo c2" "sleep 1 && echo c1"
2075   5$ parallel -j2 "sleep {} && echo c{}" ::: 5 4 3 2 1
2076
2077   6$ paral -n=5 "sleep 5 && echo c5" "sleep 4 && echo c4" \
2078        "sleep 3 && echo c3" "sleep 2 && echo c2" "sleep 1 && echo c1"
2079   6$ parallel -j5 "sleep {} && echo c{}" ::: 5 4 3 2 1
2080
2081   7$ paral -n=1 "echo a && sleep 0.5 && echo b && sleep 0.5 && \
2082        echo c && sleep 0.5 && echo d && sleep 0.5 && \
2083        echo e && sleep 0.5 && echo f && sleep 0.5 && \
2084        echo g && sleep 0.5 && echo h"
2085   7$ parallel ::: "echo a && sleep 0.5 && echo b && sleep 0.5 && \
2086        echo c && sleep 0.5 && echo d && sleep 0.5 && \
2087        echo e && sleep 0.5 && echo f && sleep 0.5 && \
2088        echo g && sleep 0.5 && echo h"
2089
2090 https://github.com/amattn/paral
2091 (Last checked: 2019-01)
2092
2093
2094 =head2 DIFFERENCES BETWEEN concurr AND GNU Parallel
2095
2096 B<concurr> is built to run jobs in parallel using a client/server
2097 model.
2098
2099 =head3 EXAMPLES FROM README.md
2100
2101 The examples from B<README.md>:
2102
2103   1$ concurr 'echo job {#} on slot {%}: {}' : arg1 arg2 arg3 arg4
2104   1$ parallel 'echo job {#} on slot {%}: {}' ::: arg1 arg2 arg3 arg4
2105
2106   2$ concurr 'echo job {#} on slot {%}: {}' :: file1 file2 file3
2107   2$ parallel 'echo job {#} on slot {%}: {}' :::: file1 file2 file3
2108
2109   3$ concurr 'echo {}' < input_file
2110   3$ parallel 'echo {}' < input_file
2111
2112   4$ cat file | concurr 'echo {}'
2113   4$ cat file | parallel 'echo {}'
2114
2115 B<concurr> deals badly empty input files and with output larger than
2116 64 KB.
2117
2118 https://github.com/mmstick/concurr
2119 (Last checked: 2019-01)
2120
2121
2122 =head2 DIFFERENCES BETWEEN lesser-parallel AND GNU Parallel
2123
2124 B<lesser-parallel> is the inspiration for B<parallel --embed>. Both
2125 B<lesser-parallel> and B<parallel --embed> define bash functions that
2126 can be included as part of a bash script to run jobs in parallel.
2127
2128 B<lesser-parallel> implements a few of the replacement strings, but
2129 hardly any options, whereas B<parallel --embed> gives you the full
2130 GNU B<parallel> experience.
2131
2132 https://github.com/kou1okada/lesser-parallel
2133 (Last checked: 2019-01)
2134
2135
2136 =head2 DIFFERENCES BETWEEN npm-parallel AND GNU Parallel
2137
2138 B<npm-parallel> can run npm tasks in parallel.
2139
2140 There are no examples and very little documentation, so it is hard to
2141 compare to GNU B<parallel>.
2142
2143 https://github.com/spion/npm-parallel
2144 (Last checked: 2019-01)
2145
2146
2147 =head2 DIFFERENCES BETWEEN machma AND GNU Parallel
2148
2149 B<machma> runs tasks in parallel. It gives time stamped
2150 output. It buffers in RAM.
2151
2152 =head3 EXAMPLES FROM README.md
2153
2154 The examples from README.md:
2155
2156   1$ # Put shorthand for timestamp in config for the examples
2157      echo '--rpl '\
2158        \''{time} $_=::strftime("%Y-%m-%d %H:%M:%S",localtime())'\' \
2159        > ~/.parallel/machma
2160      echo '--line-buffer --tagstring "{#} {time} {}"' \
2161        >> ~/.parallel/machma
2162
2163   2$ find . -iname '*.jpg' |
2164        machma --  mogrify -resize 1200x1200 -filter Lanczos {}
2165      find . -iname '*.jpg' |
2166        parallel --bar -Jmachma mogrify -resize 1200x1200 \
2167          -filter Lanczos {}
2168
2169   3$ cat /tmp/ips | machma -p 2 -- ping -c 2 -q {}
2170   3$ cat /tmp/ips | parallel -j2 -Jmachma ping -c 2 -q {}
2171
2172   4$ cat /tmp/ips |
2173        machma -- sh -c 'ping -c 2 -q $0 > /dev/null && echo alive' {}
2174   4$ cat /tmp/ips |
2175        parallel -Jmachma 'ping -c 2 -q {} > /dev/null && echo alive'
2176
2177   5$ find . -iname '*.jpg' |
2178        machma --timeout 5s -- mogrify -resize 1200x1200 \
2179          -filter Lanczos {}
2180   5$ find . -iname '*.jpg' |
2181        parallel --timeout 5s --bar mogrify -resize 1200x1200 \
2182          -filter Lanczos {}
2183
2184   6$ find . -iname '*.jpg' -print0 |
2185        machma --null --  mogrify -resize 1200x1200 -filter Lanczos {}
2186   6$ find . -iname '*.jpg' -print0 |
2187        parallel --null --bar mogrify -resize 1200x1200 \
2188          -filter Lanczos {}
2189
2190 https://github.com/fd0/machma
2191 (Last checked: 2019-06)
2192
2193
2194 =head2 DIFFERENCES BETWEEN interlace AND GNU Parallel
2195
2196 Summary (see legend above):
2197
2198 =over
2199
2200 =item - I2 I3 I4 - - -
2201
2202 =item M1 - M3 - - M6
2203
2204 =item - O2 O3 - - - - x x
2205
2206 =item E1 E2 - - - - -
2207
2208 =item - - - - - - - - -
2209
2210 =item - -
2211
2212 =back
2213
2214 B<interlace> is built for network analysis to run network tools in parallel.
2215
2216 B<interface> does not buffer output, so output from different jobs mixes.
2217
2218 The overhead for each target is O(n*n), so with 1000 targets it
2219 becomes very slow with an overhead in the order of 500ms/target.
2220
2221 =head3 EXAMPLES FROM interlace's WEBSITE
2222
2223 Using B<prips> most of the examples from
2224 https://github.com/codingo/Interlace can be run with GNU B<parallel>:
2225
2226 Blocker
2227
2228   commands.txt:
2229     mkdir -p _output_/_target_/scans/
2230     _blocker_
2231     nmap _target_ -oA _output_/_target_/scans/_target_-nmap
2232   interlace -tL ./targets.txt -cL commands.txt -o $output
2233
2234   parallel -a targets.txt \
2235     mkdir -p $output/{}/scans/\; nmap {} -oA $output/{}/scans/{}-nmap
2236
2237 Blocks
2238
2239   commands.txt:
2240     _block:nmap_
2241     mkdir -p _target_/output/scans/
2242     nmap _target_ -oN _target_/output/scans/_target_-nmap
2243     _block:nmap_
2244     nikto --host _target_
2245   interlace -tL ./targets.txt -cL commands.txt
2246
2247   _nmap() {
2248     mkdir -p $1/output/scans/
2249     nmap $1 -oN $1/output/scans/$1-nmap
2250   }
2251   export -f _nmap
2252   parallel ::: _nmap "nikto --host" :::: targets.txt
2253
2254 Run Nikto Over Multiple Sites
2255
2256   interlace -tL ./targets.txt -threads 5 \
2257     -c "nikto --host _target_ > ./_target_-nikto.txt" -v
2258
2259   parallel -a targets.txt -P5 nikto --host {} \> ./{}_-nikto.txt
2260
2261 Run Nikto Over Multiple Sites and Ports
2262
2263   interlace -tL ./targets.txt -threads 5 -c \
2264     "nikto --host _target_:_port_ > ./_target_-_port_-nikto.txt" \
2265     -p 80,443 -v
2266
2267   parallel -P5 nikto --host {1}:{2} \> ./{1}-{2}-nikto.txt \
2268     :::: targets.txt ::: 80 443
2269
2270 Run a List of Commands against Target Hosts
2271
2272   commands.txt:
2273     nikto --host _target_:_port_ > _output_/_target_-nikto.txt
2274     sslscan _target_:_port_ >  _output_/_target_-sslscan.txt
2275     testssl.sh _target_:_port_ > _output_/_target_-testssl.txt
2276   interlace -t example.com -o ~/Engagements/example/ \
2277     -cL ./commands.txt -p 80,443
2278
2279   parallel --results ~/Engagements/example/{2}:{3}{1} {1} {2}:{3} \
2280     ::: "nikto --host" sslscan testssl.sh ::: example.com ::: 80 443
2281
2282 CIDR notation with an application that doesn't support it
2283
2284   interlace -t 192.168.12.0/24 -c "vhostscan _target_ \
2285     -oN _output_/_target_-vhosts.txt" -o ~/scans/ -threads 50
2286
2287   prips 192.168.12.0/24 |
2288     parallel -P50 vhostscan {} -oN ~/scans/{}-vhosts.txt
2289
2290 Glob notation with an application that doesn't support it
2291
2292   interlace -t 192.168.12.* -c "vhostscan _target_ \
2293     -oN _output_/_target_-vhosts.txt" -o ~/scans/ -threads 50
2294
2295   # Glob is not supported in prips
2296   prips 192.168.12.0/24 |
2297     parallel -P50 vhostscan {} -oN ~/scans/{}-vhosts.txt
2298
2299 Dash (-) notation with an application that doesn't support it
2300
2301   interlace -t 192.168.12.1-15 -c \
2302     "vhostscan _target_ -oN _output_/_target_-vhosts.txt" \
2303     -o ~/scans/ -threads 50
2304
2305   # Dash notation is not supported in prips
2306   prips 192.168.12.1 192.168.12.15 |
2307     parallel -P50 vhostscan {} -oN ~/scans/{}-vhosts.txt
2308
2309 Threading Support for an application that doesn't support it
2310
2311   interlace -tL ./target-list.txt -c \
2312     "vhostscan -t _target_ -oN _output_/_target_-vhosts.txt" \
2313     -o ~/scans/ -threads 50
2314
2315   cat ./target-list.txt |
2316     parallel -P50 vhostscan -t {} -oN ~/scans/{}-vhosts.txt
2317
2318 alternatively
2319
2320   ./vhosts-commands.txt:
2321     vhostscan -t $target -oN _output_/_target_-vhosts.txt
2322   interlace -cL ./vhosts-commands.txt -tL ./target-list.txt \
2323     -threads 50 -o ~/scans
2324
2325   ./vhosts-commands.txt:
2326     vhostscan -t "$1" -oN "$2"
2327   parallel -P50 ./vhosts-commands.txt {} ~/scans/{}-vhosts.txt \
2328     :::: ./target-list.txt
2329
2330 Exclusions
2331
2332   interlace -t 192.168.12.0/24 -e 192.168.12.0/26 -c \
2333     "vhostscan _target_ -oN _output_/_target_-vhosts.txt" \
2334     -o ~/scans/ -threads 50
2335
2336   prips 192.168.12.0/24 | grep -xv -Ff <(prips 192.168.12.0/26) |
2337     parallel -P50 vhostscan {} -oN ~/scans/{}-vhosts.txt
2338
2339 Run Nikto Using Multiple Proxies
2340
2341    interlace -tL ./targets.txt -pL ./proxies.txt -threads 5 -c \
2342      "nikto --host _target_:_port_ -useproxy _proxy_ > \
2343       ./_target_-_port_-nikto.txt" -p 80,443 -v
2344
2345    parallel -j5 \
2346      "nikto --host {1}:{2} -useproxy {3} > ./{1}-{2}-nikto.txt" \
2347      :::: ./targets.txt ::: 80 443 :::: ./proxies.txt
2348
2349 https://github.com/codingo/Interlace
2350 (Last checked: 2019-09)
2351
2352
2353 =head2 DIFFERENCES BETWEEN otonvm Parallel AND GNU Parallel
2354
2355 I have been unable to get the code to run at all. It seems unfinished.
2356
2357 https://github.com/otonvm/Parallel
2358 (Last checked: 2019-02)
2359
2360
2361 =head2 DIFFERENCES BETWEEN k-bx par AND GNU Parallel
2362
2363 B<par> requires Haskell to work. This limits the number of platforms
2364 this can work on.
2365
2366 B<par> does line buffering in memory. The memory usage is 3x the
2367 longest line (compared to 1x for B<parallel --lb>). Commands must be
2368 given as arguments. There is no template.
2369
2370 These are the examples from https://github.com/k-bx/par with the
2371 corresponding GNU B<parallel> command.
2372
2373   par "echo foo; sleep 1; echo foo; sleep 1; echo foo" \
2374       "echo bar; sleep 1; echo bar; sleep 1; echo bar" && echo "success"
2375   parallel --lb ::: "echo foo; sleep 1; echo foo; sleep 1; echo foo" \
2376       "echo bar; sleep 1; echo bar; sleep 1; echo bar" && echo "success"
2377
2378   par "echo foo; sleep 1; foofoo" \
2379       "echo bar; sleep 1; echo bar; sleep 1; echo bar" && echo "success"
2380   parallel --lb --halt 1 ::: "echo foo; sleep 1; foofoo" \
2381       "echo bar; sleep 1; echo bar; sleep 1; echo bar" && echo "success"
2382
2383   par "PARPREFIX=[fooechoer] echo foo" "PARPREFIX=[bar] echo bar"
2384   parallel --lb --colsep , --tagstring {1} {2} \
2385     ::: "[fooechoer],echo foo" "[bar],echo bar"
2386
2387   par --succeed "foo" "bar" && echo 'wow'
2388   parallel "foo" "bar"; true && echo 'wow'
2389
2390 https://github.com/k-bx/par
2391 (Last checked: 2019-02)
2392
2393 =head2 DIFFERENCES BETWEEN parallelshell AND GNU Parallel
2394
2395 B<parallelshell> does not allow for composed commands:
2396
2397   # This does not work
2398   parallelshell 'echo foo;echo bar' 'echo baz;echo quuz'
2399
2400 Instead you have to wrap that in a shell:
2401
2402   parallelshell 'sh -c "echo foo;echo bar"' 'sh -c "echo baz;echo quuz"'
2403
2404 It buffers output in RAM. All commands must be given on the command
2405 line and all commands are started in parallel at the same time. This
2406 will cause the system to freeze if there are so many jobs that there
2407 is not enough memory to run them all at the same time.
2408
2409 https://github.com/keithamus/parallelshell
2410 (Last checked: 2019-02)
2411
2412 https://github.com/darkguy2008/parallelshell
2413 (Last checked: 2019-03)
2414
2415
2416 =head2 DIFFERENCES BETWEEN shell-executor AND GNU Parallel
2417
2418 B<shell-executor> does not allow for composed commands:
2419
2420   # This does not work
2421   sx 'echo foo;echo bar' 'echo baz;echo quuz'
2422
2423 Instead you have to wrap that in a shell:
2424
2425   sx 'sh -c "echo foo;echo bar"' 'sh -c "echo baz;echo quuz"'
2426
2427 It buffers output in RAM. All commands must be given on the command
2428 line and all commands are started in parallel at the same time. This
2429 will cause the system to freeze if there are so many jobs that there
2430 is not enough memory to run them all at the same time.
2431
2432 https://github.com/royriojas/shell-executor
2433 (Last checked: 2019-02)
2434
2435
2436 =head2 DIFFERENCES BETWEEN non-GNU par AND GNU Parallel
2437
2438 B<par> buffers in memory to avoid mixing of jobs. It takes 1s per 1
2439 million output lines.
2440
2441 B<par> needs to have all commands before starting the first job. The
2442 jobs are read from stdin (standard input) so any quoting will have to
2443 be done by the user.
2444
2445 Stdout (standard output) is prepended with o:. Stderr (standard error)
2446 is sendt to stdout (standard output) and prepended with e:.
2447
2448 For short jobs with little output B<par> is 20% faster than GNU
2449 B<parallel> and 60% slower than B<xargs>.
2450
2451 https://github.com/UnixJunkie/PAR
2452
2453 https://savannah.nongnu.org/projects/par
2454 (Last checked: 2019-02)
2455
2456
2457 =head2 DIFFERENCES BETWEEN fd AND GNU Parallel
2458
2459 B<fd> does not support composed commands, so commands must be wrapped
2460 in B<sh -c>.
2461
2462 It buffers output in RAM.
2463
2464 It only takes file names from the filesystem as input (similar to B<find>).
2465
2466 https://github.com/sharkdp/fd
2467 (Last checked: 2019-02)
2468
2469
2470 =head2 DIFFERENCES BETWEEN lateral AND GNU Parallel
2471
2472 B<lateral> is very similar to B<sem>: It takes a single command and
2473 runs it in the background. The design means that output from parallel
2474 running jobs may mix. If it dies unexpectly it leaves a socket in
2475 ~/.lateral/socket.PID.
2476
2477 B<lateral> deals badly with too long command lines. This makes the
2478 B<lateral> server crash:
2479
2480   lateral run echo `seq 100000| head -c 1000k`
2481
2482 Any options will be read by B<lateral> so this does not work
2483 (B<lateral> interprets the B<-l>):
2484
2485   lateral run ls -l
2486
2487 Composed commands do not work:
2488
2489   lateral run pwd ';' ls
2490
2491 Functions do not work:
2492
2493   myfunc() { echo a; }
2494   export -f myfunc
2495   lateral run myfunc
2496
2497 Running B<emacs> in the terminal causes the parent shell to die:
2498
2499   echo '#!/bin/bash' > mycmd
2500   echo emacs -nw >> mycmd
2501   chmod +x mycmd
2502   lateral start
2503   lateral run ./mycmd
2504
2505 Here are the examples from https://github.com/akramer/lateral with the
2506 corresponding GNU B<sem> and GNU B<parallel> commands:
2507
2508   1$ lateral start
2509      for i in $(cat /tmp/names); do
2510        lateral run -- some_command $i
2511      done
2512      lateral wait
2513
2514   1$ for i in $(cat /tmp/names); do
2515        sem some_command $i
2516      done
2517      sem --wait
2518
2519   1$ parallel some_command :::: /tmp/names
2520
2521   2$ lateral start
2522      for i in $(seq 1 100); do
2523        lateral run -- my_slow_command < workfile$i > /tmp/logfile$i
2524      done
2525      lateral wait
2526
2527   2$ for i in $(seq 1 100); do
2528        sem my_slow_command < workfile$i > /tmp/logfile$i
2529      done
2530      sem --wait
2531
2532   2$ parallel 'my_slow_command < workfile{} > /tmp/logfile{}' \
2533        ::: {1..100}
2534
2535   3$ lateral start -p 0 # yup, it will just queue tasks
2536      for i in $(seq 1 100); do
2537        lateral run -- command_still_outputs_but_wont_spam inputfile$i
2538      done
2539      # command output spam can commence
2540      lateral config -p 10; lateral wait
2541
2542   3$ for i in $(seq 1 100); do
2543        echo "command inputfile$i" >> joblist
2544      done
2545      parallel -j 10 :::: joblist
2546
2547   3$ echo 1 > /tmp/njobs
2548      parallel -j /tmp/njobs command inputfile{} \
2549        ::: {1..100} &
2550      echo 10 >/tmp/njobs
2551      wait
2552
2553 https://github.com/akramer/lateral
2554 (Last checked: 2019-03)
2555
2556
2557 =head2 DIFFERENCES BETWEEN with-this AND GNU Parallel
2558
2559 The examples from https://github.com/amritb/with-this.git and the
2560 corresponding GNU B<parallel> command:
2561
2562   with -v "$(cat myurls.txt)" "curl -L this"
2563   parallel curl -L ::: myurls.txt
2564
2565   with -v "$(cat myregions.txt)" \
2566     "aws --region=this ec2 describe-instance-status"
2567   parallel aws --region={} ec2 describe-instance-status \
2568     :::: myregions.txt
2569
2570   with -v "$(ls)" "kubectl --kubeconfig=this get pods"
2571   ls | parallel kubectl --kubeconfig={} get pods
2572
2573   with -v "$(ls | grep config)" "kubectl --kubeconfig=this get pods"
2574   ls | grep config | parallel kubectl --kubeconfig={} get pods
2575
2576   with -v "$(echo {1..10})" "echo 123"
2577   parallel -N0 echo 123 ::: {1..10}
2578
2579 Stderr is merged with stdout. B<with-this> buffers in RAM. It uses 3x
2580 the output size, so you cannot have output larger than 1/3rd the
2581 amount of RAM. The input values cannot contain spaces. Composed
2582 commands do not work.
2583
2584 B<with-this> gives some additional information, so the output has to
2585 be cleaned before piping it to the next command.
2586
2587 https://github.com/amritb/with-this.git
2588 (Last checked: 2019-03)
2589
2590
2591 =head2 DIFFERENCES BETWEEN Tollef's parallel (moreutils) AND GNU Parallel
2592
2593 Summary (see legend above):
2594
2595 =over
2596
2597 =item - - - I4 - - I7
2598
2599 =item - - M3 - - M6
2600
2601 =item - O2 O3 - O5 O6 - x x
2602
2603 =item E1 - - - - - E7
2604
2605 =item - x x x x x x x x
2606
2607 =item - -
2608
2609 =back
2610
2611 =head3 EXAMPLES FROM Tollef's parallel MANUAL
2612
2613 B<Tollef> parallel sh -c "echo hi; sleep 2; echo bye" -- 1 2 3
2614
2615 B<GNU> parallel "echo hi; sleep 2; echo bye" ::: 1 2 3
2616
2617 B<Tollef> parallel -j 3 ufraw -o processed -- *.NEF
2618
2619 B<GNU> parallel -j 3 ufraw -o processed ::: *.NEF
2620
2621 B<Tollef> parallel -j 3 -- ls df "echo hi"
2622
2623 B<GNU> parallel -j 3 ::: ls df "echo hi"
2624
2625 (Last checked: 2019-08)
2626
2627 =head2 DIFFERENCES BETWEEN rargs AND GNU Parallel
2628
2629 Summary (see legend above):
2630
2631 =over
2632
2633 =item I1 - - - - - I7
2634
2635 =item - - M3 M4 - -
2636
2637 =item - O2 O3 - O5 O6 - O8 -
2638
2639 =item E1 - - E4 - - -
2640
2641 =item - - - - - - - - -
2642
2643 =item - -
2644
2645 =back
2646
2647 B<rargs> has elegant ways of doing named regexp capture and field ranges.
2648
2649 With GNU B<parallel> you can use B<--rpl> to get a similar
2650 functionality as regexp capture gives, and use B<join> and B<@arg> to
2651 get the field ranges. But the syntax is longer. This:
2652
2653   --rpl '{r(\d+)\.\.(\d+)} $_=join"$opt::colsep",@arg[$$1..$$2]'
2654
2655 would make it possible to use:
2656
2657   {1r3..6}
2658
2659 for field 3..6.
2660
2661 For full support of {n..m:s} including negative numbers use a dynamic
2662 replacement string like this:
2663
2664
2665   PARALLEL=--rpl\ \''{r((-?\d+)?)\.\.((-?\d+)?)((:([^}]*))?)}
2666           $a = defined $$2 ? $$2 < 0 ? 1+$#arg+$$2 : $$2 : 1;
2667           $b = defined $$4 ? $$4 < 0 ? 1+$#arg+$$4 : $$4 : $#arg+1;
2668           $s = defined $$6 ? $$7 : " ";
2669           $_ = join $s,@arg[$a..$b]'\'
2670   export PARALLEL
2671
2672 You can then do:
2673
2674   head /etc/passwd | parallel --colsep : echo ..={1r..} ..3={1r..3} \
2675     4..={1r4..} 2..4={1r2..4} 3..3={1r3..3} ..3:-={1r..3:-} \
2676     ..3:/={1r..3:/} -1={-1} -5={-5} -6={-6} -3..={1r-3..}
2677
2678 =head3 EXAMPLES FROM rargs MANUAL
2679
2680   1$ ls *.bak | rargs -p '(.*)\.bak' mv {0} {1}
2681
2682   1$ ls *.bak | parallel mv {} {.}
2683
2684   2$ cat download-list.csv |
2685        rargs -p '(?P<url>.*),(?P<filename>.*)' wget {url} -O {filename}
2686
2687   2$ cat download-list.csv |
2688        parallel --csv wget {1} -O {2}
2689   # or use regexps:
2690   2$ cat download-list.csv |
2691        parallel --rpl '{url} s/,.*//' --rpl '{filename} s/.*?,//' \
2692          wget {url} -O {filename}
2693
2694   3$ cat /etc/passwd |
2695        rargs -d: echo -e 'id: "{1}"\t name: "{5}"\t rest: "{6..::}"'
2696
2697   3$ cat /etc/passwd |
2698        parallel -q --colsep : \
2699          echo -e 'id: "{1}"\t name: "{5}"\t rest: "{=6 $_=join":",@arg[6..$#arg]=}"'
2700
2701 https://github.com/lotabout/rargs
2702 (Last checked: 2020-01)
2703
2704
2705 =head2 DIFFERENCES BETWEEN threader AND GNU Parallel
2706
2707 Summary (see legend above):
2708
2709 =over
2710
2711 =item I1 - - - - - -
2712
2713 =item M1 - M3 - - M6
2714
2715 =item O1 - O3 - O5 - - x x
2716
2717 =item E1 - - E4 - - -
2718
2719 =item - - - - - - - - -
2720
2721 =item - -
2722
2723 =back
2724
2725 Newline separates arguments, but newline at the end of file is treated
2726 as an empty argument. So this runs 2 jobs:
2727
2728   echo two_jobs | threader -run 'echo "$THREADID"'
2729
2730 B<threader> ignores stderr, so any output to stderr is
2731 lost. B<threader> buffers in RAM, so output bigger than the machine's
2732 virtual memory will cause the machine to crash.
2733
2734 https://github.com/voodooEntity/threader
2735 (Last checked: 2020-04)
2736
2737
2738 =head2 DIFFERENCES BETWEEN runp AND GNU Parallel
2739
2740 Summary (see legend above):
2741
2742 =over
2743
2744 =item I1 I2 - - - - -
2745
2746 =item M1 - (M3) - - M6
2747
2748 =item O1 O2 O3 - O5 O6 - x x -
2749
2750 =item E1 - - - - - -
2751
2752 =item - - - - - - - - -
2753
2754 =item - -
2755
2756 =back
2757
2758 (M3): You can add a prefix and a postfix to the input, so it means you can
2759 only insert the argument on the command line once.
2760
2761 B<runp> runs 10 jobs in parallel by default.  B<runp> blocks if output
2762 of a command is > 64 Kbytes.  Quoting of input is needed.  It adds
2763 output to stderr (this can be prevented with -q)
2764
2765 =head3 Examples as GNU Parallel
2766
2767   base='https://images-api.nasa.gov/search'
2768   query='jupiter'
2769   desc='planet'
2770   type='image'
2771   url="$base?q=$query&description=$desc&media_type=$type"
2772
2773   # Download the images in parallel using runp
2774   curl -s $url | jq -r .collection.items[].href | \
2775     runp -p 'curl -s' | jq -r .[] | grep large | \
2776     runp -p 'curl -s -L -O'
2777
2778   time curl -s $url | jq -r .collection.items[].href | \
2779     runp -g 1 -q -p 'curl -s' | jq -r .[] | grep large | \
2780     runp -g 1 -q -p 'curl -s -L -O'
2781
2782   # Download the images in parallel
2783   curl -s $url | jq -r .collection.items[].href | \
2784     parallel curl -s | jq -r .[] | grep large | \
2785     parallel curl -s -L -O
2786
2787   time curl -s $url | jq -r .collection.items[].href | \
2788     parallel -j 1 curl -s | jq -r .[] | grep large | \
2789     parallel -j 1 curl -s -L -O
2790
2791
2792 =head4 Run some test commands (read from file)
2793
2794   # Create a file containing commands to run in parallel.
2795   cat << EOF > /tmp/test-commands.txt
2796   sleep 5
2797   sleep 3
2798   blah     # this will fail
2799   ls $PWD  # PWD shell variable is used here
2800   EOF
2801
2802   # Run commands from the file.
2803   runp /tmp/test-commands.txt > /dev/null
2804
2805   parallel -a /tmp/test-commands.txt > /dev/null
2806
2807 =head4 Ping several hosts and see packet loss (read from stdin)
2808
2809   # First copy this line and press Enter
2810   runp -p 'ping -c 5 -W 2' -s '| grep loss'
2811   localhost
2812   1.1.1.1
2813   8.8.8.8
2814   # Press Enter and Ctrl-D when done entering the hosts
2815
2816   # First copy this line and press Enter
2817   parallel ping -c 5 -W 2 {} '| grep loss'
2818   localhost
2819   1.1.1.1
2820   8.8.8.8
2821   # Press Enter and Ctrl-D when done entering the hosts
2822
2823 =head4 Get directories' sizes (read from stdin)
2824
2825   echo -e "$HOME\n/etc\n/tmp" | runp -q -p 'sudo du -sh'
2826
2827   echo -e "$HOME\n/etc\n/tmp" | parallel sudo du -sh
2828   # or:
2829   parallel sudo du -sh ::: "$HOME" /etc /tmp
2830
2831 =head4 Compress files
2832
2833   find . -iname '*.txt' | runp -p 'gzip --best'
2834
2835   find . -iname '*.txt' | parallel gzip --best
2836
2837 =head4 Measure HTTP request + response time
2838
2839   export CURL="curl -w 'time_total:  %{time_total}\n'"
2840   CURL="$CURL -o /dev/null -s https://golang.org/"
2841   perl -wE 'for (1..10) { say $ENV{CURL} }' |
2842      runp -q  # Make 10 requests
2843
2844   perl -wE 'for (1..10) { say $ENV{CURL} }' | parallel
2845   # or:
2846   parallel -N0 "$CURL" ::: {1..10}
2847
2848 =head4 Find open TCP ports
2849
2850   cat << EOF > /tmp/host-port.txt
2851   localhost 22
2852   localhost 80
2853   localhost 81
2854   127.0.0.1 443
2855   127.0.0.1 444
2856   scanme.nmap.org 22
2857   scanme.nmap.org 23
2858   scanme.nmap.org 443
2859   EOF
2860
2861   1$ cat /tmp/host-port.txt |
2862        runp -q -p 'netcat -v -w2 -z' 2>&1 | egrep '(succeeded!|open)$'
2863
2864   # --colsep is needed to split the line
2865   1$ cat /tmp/host-port.txt |
2866        parallel --colsep ' ' netcat -v -w2 -z 2>&1 |
2867        egrep '(succeeded!|open)$'
2868   # or use uq for unquoted:
2869   1$ cat /tmp/host-port.txt |
2870        parallel netcat -v -w2 -z {=uq=} 2>&1 |
2871        egrep '(succeeded!|open)$'
2872
2873 https://github.com/jreisinger/runp
2874 (Last checked: 2020-04)
2875
2876
2877 =head2 DIFFERENCES BETWEEN papply AND GNU Parallel
2878
2879 Summary (see legend above):
2880
2881 =over
2882
2883 =item - - - I4 - - -
2884
2885 =item M1 - M3 - - M6
2886
2887 =item - - O3 - O5 - - x x O10
2888
2889 =item E1 - - E4 - - -
2890
2891 =item - - - - - - - - -
2892
2893 =item - -
2894
2895 =back
2896
2897 B<papply> does not print the output if the command fails:
2898
2899   $ papply 'echo %F; false' foo
2900   "echo foo; false" did not succeed
2901
2902 B<papply>'s replacement strings (%F %d %f %n %e %z) can be simulated in GNU
2903 B<parallel> by putting this in B<~/.parallel/config>:
2904
2905   --rpl '%F'
2906   --rpl '%d $_=Q(::dirname($_));'
2907   --rpl '%f s:.*/::;'
2908   --rpl '%n s:.*/::;s:\.[^/.]+$::;'
2909   --rpl '%e s:.*\.:.:'
2910   --rpl '%z $_=""'
2911
2912 B<papply> buffers in RAM, and uses twice the amount of output. So
2913 output of 5 GB takes 10 GB RAM.
2914
2915 The buffering is very CPU intensive: Buffering a line of 5 GB takes 40
2916 seconds (compared to 10 seconds with GNU B<parallel>).
2917
2918
2919 =head3 Examples as GNU Parallel
2920
2921   1$ papply gzip *.txt
2922
2923   1$ parallel gzip ::: *.txt
2924
2925   2$ papply "convert %F %n.jpg" *.png
2926
2927   2$ parallel convert {} {.}.jpg ::: *.png
2928
2929
2930 https://pypi.org/project/papply/
2931 (Last checked: 2020-04)
2932
2933
2934 =head2 DIFFERENCES BETWEEN async AND GNU Parallel
2935
2936 Summary (see legend above):
2937
2938 =over
2939
2940 =item - - - I4 - - I7
2941
2942 =item - - - - - M6
2943
2944 =item - O2 O3 - O5 O6 - x x O10
2945
2946 =item E1 - - E4 - E6 -
2947
2948 =item - - - - - - - - -
2949
2950 =item S1 S2
2951
2952 =back
2953
2954 B<async> is very similary to GNU B<parallel>'s B<--semaphore> mode
2955 (aka B<sem>). B<async> requires the user to start a server process.
2956
2957 The input is quoted like B<-q> so you need B<bash -c "...;..."> to run
2958 composed commands.
2959
2960 =head3 Examples as GNU Parallel
2961
2962   1$ S="/tmp/example_socket"
2963
2964   1$ ID=myid
2965
2966   2$ async -s="$S" server --start
2967
2968   2$ # GNU Parallel does not need a server to run
2969
2970   3$ for i in {1..20}; do
2971          # prints command output to stdout
2972          async -s="$S" cmd -- bash -c "sleep 1 && echo test $i"
2973      done
2974
2975   3$ for i in {1..20}; do
2976          # prints command output to stdout
2977          sem --id "$ID" -j100% "sleep 1 && echo test $i"
2978          # GNU Parallel will only print job when it is done
2979          # If you need output from different jobs to mix
2980          # use -u or --line-buffer
2981          sem --id "$ID" -j100% --line-buffer "sleep 1 && echo test $i"
2982      done
2983
2984   4$ # wait until all commands are finished
2985      async -s="$S" wait
2986
2987   4$ sem --id "$ID" --wait
2988
2989   5$ # configure the server to run four commands in parallel
2990      async -s="$S" server -j4
2991
2992   5$ export PARALLEL=-j4
2993
2994   6$ mkdir "/tmp/ex_dir"
2995      for i in {21..40}; do
2996        # redirects command output to /tmp/ex_dir/file*
2997        async -s="$S" cmd -o "/tmp/ex_dir/file$i" -- \
2998          bash -c "sleep 1 && echo test $i"
2999      done
3000
3001   6$ mkdir "/tmp/ex_dir"
3002      for i in {21..40}; do
3003        # redirects command output to /tmp/ex_dir/file*
3004        sem --id "$ID" --result '/tmp/my-ex/file-{=$_=""=}'"$i" \
3005          "sleep 1 && echo test $i"
3006      done
3007
3008   7$ sem --id "$ID" --wait
3009
3010   7$ async -s="$S" wait
3011
3012   8$ # stops server
3013      async -s="$S" server --stop
3014
3015   8$ # GNU Parallel does not need to stop a server
3016
3017
3018 https://github.com/ctbur/async/
3019 (Last checked: 2023-01)
3020
3021
3022 =head2 DIFFERENCES BETWEEN pardi AND GNU Parallel
3023
3024 Summary (see legend above):
3025
3026 =over
3027
3028 =item I1 I2 - - - - I7
3029
3030 =item M1 - - - - M6
3031
3032 =item O1 O2 O3 O4 O5 - O7 - - O10
3033
3034 =item E1 - - E4 - - -
3035
3036 =item - - - - - - - - -
3037
3038 =item - -
3039
3040 =back
3041
3042 B<pardi> is very similar to B<parallel --pipe --cat>: It reads blocks
3043 of data and not arguments. So it cannot insert an argument in the
3044 command line. It puts the block into a temporary file, and this file
3045 name (%IN) can be put in the command line. You can only use %IN once.
3046
3047 It can also run full command lines in parallel (like: B<cat file |
3048 parallel>).
3049
3050 =head3 EXAMPLES FROM pardi test.sh
3051
3052   1$ time pardi -v -c 100 -i data/decoys.smi -ie .smi -oe .smi \
3053        -o data/decoys_std_pardi.smi \
3054           -w '(standardiser -i %IN -o %OUT 2>&1) > /dev/null'
3055
3056   1$ cat data/decoys.smi |
3057        time parallel -N 100 --pipe --cat \
3058          '(standardiser -i {} -o {#} 2>&1) > /dev/null; cat {#}; rm {#}' \
3059          > data/decoys_std_pardi.smi
3060
3061   2$ pardi -n 1 -i data/test_in.types -o data/test_out.types \
3062              -d 'r:^#atoms:' -w 'cat %IN > %OUT'
3063
3064   2$ cat data/test_in.types |
3065        parallel -n 1 -k --pipe --cat --regexp --recstart '^#atoms' \
3066          'cat {}' > data/test_out.types
3067
3068   3$ pardi -c 6 -i data/test_in.types -o data/test_out.types \
3069              -d 'r:^#atoms:' -w 'cat %IN > %OUT'
3070
3071   3$ cat data/test_in.types |
3072        parallel -n 6 -k --pipe --cat --regexp --recstart '^#atoms' \
3073          'cat {}' > data/test_out.types
3074
3075   4$ pardi -i data/decoys.mol2 -o data/still_decoys.mol2 \
3076              -d 's:@<TRIPOS>MOLECULE' -w 'cp %IN %OUT'
3077
3078   4$ cat data/decoys.mol2 |
3079        parallel -n 1 --pipe --cat --recstart '@<TRIPOS>MOLECULE' \
3080          'cp {} {#}; cat {#}; rm {#}' > data/still_decoys.mol2
3081
3082   5$ pardi -i data/decoys.mol2 -o data/decoys2.mol2 \
3083              -d b:10000 -w 'cp %IN %OUT' --preserve
3084
3085   5$ cat data/decoys.mol2 |
3086        parallel -k --pipe --block 10k --recend '' --cat \
3087          'cat {} > {#}; cat {#}; rm {#}' > data/decoys2.mol2
3088
3089 https://github.com/UnixJunkie/pardi
3090 (Last checked: 2021-01)
3091
3092
3093 =head2 DIFFERENCES BETWEEN bthread AND GNU Parallel
3094
3095 Summary (see legend above):
3096
3097 =over
3098
3099 =item - - - I4 -  - -
3100
3101 =item - - - - - M6
3102
3103 =item O1 - O3 - - - O7 O8 - -
3104
3105 =item E1 - - - - - -
3106
3107 =item - - - - - - - - -
3108
3109 =item - -
3110
3111 =back
3112
3113 B<bthread> takes around 1 sec per MB of output. The maximal output
3114 line length is 1073741759.
3115
3116 You cannot quote space in the command, so you cannot run composed
3117 commands like B<sh -c "echo a; echo b">.
3118
3119 https://gitlab.com/netikras/bthread
3120 (Last checked: 2021-01)
3121
3122
3123 =head2 DIFFERENCES BETWEEN simple_gpu_scheduler AND GNU Parallel
3124
3125 Summary (see legend above):
3126
3127 =over
3128
3129 =item I1 - - - - - I7
3130
3131 =item M1 - - - - M6
3132
3133 =item - O2 O3 - - O6 - x x O10
3134
3135 =item E1 - - - - - -
3136
3137 =item - - - - - - - - -
3138
3139 =item - -
3140
3141 =back
3142
3143 =head3 EXAMPLES FROM simple_gpu_scheduler MANUAL
3144
3145   1$ simple_gpu_scheduler --gpus 0 1 2 < gpu_commands.txt
3146
3147   1$ parallel -j3 --shuf \
3148      CUDA_VISIBLE_DEVICES='{=1 $_=slot()-1 =} {=uq;=}' \
3149        < gpu_commands.txt
3150
3151   2$ simple_hypersearch \
3152        "python3 train_dnn.py --lr {lr} --batch_size {bs}" \
3153        -p lr 0.001 0.0005 0.0001 -p bs 32 64 128 |
3154        simple_gpu_scheduler --gpus 0,1,2
3155
3156   2$ parallel --header : --shuf -j3 -v \
3157        CUDA_VISIBLE_DEVICES='{=1 $_=slot()-1 =}' \
3158        python3 train_dnn.py --lr {lr} --batch_size {bs} \
3159        ::: lr 0.001 0.0005 0.0001 ::: bs 32 64 128
3160
3161   3$ simple_hypersearch \
3162        "python3 train_dnn.py --lr {lr} --batch_size {bs}" \
3163        --n-samples 5 -p lr 0.001 0.0005 0.0001 -p bs 32 64 128 |
3164        simple_gpu_scheduler --gpus 0,1,2
3165
3166   3$ parallel --header : --shuf \
3167        CUDA_VISIBLE_DEVICES='{=1 $_=slot()-1; seq()>5 and skip() =}' \
3168        python3 train_dnn.py --lr {lr} --batch_size {bs} \
3169        ::: lr 0.001 0.0005 0.0001 ::: bs 32 64 128
3170
3171   4$ touch gpu.queue
3172      tail -f -n 0 gpu.queue | simple_gpu_scheduler --gpus 0,1,2 &
3173      echo "my_command_with | and stuff > logfile" >> gpu.queue
3174
3175   4$ touch gpu.queue
3176      tail -f -n 0 gpu.queue |
3177        parallel -j3 CUDA_VISIBLE_DEVICES='{=1 $_=slot()-1 =} {=uq;=}' &
3178      # Needed to fill job slots once
3179      seq 3 | parallel echo true >> gpu.queue
3180      # Add jobs
3181      echo "my_command_with | and stuff > logfile" >> gpu.queue
3182      # Needed to flush output from completed jobs
3183      seq 3 | parallel echo true >> gpu.queue
3184
3185 https://github.com/ExpectationMax/simple_gpu_scheduler
3186 (Last checked: 2021-01)
3187
3188
3189 =head2 DIFFERENCES BETWEEN parasweep AND GNU Parallel
3190
3191 B<parasweep> is a Python module for facilitating parallel parameter
3192 sweeps.
3193
3194 A B<parasweep> job will normally take a text file as input. The text
3195 file contains arguments for the job. Some of these arguments will be
3196 fixed and some of them will be changed by B<parasweep>.
3197
3198 It does this by having a template file such as template.txt:
3199
3200   Xval: {x}
3201   Yval: {y}
3202   FixedValue: 9
3203   # x with 2 decimals
3204   DecimalX: {x:.2f}
3205   TenX: ${x*10}
3206   RandomVal: {r}
3207
3208 and from this template it generates the file to be used by the job by
3209 replacing the replacement strings.
3210
3211 Being a Python module B<parasweep> integrates tighter with Python than
3212 GNU B<parallel>. You get the parameters directly in a Python data
3213 structure. With GNU B<parallel> you can use the JSON or CSV output
3214 format to get something similar, but you would have to read the
3215 output.
3216
3217 B<parasweep> has a filtering method to ignore parameter combinations
3218 you do not need.
3219
3220 Instead of calling the jobs directly, B<parasweep> can use Python's
3221 Distributed Resource Management Application API to make jobs run with
3222 different cluster software.
3223
3224
3225 GNU B<parallel> B<--tmpl> supports templates with replacement
3226 strings. Such as:
3227
3228   Xval: {x}
3229   Yval: {y}
3230   FixedValue: 9
3231   # x with 2 decimals
3232   DecimalX: {=x $_=sprintf("%.2f",$_) =}
3233   TenX: {=x $_=$_*10 =}
3234   RandomVal: {=1 $_=rand() =}
3235
3236 that can be used like:
3237
3238   parallel --header : --tmpl my.tmpl={#}.t myprog {#}.t \
3239     ::: x 1 2 3 ::: y 1 2 3
3240
3241 Filtering is supported as:
3242
3243   parallel --filter '{1} > {2}' echo ::: 1 2 3 ::: 1 2 3
3244
3245 https://github.com/eviatarbach/parasweep
3246 (Last checked: 2021-01)
3247
3248
3249 =head2 DIFFERENCES BETWEEN parallel-bash AND GNU Parallel
3250
3251 Summary (see legend above):
3252
3253 =over
3254
3255 =item I1 I2 - - - - -
3256
3257 =item - - M3 - - M6
3258
3259 =item - O2 O3 - O5 O6 - O8 x O10
3260
3261 =item E1 - - - - - -
3262
3263 =item - - - - - - - - -
3264
3265 =item - -
3266
3267 =back
3268
3269 B<parallel-bash> is written in pure bash. It is really fast (overhead
3270 of ~0.05 ms/job compared to GNU B<parallel>'s 3-10 ms/job). So if your
3271 jobs are extremely short lived, and you can live with the quite
3272 limited command, this may be useful.
3273
3274 It works by making a queue for each process. Then the jobs are
3275 distributed to the queues in a round robin fashion. Finally the queues
3276 are started in parallel. This works fine, if you are lucky, but if
3277 not, all the long jobs may end up in the same queue, so you may see:
3278
3279   $ printf "%b\n" 1 1 1 4 1 1 1 4 1 1 1 4 |
3280       time parallel -P4 sleep {}
3281   (7 seconds)
3282   $ printf "%b\n" 1 1 1 4 1 1 1 4 1 1 1 4 |
3283       time ./parallel-bash.bash -p 4 -c sleep {}
3284   (12 seconds)
3285
3286 Because it uses bash lists, the total number of jobs is limited to
3287 167000..265000 depending on your environment. You get a segmentation
3288 fault, when you reach the limit.
3289
3290 Ctrl-C does not stop spawning new jobs. Ctrl-Z does not suspend
3291 running jobs.
3292
3293
3294 =head3 EXAMPLES FROM parallel-bash
3295
3296   1$ some_input | parallel-bash -p 5 -c echo
3297
3298   1$ some_input | parallel -j 5 echo
3299
3300   2$ parallel-bash -p 5 -c echo < some_file
3301
3302   2$ parallel -j 5 echo < some_file
3303
3304   3$ parallel-bash -p 5 -c echo <<< 'some string'
3305
3306   3$ parallel -j 5 -c echo <<< 'some string'
3307
3308   4$ something | parallel-bash -p 5 -c echo {} {}
3309
3310   4$ something | parallel -j 5 echo {} {}
3311
3312 https://reposhub.com/python/command-line-tools/Akianonymus-parallel-bash.html
3313 (Last checked: 2021-06)
3314
3315
3316 =head2 DIFFERENCES BETWEEN bash-concurrent AND GNU Parallel
3317
3318 B<bash-concurrent> is more an alternative to B<make> than to GNU
3319 B<parallel>. Its input is very similar to a Makefile, where jobs
3320 depend on other jobs.
3321
3322 It has a nice progress indicator where you can see which jobs
3323 completed successfully, which jobs are currently running, which jobs
3324 failed, and which jobs were skipped due to a depending job failed.
3325 The indicator does not deal well with resizing the window.
3326
3327 Output is cached in tempfiles on disk, but is only shown if there is
3328 an error, so it is not meant to be part of a UNIX pipeline. If
3329 B<bash-concurrent> crashes these tempfiles are not removed.
3330
3331 It uses an O(n*n) algorithm, so if you have 1000 independent jobs it
3332 takes 22 seconds to start it.
3333
3334 https://github.com/themattrix/bash-concurrent
3335 (Last checked: 2021-02)
3336
3337
3338 =head2 DIFFERENCES BETWEEN spawntool AND GNU Parallel
3339
3340 Summary (see legend above):
3341
3342 =over
3343
3344 =item I1 - - - - - -
3345
3346 =item M1 - - - - M6
3347
3348 =item - O2 O3 - O5 O6 - x x O10
3349
3350 =item E1 - - - - - -
3351
3352 =item - - - - - - - - -
3353
3354 =item - -
3355
3356 =back
3357
3358 B<spawn> reads a full command line from stdin which it executes in
3359 parallel.
3360
3361
3362 http://code.google.com/p/spawntool/
3363 (Last checked: 2021-07)
3364
3365
3366 =head2 DIFFERENCES BETWEEN go-pssh AND GNU Parallel
3367
3368 Summary (see legend above):
3369
3370 =over
3371
3372 =item - - - - - - -
3373
3374 =item M1 - - - - -
3375
3376 =item O1 - - - - - - x x O10
3377
3378 =item E1 - - - - - -
3379
3380 =item R1 R2 - - - R6 - - -
3381
3382 =item - -
3383
3384 =back
3385
3386 B<go-pssh> does B<ssh> in parallel to multiple machines. It runs the
3387 same command on multiple machines similar to B<--nonall>.
3388
3389 The hostnames must be given as IP-addresses (not as hostnames).
3390
3391 Output is sent to stdout (standard output) if command is successful,
3392 and to stderr (standard error) if the command fails.
3393
3394 =head3 EXAMPLES FROM go-pssh
3395
3396   1$ go-pssh -l <ip>,<ip> -u <user> -p <port> -P <passwd> -c "<command>"
3397
3398   1$ parallel -S 'sshpass -p <passwd> ssh -p <port> <user>@<ip>' \
3399        --nonall "<command>"
3400
3401   2$ go-pssh scp -f host.txt -u <user> -p <port> -P <password> \
3402        -s /local/file_or_directory -d /remote/directory
3403
3404   2$ parallel --nonall --slf host.txt \
3405        --basefile /local/file_or_directory/./ --wd /remote/directory
3406        --ssh 'sshpass -p <password> ssh -p <port> -l <user>' true
3407
3408   3$ go-pssh scp -l <ip>,<ip> -u <user> -p <port> -P <password> \
3409        -s /local/file_or_directory -d /remote/directory
3410
3411   3$ parallel --nonall -S <ip>,<ip> \
3412        --basefile /local/file_or_directory/./ --wd /remote/directory
3413        --ssh 'sshpass -p <password> ssh -p <port> -l <user>' true
3414
3415 https://github.com/xuchenCN/go-pssh
3416 (Last checked: 2021-07)
3417
3418
3419 =head2 DIFFERENCES BETWEEN go-parallel AND GNU Parallel
3420
3421 Summary (see legend above):
3422
3423 =over
3424
3425 =item I1 I2 - - - - I7
3426
3427 =item - - M3 - - M6
3428
3429 =item - O2 O3 - O5 - - x x - O10
3430
3431 =item E1 - - E4 - - -
3432
3433 =item - - - - - - - - -
3434
3435 =item - -
3436
3437 =back
3438
3439 B<go-parallel> uses Go templates for replacement strings. Quite
3440 similar to the I<{= perl expr =}> replacement string.
3441
3442 =head3 EXAMPLES FROM go-parallel
3443
3444   1$ go-parallel -a ./files.txt -t 'cp {{.Input}} {{.Input | dirname | dirname}}'
3445
3446   1$ parallel -a ./files.txt cp {} '{= $_=::dirname(::dirname($_)) =}'
3447
3448   2$ go-parallel -a ./files.txt -t 'mkdir -p {{.Input}} {{noExt .Input}}'
3449
3450   2$ parallel -a ./files.txt echo mkdir -p {} {.}
3451
3452   3$ go-parallel -a ./files.txt -t 'mkdir -p {{.Input}} {{.Input | basename | noExt}}'
3453
3454   3$ parallel -a ./files.txt echo mkdir -p {} {/.}
3455
3456 https://github.com/mylanconnolly/parallel
3457 (Last checked: 2021-07)
3458
3459
3460 =head2 DIFFERENCES BETWEEN p AND GNU Parallel
3461
3462 Summary (see legend above):
3463
3464 =over
3465
3466 =item - - - I4 - - x
3467
3468 =item - - - - - M6
3469
3470 =item - O2 O3 - O5 O6 - x x - O10
3471
3472 =item E1 - - - - - -
3473
3474 =item - - - - - - - - -
3475
3476 =item - -
3477
3478 =back
3479
3480 B<p> is a tiny shell script. It can color output with some predefined
3481 colors, but is otherwise quite limited.
3482
3483 It maxes out at around 116000 jobs (probably due to limitations in Bash).
3484
3485 =head3 EXAMPLES FROM p
3486
3487 Some of the examples from B<p> cannot be implemented 100% by GNU
3488 B<parallel>: The coloring is a bit different, and GNU B<parallel>
3489 cannot have B<--tag> for some inputs and not for others.
3490
3491 The coloring done by GNU B<parallel> is not exactly the same as B<p>.
3492
3493   1$ p -bc blue "ping 127.0.0.1" -uc red "ping 192.168.0.1" \
3494      -rc yellow "ping 192.168.1.1" -t example "ping example.com"
3495
3496   1$ parallel --lb -j0 --color --tag ping \
3497      ::: 127.0.0.1 192.168.0.1 192.168.1.1 example.com
3498
3499   2$ p "tail -f /var/log/httpd/access_log" \
3500      -bc red "tail -f /var/log/httpd/error_log"
3501
3502   2$ cd /var/log/httpd;
3503      parallel --lb --color --tag tail -f ::: access_log error_log
3504
3505   3$ p tail -f "some file" \& p tail -f "other file with space.txt"
3506
3507   3$ parallel --lb tail -f ::: 'some file' "other file with space.txt"
3508
3509   4$ p -t project1 "hg pull project1" -t project2 \
3510      "hg pull project2" -t project3 "hg pull project3"
3511
3512   4$ parallel --lb hg pull ::: project{1..3}
3513
3514 https://github.com/rudymatela/evenmoreutils/blob/master/man/p.1.adoc
3515 (Last checked: 2022-04)
3516
3517
3518 =head2 DIFFERENCES BETWEEN senechal AND GNU Parallel
3519
3520 Summary (see legend above):
3521
3522 =over
3523
3524 =item I1 - - - - - -
3525
3526 =item M1 - M3 - - M6
3527
3528 =item O1 - O3 O4 - - - x x -
3529
3530 =item E1 - - - - - -
3531
3532 =item - - - - - - - - -
3533
3534 =item - -
3535
3536 =back
3537
3538 B<seneschal> only starts the first job after reading the last job, and
3539 output from the first job is only printed after the last job finishes.
3540
3541 1 byte of output requites 3.5 bytes of RAM.
3542
3543 This makes it impossible to have a total output bigger than the
3544 virtual memory.
3545
3546 Even though output is kept in RAM outputing is quite slow: 30 MB/s.
3547
3548 Output larger than 4 GB causes random problems - it looks like a race
3549 condition.
3550
3551 This:
3552
3553   echo 1 | seneschal  --prefix='yes `seq 1000`|head -c 1G' >/dev/null
3554
3555 takes 4100(!) CPU seconds to run on a 64C64T server, but only 140 CPU
3556 seconds on a 4C8T laptop. So it looks like B<seneschal> wastes a lot
3557 of CPU time coordinating the CPUs.
3558
3559 Compare this to:
3560
3561   echo 1 | time -v parallel -N0 'yes `seq 1000`|head -c 1G' >/dev/null
3562
3563 which takes 3-8 CPU seconds.
3564
3565 =head3 EXAMPLES FROM seneschal README.md
3566
3567   1$ echo $REPOS | seneschal --prefix="cd {} && git pull"
3568
3569   # If $REPOS is newline separated
3570   1$ echo "$REPOS" | parallel -k "cd {} && git pull"
3571   # If $REPOS is space separated
3572   1$ echo -n "$REPOS" | parallel -d' ' -k "cd {} && git pull"
3573
3574   COMMANDS="pwd
3575   sleep 5 && echo boom
3576   echo Howdy
3577   whoami"
3578
3579   2$ echo "$COMMANDS" | seneschal --debug
3580
3581   2$ echo "$COMMANDS" | parallel -k -v
3582
3583   3$ ls -1 | seneschal --prefix="pushd {}; git pull; popd;"
3584
3585   3$ ls -1 | parallel -k "pushd {}; git pull; popd;"
3586   # Or if current dir also contains files:
3587   3$ parallel -k "pushd {}; git pull; popd;" ::: */
3588
3589 https://github.com/TheWizardTower/seneschal
3590 (Last checked: 2022-06)
3591
3592
3593 =head2 DIFFERENCES BETWEEN async AND GNU Parallel
3594
3595 Summary (see legend above):
3596
3597 =over
3598
3599 =item x x x x x x x
3600
3601 =item - x x x x x
3602
3603 =item x O2 O3 O4 O5 O6 - x x O10
3604
3605 =item E1 - - E4 - - -
3606
3607 =item - - - - - - - - -
3608
3609 =item S1 S2
3610
3611 =back
3612
3613 B<async> works like B<sem>.
3614
3615
3616 =head3 EXAMPLES FROM async
3617
3618   1$ S="/tmp/example_socket"
3619
3620      async -s="$S" server --start
3621
3622      for i in {1..20}; do
3623          # prints command output to stdout
3624          async -s="$S" cmd -- bash -c "sleep 1 && echo test $i"
3625      done
3626
3627      # wait until all commands are finished
3628      async -s="$S" wait
3629
3630   1$ S="example_id"
3631
3632      # server not needed
3633
3634      for i in {1..20}; do
3635          # prints command output to stdout
3636          sem --bg --id "$S" -j100% "sleep 1 && echo test $i"
3637      done
3638
3639      # wait until all commands are finished
3640      sem --fg --id "$S" --wait
3641
3642   2$ # configure the server to run four commands in parallel
3643      async -s="$S" server -j4
3644
3645      mkdir "/tmp/ex_dir"
3646      for i in {21..40}; do
3647          # redirects command output to /tmp/ex_dir/file*
3648          async -s="$S" cmd -o "/tmp/ex_dir/file$i" -- \
3649            bash -c "sleep 1 && echo test $i"
3650      done
3651
3652      async -s="$S" wait
3653
3654      # stops server
3655      async -s="$S" server --stop
3656
3657   2$ # starting server not needed
3658
3659      mkdir "/tmp/ex_dir"
3660      for i in {21..40}; do
3661          # redirects command output to /tmp/ex_dir/file*
3662          sem --bg --id "$S" --results "/tmp/ex_dir/file$i{}" \
3663            "sleep 1 && echo test $i"
3664      done
3665
3666      sem --fg --id "$S" --wait
3667
3668      # there is no server to stop
3669
3670 https://github.com/ctbur/async
3671 (Last checked: 2023-01)
3672
3673
3674 =head2 DIFFERENCES BETWEEN tandem AND GNU Parallel
3675
3676 Summary (see legend above):
3677
3678 =over
3679
3680 =item - - - I4 - - x
3681
3682 =item M1 - - - - M6
3683
3684 =item - - O3 - - - - x - -
3685
3686 =item E1 - E3 - E5 - -
3687
3688 =item - - - - - - - - -
3689
3690 =item - -
3691
3692 =back
3693
3694 B<tandem> runs full commands in parallel. It is made for starting a
3695 "server", running a job against the server, and when the job is done,
3696 the server is killed.
3697
3698 More generally: it kills all jobs when the first job completes -
3699 similar to '--halt now,done=1'.
3700
3701 B<tandem> silently discards some output. It is unclear exactly when
3702 this happens. It looks like a race condition, because it varies for
3703 each run.
3704
3705   $ tandem "seq 10000" | wc -l
3706   6731 <- This should always be 10002
3707
3708
3709 =head3 EXAMPLES FROM Demo
3710
3711   tandem \
3712     'php -S localhost:8000' \
3713     'esbuild src/*.ts --bundle --outdir=dist --watch' \
3714     'tailwind -i src/index.css -o dist/index.css --watch'
3715
3716   # Emulate tandem's behaviour
3717   PARALLEL='--color --lb  --halt now,done=1 --tagstring '
3718   PARALLEL="$PARALLEL'"'{=s/ .*//; $_.=".".$app{$_}++;=}'"'"
3719   export PARALLEL
3720
3721   parallel ::: \
3722     'php -S localhost:8000' \
3723     'esbuild src/*.ts --bundle --outdir=dist --watch' \
3724     'tailwind -i src/index.css -o dist/index.css --watch'
3725
3726
3727 =head3 EXAMPLES FROM tandem -h
3728
3729   # Emulate tandem's behaviour
3730   PARALLEL='--color --lb  --halt now,done=1 --tagstring '
3731   PARALLEL="$PARALLEL'"'{=s/ .*//; $_.=".".$app{$_}++;=}'"'"
3732   export PARALLEL
3733
3734   1$ tandem 'sleep 5 && echo "hello"' 'sleep 2 && echo "world"'
3735
3736   1$ parallel ::: 'sleep 5 && echo "hello"' 'sleep 2 && echo "world"'
3737
3738   # '-t 0' fails. But '--timeout 0 works'
3739   2$ tandem --timeout 0 'sleep 5 && echo "hello"' \
3740        'sleep 2 && echo "world"'
3741
3742   2$ parallel --timeout 0 ::: 'sleep 5 && echo "hello"' \
3743        'sleep 2 && echo "world"'
3744
3745 =head3 EXAMPLES FROM tandem's readme.md
3746
3747   # Emulate tandem's behaviour
3748   PARALLEL='--color --lb  --halt now,done=1 --tagstring '
3749   PARALLEL="$PARALLEL'"'{=s/ .*//; $_.=".".$app{$_}++;=}'"'"
3750   export PARALLEL
3751
3752   1$ tandem 'next dev' 'nodemon --quiet ./server.js'
3753
3754   1$ parallel ::: 'next dev' 'nodemon --quiet ./server.js'
3755
3756   2$ cat package.json
3757      {
3758        "scripts": {
3759          "dev:php": "...",
3760          "dev:js": "...",
3761          "dev:css": "..."
3762        }
3763      }
3764
3765      tandem 'npm:dev:php' 'npm:dev:js' 'npm:dev:css'
3766
3767   # GNU Parallel uses bash functions instead
3768   2$ cat package.sh
3769      dev:php() { ... ; }
3770      dev:js() { ... ; }
3771      dev:css() { ... ; }
3772      export -f dev:php dev:js dev:css
3773
3774      . package.sh
3775      parallel ::: dev:php dev:js dev:css
3776
3777   3$ tandem 'npm:dev:*'
3778
3779   3$ compgen -A function | grep ^dev: | parallel
3780
3781 For usage in Makefiles, include a copy of GNU Parallel with your
3782 source using `parallel --embed`. This has the added benefit of also
3783 working if access to the internet is down or restricted.
3784
3785 https://github.com/rosszurowski/tandem
3786 (Last checked: 2023-01)
3787
3788
3789 =head2 DIFFERENCES BETWEEN rust-parallel(aaronriekenberg) AND GNU Parallel
3790
3791 Summary (see legend above):
3792
3793 =over
3794
3795 =item I1 I2 I3 - - - -
3796
3797 =item - - - - - M6
3798
3799 =item O1 O2 O3 - O5 O6 - x - O10
3800
3801 =item E1 - - E4 - - -
3802
3803 =item - - - - - - - - -
3804
3805 =item - -
3806
3807 =back
3808
3809 B<rust-parallel> has a goal of only using Rust. It seems it is
3810 impossible to call bash functions from the command line. You would
3811 need to put these in a script.
3812
3813 Calling a script that misses the shebang line (#! as first line)
3814 fails.
3815
3816 =head3 EXAMPLES FROM rust-parallel's README.md
3817
3818   $ cat >./test <<EOL
3819   echo hi
3820   echo there
3821   echo how
3822   echo are
3823   echo you
3824   EOL
3825
3826   1$ cat test | rust-parallel -j5
3827
3828   1$ cat test | parallel -j5
3829
3830   2$ cat test | rust-parallel -j1
3831
3832   2$ cat test | parallel -j1
3833
3834   3$ head -100 /usr/share/dict/words | rust-parallel md5 -s
3835
3836   3$ head -100 /usr/share/dict/words | parallel md5 -s
3837
3838   4$ find . -type f -print0 | rust-parallel -0 gzip -f -k
3839
3840   4$ find . -type f -print0 | parallel -0 gzip -f -k
3841
3842   5$ head -100 /usr/share/dict/words |
3843        awk '{printf "md5 -s %s\n", $1}' | rust-parallel
3844
3845   5$ head -100 /usr/share/dict/words |
3846        awk '{printf "md5 -s %s\n", $1}' | parallel
3847
3848   6$ head -100 /usr/share/dict/words | rust-parallel md5 -s |
3849        grep -i abba
3850
3851   6$ head -100 /usr/share/dict/words | parallel md5 -s |
3852        grep -i abba
3853
3854 https://github.com/aaronriekenberg/rust-parallel
3855 (Last checked: 2023-01)
3856
3857
3858 =head2 DIFFERENCES BETWEEN parallelium AND GNU Parallel
3859
3860 Summary (see legend above):
3861
3862 =over
3863
3864 =item - I2 - - - - -
3865
3866 =item M1 - - - - M6
3867
3868 =item O1 - O3 - - - - x - -
3869
3870 =item E1 - - E4 - - -
3871
3872 =item - - - - - - - - -
3873
3874 =item - -
3875
3876 =back
3877
3878 B<parallelium> merges standard output (stdout) and standard error
3879 (stderr). The maximal output of a command is 8192 bytes. Bigger output
3880 makes B<parallelium> go into an infinite loop.
3881
3882 In the input file for B<parallelium> you can define a tag, so that you
3883 can select to run only these commands. A bit like a target in a
3884 Makefile.
3885
3886 Progress is printed on standard output (stdout) prepended with '#'
3887 with similar information as GNU B<parallel>'s B<--bar>.
3888
3889 =head3 EXAMPLES
3890
3891     $ cat testjobs.txt
3892     #tag common sleeps classA
3893     (sleep 4.495;echo "job 000")
3894     :
3895     (sleep 2.587;echo "job 016")
3896
3897     #tag common sleeps classB
3898     (sleep 0.218;echo "job 017")
3899     :
3900     (sleep 2.269;echo "job 040")
3901
3902     #tag common sleeps classC
3903     (sleep 2.586;echo "job 041")
3904     :
3905     (sleep 1.626;echo "job 099")
3906
3907     #tag lasthalf, sleeps, classB
3908     (sleep 1.540;echo "job 100")
3909     :
3910     (sleep 2.001;echo "job 199")
3911
3912     1$ parallelium -f testjobs.txt -l logdir -t classB,classC
3913
3914     1$ cat testjobs.txt |
3915          parallel --plus --results logdir/testjobs.txt_{0#}.output \
3916            '{= if(/^#tag /) { @tag = split/,|\s+/ }
3917                (grep /^(classB|classC)$/, @tag) or skip =}'
3918
3919 https://github.com/beomagi/parallelium
3920 (Last checked: 2023-01)
3921
3922
3923 =head2 DIFFERENCES BETWEEN forkrun AND GNU Parallel
3924
3925 Summary (see legend above):
3926
3927 =over
3928
3929 =item I1 - - - - - I7
3930
3931 =item - - - - - -
3932
3933 =item - O2 O3 - O5 - - - - O10
3934
3935 =item E1 - - E4 - - -
3936
3937 =item - - - - - - - - -
3938
3939 =item - -
3940
3941 =back
3942
3943
3944 B<forkrun> blocks if it receives fewer jobs than slots:
3945
3946   echo | forkrun -p 2 echo
3947
3948 or when it gets some specific commands e.g.:
3949
3950   f() { seq "$@" | pv -qL 3; }
3951   seq 10 | forkrun f
3952
3953 It is not clear why.
3954
3955 It is faster than GNU B<parallel> (overhead: 1.2 ms/job vs 3 ms/job),
3956 but way slower than B<parallel-bash> (0.059 ms/job).
3957
3958 Running jobs cannot be stopped by pressing CTRL-C.
3959
3960 B<-k> is supposed to keep the order but fails on the MIX testing
3961 example below. If used with B<-k> it caches output in RAM.
3962
3963 If B<forkrun> is killed, it leaves temporary files in
3964 B</tmp/.forkrun.*> that has to be cleaned up manually.
3965
3966 =head3 EXAMPLES
3967
3968   1$ time find ./ -type f |
3969        forkrun -l512 -- sha256sum 2>/dev/null | wc -l
3970   1$ time find ./ -type f |
3971        parallel -j28 -m -- sha256sum 2>/dev/null | wc -l
3972
3973   2$ time find ./ -type f |
3974        forkrun -l512 -k -- sha256sum 2>/dev/null | wc -l
3975   2$ time find ./ -type f |
3976        parallel -j28 -k -m -- sha256sum 2>/dev/null | wc -l
3977
3978 https://github.com/jkool702/forkrun
3979 (Last checked: 2023-02)
3980
3981
3982 =head2 DIFFERENCES BETWEEN parallel-sh AND GNU Parallel
3983
3984 Summary (see legend above):
3985
3986 =over
3987
3988 =item I1 I2 - I4 - - -
3989
3990 =item M1 - - - - M6
3991
3992 =item O1 O2 O3 - O5 O6 - - - O10
3993
3994 =item E1 - - E4 - - -
3995
3996 =item - - - - - - - - -
3997
3998 =item - -
3999
4000 =back
4001
4002 B<parallel-sh> buffers in RAM. The buffering data takes O(n^1.5) time:
4003
4004 2MB=0.107s 4MB=0.175s 8MB=0.342s 16MB=0.766s 32MB=2.2s 64MB=6.7s
4005 128MB=20s 256MB=64s 512MB=248s 1024MB=998s 2048MB=3756s
4006
4007 It limits the practical usability to jobs outputting < 256 MB. GNU
4008 B<parallel> buffers on disk, yet is faster for jobs with outputs > 16
4009 MB and is only limited by the free space in $TMPDIR.
4010
4011 B<parallel-sh> can kill running jobs if a job fails (Similar to
4012 B<--halt now,fail=1>).
4013
4014 =head3 EXAMPLES
4015
4016   1$ parallel-sh "sleep 2 && echo first" "sleep 1 && echo second"
4017
4018   1$ parallel ::: "sleep 2 && echo first" "sleep 1 && echo second"
4019
4020   2$ cat /tmp/commands
4021      sleep 2 && echo first
4022      sleep 1 && echo second
4023
4024   2$ parallel-sh -f /tmp/commands
4025
4026   2$ parallel -a /tmp/commands
4027
4028   3$ echo -e 'sleep 2 && echo first\nsleep 1 && echo second' |
4029        parallel-sh
4030
4031   3$ echo -e 'sleep 2 && echo first\nsleep 1 && echo second' |
4032        parallel
4033
4034 https://github.com/thyrc/parallel-sh
4035 (Last checked: 2023-04)
4036
4037
4038 =head2 DIFFERENCES BETWEEN bash-parallel AND GNU Parallel
4039
4040 Summary (see legend above):
4041
4042 =over
4043
4044 =item - I2 - - - - I7
4045
4046 =item M1 - M3 - M5 M6
4047
4048 =item - O2 O3 - - O6 - O8 - O10
4049
4050 =item E1 - - - - - -
4051
4052 =item - - - - - - - - -
4053
4054 =item - -
4055
4056 =back
4057
4058 B<bash-parallel> is not as much a command as it is a shell script that
4059 you have to alter. It requires you to change the shell function
4060 process_job that runs the job, and set $MAX_POOL_SIZE to the number of
4061 jobs to run in parallel.
4062
4063 It is half as fast as GNU B<parallel> for short jobs.
4064
4065 https://github.com/thilinaba/bash-parallel
4066 (Last checked: 2023-05)
4067
4068
4069 =head2 DIFFERENCES BETWEEN PaSH AND GNU Parallel
4070
4071 Summary (see legend above): N/A
4072
4073 B<pash> is quite different from GNU B<parallel>. It is not a general
4074 parallelizer. It takes a shell script and analyses it and parallelizes
4075 parts of it by replacing the parts with commands that will give the same
4076 result.
4077
4078 This will replace B<sort> with a command that does pretty much the
4079 same as B<parsort --parallel=8> (except somewhat slower):
4080
4081   pa.sh --width 8 -c 'cat bigfile | sort'
4082
4083 However, even a simple change will confuse B<pash> and you will get no
4084 parallelization:
4085
4086   pa.sh --width 8 -c 'mysort() { sort; }; cat bigfile | mysort'
4087   pa.sh --width 8 -c 'cat bigfile | sort | md5sum'
4088
4089 From the source it seems B<pash> only looks at: awk cat col comm cut
4090 diff grep head mkfifo mv rm sed seq sort tail tee tr uniq wc xargs
4091
4092 For pipelines where these commands are bottlenecks, it might be worth
4093 testing if B<pash> is faster than GNU B<parallel>.
4094
4095 B<pash> does not respect $TMPDIR but always uses /tmp. If B<pash> dies
4096 unexpectantly it does not clean up.
4097
4098 https://github.com/binpash/pash
4099 (Last checked: 2023-05)
4100
4101
4102 =head2 DIFFERENCES BETWEEN korovkin-parallel AND GNU Parallel
4103
4104 Summary (see legend above):
4105
4106 =over
4107
4108 =item I1 - - - - - -
4109
4110 =item M1 - - - - M6
4111
4112 =item - - O3 - - - - x x -
4113
4114 =item E1 - - - - - -
4115
4116 =item R1 - - - - R6 x x -
4117
4118 =item - -
4119
4120 =back
4121
4122 B<korovkin-parallel> prepends all lines with some info.
4123
4124 The output is colored with 6 color combinations, so job 1 and 7 will
4125 get the same color.
4126
4127 You can get similar output with:
4128
4129   (echo ...) |
4130     parallel --color -j 10 --lb --tagstring \
4131       '[l:{#}:{=$_=sprintf("%7.03f",::now()-$^T)=} {=$_=hh_mm_ss($^T)=} {%}]'
4132
4133 Lines longer than 8192 chars are broken into lines shorter than
4134 8192. B<korovkin-parallel> loses the last char for lines exactly 8193
4135 chars long.
4136
4137 Short lines from different jobs do not mix, but long lines do:
4138
4139   fun() {
4140     perl -e '$a="'$1'"x1000000; for(1..'$2') { print $a };';
4141     echo;
4142   }
4143   export -f fun
4144   (echo fun a 100;echo fun b 100) | korovkin-parallel | tr -s abcdef
4145   # Compare to:
4146   (echo fun a 100;echo fun b 100) | parallel | tr -s abcdef
4147
4148 There should be only one line of a's and one line of b's.
4149
4150 Just like GNU B<parallel> B<korovkin-parallel> offers a master/slave
4151 model, so workers on other servers can do some of the tasks. But
4152 contrary to GNU B<parallel> you must manually start workers on these
4153 servers. The communication is neither authenticated nor encrypted.
4154
4155 It caches output in RAM: a 1GB line uses ~2.5GB RAM
4156
4157 https://github.com/korovkin/parallel
4158 (Last checked: 2023-07)
4159
4160
4161 =head2 DIFFERENCES BETWEEN xe AND GNU Parallel
4162
4163 Summary (see legend above):
4164
4165 =over
4166
4167 =item I1 I2 - I4 - - I7
4168
4169 =item M1 - M3 M4 - M6
4170
4171 =item - O2 O3 - O5 O6 - O8 - O10
4172
4173 =item E1 - - E4 - - -
4174
4175 =item - - - - - - - - -
4176
4177 =item - -
4178
4179 =back
4180
4181 B<xe> has a peculiar limitation:
4182
4183   echo /bin/echo | xe {} OK
4184   echo echo | xe /bin/{} fails
4185
4186
4187 =head3 EXAMPLES
4188
4189 Compress all .c files in the current directory, using all CPU cores:
4190
4191   1$ xe -a -j0 gzip -- *.c
4192
4193   1$ parallel gzip ::: *.c
4194
4195 Remove all empty files, using lr(1):
4196
4197   2$ lr -U -t 'size == 0' | xe -N0 rm
4198
4199   2$ lr -U -t 'size == 0' | parallel -X rm
4200
4201 Convert .mp3 to .ogg, using all CPU cores:
4202
4203   3$ xe -a -j0 -s 'ffmpeg -i "${1}" "${1%.mp3}.ogg"' -- *.mp3
4204
4205   3$ parallel ffmpeg -i {} {.}.ogg ::: *.mp3
4206
4207 Same, using percent rules:
4208
4209   4$ xe -a -j0 -p %.mp3 ffmpeg -i %.mp3 %.ogg -- *.mp3
4210
4211   4$ parallel --rpl '% s/\.mp3// or skip' ffmpeg -i %.mp3 %.ogg ::: *.mp3
4212
4213 Similar, but hiding output of ffmpeg, instead showing spawned jobs:
4214
4215   5$ xe -ap -j0 -vvq '%.{m4a,ogg,opus}' ffmpeg -y -i {} out/%.mp3 -- *
4216
4217   5$ parallel -v --rpl '% s/\.(m4a|ogg|opus)// or skip' \
4218        ffmpeg -y -i {} out/%.mp3 '2>/dev/null' ::: *
4219
4220   5$ parallel -v ffmpeg -y -i {} out/{.}.mp3 '2>/dev/null' ::: *
4221
4222 https://github.com/leahneukirchen/xe
4223 (Last checked: 2023-08)
4224
4225
4226 =head2 DIFFERENCES BETWEEN sp AND GNU Parallel
4227
4228 Summary (see legend above):
4229
4230 =over
4231
4232 =item - - - I4 - - -
4233
4234 =item M1 - M3 - - M6
4235
4236 =item - O2 O3 - O5 (O6) - x x O10
4237
4238 =item E1 - - - - - -
4239
4240 =item - - - - - - - - -
4241
4242 =item - -
4243
4244 =back
4245
4246 B<sp> has very few options.
4247
4248 It can either be used like:
4249
4250   sp command {} option :: arg1 arg2 arg3
4251
4252 which is similar to:
4253
4254   parallel command {} option ::: arg1 arg2 arg3
4255
4256 Or:
4257
4258   sp command1 :: "command2 -option" :: "command3 foo bar"
4259
4260 which is similar to:
4261
4262   parallel ::: command1 "command2 -option" "command3 foo bar"
4263
4264 B<sp> deals badly with too many commands: This causes B<sp> to run out
4265 of file handles and gives data loss.
4266
4267 For each command that fails, B<sp> will print an error message on
4268 stderr (standard error).
4269
4270 You cannot used exported shell functions as commands.
4271
4272 =head3 EXAMPLES
4273
4274   1$ sp echo {} :: 1 2 3
4275
4276   1$ parallel echo {} ::: 1 2 3
4277
4278   2$ sp echo {} {} :: 1 2 3
4279
4280   2$ parallel echo {} {} :: 1 2 3
4281
4282   3$ sp echo 1 :: echo 2 :: echo 3
4283
4284   3$ parallel ::: 'echo 1' 'echo 2' 'echo 3'
4285
4286   4$ sp a foo bar :: "b 'baz  bar'" :: c
4287
4288   4$ parallel ::: 'a foo bar' "b 'baz  bar'" :: c
4289
4290 https://github.com/SergioBenitez/sp
4291 (Last checked: 2023-10)
4292
4293
4294 =head2 DIFFERENCES BETWEEN repeater AND GNU Parallel
4295
4296 Summary (see legend above):
4297
4298 =over
4299
4300 =item - - - - - - -
4301
4302 =item - - - - - -
4303
4304 =item - O2 O3 N/A - O6 - x x ?O10
4305
4306 =item E1 - - - E5 - -
4307
4308 =item - - - - - - - - -
4309
4310 =item - -
4311
4312 =back
4313
4314 B<repeater> runs the same job repeatedly. In other words: It does not
4315 read arguments, thus is it an alternative for GNU B<parallel> for only
4316 quite limited applications.
4317
4318 B<repeater> has an overhead of around 0.23 ms/job. Compared to GNU
4319 B<parallel>'s 2-3 ms this is fast. Compared to B<bash-parallel>'s 0.05
4320 ms/job it is slow.
4321
4322 =head3 Memory use and run time for large output
4323
4324 Output takes O(n^2) time for output of size n. 10 MB takes ~1 second,
4325 30 MB takes ~7 seconds, 100 MB takes ~60 seconds, 300 MB takes ~480
4326 seconds, 1000 MB takes ~10000 seconds.
4327
4328 100 MB of output takes around 1 GB of RAM.
4329
4330     # Run time = 15 sec
4331     # Memory use = 20 MB
4332     # Output = 1 GB per job
4333     \time -v parallel -j1 seq ::: 120000000 120000000 >/dev/null
4334
4335     # Run time = 4.7 sec
4336     # Memory use = 95 MB
4337     # Output = 8 MB per job
4338     \time -v repeater -w 1 -n 2 -reportFile ./run_output seq 1200000 >/dev/null
4339
4340     # Run time = 42 sec
4341     # Memory use = 277 MB
4342     # Output = 27 MB per job
4343     \time -v repeater -w 1 -n 2 -reportFile ./run_output seq 3600000 >/dev/null
4344
4345     # Run time = 530 sec
4346     # Memory use = 1000 MB
4347     # Output = 97 MB per job
4348     \time -v repeater -w 1 -n 2 -reportFile ./run_output seq 12000000 >/dev/null
4349
4350     # Run time = 2h41m
4351     # Memory use = 8.6 GB
4352     # Output = 1 GB per job
4353     \time -v repeater -w 1 -n 2 -reportFile ./run_output seq 120000000 >/dev/null
4354
4355 For even just moderate sized outputs GNU B<parallel> will be faster
4356 and use less memory.
4357
4358
4359 =head3 EXAMPLES
4360
4361   1$ repeater -n 100 -w 10 -reportFile ./run_output
4362        -output REPORT_FILE -progress BOTH curl example.com
4363
4364   1$ seq 100 | parallel --joblog run.log --eta curl example.com > output
4365
4366   2$ repeater -n 100 -increment -progress HIDDEN -reportFile foo
4367        echo "this is increment: " INC
4368   2$ seq 100 | parallel echo {}
4369   2$ seq 100 | parallel echo '{= $_ = ++$myvar =}'
4370
4371 https://github.com/baalimago/repeater
4372 (Last checked: 2023-12)
4373
4374
4375 =head2 DIFFERENCES BETWEEN parallelize AND GNU Parallel
4376
4377 Summary (see legend above):
4378
4379 =over
4380
4381 =item I1 - - - - - I7
4382
4383 =item - - - - - M6
4384
4385 =item O1 - O3 O4 O5 - O7 - - -
4386
4387 =item E1 - - E4 - - -
4388
4389 =item - - - - - - - - -
4390
4391 =item - -
4392
4393 =back
4394
4395 B<parallelize> runs the full line as a command. If the command is not
4396 found, there is no warning.
4397
4398 The output at most ~1000000 lines/s. If the lines are short this is
4399 quite slow. The lines can at most be 2047999 bytes long. Longer lines
4400 cause segfault.
4401
4402
4403 =head3 EXAMPLES
4404
4405 simple.dat:
4406
4407   sleep 5
4408   foo
4409   cat alire.toml
4410   loc src/parallelize.adb
4411   sh loc src/*.ad?
4412
4413 1$ bin/parallelize -v <simple.dat
4414
4415 1$ parallel <simple.dat
4416
4417 https://github.com/simonjwright/parallelize
4418 (Last checked: 2024-04)
4419
4420 =head2 Todo
4421
4422 https://github.com/justanhduc/task-spooler
4423
4424 https://manpages.ubuntu.com/manpages/xenial/man1/tsp.1.html
4425
4426 https://www.npmjs.com/package/concurrently
4427
4428 http://code.google.com/p/push/ (cannot compile)
4429
4430 https://github.com/krashanoff/parallel
4431
4432 https://github.com/Nukesor/pueue
4433
4434 https://arxiv.org/pdf/2012.15443.pdf KumQuat
4435
4436 https://github.com/JeiKeiLim/simple_distribute_job
4437
4438 https://github.com/reggi/pkgrun - not obvious how to use
4439
4440 https://github.com/benoror/better-npm-run - not obvious how to use
4441
4442 https://github.com/bahmutov/with-package
4443
4444 https://github.com/flesler/parallel
4445
4446 https://github.com/Julian/Verge
4447
4448 https://vicerveza.homeunix.net/~viric/soft/ts/
4449
4450 https://github.com/chapmanjacobd/que
4451
4452
4453
4454 =head1 TESTING OTHER TOOLS
4455
4456 There are certain issues that are very common on parallelizing
4457 tools. Here are a few stress tests. Be warned: If the tool is badly
4458 coded it may overload your machine.
4459
4460
4461 =head2 MIX: Output mixes
4462
4463 Output from 2 jobs should not mix. If the output is not used, this
4464 does not matter; but if the output I<is> used then it is important
4465 that you do not get half a line from one job followed by half a line
4466 from another job.
4467
4468 If the tool does not buffer, output will most likely mix now and then.
4469
4470 This test stresses whether output mixes.
4471
4472   #!/bin/bash
4473
4474   paralleltool="parallel -j 30"
4475
4476   cat <<-EOF > mycommand
4477   #!/bin/bash
4478
4479   # If a, b, c, d, e, and f mix: Very bad
4480   perl -e 'print STDOUT "a"x3000_000," "'
4481   perl -e 'print STDERR "b"x3000_000," "'
4482   perl -e 'print STDOUT "c"x3000_000," "'
4483   perl -e 'print STDERR "d"x3000_000," "'
4484   perl -e 'print STDOUT "e"x3000_000," "'
4485   perl -e 'print STDERR "f"x3000_000," "'
4486   echo
4487   echo >&2
4488   EOF
4489   chmod +x mycommand
4490
4491   # Run 30 jobs in parallel
4492   seq 30 |
4493     $paralleltool ./mycommand > >(tr -s abcdef) 2> >(tr -s abcdef >&2)
4494
4495   # 'a c e' and 'b d f' should always stay together
4496   # and there should only be a single line per job
4497
4498
4499 =head2 STDERRMERGE: Stderr is merged with stdout
4500
4501 Output from stdout and stderr should not be merged, but kept separated.
4502
4503 This test shows whether stdout is mixed with stderr.
4504
4505   #!/bin/bash
4506
4507   paralleltool="parallel -j0"
4508
4509   cat <<-EOF > mycommand
4510   #!/bin/bash
4511
4512   echo stdout
4513   echo stderr >&2
4514   echo stdout
4515   echo stderr >&2
4516   EOF
4517   chmod +x mycommand
4518
4519   # Run one job
4520   echo |
4521     $paralleltool ./mycommand > stdout 2> stderr
4522   cat stdout
4523   cat stderr
4524
4525
4526 =head2 RAM: Output limited by RAM
4527
4528 Some tools cache output in RAM. This makes them extremely slow if the
4529 output is bigger than physical memory and crash if the output is
4530 bigger than the virtual memory.
4531
4532   #!/bin/bash
4533
4534   paralleltool="parallel -j0"
4535
4536   cat <<'EOF' > mycommand
4537   #!/bin/bash
4538
4539   # Generate 1 GB output
4540   yes "`perl -e 'print \"c\"x30_000'`" | head -c 1G
4541   EOF
4542   chmod +x mycommand
4543
4544   # Run 20 jobs in parallel
4545   # Adjust 20 to be > physical RAM and < free space on /tmp
4546   seq 20 | time $paralleltool ./mycommand | wc -c
4547
4548
4549 =head2 DISKFULL: Incomplete data if /tmp runs full
4550
4551 If caching is done on disk, the disk can run full during the run. Not
4552 all programs discover this. GNU Parallel discovers it, if it stays
4553 full for at least 2 seconds.
4554
4555   #!/bin/bash
4556
4557   paralleltool="parallel -j0"
4558
4559   # This should be a dir with less than 100 GB free space
4560   smalldisk=/tmp/shm/parallel
4561
4562   TMPDIR="$smalldisk"
4563   export TMPDIR
4564
4565   max_output() {
4566       # Force worst case scenario:
4567       # Make GNU Parallel only check once per second
4568       sleep 10
4569       # Generate 100 GB to fill $TMPDIR
4570       # Adjust if /tmp is bigger than 100 GB
4571       yes | head -c 100G >$TMPDIR/$$
4572       # Generate 10 MB output that will not be buffered
4573       # due to full disk
4574       perl -e 'print "X"x10_000_000' | head -c 10M
4575       echo This part is missing from incomplete output
4576       sleep 2
4577       rm $TMPDIR/$$
4578       echo Final output
4579   }
4580
4581   export -f max_output
4582   seq 10 | $paralleltool max_output | tr -s X
4583
4584
4585 =head2 CLEANUP: Leaving tmp files at unexpected death
4586
4587 Some tools do not clean up tmp files if they are killed. If the tool
4588 buffers on disk, they may not clean up, if they are killed.
4589
4590   #!/bin/bash
4591
4592   paralleltool=parallel
4593
4594   ls /tmp >/tmp/before
4595   seq 10 | $paralleltool sleep &
4596   pid=$!
4597   # Give the tool time to start up
4598   sleep 1
4599   # Kill it without giving it a chance to cleanup
4600   kill -9 $!
4601   # Should be empty: No files should be left behind
4602   diff <(ls /tmp) /tmp/before
4603
4604
4605 =head2 SPCCHAR: Dealing badly with special file names.
4606
4607 It is not uncommon for users to create files like:
4608
4609   My brother's 12" *** record  (costs $$$).jpg
4610
4611 Some tools break on this.
4612
4613   #!/bin/bash
4614
4615   paralleltool=parallel
4616
4617   touch "My brother's 12\" *** record  (costs \$\$\$).jpg"
4618   ls My*jpg | $paralleltool ls -l
4619
4620
4621 =head2 COMPOSED: Composed commands do not work
4622
4623 Some tools require you to wrap composed commands into B<bash -c>.
4624
4625   echo bar | $paralleltool echo foo';' echo {}
4626
4627
4628 =head2 ONEREP: Only one replacement string allowed
4629
4630 Some tools can only insert the argument once.
4631
4632   echo bar | $paralleltool echo {} foo {}
4633
4634
4635 =head2 INPUTSIZE: Length of input should not be limited
4636
4637 Some tools limit the length of the input lines artificially with no good
4638 reason. GNU B<parallel> does not:
4639
4640   perl -e 'print "foo."."x"x100_000_000' | parallel echo {.}
4641
4642 GNU B<parallel> limits the command to run to 128 KB due to execve(1):
4643
4644   perl -e 'print "x"x131_000' | parallel echo {} | wc
4645
4646
4647 =head2 NUMWORDS: Speed depends on number of words
4648
4649 Some tools become very slow if output lines have many words.
4650
4651   #!/bin/bash
4652
4653   paralleltool=parallel
4654
4655   cat <<-EOF > mycommand
4656   #!/bin/bash
4657
4658   # 10 MB of lines with 1000 words
4659   yes "`seq 1000`" | head -c 10M
4660   EOF
4661   chmod +x mycommand
4662
4663   # Run 30 jobs in parallel
4664   seq 30 | time $paralleltool -j0 ./mycommand > /dev/null
4665
4666 =head2 4GB: Output with a line > 4GB should be OK
4667
4668   #!/bin/bash
4669
4670   paralleltool="parallel -j0"
4671
4672   cat <<-EOF > mycommand
4673   #!/bin/bash
4674
4675   perl -e '\$a="a"x1000_000; for(1..5000) { print \$a }'
4676   EOF
4677   chmod +x mycommand
4678
4679   # Run 1 job
4680   seq 1 | $paralleltool ./mycommand | LC_ALL=C wc
4681
4682
4683 =head1 AUTHOR
4684
4685 When using GNU B<parallel> for a publication please cite:
4686
4687 O. Tange (2011): GNU Parallel - The Command-Line Power Tool, ;login:
4688 The USENIX Magazine, February 2011:42-47.
4689
4690 This helps funding further development; and it won't cost you a cent.
4691 If you pay 10000 EUR you should feel free to use GNU Parallel without citing.
4692
4693 Copyright (C) 2007-10-18 Ole Tange, http://ole.tange.dk
4694
4695 Copyright (C) 2008-2010 Ole Tange, http://ole.tange.dk
4696
4697 Copyright (C) 2010-2024 Ole Tange, http://ole.tange.dk and Free
4698 Software Foundation, Inc.
4699
4700 Parts of the manual concerning B<xargs> compatibility is inspired by
4701 the manual of B<xargs> from GNU findutils 4.4.2.
4702
4703
4704 =head1 LICENSE
4705
4706 This program is free software; you can redistribute it and/or modify
4707 it under the terms of the GNU General Public License as published by
4708 the Free Software Foundation; either version 3 of the License, or
4709 at your option any later version.
4710
4711 This program is distributed in the hope that it will be useful,
4712 but WITHOUT ANY WARRANTY; without even the implied warranty of
4713 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
4714 GNU General Public License for more details.
4715
4716 You should have received a copy of the GNU General Public License
4717 along with this program.  If not, see <https://www.gnu.org/licenses/>.
4718
4719 =head2 Documentation license I
4720
4721 Permission is granted to copy, distribute and/or modify this
4722 documentation under the terms of the GNU Free Documentation License,
4723 Version 1.3 or any later version published by the Free Software
4724 Foundation; with no Invariant Sections, with no Front-Cover Texts, and
4725 with no Back-Cover Texts.  A copy of the license is included in the
4726 file LICENSES/GFDL-1.3-or-later.txt.
4727
4728 =head2 Documentation license II
4729
4730 You are free:
4731
4732 =over 9
4733
4734 =item B<to Share>
4735
4736 to copy, distribute and transmit the work
4737
4738 =item B<to Remix>
4739
4740 to adapt the work
4741
4742 =back
4743
4744 Under the following conditions:
4745
4746 =over 9
4747
4748 =item B<Attribution>
4749
4750 You must attribute the work in the manner specified by the author or
4751 licensor (but not in any way that suggests that they endorse you or
4752 your use of the work).
4753
4754 =item B<Share Alike>
4755
4756 If you alter, transform, or build upon this work, you may distribute
4757 the resulting work only under the same, similar or a compatible
4758 license.
4759
4760 =back
4761
4762 With the understanding that:
4763
4764 =over 9
4765
4766 =item B<Waiver>
4767
4768 Any of the above conditions can be waived if you get permission from
4769 the copyright holder.
4770
4771 =item B<Public Domain>
4772
4773 Where the work or any of its elements is in the public domain under
4774 applicable law, that status is in no way affected by the license.
4775
4776 =item B<Other Rights>
4777
4778 In no way are any of the following rights affected by the license:
4779
4780 =over 2
4781
4782 =item *
4783
4784 Your fair dealing or fair use rights, or other applicable
4785 copyright exceptions and limitations;
4786
4787 =item *
4788
4789 The author's moral rights;
4790
4791 =item *
4792
4793 Rights other persons may have either in the work itself or in
4794 how the work is used, such as publicity or privacy rights.
4795
4796 =back
4797
4798 =back
4799
4800 =over 9
4801
4802 =item B<Notice>
4803
4804 For any reuse or distribution, you must make clear to others the
4805 license terms of this work.
4806
4807 =back
4808
4809 A copy of the full license is included in the file as
4810 LICENCES/CC-BY-SA-4.0.txt
4811
4812
4813 =head1 DEPENDENCIES
4814
4815 GNU B<parallel> uses Perl, and the Perl modules Getopt::Long,
4816 IPC::Open3, Symbol, IO::File, POSIX, and File::Temp. For remote usage
4817 it also uses rsync with ssh.
4818
4819
4820 =head1 SEE ALSO
4821
4822 B<find>(1), B<xargs>(1), B<make>(1), B<pexec>(1), B<ppss>(1),
4823 B<xjobs>(1), B<prll>(1), B<dxargs>(1), B<mdm>(1)
4824
4825 =cut