fangle.tm

   1 <TeXmacs|1.0.7.10>
   2
   3 <style|<tuple|book|fangle|header-book|tmdoc-keyboard>>
   4
   5 <\body>
   6   <hide-preamble|<assign|LyX|<macro|L<space|-0.1667em><move|Y|0fn|-0.25em><space|-0.125em>X>><assign|par-first|0fn><assign|par-par-sep|0.5fn>>
   7
   8   <doc-data|<doc-title|fangle>|<doc-author-data|<author-name|Sam
   9   Liddicott>|<\author-address>
  10     sam@liddicott.com
  11   </author-address>>|<doc-date|August 2009>>
  12
  13   <section*|Introduction>
  14
  15   <name|Fangle> is a tool for fangled literate programming. Newfangled is
  16   defined as <em|New and often needlessly novel> by
  17   <name|TheFreeDictionary.com>.
  18
  19   In this case, fangled means yet another not-so-new<footnote|but improved.>
  20   method for literate programming.
  21
  22   <name|Literate Programming> has a long history starting with the great
  23   <name|Donald Knuth> himself, whose literate programming tools seem to make
  24   use of as many escape sequences for semantic markup as <TeX> (also by
  25   <name|Donald Knuth>).
  26
  27   <name|Norman Ramsey> wrote the <name|Noweb> set of tools
  28   (<verbatim|notangle>, <verbatim|noweave> and <verbatim|noroots>) and
  29   helpfully reduced the amount of magic character sequences to pretty much
  30   just <verbatim|\<less\>\<less\>>, <verbatim|\<gtr\>\<gtr\>> and
  31   <verbatim|@>, and in doing so brought the wonders of literate programming
  32   within my reach.
  33
  34   While using the <LyX> editor for <LaTeX> editing I had various troubles
  35   with the noweb tools, some of which were my fault, some of which were
  36   noweb's fault and some of which were <LyX>'s fault.
  37
  38   <name|Noweb> generally brought literate programming to the masses through
  39   removing some of the complexity of the original literate programming, but
  40   this would be of no advantage to me if the <LyX> / <LaTeX> combination
  41   brought more complications in their place.
  42
  43   <name|Fangle> was thus born (originally called <name|Newfangle>) as an awk
  44   replacement for notangle, adding some important features, like better
  45   integration with <LyX> and <LaTeX> (and later <TeXmacs>), multiple output
  46   format conversions, and fixing notangle bugs like indentation when using -L
  47   for line numbers.
  48
  49   Significantly, fangle is just one program which replaces various programs
  50   in <name|Noweb>. Noweave is done away with and implemented directly as
  51   <LaTeX> macros, and noroots is implemented as a function of the untangler
  52   fangle.
  53
  54   Fangle is written in awk for portability reasons, awk being available for
  55   most platforms. A Python version<\footnote>
  56     hasn't anyone implemented awk in python yet?
  57   </footnote> was considered for the benefit of <LyX> but a scheme version
  58   for <TeXmacs> will probably materialise first; as <TeXmacs> macro
  59   capabilities help make edit-time and format-time rendering of fangle chunks
  60   simple enough for my weak brain.
  61
  62   As an extension to many literate-programming styles, Fangle permits code
  63   chunks to take parameters and thus operate somewhat like C pre-processor
  64   macros, or like C++ templates. Name parameters (or even local
  65   <em|variables> in the callers scope) are anticipated, as parameterized
  66   chunks <emdash> useful though they are <emdash> are hard to comprehend in
  67   the literate document.
  68
  69   <section*|License><new-page*><label|License>
  70
  71   Fangle is licensed under the GPL 3 (or later).
  72
  73   This doesn't mean that sources generated by fangle must be licensed under
  74   the GPL 3.
  75
  76   This doesn't mean that you can't use or distribute fangle with sources of
  77   an incompatible license, but it means you must make the source of fangle
  78   available too.
  79
  80   As fangle is currently written in awk, an interpreted language, this should
  81   not be too hard.
  82
  83   <\nf-chunk|gpl3-copyright>
  84     <item>fangle - fully featured notangle replacement in awk
  85
  86     <item>
  87
  88     <item>Copyright (C) 2009-2010 Sam Liddicott
  89     \<less\>sam@liddicott.com\<gtr\>
  90
  91     <item>
  92
  93     <item>This program is free software: you can redistribute it and/or
  94     modify
  95
  96     <item>it under the terms of the GNU General Public License as published
  97     by
  98
  99     <item>the Free Software Foundation, either version 3 of the License, or
 100
 101     <item>(at your option) any later version.
 102
 103     <item>
 104
 105     <item>This program is distributed in the hope that it will be useful,
 106
 107     <item>but WITHOUT ANY WARRANTY; without even the implied warranty of
 108
 109     <item>MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. \ See the
 110
 111     <item>GNU General Public License for more details.
 112
 113     <item>
 114
 115     <item>You should have received a copy of the GNU General Public License
 116
 117     <item>along with this program. \ If not, see
 118     \<less\>http://www.gnu.org/licenses/\<gtr\>.
 119   </nf-chunk|text|>
 120
 121   <\table-of-contents|toc>
 122   </table-of-contents>
 123
 124   <part|Using Fangle>
 125
 126   <chapter|Introduction to Literate Programming>
 127
 128   Todo: Should really follow on from a part-0 explanation of what literate
 129   programming is.
 130
 131   <chapter|Running Fangle>
 132
 133   Fangle is a replacement for <name|noweb>, which consists of
 134   <verbatim|notangle>, <verbatim|noroots> and <verbatim|noweave>.
 135
 136   Like <verbatim|notangle> and <verbatim|noroots>, <verbatim|fangle> can read
 137   multiple named files, or from stdin.
 138
 139   <section|Listing roots>
 140
 141   The -r option causes fangle to behave like noroots.
 142
 143   <code*|fangle -r filename.tex>
 144
 145   will print out the fangle roots of a tex file.\
 146
 147   Unlike the <verbatim|noroots> command, the printed roots are not enclosed
 148   in angle brackets e.g. <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>>,
 149   unless at least one of the roots is defined using the <verbatim|notangle>
 150   notation <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>=>.
 151
 152   Also, unlike noroots, it prints out all roots --- not just those that are
 153   not used elsewhere. I find that a root not being used doesn't make it
 154   particularly top level <emdash> and so-called top level roots could also be
 155   included in another root as well.\
 156
 157   My convention is that top level roots to be extracted begin with
 158   <verbatim|./> and have the form of a filename.
 159
 160   Makefile.inc, discussed in <reference|makefile.inc>, can automatically
 161   extract all such sources prefixed with <verbatim|./>
 162
 163   <section|Extracting roots>
 164
 165   notangle's <verbatim|-R> and <verbatim|-L> options are supported.
 166
 167   If you are using <LyX> or <LaTeX>, the standard way to extract a file would
 168   be:
 169
 170   <verbatim|fangle -R./Makefile.inc fangle.tex \<gtr\> ./Makefile.inc>
 171
 172   If you are using <TeXmacs>, the standard way to extract a file would
 173   similarly be:
 174
 175   <verbatim|fangle -R./Makefile.inc fangle.txt \<gtr\> ./Makefile.inc>
 176
 177   <TeXmacs> users would obtain the text file with a <em|verbatim> export from
 178   <TeXmacs> which can be done on the command line with <verbatim|texmacs -s
 179   -c fangle.tm fangle.txt -q>
 180
 181   Unlike the <verbatim|noroots> command, the <verbatim|<verbatim|-L>> option
 182   to generate C pre-preocessor <verbatim|#file> style line-number
 183   directives,does not break indenting of the generated file..
 184
 185   Also, thanks to mode tracking (described in <reference|modes>) the
 186   <verbatim|-L> option does not interrupt (and break) multi-line C macros
 187   either.
 188
 189   This does mean that sometimes the compiler might calculate the source line
 190   wrongly when generating error messages in such cases, but there isn't any
 191   other way around if multi-line macros include other chunks.
 192
 193   Future releases will include a mapping file so that line/character
 194   references from the C compiler can be converted to the correct part of the
 195   source document.
 196
 197   <section|Formatting the document>
 198
 199   The noweave replacement built into the editing and formatting environment
 200   for <TeXmacs>, <LyX> (which uses <LaTeX>), and even for raw <LaTeX>.
 201
 202   Use of fangle with <TeXmacs>, <LyX> and <LaTeX> are explained the the next
 203   few chapters.
 204
 205   <chapter|Using Fangle with <LaTeX>>
 206
 207   Because the noweave replacement is impemented in <LaTeX>, there is no
 208   processing stage required before running the <LaTeX> command. Of course,
 209   <LaTeX> may need running two or more times, so that the code chunk
 210   references can be fully calculated.
 211
 212   The formatting is managed by a set of macros shown in
 213   <reference|latex-source>, and can be included with:
 214
 215   <verbatim|\\usepackage{fangle.sty}>
 216
 217   Norman Ramsay's origial <filename|noweb.sty> package is currently required
 218   as it is used for formatting the code chunk captions.
 219
 220   The <filename|listings.sty> package is required, and is used for formatting
 221   the code chunks and syntax highlighting.
 222
 223   The <filename|xargs.sty> package is also required, and makes writing
 224   <LaTeX> macro so much more pleasant.
 225
 226   <todo|Add examples of use of Macros>
 227
 228   <chapter|Using Fangle with <LyX>>
 229
 230   <LyX> uses the same <LaTeX> macros shown in <reference|latex-source> as
 231   part of a <LyX> module file <filename|fangle.module>, which automatically
 232   includes the macros in the document pre-amble provided that the fangle
 233   <LyX> module is used in the document.
 234
 235   <section|Installing the <LyX> module>
 236
 237   Copy <filename|fangle.module> to your <LyX> layouts directory, which for
 238   unix users will be <filename|~/.lyx/layouts>
 239
 240   In order to make the new literate styles availalble, you will need to
 241   reconfigure <LyX> by clicking Tools-\<gtr\>Reconfigure, and then re-start
 242   <LyX>.
 243
 244   <section|Obtaining a decent mono font>
 245
 246   The syntax high-lighting features of <name|lstlistings> makes use of bold;
 247   however a mono-space tt font is used to typeset the listings. Obtaining a
 248   <with|font-family|tt|<strong|bold> tt font> can be impossibly difficult and
 249   amazingly easy. I spent many hours at it, following complicated
 250   instructions from those who had spend many hours over it, and was finally
 251   delivered the simple solution on the lyx mailing list.
 252
 253   <subsection|txfonts>
 254
 255   The simple way was to add this to my preamble:
 256
 257   <\verbatim>
 258     \\usepackage{txfonts}
 259
 260     \\renewcommand{\\ttdefault}{txtt}
 261   </verbatim>
 262
 263   \;
 264
 265   <subsection|ams pmb>
 266
 267   The next simplest way was to use ams poor-mans-bold, by adding this to the
 268   pre-amble:
 269
 270   <\verbatim>
 271     \\usepackage{amsbsy}
 272
 273     %\\renewcommand{\\ttdefault}{txtt}
 274
 275     %somehow make \\pmb be the command for bold, forgot how, sorry, above
 276     line not work
 277   </verbatim>
 278
 279   It works, but looks wretched on the dvi viewer.
 280
 281   <subsection|Luximono>
 282
 283   The lstlistings documention suggests using Luximono.
 284
 285   Luximono was installed according to the instructions in Ubuntu Forums
 286   thread 1159181<\footnote>
 287     http://ubuntuforums.org/showthread.php?t=1159181
 288   </footnote> with tips from miknight<\footnote>
 289     http://miknight.blogspot.com/2005/11/how-to-install-luxi-mono-font-in.html
 290   </footnote> stating that <verbatim|sudo updmap --enable MixedMap ul9.map>
 291   is required. It looks fine in PDF and PS view but still looks rotten in dvi
 292   view.
 293
 294   <section|Formatting your Lyx document>
 295
 296   It is not necessary to base your literate document on any of the original
 297   <LyX> literate classes; so select a regular class for your document type.
 298
 299   Add the new module <em|Fangle Literate Listings> and also <em|Logical
 300   Markup> which is very useful.
 301
 302   In the drop-down style listbox you should notice a new style defined,
 303   called <em|Chunk>.
 304
 305   When you wish to insert a literate chunk, you enter it's plain name in the
 306   Chunk style, instead of the old <name|noweb> method that uses
 307   <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>=> type tags. In the line (or
 308   paragraph) following the chunk name, you insert a listing with:
 309   Insert-\<gtr\>Program Listing.
 310
 311   Inside the white listing box you can type (or paste using
 312   <kbd|shift+ctrl+V>) your listing. There is no need to use <kbd|ctrl+enter>
 313   at the end of lines as with some older <LyX> literate techniques --- just
 314   press enter as normal.
 315
 316   <subsection|Customising the listing appearance>
 317
 318   The code is formatted using the <name|lstlistings> package. The chunk style
 319   doesn't just define the chunk name, but can also define any other chunk
 320   options supported by the lstlistings package <verbatim|\\lstset> command.
 321   In fact, what you type in the chunk style is raw latex. If you want to set
 322   the chunk language without having to right-click the listing, just add
 323   <verbatim|,lanuage=C> after the chunk name. (Currently the language will
 324   affect all subsequent listings, so you may need to specify
 325   <verbatim|,language=> quite a lot).
 326
 327   <todo|so fix the bug>
 328
 329   Of course you can do this by editing the listings box advanced properties
 330   by right-clicking on the listings box, but that takes longer, and you can't
 331   see at-a-glance what the advanced settings are while editing the document;
 332   also advanced settings apply only to that box --- the chunk settings apply
 333   through the rest of the document<\footnote>
 334     It ought to apply only to subsequent chunks of the same name. I'll fix
 335     that later
 336   </footnote>.
 337
 338   <todo|So make sure they only apply to chunks of that name>
 339
 340   <subsection|Global customisations>
 341
 342   As lstlistings is used to set the code chunks, it's <verbatim|\\lstset>
 343   command can be used in the pre-amble to set some document wide settings.
 344
 345   If your source has many words with long sequences of capital letters, then
 346   <verbatim|columns=fullflexible> may be a good idea, or the capital letters
 347   will get crowded. (I think lstlistings ought to use a slightly smaller font
 348   for captial letters so that they still fit).
 349
 350   The font family <verbatim|\\ttfamily> looks more normal for code, but has
 351   no bold (an alternate typewriter font is used).\
 352
 353   With <verbatim|\\ttfamily>, I must also specify
 354   <verbatim|columns=fullflexible> or the wrong letter spacing is used.
 355
 356   In my <LaTeX> pre-amble I usually specialise my code format with:
 357
 358   <\nf-chunk|document-preamble>
 359     <item>\\lstset{
 360
 361     <item>numbers=left, stepnumber=1, numbersep=5pt,
 362
 363     <item>breaklines=false,
 364
 365     <item>basicstyle=\\footnotesize\\ttfamily,
 366
 367     <item>numberstyle=\\tiny,
 368
 369     <item>language=C,
 370
 371     <item>columns=fullflexible,
 372
 373     <item>numberfirstline=true
 374
 375     <item>}
 376   </nf-chunk|tex|>
 377
 378   \;
 379
 380   <section|Configuring the build script>
 381
 382   You can invoke code extraction and building from the <LyX> menu option
 383   Document-\<gtr\>Build Program.
 384
 385   First, make sure you don't have a conversion defined for Lyx-\<gtr\>Program
 386
 387   From the menu Tools-\<gtr\>Preferences, add a conversion from
 388   Latex(Plain)-\<gtr\>Program as:
 389
 390   <\verbatim>
 391     set -x ; fangle -Rlyx-build $$i \|\
 392
 393     \ \ env LYX_b=$$b LYX_i=$$i LYX_o=$$o LYX_p=$$p LYX_r=$$r bash
 394   </verbatim>
 395
 396   (But don't cut-n-paste it from this document or you may be be pasting a
 397   multi-line string which will break your lyx preferences file).\
 398
 399   I hope that one day, <LyX> will set these into the environment when calling
 400   the build script.
 401
 402   You may also want to consider adding options to this conversion...
 403
 404   <verbatim|parselog=/usr/share/lyx/scripts/listerrors>
 405
 406   ...but if you do you will lose your stderr<\footnote>
 407     There is some bash plumbing to get a copy of stderr but this footnote is
 408     too small
 409   </footnote>.
 410
 411   Now, a shell script chunk called <filename|lyx-build> will be extracted and
 412   run whenever you choose the Document-\<gtr\>Build Program menu item.
 413
 414   This document was originally managed using <LyX> and lyx-build script for
 415   this document is shown here for historical reference.\
 416
 417   <\verbatim>
 418     lyx -e latex fangle.lyx && \\
 419
 420     \ \ fangle fangle.lyx \<gtr\> ./autoboot
 421   </verbatim>
 422
 423   This looks simple enough, but as mentioned, fangle has to be had from
 424   somewhere before it can be extracted.
 425
 426   <subsection|...>
 427
 428   When the lyx-build chunk is executed, the current directory will be a
 429   temporary directory, and <verbatim|LYX_SOURCE> will refer to the tex file
 430   in this temporary directory. This is unfortunate as our makefile wants to
 431   run from the project directory where the Lyx file is kept.
 432
 433   We can extract the project directory from <verbatim|$$r>, and derive the
 434   probable Lyx filename from the noweb file that Lyx generated.
 435
 436   <\nf-chunk|lyx-build-helper>
 437     <item>PROJECT_DIR="$LYX_r"
 438
 439     <item>LYX_SRC="$PROJECT_DIR/${LYX_i%.tex}.lyx"
 440
 441     <item>TEX_DIR="$LYX_p"
 442
 443     <item>TEX_SRC="$TEX_DIR/$LYX_i"
 444   </nf-chunk|sh|>
 445
 446   And then we can define a lyx-build fragment similar to the autoboot
 447   fragment
 448
 449   <\nf-chunk|lyx-build>
 450     <item>#! /bin/sh
 451
 452     <item><nf-ref|lyx-build-helper|>
 453
 454     <item>cd $PROJECT_DIR \|\| exit 1
 455
 456     <item>
 457
 458     <item>#/usr/bin/fangle -filter ./notanglefix-filter \\
 459
 460     <item># \ -R./Makefile.inc "../../noweb-lyx/noweb-lyx3.lyx" \\
 461
 462     <item># \ \| sed '/NOWEB_SOURCE=/s/=.*/=samba4-dfs.lyx/' \\
 463
 464     <item># \ \<gtr\> ./Makefile.inc
 465
 466     <item>#
 467
 468     <item>#make -f ./Makefile.inc fangle_sources
 469   </nf-chunk|sh|>
 470
 471   \;
 472
 473   <chapter|Using Fangle with <TeXmacs>>
 474
 475   <todo|Write this chapter>
 476
 477   <chapter|Fangle with Makefiles><label|makefile.inc>
 478
 479   Here we describe a <filename|Makefile.inc> that you can include in your own
 480   Makefiles, or glue as a recursive make to other projects.
 481
 482   <filename|Makefile.inc> will cope with extracting all the other source
 483   files from this or any specified literate document and keeping them up to
 484   date.\
 485
 486   It may also be included by a <verbatim|Makefile> or <verbatim|Makefile.am>
 487   defined in a literate document to automatically deal with the extraction of
 488   source files and documents during normal builds.
 489
 490   Thus, if <verbatim|Makefile.inc> is included into a main project makefile
 491   it add rules for the source files, capable of extracting the source files
 492   from the literate document.
 493
 494   <section|A word about makefiles formats>
 495
 496   Whitespace formatting is very important in a Makefile. The first character
 497   of each action line must be a TAB.\
 498
 499   <\verbatim>
 500     target: pre-requisite
 501
 502     <nf-tab>action
 503
 504     <nf-tab>action
 505   </verbatim>
 506
 507   This requires that the literate programming environment have the ability to
 508   represent a TAB character in a way that fangle will generate an actual TAB
 509   character.
 510
 511   We also adopt a convention that code chunks whose names beginning with
 512   <verbatim|./> should always be automatically extracted from the document.
 513   Code chunks whose names do not begin with <verbatim|./> are for internal
 514   reference. Such chunks may be extracted directly, but will not be
 515   automatically extracted by this Makefile.
 516
 517   <section|Extracting Sources>
 518
 519   Our makefile has two parts; variables must be defined before the targets
 520   that use them.
 521
 522   As we progress through this chapter, explaining concepts, we will be adding
 523   lines to <nf-ref|Makefile.inc-vars|> and <nf-ref|Makefile.inc-targets|>
 524   which are included in <nf-ref|./Makefile.inc|> below.
 525
 526   <\nf-chunk|./Makefile.inc>
 527     <item><nf-ref|Makefile.inc-vars|>
 528
 529     <item><nf-ref|Makefile.inc-default-targets|>
 530
 531     <item><nf-ref|Makefile.inc-targets|>
 532   </nf-chunk|make|>
 533
 534   We first define a placeholder for the tool <verbatim|fangle> in case it
 535   cannot be found in the path.
 536
 537   <\nf-chunk|Makefile.inc-vars>
 538     <item>FANGLE=fangle
 539   </nf-chunk||>
 540
 541   We also define a placeholder for <verbatim|LITERATE_SOURCE> to hold the
 542   name of this document. This will normally be passed on the command line.
 543
 544   <\nf-chunk|Makefile.inc-vars>
 545     <item>LITERATE_SOURCE=
 546   </nf-chunk||>
 547
 548   Fangle cannot process <LyX> or <TeXmacs> documents directly, so the first
 549   stage is to convert these to more suitable text based formats<\footnote>
 550     <LyX> and <TeXmacs> formats are text-based, but not suitable for fangle
 551   </footnote>.
 552
 553   <subsection|Converting from <LyX> to <LaTeX>><label|Converting-from-Lyx>
 554
 555   The first stage will always be to convert the <LyX> file to a <LaTeX> file.
 556   Fangle must run on a <TeX> file because the <LyX> command
 557   <verbatim|server-goto-file-line><\footnote>
 558     The Lyx command <verbatim|server-goto-file-line> is used to position the
 559     Lyx cursor at the compiler errors.
 560   </footnote> requries that the line number provided be a line of the <TeX>
 561   file and always maps this the line in the <LyX> docment. We use
 562   <verbatim|server-goto-file-line> when moving the cursor to error lines
 563   during compile failures.
 564
 565   The command <verbatim|lyx -e literate fangle.lyx> will produce
 566   <verbatim|fangle.tex>, a <TeX> file; so we define a make target to be the
 567   same as the <LyX> file but with the <verbatim|.tex> extension.
 568
 569   The <verbatim|EXTRA_DIST> is for automake support so that the <TeX> files
 570   will automaticaly be distributed with the source, to help those who don't
 571   have <LyX> installed.
 572
 573   <\nf-chunk|Makefile.inc-vars>
 574     <item>LYX_SOURCE=$(LITERATE_SOURCE) # but only the .lyx files
 575
 576     <item>TEX_SOURCE=$(LYX_SOURCE:.lyx=.tex)
 577
 578     <item>EXTRA_DIST+=$(TEX_SOURCE)
 579   </nf-chunk||>
 580
 581   We then specify that the <TeX> source is to be generated from the <LyX>
 582   source.
 583
 584   <\nf-chunk|Makefile.inc-targets>
 585     <item>.SUFFIXES: .tex .lyx
 586
 587     <item>.lyx.tex:
 588
 589     <item><nf-tab>lyx -e latex $\<less\>
 590
 591     <item>clean_tex:
 592
 593     <item><nf-tab>rm -f -- $(TEX_SOURCE)
 594
 595     <item>clean: clean_tex
 596   </nf-chunk||>
 597
 598   <subsection|Converting from <TeXmacs>><label|Converting-from-Lyx>
 599
 600   Fangle cannot process <TeXmacs> files directly<\footnote>
 601     but this is planned when <TeXmacs> uses xml as it's native format
 602   </footnote>, but must first convert them to text files.
 603
 604   The command <verbatim|texmacs -c fangle.tm fangle.txt -q> will produce
 605   <verbatim|fangle.txt>, a text file; so we define a make target to be the
 606   same as the <TeXmacs> file but with the <verbatim|.txt> extension.
 607
 608   The <verbatim|EXTRA_DIST> is for automake support so that the <TeX> files
 609   will automaticaly be distributed with the source, to help those who don't
 610   have <LyX> installed.
 611
 612   <\nf-chunk|Makefile.inc-vars>
 613     <item>TEXMACS_SOURCE=$(LITERATE_SOURCE) # but only the .tm files
 614
 615     <item>TXT_SOURCE=$(LITERATE_SOURCE:.tm=.txt)
 616
 617     <item>EXTRA_DIST+=$(TXT_SOURCE)
 618   </nf-chunk||>
 619
 620   <todo|Add loop around each $\<less\> so multiple targets can be specified>
 621
 622   <\nf-chunk|Makefile.inc-targets>
 623     <item>.SUFFIXES: .txt .tm
 624
 625     <item>.tm.txt:
 626
 627     <item><nf-tab>texmacs -s -c $\<less\> $@ -q
 628
 629     <item>.PHONEY: clean_txt
 630
 631     <item>clean_txt:
 632
 633     <item><nf-tab>rm -f -- $(TXT_SOURCE)
 634
 635     <item>clean: clean_txt
 636   </nf-chunk||>
 637
 638   <section|Extracting Program Source>
 639
 640   The program source is extracted using fangle, which is designed to operate
 641   on text or a <LaTeX> documents<\footnote>
 642     <LaTeX> documents are just slightly special text documents
 643   </footnote>.
 644
 645   <\nf-chunk|Makefile.inc-vars>
 646     <item>FANGLE_SOURCE=$(TXT_SOURCE)
 647   </nf-chunk||>
 648
 649   The literate document can result in any number of source files, but not all
 650   of these will be changed each time the document is updated. We certainly
 651   don't want to update the timestamps of these files and cause the whole
 652   source tree to be recompiled just because the literate explanation was
 653   revised. We use <verbatim|CPIF> from the <em|Noweb> tools to avoid updating
 654   the file if the content has not changed, but should probably write our own.
 655
 656   However, if a source file is not updated, then the fangle file will always
 657   have a newer time-stamp and the makefile would always re-attempt to extact
 658   a newer source file which would be a waste of time.
 659
 660   Because of this, we use a stamp file which is always updated each time the
 661   sources are fully extracted from the <LaTeX> document. If the stamp file is
 662   newer than the document, then we can avoid an attempt to re-extract any of
 663   the sources. Because this stamp file is only updated when extraction is
 664   complete, it is safe for the user to interrupt the build-process
 665   mid-extraction.
 666
 667   We use <verbatim|echo> rather than <verbatim|touch> to update the stamp
 668   file beause the <verbatim|touch> command does not work very well over an
 669   <verbatim|sshfs> mount \ that I was using.
 670
 671   <\nf-chunk|Makefile.inc-vars>
 672     <item>FANGLE_SOURCE_STAMP=$(FANGLE_SOURCE).stamp
 673   </nf-chunk||>
 674
 675   <\nf-chunk|Makefile.inc-targets>
 676     <item>$(FANGLE_SOURCE_STAMP): $(FANGLE_SOURCE) \\
 677
 678     <item><nf-tab> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ $(FANGLE_SOURCES) ; \\
 679
 680     <item><nf-tab>echo -n \<gtr\> $(FANGLE_SOURCE_STAMP)
 681
 682     <item>clean_stamp:
 683
 684     <item><nf-tab>rm -f $(FANGLE_SOURCE_STAMP)
 685
 686     <item>clean: clean_stamp
 687   </nf-chunk||>
 688
 689   <section|Extracting Source Files>
 690
 691   We compute <verbatim|FANGLE_SOURCES> to hold the names of all the source
 692   files defined in the document. We compute this only once, by means of
 693   <verbatim|:=> in assignent. The sed deletes the any
 694   <verbatim|\<less\>\<less\>> and <verbatim|\<gtr\>\<gtr\>> which may
 695   surround the roots names (for compatibility with Noweb's noroots command).
 696
 697   As we use chunk names beginning with <filename|./> to denote top level
 698   fragments that should be extracted, we filter out all fragments that do not
 699   begin with <filename|./>
 700
 701   <\note>
 702     <verbatim|FANGLE_PREFIX> is set to <verbatim|./> by default, but whatever
 703     it may be overridden to, the prefix is replaced by a literal
 704     <verbatim|./> before extraction so that files will be extracted in the
 705     current directory whatever the prefix. This helps namespace or
 706     sub-project prefixes like <verbatim|documents:> for chunks like
 707     <verbatim|documents:docbook/intro.xml>
 708   </note>
 709
 710   <todo|This doesn't work though, because it loses the full name and doesn't
 711   know what to extact!>
 712
 713   <\nf-chunk|Makefile.inc-vars>
 714     <item>FANGLE_PREFIX:=\\.\\/
 715
 716     <item>FANGLE_SOURCES:=$(shell \\
 717
 718     <item> \ $(FANGLE) -r $(FANGLE_SOURCE) \|\\
 719
 720     <item> \ sed -e 's/^[\<less\>][\<less\>]//;s/[\<gtr\>][\<gtr\>]$$//;/^$(FANGLE_PREFIX)/!d'
 721     \\
 722
 723     <item> \ \ \ \ \ -e 's/^$(FANGLE_PREFIX)/\\.\\//' )
 724   </nf-chunk||>
 725
 726   The target below, <verbatim|echo_fangle_sources> is a helpful debugging
 727   target and shows the names of the files that would be extracted.
 728
 729   <\nf-chunk|Makefile.inc-targets>
 730     <item>.PHONY: echo_fangle_sources
 731
 732     <item>echo_fangle_sources: ; @echo $(FANGLE_SOURCES)
 733   </nf-chunk||>
 734
 735   We define a convenient target called <verbatim|fangle_sources> so that
 736   <verbatim|make -f fangle_sources> will re-extract the source if the
 737   literate document has been updated.\
 738
 739   <\nf-chunk|Makefile.inc-targets>
 740     <item>.PHONY: fangle_sources
 741
 742     <item>fangle_sources: $(FANGLE_SOURCE_STAMP)
 743   </nf-chunk||>
 744
 745   And also a convenient target to remove extracted sources.
 746
 747   <\nf-chunk|Makefile.inc-targets>
 748     <item>.PHONY: clean_fangle_sources
 749
 750     <item>clean_fangle_sources: ; \\
 751
 752     <item> \ \ \ \ \ \ \ rm -f -- $(FANGLE_SOURCE_STAMP) $(FANGLE_SOURCES)
 753   </nf-chunk||>
 754
 755   We now look at the extraction of the source files.
 756
 757   This makefile macro <verbatim|if_extension> takes 4 arguments: the filename
 758   <verbatim|$(1)>, some extensions to match <verbatim|$(2)> and a shell
 759   command to return if the filename does match the exensions <verbatim|$(3)>,
 760   and a shell command to return if it does not match the extensions
 761   <verbatim|$(4)>.
 762
 763   <\nf-chunk|Makefile.inc-vars>
 764     <item>if_extension=$(if $(findstring $(suffix $(1)),$(2)),$(3),$(4))
 765   </nf-chunk||>
 766
 767   For some source files like C files, we want to output the line number and
 768   filename of the original <LaTeX> document from which the source
 769   came<\footnote>
 770     I plan to replace this option with a separate mapping file so as not to
 771     pollute the generated source, and also to allow a code pretty-printing
 772     reformatter like <verbatim|indent> be able to re-format the file and
 773     adjust for changes through comparing the character streams.
 774   </footnote>.
 775
 776   To make this easier we define the file extensions for which we want to do
 777   this.
 778
 779   <\nf-chunk|Makefile.inc-vars>
 780     <item>C_EXTENSIONS=.c .h
 781   </nf-chunk||>
 782
 783   We can then use the <verbatim|if_extensions> macro to define a macro which
 784   expands out to the <verbatim|-L> option if fangle is being invoked in a C
 785   source file, so that C compile errors will refer to the line number in the
 786   <TeX> document.\
 787
 788   <\nf-chunk|Makefile.inc-vars>
 789     <item>TABS=8
 790
 791     <item>nf_line=-L -T$(TABS)
 792
 793     <item>fangle=$(FANGLE) $(call if_extension,$(2),$(C_EXTENSIONS),$(nf_line))
 794     -R"$(2)" $(1)
 795   </nf-chunk||>
 796
 797   We can use a similar trick to define an indent macro which takes just the
 798   filename as an argument and can return a pipeline stage calling the indent
 799   command. Indent can be turned off with <verbatim|make fangle_sources
 800   indent=>
 801
 802   <\nf-chunk|Makefile.inc-vars>
 803     <item>indent_options=-npro -kr -i8 -ts8 -sob -l80 -ss -ncs
 804
 805     <item>indent=$(call if_extension,$(1),$(C_EXTENSIONS), \| indent
 806     $(indent_options))
 807   </nf-chunk||>
 808
 809   We now define the pattern for extracting a file. The files are written
 810   using noweb's <verbatim|cpif> so that the file timestamp will not be
 811   touched if the contents haven't changed. This avoids the need to rebuild
 812   the entire project because of a typographical change in the documentation,
 813   or if none or a few C source files have changed.
 814
 815   <\nf-chunk|Makefile.inc-vars>
 816     <item>fangle_extract=@mkdir -p $(dir $(1)) && \\
 817
 818     <item> \ $(call fangle,$(2),$(1)) \<gtr\> "$(1).tmp" && \\
 819
 820     <item> \ cat "$(1).tmp" $(indent) \| cpif "$(1)" \\
 821
 822     <item> \ && rm -f -- "$(1).tmp" \|\| \\
 823
 824     <item> \ (echo error fangling $(1) from $(2) ; exit 1)
 825   </nf-chunk||>
 826
 827   We define a target which will extract or update all sources. To do this we
 828   first defined a makefile template that can do this for any source file in
 829   the <LaTeX> document.
 830
 831   <\nf-chunk|Makefile.inc-vars>
 832     <item>define FANGLE_template
 833
 834     <item> \ $(1): $(2)
 835
 836     <item><nf-tab>$$(call fangle_extract,$(1),$(2))
 837
 838     <item> \ FANGLE_TARGETS+=$(1)
 839
 840     <item>endef
 841   </nf-chunk||>
 842
 843   We then enumerate the discovered <verbatim|FANGLE_SOURCES> to generate a
 844   makefile rule for each one using the makefile template we defined above.
 845
 846   <\nf-chunk|Makefile.inc-targets>
 847     <item>$(foreach source,$(FANGLE_SOURCES),\\
 848
 849     <item> \ $(eval $(call FANGLE_template,$(source),$(FANGLE_SOURCE))) \\
 850
 851     <item>)
 852   </nf-chunk||>
 853
 854   These will all be built with <verbatim|FANGLE_SOURCE_STAMP>.
 855
 856   We also remove the generated sources on a make distclean.
 857
 858   <\nf-chunk|Makefile.inc-targets>
 859     <item>_distclean: clean_fangle_sources
 860   </nf-chunk||>
 861
 862   <section|Extracting Documentation>
 863
 864   We then identify the intermediate stages of the documentation and their
 865   build and clean targets.
 866
 867   <\nf-chunk|Makefile.inc-default-targets>
 868     <item>.PHONEY : clean_pdf
 869   </nf-chunk||>
 870
 871   <subsection|Formatting <TeX>>
 872
 873   <subsubsection|Running pdflatex>
 874
 875   We produce a pdf file from the tex file.
 876
 877   <\nf-chunk|Makefile.inc-vars>
 878     <item>FANGLE_PDF+=$(TEX_SOURCE:.tex=.pdf)
 879   </nf-chunk||>
 880
 881   We run pdflatex twice to be sure that the contents and aux files are up to
 882   date. We certainly are <em|required> to run pdflatex at least twice if
 883   these files do not exist.
 884
 885   <\nf-chunk|Makefile.inc-targets>
 886     <item>.SUFFIXES: .tex .pdf
 887
 888     <item>.tex.pdf:
 889
 890     <item><nf-tab>pdflatex $\<less\> && pdflatex $\<less\>
 891
 892     <item>
 893
 894     <item>clean_pdf_tex:
 895
 896     <item><nf-tab>rm -f -- $(FANGLE_PDF) $(TEX_SOURCE:.tex=.toc) \\
 897
 898     <item><nf-tab> \ $(TEX_SOURCE:.tex=.log) $(TEX_SOURCE:.tex=.aux)
 899
 900     <item>clean_pdf: clean_pdf_tex
 901   </nf-chunk||>
 902
 903   <subsection|Formatting <TeXmacs>>
 904
 905   <TeXmacs> can produce a PDF file directly.
 906
 907   <\nf-chunk|Makefile.inc-vars>
 908     <item>FANGLE_PDF+=$(LITERATE_SOURCE:.tm=.pdf)
 909   </nf-chunk||>
 910
 911   <\todo>
 912     Outputting the PDF may not be enough to update the links and page
 913     references. I think
 914
 915     we need to update twice, generate a pdf, update twice mode and generate a
 916     new PDF.
 917
 918     Basically the PDF export of <TeXmacs> is pretty rotten and doesn't work
 919     properly from the CLI
 920   </todo>
 921
 922   <\nf-chunk|Makefile.inc-targets>
 923     <item>.SUFFIXES: .tm .pdf
 924
 925     <item>.tm.pdf:
 926
 927     <item><nf-tab>texmacs -s -c $\<less\> $@ -q
 928
 929     <item>
 930
 931     <item>clean_pdf_texmacs:
 932
 933     <item><nf-tab>rm -f -- $(FANGLE_PDF)
 934
 935     <item>clean_pdf: clean_pdf_texmacs
 936   </nf-chunk||>
 937
 938   <subsection|Building the Documentation as a Whole>
 939
 940   Currently we only build pdf as a final format, but <verbatim|FANGLE_DOCS>
 941   may later hold other output formats.
 942
 943   <\nf-chunk|Makefile.inc-vars>
 944     <item>FANGLE_DOCS=$(FANGLE_PDF)
 945   </nf-chunk||>
 946
 947   We also define <verbatim|fangle_docs> as a convenient phony target.
 948
 949   <\nf-chunk|Makefile.inc-targets>
 950     <item>.PHONY: fangle_docs
 951
 952     <item>fangle_docs: $(FANGLE_DOCS)
 953
 954     <item>docs: fangle_docs
 955   </nf-chunk||>
 956
 957   And define a convenient <verbatim|clean_fangle_docs> which we add to the
 958   regular clean target
 959
 960   <\nf-chunk|Makefile.inc-targets>
 961     <item>.PHONEY: clean_fangle_docs
 962
 963     <item>clean_fangle_docs: clean_tex clean_pdf
 964
 965     <item>clean: clean_fangle_docs
 966
 967     <item>
 968
 969     <item>distclean_fangle_docs: clean_tex clean_fangle_docs
 970
 971     <item>distclean: clean distclean_fangle_docs
 972   </nf-chunk||>
 973
 974   <section|Other helpers>
 975
 976   If <filename|Makefile.inc> is included into <filename|Makefile>, then
 977   extracted files can be updated with this command:
 978
 979   <verbatim|make fangle_sources>
 980
 981   otherwise, with:
 982
 983   <verbatim|make -f Makefile.inc fangle_sources>
 984
 985   <section|Boot-strapping the extraction>
 986
 987   As well as having the makefile extract or update the source files as part
 988   of it's operation, it also seems convenient to have the makefile
 989   re-extracted itself from <em|this> document.
 990
 991   It would also be convenient to have the code that extracts the makefile
 992   from this document to also be part of this document, however we have to
 993   start somewhere and this unfortunately requires us to type at least a few
 994   words by hand to start things off.
 995
 996   Therefore we will have a minimal root fragment, which, when extracted, can
 997   cope with extracting the rest of the source. This shell script fragment can
 998   do that. It's name is <verbatim|*> <emdash> out of regard for <name|Noweb>,
 999   but when extracted might better be called <verbatim|autoupdate>.
1000
1001   <todo|De-lyxify>
1002
1003   <\nf-chunk|*>
1004     <item>#! /bin/sh
1005
1006     <item>
1007
1008     <item>MAKE_SRC="${1:-${NW_LYX:-../../noweb-lyx/noweb-lyx3.lyx}}"
1009
1010     <item>MAKE_SRC=`dirname "$MAKE_SRC"`/`basename "$MAKE_SRC" .lyx`
1011
1012     <item>NOWEB_SRC="${2:-${NOWEB_SRC:-$MAKE_SRC.lyx}}"
1013
1014     <item>lyx -e latex $MAKE_SRC
1015
1016     <item>
1017
1018     <item>fangle -R./Makefile.inc ${MAKE_SRC}.tex \\
1019
1020     <item> \ \| sed "/FANGLE_SOURCE=/s/^/#/;T;aNOWEB_SOURCE=$FANGLE_SRC" \\
1021
1022     <item> \ \| cpif ./Makefile.inc
1023
1024     <item>
1025
1026     <item>make -f ./Makefile.inc fangle_sources
1027   </nf-chunk|sh|>
1028
1029   The general Makefile can be invoked with <filename|./autoboot> and can also
1030   be included into any automake file to automatically re-generate the source
1031   files.
1032
1033   The <em|autoboot> can be extracted with this command:
1034
1035   <\verbatim>
1036     lyx -e latex fangle.lyx && \\
1037
1038     \ \ fangle fangle.lyx \<gtr\> ./autoboot
1039   </verbatim>
1040
1041   This looks simple enough, but as mentioned, fangle has to be had from
1042   somewhere before it can be extracted.
1043
1044   On a unix system this will extract <filename|fangle.module> and the
1045   <filename|fangle> awk script, and run some basic tests.\
1046
1047   <todo|cross-ref to test chapter when it is a chapter all on its own>
1048
1049   <section|Incorporating Makefile.inc into existing projects>
1050
1051   If you are writing a literate module of an existing non-literate program
1052   you may find it easier to use a slight recursive make instead of directly
1053   including <verbatim|Makefile.inc> in the projects makefile.\
1054
1055   This way there is less chance of definitions in <verbatim|Makefile.inc>
1056   interfering with definitions in the main makefile, or with definitions in
1057   other <verbatim|Makefile.inc> from other literate modules of the same
1058   project.
1059
1060   To do this we add some <em|glue> to the project makefile that invokes
1061   Makefile.inc in the right way. The glue works by adding a <verbatim|.PHONY>
1062   target to call the recursive make, and adding this target as an additional
1063   pre-requisite to the existing targets.
1064
1065   <paragraph|Example>Sub-module of existing system
1066
1067   In this example, we are building <verbatim|module.so> as a literate module
1068   of a larger project.
1069
1070   We will show the sort glue that can be inserted into the projects Makefile
1071   <emdash> or more likely <emdash> a regular Makefile included in or invoked
1072   by the projects Makefile.
1073
1074   <\nf-chunk|makefile-glue>
1075     <item>module_srcdir=modules/module
1076
1077     <item>MODULE_SOURCE=module.tm
1078
1079     <item>MODULE_STAMP=$(MODULE_SOURCE).stamp
1080   </nf-chunk||>
1081
1082   The existing build system may already have a build target for
1083   <filename|module.o>, but we just add another pre-requisite to that. In this
1084   case we use <filename|module.tm.stamp> as a pre-requisite, the stamp file's
1085   modified time indicating when all sources were extracted<\footnote>
1086     If the projects build system does not know how to build the module from
1087     the extracted sources, then just add build actions here as normal.
1088   </footnote>.
1089
1090   <\nf-chunk|makefile-glue>
1091     <item>$(module_srcdir)/module.o: $(module_srcdir)/$(MODULE_STAMP)
1092   </nf-chunk|make|>
1093
1094   The target for this new pre-requisite will be generated by a recursive make
1095   using <filename|Makefile.inc> which will make sure that the source is up to
1096   date, before it is built by the main projects makefile.
1097
1098   <\nf-chunk|makefile-glue>
1099     <item>$(module_srcdir)/$(MODULE_STAMP): $(module_srcdir)/$(MODULE_SOURCE)
1100
1101     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc fangle_sources
1102     LITERATE_SOURCE=$(MODULE_SOURCE)
1103   </nf-chunk||>
1104
1105   We can do similar glue for the docs, clean and distclean targets. In this
1106   example the main prject was using a double colon for these targets, so we
1107   must use the same in our glue.
1108
1109   <\nf-chunk|makefile-glue>
1110     <item>docs:: docs_module
1111
1112     <item>.PHONY: docs_module
1113
1114     <item>docs_module:
1115
1116     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc docs
1117     LITERATE_SOURCE=$(MODULE_SOURCE)
1118
1119     <item>
1120
1121     <item>clean:: clean_module
1122
1123     <item>.PHONEY: clean_module
1124
1125     <item>clean_module:
1126
1127     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc clean
1128     LITERATE_SOURCE=$(MODULE_SOURCE)
1129
1130     <item>
1131
1132     <item>distclean:: distclean_module
1133
1134     <item>.PHONY: distclean_module
1135
1136     <item>distclean_module:
1137
1138     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc distclean
1139     LITERATE_SOURCE=$(MODULE_SOURCE)
1140   </nf-chunk||>
1141
1142   We could do similarly for install targets to install the generated docs.
1143
1144   <part|Source Code>
1145
1146   <chapter|Fangle awk source code>
1147
1148   We use the copyright notice from chapter <reference|License>.
1149
1150   <\nf-chunk|./fangle>
1151     <item>#! /usr/bin/awk -f
1152
1153     <item># <nf-ref|gpl3-copyright|>
1154   </nf-chunk|awk|>
1155
1156   We also use code from <person|Arnold Robbins> public domain getopt (1993
1157   revision) defined in <reference|getopt>, and naturally want to attribute
1158   this appropriately.
1159
1160   <\nf-chunk|./fangle>
1161     <item># NOTE: Arnold Robbins public domain getopt for awk is also used:
1162
1163     <item><nf-ref|getopt.awk-header|>
1164
1165     <item><nf-ref|getopt.awk-getopt()|>
1166
1167     <item>
1168   </nf-chunk||>
1169
1170   And include the following chunks (which are explained further on) to make
1171   up the program:
1172
1173   <\nf-chunk|./fangle>
1174     <item><nf-ref|helper-functions|>
1175
1176     <item><nf-ref|mode-tracker|>
1177
1178     <item><nf-ref|parse_chunk_args|>
1179
1180     <item><nf-ref|chunk-storage-functions|>
1181
1182     <item><nf-ref|output_chunk_names()|>
1183
1184     <item><nf-ref|output_chunks()|>
1185
1186     <item><nf-ref|write_chunk()|>
1187
1188     <item><nf-ref|expand_chunk_args()|>
1189
1190     <item>
1191
1192     <item><nf-ref|begin|>
1193
1194     <item><nf-ref|recognize-chunk|>
1195
1196     <item><nf-ref|end|>
1197   </nf-chunk||>
1198
1199   <section|AWK tricks>
1200
1201   The portable way to erase an array in awk is to split the empty string, so
1202   we define a fangle macro that can split an array, like this:
1203
1204   <\nf-chunk|awk-delete-array>
1205     <item>split("", <nf-arg|ARRAY>);
1206   </nf-chunk|awk|<tuple|ARRAY>>
1207
1208   For debugging it is sometimes convenient to be able to dump the contents of
1209   an array to <verbatim|stderr>, and so this macro is also useful.
1210
1211   <\nf-chunk|dump-array>
1212     <item>print "\\nDump: <nf-arg|ARRAY>\\n--------\\n" \<gtr\>
1213     "/dev/stderr";
1214
1215     <item>for (_x in <nf-arg|ARRAY>) {
1216
1217     <item> \ print _x "=" <nf-arg|ARRAY>[_x] "\\n" \<gtr\> "/dev/stderr";
1218
1219     <item>}
1220
1221     <item>print "========\\n" \<gtr\> "/dev/stderr";
1222   </nf-chunk|awk|<tuple|ARRAY>>
1223
1224   <section|Catching errors>
1225
1226   Fatal errors are issued with the error function:
1227
1228   <\nf-chunk|error()>
1229     <item>function error(message)
1230
1231     <item>{
1232
1233     <item> \ print "ERROR: " FILENAME ":" FNR " " message \<gtr\>
1234     "/dev/stderr";
1235
1236     <item> \ exit 1;
1237
1238     <item>}
1239   </nf-chunk|awk|>
1240
1241   and likewise for non-fatal warnings:
1242
1243   <\nf-chunk|error()>
1244     <item>function warning(message)
1245
1246     <item>{
1247
1248     <item> \ print "WARNING: " FILENAME ":" FNR " " message \<gtr\>
1249     "/dev/stderr";
1250
1251     <item> \ warnings++;
1252
1253     <item>}
1254   </nf-chunk|awk|>
1255
1256   and debug output too:
1257
1258   <\nf-chunk|error()>
1259     <item>function debug_log(message)
1260
1261     <item>{
1262
1263     <item> \ print "DEBUG: " FILENAME ":" FNR " " message \<gtr\>
1264     "/dev/stderr";
1265
1266     <item>}
1267   </nf-chunk|awk|>
1268
1269   <todo|append=helper-functions>
1270
1271   <\nf-chunk|helper-functions>
1272     <item><nf-ref|error()|>
1273   </nf-chunk||>
1274
1275   <chapter|<TeXmacs> args>
1276
1277   <TeXmacs> functions with arguments<\footnote>
1278     or function declarations with parameters
1279   </footnote> appear like this:
1280
1281   <math|<math-tt|blah(><wide*|<wide|<math-tt|I came, I saw, I
1282   conquered>|\<wide-overbrace\>><rsup|argument 1><wide|<math-tt|<key|^K>>,
1283   |\<wide-overbrace\>><rsup|sep.><wide|and then went home
1284   asd|\<wide-overbrace\>><rsup|argument 3><wide|<math-tt|<key|^K>><math-tt|)>|\<wide-overbrace\>><rsup|term.>|\<wide-underbrace\>><rsub|arguments>>
1285
1286   Arguments commence after the opening parenthesis. The first argument runs
1287   up till the next <key|^K>.\
1288
1289   If the following character is a <key|,> then another argument follows. If
1290   the next character after the <key|,> is a space character, then it is also
1291   eaten. The fangle stylesheet emits <key|^K><key|,><key|space> as
1292   separators, but the fangle untangler will forgive a missing space.
1293
1294   If the following character is <key|)> then this is a terminator and there
1295   are no more arguments.
1296
1297   <\nf-chunk|constants>
1298     <item>ARG_SEPARATOR=sprintf("%c", 11);
1299   </nf-chunk||>
1300
1301   To process the <verbatim|text> in this fashion, we split the string on
1302   <key|^K>
1303
1304   \;
1305
1306   <\nf-chunk|get_chunk_args>
1307     <item>function get_texmacs_chunk_args(text, args, \ \ a, done) {
1308
1309     <item> \ split(text, args, ARG_SEPARATOR);
1310
1311     <item>
1312
1313     <item> \ done=0
1314
1315     <item> \ for (a=1; (a in args); a++) if (a\<gtr\>1) {
1316
1317     <item> \ \ \ if (args[a] == "" \|\| substr(args[a], 1, 1) == ")") done=1;
1318
1319     <item> \ \ \ if (done) {
1320
1321     <item> \ \ \ \ \ delete args[a];
1322
1323     <item> \ \ \ \ \ break;
1324
1325     <item> \ \ \ }
1326
1327     <item>
1328
1329     <item> \ \ \ if (substr(args[a], 1, 2) == ", ") args[a]=substr(args[a],
1330     3);
1331
1332     <item> \ \ \ else if (substr(args[a], 1, 1) == ",")
1333     args[a]=substr(args[a], 2); \
1334
1335     <item> \ }
1336
1337     <item>}
1338   </nf-chunk||>
1339
1340   <chapter|<LaTeX> and lstlistings>
1341
1342   <todo|Split LyX and TeXmacs parts>
1343
1344   For <LyX> and <LaTeX>, the <verbatim|lstlistings> package is used to format
1345   the lines of code chunks. You may recal from chapter XXX that arguments to
1346   a chunk definition are pure <LaTeX> code. This means that fangle needs to
1347   be able to parse <LaTeX> a little.
1348
1349   <LaTeX> arguments to <verbatim|lstlistings> macros are a comma seperated
1350   list of key-value pairs, and values containing commas are enclosed in
1351   <verbatim|{> braces <verbatim|}> (which is to be expected for <LaTeX>).
1352
1353   A sample expressions is:
1354
1355   <verbatim|name=thomas, params={a, b}, something, something-else>
1356
1357   but we see that this is just a simpler form of this expression:
1358
1359   <verbatim|name=freddie, foo={bar=baz, quux={quirk, a=fleeg}}, etc>
1360
1361   We may consider that we need a function that can parse such <LaTeX>
1362   expressions and assign the values to an AWK associated array, perhaps using
1363   a recursive parser into a multi-dimensional hash<\footnote>
1364     as AWK doesn't have nested-hash support
1365   </footnote>, resulting in:
1366
1367   <tabular|<tformat|<cwith|2|6|1|2|cell-lborder|0.5pt>|<cwith|2|6|1|2|cell-rborder|0.5pt>|<cwith|2|6|1|2|cell-bborder|0.5pt>|<cwith|2|6|1|2|cell-tborder|0.5pt>|<cwith|1|1|1|2|cell-lborder|0.5pt>|<cwith|1|1|1|2|cell-rborder|0.5pt>|<cwith|1|1|1|2|cell-bborder|0.5pt>|<cwith|1|1|1|2|cell-tborder|0.5pt>|<table|<row|<cell|key>|<cell|value>>|<row|<cell|a[name]>|<cell|freddie>>|<row|<cell|a[foo,
1368   bar]>|<cell|baz>>|<row|<cell|a[foo, quux,
1369   quirk]>|<cell|>>|<row|<cell|a[foo, quux,
1370   a]>|<cell|fleeg>>|<row|<cell|a[etc]>|<cell|>>>>>
1371
1372   Yet, also, on reflection it seems that sometimes such nesting is not
1373   desirable, as the braces are also used to delimit values that contain
1374   commas --- we may consider that
1375
1376   <verbatim|name={williamson, freddie}>
1377
1378   should assign <verbatim|williamson, freddie> to <verbatim|name>.
1379
1380   In fact we are not so interested in the detail so as to be bothered by
1381   this, which turns out to be a good thing for two reasons. Firstly <TeX> has
1382   a malleable parser with no strict syntax, and secondly whether or not
1383   <verbatim|williamson> and <verbatim|freddie> should count as two items will
1384   be context dependant anyway.
1385
1386   We need to parse this latex for only one reason; which is that we are
1387   extending lstlistings to add some additional arguments which will be used
1388   to express chunk parameters and other chunk options.
1389
1390   <section|Additional lstlstings parameters>
1391
1392   Further on we define a <verbatim|\\Chunk> <LaTeX> macro whose arguments
1393   will consist of a the chunk name, optionally followed by a comma and then a
1394   comma separated list of arguments. In fact we will just need to prefix
1395   <verbatim|name=> to the arguments to in order to create valid lstlistings
1396   arguments.\
1397
1398   There will be other arguments supported too;\
1399
1400   <\description-long>
1401     <item*|params>As an extension to many literate-programming styles, fangle
1402     permits code chunks to take parameters and thus operate somewhat like C
1403     pre-processor macros, or like C++ templates. Chunk parameters are
1404     declared with a chunk argument called params, which holds a semi-colon
1405     separated list of parameters, like this:
1406
1407     <verbatim|achunk,language=C,params=name;address>
1408
1409     <item*|addto>a named chunk that this chunk is to be included into. This
1410     saves the effort of having to declare another listing of the named chunk
1411     merely to include this one.
1412   </description-long>
1413
1414   Function get_chunk_args() will accept two paramters, text being the text to
1415   parse, and values being an array to receive the parsed values as described
1416   above. The optional parameter path is used during recursion to build up the
1417   multi-dimensional array path.
1418
1419   <\nf-chunk|./fangle>
1420     <item><nf-ref|get_chunk_args()|>
1421   </nf-chunk||>
1422
1423   <\nf-chunk|get_chunk_args()>
1424     <item>function get_tex_chunk_args(text, values,
1425
1426     <item> \ # optional parameters
1427
1428     <item> \ path, # hierarchical precursors
1429
1430     <item> \ # local vars
1431
1432     <item> \ a, name)
1433   </nf-chunk||>
1434
1435   The strategy is to parse the name, and then look for a value. If the value
1436   begins with a brace <verbatim|{>, then we recurse and consume as much of
1437   the text as necessary, returning the remaining text when we encounter a
1438   leading close-brace <verbatim|}>. This being the strategy --- and executed
1439   in a loop --- we realise that we must first look for the closing brace
1440   (perhaps preceded by white space) in order to terminate the recursion, and
1441   returning remaining text.
1442
1443   <\nf-chunk|get_chunk_args()>
1444     <item>{
1445
1446     <item> \ split("", values);
1447
1448     <item> \ while(length(text)) {
1449
1450     <item> \ \ \ if (match(text, "^ *}(.*)", a)) {
1451
1452     <item> \ \ \ \ \ return a[1];
1453
1454     <item> \ \ \ }
1455
1456     <item> \ \ \ <nf-ref|parse-chunk-args|>
1457
1458     <item> \ }
1459
1460     <item> \ return text;
1461
1462     <item>}
1463   </nf-chunk||>
1464
1465   We can see that the text could be inspected with this regex:
1466
1467   <\nf-chunk|parse-chunk-args>
1468     <item>if (! match(text, " *([^,=]*[^,= ]) *(([,=]) *(([^,}]*) *,*
1469     *(.*))\|)$", a)) {
1470
1471     <item> \ return text;
1472
1473     <item>}
1474   </nf-chunk||>
1475
1476   and that <verbatim|a> will have the following values:
1477
1478   <tabular|<tformat|<cwith|2|7|1|2|cell-lborder|0.5pt>|<cwith|2|7|1|2|cell-rborder|0.5pt>|<cwith|2|7|1|2|cell-bborder|0.5pt>|<cwith|2|7|1|2|cell-tborder|0.5pt>|<cwith|1|1|1|2|cell-lborder|0.5pt>|<cwith|1|1|1|2|cell-rborder|0.5pt>|<cwith|1|1|1|2|cell-bborder|0.5pt>|<cwith|1|1|1|2|cell-tborder|0.5pt>|<table|<row|<cell|a[n]>|<cell|assigned
1479   text>>|<row|<cell|1>|<cell|freddie>>|<row|<cell|2>|<cell|=freddie,
1480   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>|<row|<cell|3>|<cell|=>>|<row|<cell|4>|<cell|freddie,
1481   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>|<row|<cell|5>|<cell|freddie>>|<row|<cell|6>|<cell|,
1482   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>>>>
1483
1484   <verbatim|a[3]> will be either <verbatim|=> or <verbatim|,> and signify
1485   whether the option named in <verbatim|a[1]> has a value or not
1486   (respectively).
1487
1488   If the option does have a value, then if the expression
1489   <verbatim|substr(a[4],1,1)> returns a brace <verbatim|{> it will signify
1490   that we need to recurse:
1491
1492   <\nf-chunk|parse-chunk-args>
1493     <item>name=a[1];
1494
1495     <item>if (a[3] == "=") {
1496
1497     <item> \ if (substr(a[4],1,1) == "{") {
1498
1499     <item> \ \ \ text = get_tex_chunk_args(substr(a[4],2), values, path name
1500     SUBSEP);
1501
1502     <item> \ } else {
1503
1504     <item> \ \ \ values[path name]=a[5];
1505
1506     <item> \ \ \ text = a[6];
1507
1508     <item> \ }
1509
1510     <item>} else {
1511
1512     <item> \ values[path name]="";
1513
1514     <item> \ text = a[2];
1515
1516     <item>}
1517   </nf-chunk||>
1518
1519   We can test this function like this:
1520
1521   <\nf-chunk|gca-test.awk>
1522     <item><nf-ref|get_chunk_args()|>
1523
1524     <item>BEGIN {
1525
1526     <item> \ SUBSEP=".";
1527
1528     <item>
1529
1530     <item> \ print get_tex_chunk_args("name=freddie, foo={bar=baz,
1531     quux={quirk, a=fleeg}}, etc", a);
1532
1533     <item> \ for (b in a) {
1534
1535     <item> \ \ \ print "a[" b "] =\<gtr\> " a[b];
1536
1537     <item> \ }
1538
1539     <item>}
1540   </nf-chunk||>
1541
1542   which should give this output:
1543
1544   <\nf-chunk|gca-test.awk-results>
1545     <item>a[foo.quux.quirk] =\<gtr\>\
1546
1547     <item>a[foo.quux.a] =\<gtr\> fleeg
1548
1549     <item>a[foo.bar] =\<gtr\> baz
1550
1551     <item>a[etc] =\<gtr\>\
1552
1553     <item>a[name] =\<gtr\> freddie
1554   </nf-chunk||>
1555
1556   <section|Parsing chunk arguments><label|Chunk Arguments>
1557
1558   Arguments to paramterized chunks are expressed in round brackets as a comma
1559   separated list of optional arguments. For example, a chunk that is defined
1560   with:
1561
1562   <verbatim|\\Chunk{achunk, params=name ; address}>
1563
1564   could be invoked as:
1565
1566   <verbatim|\\chunkref{achunk}(John Jones, jones@example.com)>
1567
1568   An argument list may be as simple as in <verbatim|\\chunkref{pull}(thing,
1569   otherthing)> or as complex as:
1570
1571   <verbatim|\\chunkref{pull}(things[x, y], get_other_things(a, "(all)"))>
1572
1573   --- which for all it's commas and quotes and parenthesis represents only
1574   two parameters: <verbatim|things[x, y]> and <verbatim|get_other_things(a,
1575   "(all)")>.
1576
1577   If we simply split parameter list on commas, then the comma in
1578   <verbatim|things[x,y]> would split into two seperate arguments:
1579   <verbatim|things[x> and <verbatim|y]>--- neither of which make sense on
1580   their own.
1581
1582   One way to prevent this would be by refusing to split text between matching
1583   delimiters, such as <verbatim|[>, <verbatim|]>, <verbatim|(>, <verbatim|)>,
1584   <verbatim|{>, <verbatim|}> and most likely also <verbatim|">, <verbatim|">
1585   and <verbatim|'>, <verbatim|'>. Of course this also makes it impossible to
1586   pass such mis-matched code fragments as parameters, but I think that it
1587   would be hard for readers to cope with authors who would pass such code
1588   unbalanced fragments as chunk parameters<\footnote>
1589     I know that I couldn't cope with users doing such things, and although
1590     the GPL3 license prevents me from actually forbidding anyone from trying,
1591     if they want it to work they'll have to write the code themselves and not
1592     expect any support from me.
1593   </footnote>.
1594
1595   Unfortunately, the full set of matching delimiters may vary from language
1596   to language. In certain C++ template contexts, <verbatim|\<less\>> and
1597   <verbatim|\<gtr\>> would count as delimiters, and yet in other contexts
1598   they would not.
1599
1600   This puts me in the unfortunate position of having to parse-somewhat all
1601   programming languages without knowing what they are!
1602
1603   However, if this universal mode-tracking is possible, then parsing the
1604   arguments would be trivial. Such a mode tracker is described in chapter
1605   <reference|modes> and used here with simplicity.
1606
1607   <\nf-chunk|parse_chunk_args>
1608     <item>function parse_chunk_args(language, text, values, mode,
1609
1610     <item> \ # local vars
1611
1612     <item> \ c, context, rest)
1613
1614     <item>{
1615
1616     <item> \ <nf-ref|new-mode-tracker|<tuple|context|language|mode>>
1617
1618     <item> \ rest = mode_tracker(context, text, values);
1619
1620     <item> \ # extract values
1621
1622     <item> \ for(c=1; c \<less\>= context[0, "values"]; c++) {
1623
1624     <item> \ \ \ values[c] = context[0, "values", c];
1625
1626     <item> \ }
1627
1628     <item> \ return rest;
1629
1630     <item>}
1631   </nf-chunk||>
1632
1633   <section|Expanding parameters in the text>
1634
1635   Within the body of the chunk, the parameters are referred to with:
1636   <verbatim|${name}> and <verbatim|${address}>. There is a strong case that a
1637   <LaTeX> style notation should be used, like <verbatim|\\param{name}> which
1638   would be expressed in the listing as <verbatim|=\<less\>\\param{name}\<gtr\>>
1639   and be rendered as <verbatim|<nf-arg|name>>. Such notation would make me go
1640   blind, but I do intend to adopt it.
1641
1642   We therefore need a function <verbatim|expand_chunk_args> which will take a
1643   block of text, a list of permitted parameters, and the arguments which must
1644   substitute for the parameters.\
1645
1646   Here we split the text on <verbatim|${> which means that all parts except
1647   the first will begin with a parameter name which will be terminated by
1648   <verbatim|}>. The split function will consume the literal <verbatim|${> in
1649   each case.
1650
1651   <\nf-chunk|expand_chunk_args()>
1652     <item>function expand_chunk_args(text, params, args, \
1653
1654     <item> \ p, text_array, next_text, v, t, l)
1655
1656     <item>{
1657
1658     <item> \ if (split(text, text_array, "\\\\${")) {
1659
1660     <item> \ \ \ <nf-ref|substitute-chunk-args|>
1661
1662     <item> \ }
1663
1664     <item>
1665
1666     <item> \ return text;
1667
1668     <item>}
1669   </nf-chunk||>
1670
1671   First, we produce an associative array of substitution values indexed by
1672   parameter names. This will serve as a cache, allowing us to look up the
1673   replacement values as we extract each name.
1674
1675   <\nf-chunk|substitute-chunk-args>
1676     <item>for(p in params) {
1677
1678     <item> \ v[params[p]]=args[p];
1679
1680     <item>}
1681   </nf-chunk||>
1682
1683   We accumulate substituted text in the variable text. As the first part of
1684   the split function is the part before the delimiter --- which is
1685   <verbatim|${> in our case --- this part will never contain a parameter
1686   reference, so we assign this directly to the result kept in
1687   <verbatim|$text>.
1688
1689   <\nf-chunk|substitute-chunk-args>
1690     <item>text=text_array[1];
1691   </nf-chunk||>
1692
1693   We then iterate over the remaining values in the array, and substitute each
1694   reference for it's argument.
1695
1696   <\nf-chunk|substitute-chunk-args>
1697     <item>for(t=2; t in text_array; t++) {
1698
1699     <item> \ <nf-ref|substitute-chunk-arg|>
1700
1701     <item>}
1702   </nf-chunk||>
1703
1704   After the split on <verbatim|${> a valid parameter reference will consist
1705   of valid parameter name terminated by a close-brace <verbatim|}>. A valid
1706   character name begins with the underscore or a letter, and may contain
1707   letters, digits or underscores.
1708
1709   A valid looking reference that is not actually the name of a parameter will
1710   be and not substituted. This is good because there is nothing to substitute
1711   anyway, and it avoids clashes when writing code for languages where
1712   <verbatim|${...}> is a valid construct --- such constructs will not be
1713   interfered with unless the parameter name also matches.
1714
1715   <\nf-chunk|substitute-chunk-arg>
1716     <item>if (match(text_array[t], "^([a-zA-Z_][a-zA-Z0-9_]*)}", l) &&
1717
1718     <item> \ \ \ l[1] in v)\
1719
1720     <item>{
1721
1722     <item> \ text = text v[l[1]] substr(text_array[t], length(l[1])+2);
1723
1724     <item>} else {
1725
1726     <item> \ text = text "${" text_array[t];
1727
1728     <item>}
1729   </nf-chunk||>
1730
1731   <chapter|Language Modes & Quoting><label|modes>
1732
1733   <verbatim|lstlistings> and <verbatim|fangle> both recognize source
1734   languages, and perform some basic parsing and syntax highlighting in the
1735   rendered document<\footnote>
1736     although lstlisting supports many more languages
1737   </footnote>. <verbatim|lstlistings> can detect strings and comments within
1738   a language definition and perform suitable rendering, such as italics for
1739   comments, and visible-spaces within strings.
1740
1741   Fangle similarly can recognize strings, and comments, etc, within a
1742   language, so that any chunks included with <verbatim|\\chunkref{a-chunk}>
1743   or <nf-ref|a-chunk|> can be suitably escape or quoted.
1744
1745   <section|Modes to keep code together>
1746
1747   As an example, the C language has a few parse modes, which affect the
1748   interpretation of characters.
1749
1750   One parse mode is the string mode. The string mode is commenced by an
1751   un-escaped quotation mark <verbatim|"> and terminated by the same. Within
1752   the string mode, only one additional mode can be commenced, it is the
1753   backslash mode <verbatim|\\>, which is always terminated by the following
1754   character.
1755
1756   Another mode is <verbatim|[> which is terminated by a <verbatim|]> (unless
1757   it occurs in a string).
1758
1759   Consider this fragment of C code:
1760
1761   <math|<math-tt|do_something><wide|<around*|(|<math-tt|things><wide|<around|[|<math-tt|x>,
1762   <math-tt|y>|]>|\<wide-overbrace\>><rsup|2. <math-tt|[> mode><math-tt|,
1763   get_other_things><wide|<around|(|<math-tt|a>,
1764   <wide*|<text|"><math-tt|<around|(|all|)>><text|">|\<wide-underbrace\>><rsub|4.
1765   <text|"> mode>|)>|\<wide-overbrace\>><rsup|3. <math-tt|(>
1766   mode>|)>|\<wide-overbrace\>><rsup|1. <math-tt|(> mode>>
1767
1768   \;
1769
1770   Mode nesting prevents the close parenthesis in the quoted string (part 4)
1771   from terminating the parenthesis mode (part 3).
1772
1773   Each language has a set of modes, the default mode being the null mode.
1774   Each mode can lead to other modes.
1775
1776   <section|Modes affect included chunks>
1777
1778   For instance, consider this chunk with <verbatim|language=perl>:
1779
1780   <\nf-chunk|test:example-perl>
1781     <item>print "hello world $0\\n";
1782   </nf-chunk|perl|>
1783
1784   If it were included in a chunk with <verbatim|language=sh>, like this:
1785
1786   <\nf-chunk|test:example-sh>
1787     <item>perl -e "<nf-ref|test:example-perl|>"
1788   </nf-chunk|sh|>
1789
1790   we might want fangle would to generate output like this:
1791
1792   <\nf-chunk|test:example-sh.result>
1793     <item>perl -e "print \\"hello world \\$0\\\\n\\";"
1794   </nf-chunk|sh|>
1795
1796   See that the double quote <verbatim|">, back-slash <verbatim|\\> and
1797   <verbatim|$> have been quoted with a back-slash to protect them from shell
1798   interpretation.
1799
1800   If that were then included in a chunk with language=make, like this:
1801
1802   <\nf-chunk|test:example-makefile>
1803     <item>target: pre-req
1804
1805     <item><nf-tab><nf-ref|test:example-sh|>
1806   </nf-chunk|make|>
1807
1808   We would need the output to look like this --- note the <verbatim|$$> as
1809   the single <verbatim|$> has been makefile-quoted with another <verbatim|$>.
1810
1811   <\nf-chunk|test:example-makefile.result>
1812     <item>target: pre-req
1813
1814     <item><nf-tab>perl -e "print \\"hello world \\$$0\\\\n\\";"
1815   </nf-chunk|make|>
1816
1817   <section|Modes operation>
1818
1819   In order to make this work, we must define a mode-tracker supporting each
1820   language, that can detect the various quoting modes, and provide a
1821   transformation that may be applied to any included text so that included
1822   text will be interpreted correctly after any interpolation that it may be
1823   subject to at run-time.
1824
1825   For example, the sed transformation for text to be inserted into shell
1826   double-quoted strings would be something like:
1827
1828   <verbatim|s/\\\\/\\\\\\\\/g;s/$/\\\\$/g;s/"/\\\\"/g;>
1829
1830   which would protect <verbatim|\\ $ ">
1831
1832   The mode tracker must also nested mode-changes, as in this shell example:
1833
1834   <verbatim|echo "hello `id ...`">
1835
1836   <phantom|<verbatim|echo "hello `id >><math|\<uparrow\>>
1837
1838   Any shell special characters inserted at the point marked
1839   <math|\<uparrow\>> would need to be escaped if their plain-text meaning is
1840   to be preserved, including <verbatim|`> <verbatim|\|> <verbatim|*> among
1841   others. The set of characters that need escaping in the back-ticks
1842   <verbatim|`> is not the same as the set that need escaing in the
1843   double-quotes <verbatim|">. However, in shell syntax, a <verbatim|"> at the
1844   point marked <math|\<uparrow\>> does not close the leading <verbatim|"> and
1845   so would not need additional escaping because of the nesting of the two
1846   modes.
1847
1848   <todo|MAYBE>Escaping need not occur if the format and mode of the included
1849   chunk matches that of the including chunk.
1850
1851   As each chunk is output a new mode tracker for that language is initialized
1852   in it's normal state. As text is output for that chunk the output mode is
1853   tracked. When a new chunk is included, a transformation appropriate to that
1854   mode is selected and pushed onto a stack of transformations. Any text to be
1855   output is passed through this stack of transformations.
1856
1857   It remains to consider if the chunk-include function should return it's
1858   generated text so that the caller can apply any transformations (and
1859   formatting), or if it should apply the stack of transformations itself.
1860
1861   Note that the transformed included text should have the property of not
1862   being able to change the mode in the current chunk.
1863
1864   <todo|Note chunk parameters should probably also be transformed>
1865
1866   <section|Quoting scenarios>
1867
1868   <subsection|Direct quoting>
1869
1870   He we give examples of various quoting scenarios and discuss what the
1871   expected outcome might be and how this could be obtained.
1872
1873   <\with|par-columns|2>
1874     <\nf-chunk|test:q:1>
1875       <item>echo "$(<nf-ref|test:q:1-inc|>)"
1876     </nf-chunk|sh|>
1877
1878     <\nf-chunk|test:q:1-inc>
1879       <item>echo "hello"
1880     </nf-chunk|sh|>
1881   </with>
1882
1883   Should this examples produce <verbatim|echo "$(echo "hello")"> or
1884   <verbatim|echo "\\$(echo \\"hello\\")"> ?
1885
1886   This depends on what the author intended, but we must provde a way to
1887   express that intent.
1888
1889   We might argue that as both chunks have <verbatim|lang=sh> the intent must
1890   have been to quote the included chunk <emdash> but consider that this might
1891   be shell script that writes shell script.
1892
1893   If <nf-ref|test:q:1-inc|> had <verbatim|lang=text> then it certainly would
1894   have been right to quote it, which leads us to ask: in what ways can we
1895   reduce quoting if lang of the included chunk is compatible with the lang of
1896   the including chunk?
1897
1898   If we take a completely nested approach then even though <verbatim|$(> mode
1899   might do no quoting of it's own, <verbatim|"> mode will still do it's own
1900   quoting. We need a model where the nested <verbatim|$(> mode will prevent
1901   <verbatim|"> from quoting.
1902
1903   This leads rise to the <em|tunneling> feature. In bash, the <verbatim|$(>
1904   gives rise to a new top-level parsing scenario, so we need to enter the
1905   <em|null> mode, and also ignore any quoting and then undo-this when the
1906   <verbatim|$(> mode is terminated by the corresponding close <verbatim|)>.
1907
1908   We shall say that tunneling is when a mode in a language ignores other
1909   modes in the same language and arrives back at an earlier <em|null> mode of
1910   the same language.
1911
1912   In example <nf-ref|test:q:1|> above, the nesting of modes is: <em|null>,
1913   <verbatim|">, <verbatim|$(>
1914
1915   When mode <verbatim|$(> is commenced, the stack of nest modes will be
1916   traversed. If the <em|null> mode can be found in the same language, without
1917   the language varying, then a tunnel will be established so that the
1918   intervening modes, <verbatim|"> in this case, can be skipped when the modes
1919   are enumerated to quote the texted being emitted.
1920
1921   In such a case, the correct result would be:
1922
1923   <\nf-chunk|test:q:1.result>
1924     <item>echo "$(echo "hello")"
1925   </nf-chunk|sh|>
1926
1927   <section|Language Mode Definitions>
1928
1929   All modes definitions are stored in a single multi-dimensional hash. The
1930   first index is the language, and the second index is the mode-identifier.
1931   The third indexes hold properties: terminators, and optionally, submodes,
1932   delimiters, and tunnel targets.
1933
1934   A useful set of mode definitions for a nameless general C-type language is
1935   shown here. (Don't be confused by the double backslash escaping needed in
1936   awk. One set of escaping is for the string, and the second set of escaping
1937   is for the regex).
1938
1939   <\todo>
1940     TODO: Add =\<less\>\\mode{}\<gtr\> command which will allow us to signify
1941     that a string is
1942
1943     \ regex and thus fangle will quote it for us.
1944   </todo>
1945
1946   Submodes are entered by the characters \ <verbatim|"> <verbatim|'>
1947   <verbatim|{> <verbatim|(> <verbatim|[> <verbatim|/*>
1948
1949   <\nf-chunk|common-mode-definitions>
1950     <item>modes[<nf-arg|language>, "", \ "submodes"]="\\\\\\\\\|\\"\|'\|{\|\\\\(\|\\\\[";
1951   </nf-chunk||<tuple|language>>
1952
1953   In the default mode, a comma surrounded by un-important white space is a
1954   delimiter of language items<\footnote>
1955     whatever a <em|language item> might be
1956   </footnote>.
1957
1958   <\nf-chunk|common-mode-definitions>
1959     <item>modes[<nf-arg|language>, "", \ "delimiters"]=" *, *";
1960   </nf-chunk||language>
1961
1962   and should pass this test:<todo|Why do the tests run in ?(? mode and not ??
1963   mode>
1964
1965   <\nf-chunk|test:mode-definitions>
1966     <item>parse_chunk_args("c-like", "1,2,3", a, "");
1967
1968     <item>if (a[1] != "1") e++;
1969
1970     <item>if (a[2] != "2") e++;
1971
1972     <item>if (a[3] != "3") e++;
1973
1974     <item>if (length(a) != 3) e++;
1975
1976     <item><nf-ref|pca-test.awk:summary|>
1977
1978     <item>
1979
1980     <item>parse_chunk_args("c-like", "joe, red", a, "");
1981
1982     <item>if (a[1] != "joe") e++;
1983
1984     <item>if (a[2] != "red") e++;
1985
1986     <item>if (length(a) != 2) e++;
1987
1988     <item><nf-ref|pca-test.awk:summary|>
1989
1990     <item>
1991
1992     <item>parse_chunk_args("c-like", "${colour}", a, "");
1993
1994     <item>if (a[1] != "${colour}") e++;
1995
1996     <item>if (length(a) != 1) e++;
1997
1998     <item><nf-ref|pca-test.awk:summary|>
1999   </nf-chunk||>
2000
2001   Nested modes are identified by a backslash, a double or single quote,
2002   various bracket styles or a <verbatim|/*> comment.
2003
2004   For each of these sub-modes modes we must also identify at a mode
2005   terminator, and any sub-modes or delimiters that may be entered<\footnote>
2006     Because we are using the sub-mode characters as the mode identifier it
2007     means we can't currently have a mode character dependant on it's context;
2008     i.e. <verbatim|{> can't behave differently when it is inside
2009     <verbatim|[>.
2010   </footnote>.
2011
2012   <subsection|Backslash>
2013
2014   The backslash mode has no submodes or delimiters, and is terminated by any
2015   character. Note that we are not so much interested in evaluating or
2016   interpolating content as we are in delineating content. It is no matter
2017   that a double backslash (<verbatim|\\\\>) may represent a single backslash
2018   while a backslash-newline may represent white space, but it does matter
2019   that the newline in a backslash newline should not be able to terminate a C
2020   pre-processor statement; and so the newline will be consumed by the
2021   backslash however it is to be interpreted.
2022
2023   <\nf-chunk|common-mode-definitions>
2024     <item>modes[<nf-arg|language>, "\\\\", "terminators"]=".";
2025   </nf-chunk||>
2026
2027   <subsection|Strings>
2028
2029   Common languages support two kinds of strings quoting, double quotes and
2030   single quotes.
2031
2032   In a string we have one special mode, which is the backslash. This may
2033   escape an embedded quote and prevent us thinking that it should terminate
2034   the string.
2035
2036   <\nf-chunk|mode:common-string>
2037     <item>modes[<nf-arg|language>, <nf-arg|quote>, "submodes"]="\\\\\\\\";
2038   </nf-chunk||<tuple|language|quote>>
2039
2040   Otherwise, the string will be terminated by the same character that
2041   commenced it.
2042
2043   <\nf-chunk|mode:common-string>
2044     <item>modes[<nf-arg|language>, <nf-arg|quote>,
2045     "terminators"]=<nf-arg|quote>;
2046   </nf-chunk||language>
2047
2048   In C type languages, certain escape sequences exist in strings. We need to
2049   define mechanism to enclode any chunks included in this mode using those
2050   escape sequences. These are expressed in two parts, s meaning search, and r
2051   meaning replace.
2052
2053   The first substitution is to replace a backslash with a double backslash.
2054   We do this first as other substitutions may introduce a backslash which we
2055   would not then want to escape again here.
2056
2057   Note: Backslashes need double-escaping in the search pattern but not in the
2058   replacement string, hence we are replacing a literal <verbatim|\\> with a
2059   literal <verbatim|\\\\>.
2060
2061   <\nf-chunk|mode:common-string>
2062     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2063     ++escapes[<nf-arg|language>, <nf-arg|quote>], "s"]="\\\\\\\\";
2064
2065     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2066     \ \ escapes[<nf-arg|language>, <nf-arg|quote>], "r"]="\\\\\\\\";
2067   </nf-chunk||language>
2068
2069   If the quote character occurs in the text, it should be preceded by a
2070   backslash, otherwise it would terminate the string unexpectedly.
2071
2072   <\nf-chunk|mode:common-string>
2073     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2074     ++escapes[<nf-arg|language>, <nf-arg|quote>], "s"]=<nf-arg|quote>;
2075
2076     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2077     \ \ escapes[<nf-arg|language>, <nf-arg|quote>], "r"]="\\\\"
2078     <nf-arg|quote>;
2079   </nf-chunk||language>
2080
2081   Any newlines in the string, must be replaced by <verbatim|\\n>.
2082
2083   <\nf-chunk|mode:common-string>
2084     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2085     ++escapes[<nf-arg|language>, <nf-arg|quote>], "s"]="\\n";
2086
2087     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2088     \ \ escapes[<nf-arg|language>, <nf-arg|quote>], "r"]="\\\\n";
2089   </nf-chunk||language>
2090
2091   For the common modes, we define this string handling for double and single
2092   quotes.
2093
2094   <\nf-chunk|common-mode-definitions>
2095     <item><nf-ref|mode:common-string|<tuple|<nf-arg|language>|"\\"">>
2096
2097     <item><nf-ref|mode:common-string|<tuple|<nf-arg|language>|"'">>
2098   </nf-chunk||>
2099
2100   Working strings should pass this test:
2101
2102   <\nf-chunk|test:mode-definitions>
2103     <item>parse_chunk_args("c-like", "say \\"I said, \\\\\\"Hello, how are
2104     you\\\\\\".\\", for me", a, "");
2105
2106     <item>if (a[1] != "say \\"I said, \\\\\\"Hello, how are you\\\\\\".\\"")
2107     e++;
2108
2109     <item>if (a[2] != "for me") e++;
2110
2111     <item>if (length(a) != 2) e++;
2112
2113     <item><nf-ref|pca-test.awk:summary|>
2114   </nf-chunk||>
2115
2116   <subsection|Parentheses, Braces and Brackets>
2117
2118   Where quotes are closed by the same character, parentheses, brackets and
2119   braces are closed by an alternate character.
2120
2121   <\nf-chunk|mode:common-brackets>
2122     <item>modes[<nf-arg|language>, <nf-arg|open>, \ "submodes"
2123     ]="\\\\\\\\\|\\"\|{\|\\\\(\|\\\\[\|'\|/\\\\*";
2124
2125     <item>modes[<nf-arg|language>, <nf-arg|open>, \ "delimiters"]=" *, *";
2126
2127     <item>modes[<nf-arg|language>, <nf-arg|open>,
2128     \ "terminators"]=<nf-arg|close>;
2129   </nf-chunk||<tuple|language|open|close>>
2130
2131   Note that the open is NOT a regex but the close token IS. <todo|When we can
2132   quote regex we won't have to put the slashes in here>
2133
2134   <\nf-chunk|common-mode-definitions>
2135     <item><nf-ref|mode:common-brackets|<tuple|<nf-arg|language>|"{"|"}">>
2136
2137     <item><nf-ref|mode:common-brackets|<tuple|<nf-arg|language>|"["|"\\\\]">>
2138
2139     <item><nf-ref|mode:common-brackets|<tuple|<nf-arg|language>|"("|"\\\\)">>
2140   </nf-chunk||>
2141
2142   <subsection|Customizing Standard Modes>
2143
2144   <\nf-chunk|mode:add-submode>
2145     <item>modes[<nf-arg|language>, <nf-arg|mode>, "submodes"] =
2146     modes[<nf-arg|language>, <nf-arg|mode>, "submodes"] "\|"
2147     <nf-arg|submode>;
2148   </nf-chunk||<tuple|language|mode|submode>>
2149
2150   <\nf-chunk|mode:add-escapes>
2151     <item>escapes[<nf-arg|language>, <nf-arg|mode>,
2152     ++escapes[<nf-arg|language>, <nf-arg|mode>], "s"]=<nf-arg|search>;
2153
2154     <item>escapes[<nf-arg|language>, <nf-arg|mode>,
2155     \ \ escapes[<nf-arg|language>, <nf-arg|mode>], "r"]=<nf-arg|replace>;
2156   </nf-chunk||<tuple|language|mode|search|replace>>
2157
2158   \;
2159
2160   <subsection|Comments>
2161
2162   We can define <verbatim|/* comment */> style comments and
2163   <verbatim|//comment> style comments to be added to any language:
2164
2165   <\nf-chunk|mode:multi-line-comments>
2166     <item><nf-ref|mode:add-submode|<tuple|<nf-arg|language>|""|"/\\\\*">>
2167
2168     <item>modes[<nf-arg|language>, "/*", "terminators"]="\\\\*/";
2169   </nf-chunk||<tuple|language>>
2170
2171   <\nf-chunk|mode:single-line-slash-comments>
2172     <item><nf-ref|mode:add-submode|<tuple|<nf-arg|language>|""|"//">>
2173
2174     <item>modes[<nf-arg|language>, "//", "terminators"]="\\n";
2175
2176     <item><nf-ref|mode:add-escapes|<tuple|<nf-arg|language>|"//"|"\\n"|"\\n//">>
2177   </nf-chunk||language>
2178
2179   We can also define <verbatim|# comment> style comments (as used in awk and
2180   shell scripts) in a similar manner.
2181
2182   <todo|I'm having to use # for hash and \textbackslash{} for \ and have
2183   hacky work-arounds in the parser for now>
2184
2185   <\nf-chunk|mode:add-hash-comments>
2186     <item><nf-ref|mode:add-submode|<tuple|<nf-arg|language>|""|"#">>
2187
2188     <item>modes[<nf-arg|language>, "#", "terminators"]="\\n";
2189
2190     <item><nf-ref|mode:add-escapes|<tuple|<nf-arg|language>|"#"|"\\n"|"\\n#">>
2191   </nf-chunk||<tuple|language>>
2192
2193   In C, the <verbatim|#> denotes pre-processor directives which can be
2194   multi-line
2195
2196   <\nf-chunk|mode:add-hash-defines>
2197     <item><nf-ref|mode:add-submode|<tuple|<nf-arg|language>|""|"#">>
2198
2199     <item>modes[<nf-arg|language>, "#", "submodes" ]="\\\\\\\\";
2200
2201     <item>modes[<nf-arg|language>, "#", "terminators"]="\\n";
2202
2203     <item><nf-ref|mode:add-escapes|<tuple|<nf-arg|language>|"#"|"\\n"|"\\\\\\\\\\n">>
2204   </nf-chunk||<tuple|language>>
2205
2206   <\nf-chunk|mode:quote-dollar-escape>
2207     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2208     ++escapes[<nf-arg|language>, <nf-arg|quote>], "s"]="\\\\$";
2209
2210     <item>escapes[<nf-arg|language>, <nf-arg|quote>,
2211     \ \ escapes[<nf-arg|language>, <nf-arg|quote>], "r"]="\\\\$";
2212   </nf-chunk||<tuple|language|quote>>
2213
2214   We can add these definitions to various languages
2215
2216   <\nf-chunk|mode-definitions>
2217     <item><nf-ref|common-mode-definitions|<tuple|"c-like">>
2218
2219     <item>
2220
2221     <item><nf-ref|common-mode-definitions|<tuple|"c">>
2222
2223     <item><nf-ref|mode:multi-line-comments|<tuple|"c">>
2224
2225     <item><nf-ref|mode:single-line-slash-comments|<tuple|"c">>
2226
2227     <item><nf-ref|mode:add-hash-defines|<tuple|"c">>
2228
2229     <item>
2230
2231     <item><nf-ref|common-mode-definitions|<tuple|"awk">>
2232
2233     <item><nf-ref|mode:add-hash-comments|<tuple|"awk">>
2234
2235     <item><nf-ref|mode:add-naked-regex|<tuple|"awk">>
2236   </nf-chunk||>
2237
2238   The awk definitions should allow a comment block like this:
2239
2240   <nf-chunk|test:comment-quote|<item># Comment:
2241   <nf-ref|test:comment-text|>|awk|>
2242
2243   <\nf-chunk|test:comment-text>
2244     <item>Now is the time for
2245
2246     <item>the quick brown fox to bring lemonade
2247
2248     <item>to the party
2249   </nf-chunk||>
2250
2251   to come out like this:
2252
2253   <\nf-chunk|test:comment-quote:result>
2254     <item># Comment: Now is the time for
2255
2256     <item>#the quick brown fox to bring lemonade
2257
2258     <item>#to the party
2259   </nf-chunk||>
2260
2261   The C definition for such a block should have it come out like this:
2262
2263   <\nf-chunk|test:comment-quote:C-result>
2264     <item># Comment: Now is the time for\\
2265
2266     <item>the quick brown fox to bring lemonade\\
2267
2268     <item>to the party
2269   </nf-chunk||>
2270
2271   <subsection|Regex>
2272
2273   This pattern is incomplete, but meant to detect naked regular expressions
2274   in awk and perl; e.g. <verbatim|/.*$/>, however required capabilities are
2275   not present.
2276
2277   Current it only detects regexes anchored with ^ as used in fangle.
2278
2279   For full regex support, modes need to be named not after their starting
2280   character, but some other more fully qualified name.
2281
2282   <\nf-chunk|mode:add-naked-regex>
2283     <item><nf-ref|mode:add-submode|<tuple|<nf-arg|language>|""|"/\\\\^">>
2284
2285     <item>modes[<nf-arg|language>, "/^", "terminators"]="/";
2286   </nf-chunk||<tuple|language>>
2287
2288   <subsection|Perl>
2289
2290   <\nf-chunk|mode-definitions>
2291     <item><nf-ref|common-mode-definitions|<tuple|"perl">>
2292
2293     <item><nf-ref|mode:multi-line-comments|<tuple|"perl">>
2294
2295     <item><nf-ref|mode:add-hash-comments|<tuple|"perl">>
2296   </nf-chunk||>
2297
2298   Still need to add add <verbatim|s/>, submode <verbatim|/>, terminate both
2299   with <verbatim|//>. This is likely to be impossible as perl regexes can
2300   contain perl.
2301
2302   <subsection|sh>
2303
2304   Shell single-quote strings are different to other strings and have no
2305   escape characters. The only special character is the single quote
2306   <verbatim|'> which always closes the string. Therefore we cannot use
2307   <nf-ref|common-mode-definitions|<tuple|"sh">> but we will invoke most of
2308   it's definition apart from single-quote strings.\
2309
2310   <\nf-chunk|mode-definitions>
2311     <item>modes["sh", "", \ "submodes"]="\\\\\\\\\|\\"\|'\|{\|\\\\(\|\\\\[\|\\\\$\\\\(";
2312
2313     <item>modes["sh", "\\\\", "terminators"]=".";
2314
2315     <item>
2316
2317     <item>modes["sh", "\\"", "submodes"]="\\\\\\\\\|\\\\$\\\\(";
2318
2319     <item>modes["sh", "\\"", "terminators"]="\\"";
2320
2321     <item>escapes["sh", "\\"", ++escapes["sh", "\\""], "s"]="\\\\\\\\";
2322
2323     <item>escapes["sh", "\\"", \ \ escapes["sh", "\\""], "r"]="\\\\\\\\";
2324
2325     <item>escapes["sh", "\\"", ++escapes["sh", "\\""], "s"]="\\"";
2326
2327     <item>escapes["sh", "\\"", \ \ escapes["sh", "\\""], "r"]="\\\\" "\\"";
2328
2329     <item>escapes["sh", "\\"", ++escapes["sh", "\\""], "s"]="\\n";
2330
2331     <item>escapes["sh", "\\"", \ \ escapes["sh", "\\""], "r"]="\\\\n";
2332
2333     <item>
2334
2335     <item>modes["sh", "'", "terminators"]="'";
2336
2337     <item>escapes["sh", "'", ++escapes["sh", "'"], "s"]="'";
2338
2339     <item>escapes["sh", "'", \ \ escapes["sh", "'"], "r"]="'\\\\'" "'";
2340
2341     <item><nf-ref|mode:common-brackets|<tuple|"sh"|"$("|"\\\\)">>
2342
2343     <item><nf-ref|mode:add-tunnel|<tuple|"sh"|"$("|"">>
2344
2345     <item><nf-ref|mode:common-brackets|<tuple|"sh"|"{"|"}">>
2346
2347     <item><nf-ref|mode:common-brackets|<tuple|"sh"|"["|"\\\\]">>
2348
2349     <item><nf-ref|mode:common-brackets|<tuple|"sh"|"("|"\\\\)">>
2350
2351     <item><nf-ref|mode:add-hash-comments|<tuple|"sh">>
2352
2353     <item><nf-ref|mode:quote-dollar-escape|<tuple|"sh"|"\\"">>
2354   </nf-chunk|awk|>
2355
2356   The definition of add-tunnel is:
2357
2358   <\nf-chunk|mode:add-tunnel>
2359     <item>escapes[<nf-arg|language>, <nf-arg|mode>,
2360     ++escapes[<nf-arg|language>, <nf-arg|mode>], "tunnel"]=<nf-arg|tunnel>;
2361   </nf-chunk||<tuple|language|mode|tunnel>>
2362
2363   <subsection|Make>
2364
2365   For makefiles, we currently recognize 2 modes: the <em|null> mode and
2366   <nf-tab> mode, which is tabbed mode and contains the makefile recipie. In
2367   the <em|null> mode the only escape is <verbatim|$> which must be converted
2368   to <verbatim|$$>.\
2369
2370   Tabbed mode is harder to manage, as the GNU Make Manual says in the section
2371   on <hlink|splitting lines|http://www.gnu.org/s/hello/manual/make/Splitting-Lines.html>.
2372   There is no way to escape a multi-line text that occurs as part of a
2373   makefile recipe.
2374
2375   Despite this sad fact, if the newline's in the shell script all occur at
2376   points of top-level shell syntax, then we could replace them with
2377   <verbatim| ;\\n<nf-tab>>and largely get the right effect.
2378
2379   <\with|par-columns|2>
2380     <\nf-chunk|test:make:1>
2381       <item>all:
2382
2383       <item><nf-tab>echo making
2384
2385       <item><nf-tab><nf-ref|test:make:1-inc|$@>
2386
2387       \;
2388     </nf-chunk|make|>
2389
2390     <\nf-chunk|test:make:1-inc>
2391       <item>if test "<nf-arg|target>" = "all"
2392
2393       <item>then echo yes, all
2394
2395       <item>else echo not all
2396
2397       <item>fi
2398     </nf-chunk|sh|<tuple|target>>
2399   </with>
2400
2401   The two chunks about could reasonably produce this:
2402
2403   <\nf-chunk|test:make:1.result>
2404     <item>all:
2405
2406     <item><nf-tab>echo making
2407
2408     <item><nf-tab>if test "$@" = "all" ;\\
2409
2410     <item><nf-tab>then echo yes, all ;\\
2411
2412     <item><nf-tab>else echo not all ;\\
2413
2414     <item><nf-tab>fi
2415   </nf-chunk|make|>
2416
2417   But will more likely produce this:
2418
2419   <\nf-chunk|test:make:1.result-actual>
2420     <item>all:
2421
2422     <item><nf-tab>echo making
2423
2424     <item><nf-tab>if test "$$@" = "all" ;\\
2425
2426     <item><nf-tab> then echo yes, all ;\\
2427
2428     <item><nf-tab> else echo not all ;\\
2429
2430     <item><nf-tab> fi
2431   </nf-chunk|make|>
2432
2433   The chunk argument <verbatim|$@> has been quoted (which would have been
2434   fine if we were passing the name of a shell variable), and the other shell
2435   lines are (harmlessly) indented by 1 space as part of fangle
2436   indent-matching which should have taken into account the expanded tab size,
2437   and should generally take into account the expanded prefix of the line
2438   whose indent it is trying to match, but which in this case we want to have
2439   no effect at all!
2440
2441   <\todo>
2442     The $@ was passed from a make fragment. In what cases should it be
2443     converted to $$@?
2444
2445     Do we need to track the language of sources of arguments?
2446   </todo>
2447
2448   A more ugly work-around until this problem can be solved would be to use
2449   this notation:
2450
2451   <\nf-chunk|test:make:2>
2452     <item>all:
2453
2454     <item><nf-tab>echo making
2455
2456     <item><nf-tab>ARG="$@"; <nf-ref|test:make:1-inc|$ARG>
2457   </nf-chunk|make|>
2458
2459   which produces this output which is more useful (because it works):
2460
2461   <\nf-chunk|test:make:2.result>
2462     <item>all:
2463
2464     <item><nf-tab>echo making test
2465
2466     <item><nf-tab>ARG="$@"; if test "$$ARG" = "all" ;\\
2467
2468     <item><nf-tab> \ \ \ \ \ \ \ \ \ \ then echo yes, all ;\\
2469
2470     <item><nf-tab> \ \ \ \ \ \ \ \ \ \ else echo not all ;\\
2471
2472     <item><nf-tab> \ \ \ \ \ \ \ \ \ \ fi
2473   </nf-chunk|make|>
2474
2475   If, however, the shell fragment contained strings with literal newline
2476   characters then there would be no easy way to escape these and preserve the
2477   value of the string.
2478
2479   A different style of makefile construction might be used <emdash> the
2480   recipe could be stored in a <hlink|target specific
2481   variable|http://www.gnu.org/s/hello/manual/make/Target_002dspecific.html>
2482   which contains the recipe with a more normal escape mechanism.
2483
2484   \;
2485
2486   <\nf-chunk|mode-definitions>
2487     <item>modes["make", "", \ "submodes"]="<nf-tab>";
2488
2489     <item>modes["make", "<nf-tab>", "terminators"]="\\\\n";
2490
2491     <item>escapes["make", "<nf-tab>", ++escapes["make", "<nf-tab>"],
2492     "s"]="\\\\n";
2493
2494     <item>escapes["make", "<nf-tab>", escapes["make", "<nf-tab>"], "r"]="
2495     ;\\\\\\n<nf-tab>";
2496
2497     <item>escapes["make", "<nf-tab>", ++escapes["make", "<nf-tab>"],
2498     "s"]="\\\\$";
2499
2500     <item>escapes["make", "<nf-tab>", escapes["make", "<nf-tab>"], "r"]="$$";
2501   </nf-chunk|awk|>
2502
2503   Note also that the tab character is hard-wired into the pattern, and that
2504   the make variable <verbatim|.RECIPEPREFIX> might change this to something
2505   else.
2506
2507   <section|Some tests>
2508
2509   Also, the parser must return any spare text at the end that has not been
2510   processed due to a mode terminator being found.
2511
2512   <\nf-chunk|test:mode-definitions>
2513     <item>rest = parse_chunk_args("c-like", "1, 2, 3) spare", a, "(");
2514
2515     <item>if (a[1] != 1) e++;
2516
2517     <item>if (a[2] != 2) e++;
2518
2519     <item>if (a[3] != 3) e++;
2520
2521     <item>if (length(a) != 3) e++;
2522
2523     <item>if (rest != " spare") e++;
2524
2525     <item><nf-ref|pca-test.awk:summary|>
2526   </nf-chunk||>
2527
2528   We must also be able to parse the example given earlier.
2529
2530   <\nf-chunk|test:mode-definitions>
2531     <item>parse_chunk_args("c-like", "things[x, y], get_other_things(a,
2532     \\"(all)\\"), 99", a, "(");
2533
2534     <item>if (a[1] != "things[x, y]") e++;
2535
2536     <item>if (a[2] != "get_other_things(a, \\"(all)\\")") e++;
2537
2538     <item>if (a[3] != "99") e++;
2539
2540     <item>if (length(a) != 3) e++;
2541
2542     <item><nf-ref|pca-test.awk:summary|>
2543   </nf-chunk||>
2544
2545   <section|A non-recursive mode tracker>
2546
2547   <subsection|Constructor>
2548
2549   The mode tracker holds its state in a stack based on a numerically indexed
2550   hash. This function, when passed an empty hash, will intialize it.
2551
2552   <\nf-chunk|new_mode_tracker()>
2553     <item>function new_mode_tracker(context, language, mode) {
2554
2555     <item> \ context[""] = 0;
2556
2557     <item> \ context[0, "language"] = language;
2558
2559     <item> \ context[0, "mode"] = mode;
2560
2561     <item>}
2562   </nf-chunk||>
2563
2564   Because awk functions cannot return an array, we must create the array
2565   first and pass it in, so we have a fangle macro to do this:
2566
2567   <\nf-chunk|new-mode-tracker>
2568     <item><nf-ref|awk-delete-array|<tuple|context>>
2569
2570     <item>new_mode_tracker(<nf-arg|context>, <nf-arg|language>,
2571     <nf-arg|mode>);
2572   </nf-chunk|awk|<tuple|context|language|mode>>
2573
2574   <subsection|Management>
2575
2576   And for tracking modes, we dispatch to a mode-tracker action based on the
2577   current language
2578
2579   <\nf-chunk|mode_tracker>
2580     <item>function push_mode_tracker(context, language, mode,
2581
2582     <item> \ # local vars
2583
2584     <item> \ top)
2585
2586     <item>{
2587
2588     <item> \ if (! ("" in context)) {
2589
2590     <item> \ \ \ <nf-ref|new-mode-tracker|<tuple|context|language|mode>>
2591
2592     <item> \ \ \ return;
2593
2594     <item> \ } else {
2595
2596     <item> \ \ \ top = context[""];
2597
2598     <item># \ \ \ if (context[top, "language"] == language && mode=="") mode
2599     = context[top, "mode"];
2600
2601     <item> \ \ \ if (context[top, "language"] == language && context[top,
2602     "mode"] == mode) return top;
2603
2604     <item> \ \ \ old_top = top;
2605
2606     <item> \ \ \ top++;
2607
2608     <item> \ \ \ context[top, "language"] = language;
2609
2610     <item> \ \ \ context[top, "mode"] = mode;
2611
2612     <item> \ \ \ context[""] = top;
2613
2614     <item> \ }
2615
2616     <item> \ return old_top;
2617
2618     <item>}
2619   </nf-chunk|awk|>
2620
2621   <\nf-chunk|mode_tracker>
2622     <item>function dump_mode_tracker(context, \
2623
2624     <item> \ c, d)
2625
2626     <item>{
2627
2628     <item> \ for(c=0; c \<less\>= context[""]; c++) {
2629
2630     <item> \ \ \ printf(" %2d \ \ %s:%s\\n", c, context[c, "language"],
2631     context[c, "mode"]) \<gtr\> "/dev/stderr";
2632
2633     <item> \ \ \ for(d=1; ( (c, "values", d) in context); d++) {
2634
2635     <item> \ \ \ \ \ printf(" \ \ %2d %s\\n", d, context[c, "values", d])
2636     \<gtr\> "/dev/stderr";
2637
2638     <item> \ \ \ }
2639
2640     <item> \ }
2641
2642     <item>}
2643   </nf-chunk||>
2644
2645   <\nf-chunk|mode_tracker>
2646     <item>function pop_mode_tracker(context, context_origin)
2647
2648     <item>{
2649
2650     <item> \ if ( (context_origin) && ("" in context) && context[""] !=
2651     (1+context_origin) && context[""] != context_origin) return 0;
2652
2653     <item> \ context[""] = context_origin;
2654
2655     <item> \ return 1;
2656
2657     <item>}
2658   </nf-chunk||>
2659
2660   This implies that any chunk must be syntactically whole; for instance, this
2661   is fine:
2662
2663   <\nf-chunk|test:whole-chunk>
2664     <item>if (1) {
2665
2666     <item> \ <nf-ref|test:say-hello|>
2667
2668     <item>}
2669   </nf-chunk||>
2670
2671   <\nf-chunk|test:say-hello>
2672     <item>print "hello";
2673   </nf-chunk||>
2674
2675   But this is not fine; the chunk <nf-ref|test:hidden-else|> is not properly
2676   cromulent.
2677
2678   <\nf-chunk|test:partial-chunk>
2679     <item>if (1) {
2680
2681     <item> \ <nf-ref|test:hidden-else|>
2682
2683     <item>}
2684   </nf-chunk||>
2685
2686   <\nf-chunk|test:hidden-else>
2687     <item> \ print "I'm fine";
2688
2689     <item>} else {
2690
2691     <item> \ print "I'm not";
2692   </nf-chunk||>
2693
2694   These tests will check for correct behaviour:
2695
2696   <\nf-chunk|test:cromulence>
2697     <item>echo Cromulence test
2698
2699     <item>passtest $FANGLE -Rtest:whole-chunk $TXT_SRC &\<gtr\>/dev/null \|\|
2700     ( echo "Whole chunk failed" && exit 1 )
2701
2702     <item>failtest $FANGLE -Rtest:partial-chunk $TXT_SRC &\<gtr\>/dev/null
2703     \|\| ( echo "Partial chunk failed" && exit 1 )
2704   </nf-chunk||>
2705
2706   <subsection|Tracker>
2707
2708   We must avoid recursion as a language construct because we intend to employ
2709   mode-tracking to track language mode of emitted code, and the code is
2710   emitted from a function which is itself recursive, so instead we implement
2711   psuedo-recursion using our own stack based on a hash.
2712
2713   <\nf-chunk|mode_tracker()>
2714     <item>function mode_tracker(context, text, values,\
2715
2716     <item> \ # optional parameters
2717
2718     <item> \ # local vars
2719
2720     <item> \ mode, submodes, language,
2721
2722     <item> \ cindex, c, a, part, item, name, result, new_values, new_mode,\
2723
2724     <item> \ delimiters, terminators)
2725
2726     <item>{
2727   </nf-chunk|awk|>
2728
2729   We could be re-commencing with a valid context, so we need to setup the
2730   state according to the last context.
2731
2732   <\nf-chunk|mode_tracker()>
2733     <item> \ cindex = context[""] + 0;
2734
2735     <item> \ mode = context[cindex, "mode"];
2736
2737     <item> \ language = context[cindex, "language" ];
2738   </nf-chunk||>
2739
2740   First we construct a single large regex combining the possible sub-modes
2741   for the current mode along with the terminators for the current mode.
2742
2743   <\nf-chunk|parse_chunk_args-reset-modes>
2744     <item> \ submodes=modes[language, mode, "submodes"];
2745
2746     <item>
2747
2748     <item> \ if ((language, mode, "delimiters") in modes) {
2749
2750     <item> \ \ \ delimiters = modes[language, mode, "delimiters"];
2751
2752     <item> \ \ \ if (length(submodes)\<gtr\>0) submodes = submodes "\|";
2753
2754     <item> \ \ \ submodes=submodes delimiters;
2755
2756     <item> \ } else delimiters="";
2757
2758     <item> \ if ((language, mode, "terminators") in modes) {
2759
2760     <item> \ \ \ terminators = modes[language, mode, "terminators"];
2761
2762     <item> \ \ \ if (length(submodes)\<gtr\>0) submodes = submodes "\|";
2763
2764     <item> \ \ \ submodes=submodes terminators;
2765
2766     <item> \ } else terminators="";
2767   </nf-chunk||>
2768
2769   If we don't find anything to match on --- probably because the language is
2770   not supported --- then we return the entire text without matching anything.
2771
2772   <\nf-chunk|parse_chunk_args-reset-modes>
2773     <item> if (! length(submodes)) return text;
2774   </nf-chunk||>
2775
2776   <\nf-chunk|mode_tracker()>
2777     <item><nf-ref|parse_chunk_args-reset-modes|>
2778   </nf-chunk||>
2779
2780   We then iterate the text (until there is none left) looking for sub-modes
2781   or terminators in the regex.
2782
2783   <\nf-chunk|mode_tracker()>
2784     <item> \ while((cindex \<gtr\>= 0) && length(text)) {
2785
2786     <item> \ \ \ if (match(text, "(" submodes ")", a)) {
2787   </nf-chunk||>
2788
2789   A bug that creeps in regularly during development is bad regexes of zero
2790   length which result in an infinite loop (as no text is consumed), so I
2791   catch that right away with this test.
2792
2793   <\nf-chunk|mode_tracker()>
2794     <item> \ \ \ \ \ if (RLENGTH\<less\>1) {
2795
2796     <item> \ \ \ \ \ \ \ error(sprintf("Internal error, matched zero length
2797     submode, should be impossible - likely regex computation error\\n" \\
2798
2799     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ "Language=%s\\nmode=%s\\nmatch=%s\\n",
2800     language, mode, submodes));
2801
2802     <item> \ \ \ \ \ }
2803   </nf-chunk||>
2804
2805   part is defined as the text up to the sub-mode or terminator, and this is
2806   appended to item --- which is the current text being gathered. If a mode
2807   has a delimiter, then item is reset each time a delimiter is found.
2808
2809   <math|<wide|<with|mode|prog|"><wide*|hello|\<wide-underbrace\>><rsub|item>,
2810   <wide*|there|\<wide-underbrace\>><rsub|item><with|mode|prog|">|\<wide-overbrace\>><rsup|item>,
2811   \ <wide|he said.|\<wide-overbrace\>><rsup|item>>
2812
2813   <\nf-chunk|mode_tracker()>
2814     <item> \ \ \ \ \ part = substr(text, 1, RSTART -1);
2815
2816     <item> \ \ \ \ \ item = item part;
2817   </nf-chunk||>
2818
2819   We must now determine what was matched. If it was a terminator, then we
2820   must restore the previous mode.
2821
2822   <\nf-chunk|mode_tracker()>
2823     <item> \ \ \ \ \ if (match(a[1], "^" terminators "$")) {
2824
2825     <item>#printf("%2d EXIT \ MODE [%s] by [%s] [%s]\\n", cindex, mode, a[1],
2826     text) \<gtr\> "/dev/stderr"
2827
2828     <item> \ \ \ \ \ \ \ context[cindex, "values", ++context[cindex,
2829     "values"]] = item;
2830
2831     <item> \ \ \ \ \ \ \ delete context[cindex];
2832
2833     <item> \ \ \ \ \ \ \ context[""] = --cindex;
2834
2835     <item> \ \ \ \ \ \ \ if (cindex\<gtr\>=0) {
2836
2837     <item> \ \ \ \ \ \ \ \ \ mode = context[cindex, "mode"];
2838
2839     <item> \ \ \ \ \ \ \ \ \ language = context[cindex, "language"];
2840
2841     <item> \ \ \ \ \ \ \ \ \ <nf-ref|parse_chunk_args-reset-modes|>
2842
2843     <item> \ \ \ \ \ \ \ }
2844
2845     <item> \ \ \ \ \ \ \ item = item a[1];
2846
2847     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2848     length(a[1]));
2849
2850     <item> \ \ \ \ \ }
2851   </nf-chunk||>
2852
2853   If a delimiter was matched, then we must store the current item in the
2854   parsed values array, and reset the item.
2855
2856   <\nf-chunk|mode_tracker()>
2857     <item> \ \ \ \ \ else if (match(a[1], "^" delimiters "$")) {
2858
2859     <item> \ \ \ \ \ \ \ if (cindex==0) {
2860
2861     <item> \ \ \ \ \ \ \ \ \ context[cindex, "values", ++context[cindex,
2862     "values"]] = item;
2863
2864     <item> \ \ \ \ \ \ \ \ \ item = "";
2865
2866     <item> \ \ \ \ \ \ \ } else {
2867
2868     <item> \ \ \ \ \ \ \ \ \ item = item a[1];
2869
2870     <item> \ \ \ \ \ \ \ }
2871
2872     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2873     length(a[1]));
2874
2875     <item> \ \ \ \ \ }
2876   </nf-chunk||>
2877
2878   otherwise, if a new submode is detected (all submodes have terminators), we
2879   must create a nested parse context until we find the terminator for this
2880   mode.
2881
2882   <\nf-chunk|mode_tracker()>
2883     <item> else if ((language, a[1], "terminators") in modes) {
2884
2885     <item> \ \ \ \ \ \ \ #check if new_mode is defined
2886
2887     <item> \ \ \ \ \ \ \ item = item a[1];
2888
2889     <item>#printf("%2d ENTER MODE [%s] in [%s]\\n", cindex, a[1], text)
2890     \<gtr\> "/dev/stderr"
2891
2892     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2893     length(a[1]));
2894
2895     <item> \ \ \ \ \ \ \ context[""] = ++cindex;
2896
2897     <item> \ \ \ \ \ \ \ context[cindex, "mode"] = a[1];
2898
2899     <item> \ \ \ \ \ \ \ context[cindex, "language"] = language;
2900
2901     <item> \ \ \ \ \ \ \ mode = a[1];
2902
2903     <item> \ \ \ \ \ \ \ <nf-ref|parse_chunk_args-reset-modes|>
2904
2905     <item> \ \ \ \ \ } else {
2906
2907     <item> \ \ \ \ \ \ \ error(sprintf("Submode '%s' set unknown mode in
2908     text: %s\\nLanguage %s Mode %s\\n", a[1], text, language, mode));
2909
2910     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2911     length(a[1]));
2912
2913     <item> \ \ \ \ \ }
2914
2915     <item> \ \ \ }
2916   </nf-chunk||>
2917
2918   In the final case, we parsed to the end of the string. If the string was
2919   entire, then we should have no nested mode context, but if the string was
2920   just a fragment we may have a mode context which must be preserved for the
2921   next fragment. Todo: Consideration ought to be given if sub-mode strings
2922   are split over two fragments.
2923
2924   <\nf-chunk|mode_tracker()>
2925     <item>else {
2926
2927     <item> \ \ \ \ \ context[cindex, "values", ++context[cindex, "values"]] =
2928     item text;
2929
2930     <item> \ \ \ \ \ text = "";
2931
2932     <item> \ \ \ \ \ item = "";
2933
2934     <item> \ \ \ }
2935
2936     <item> \ }
2937
2938     <item>
2939
2940     <item> \ context["item"] = item;
2941
2942     <item>
2943
2944     <item> \ if (length(item)) context[cindex, "values", ++context[cindex,
2945     "values"]] = item;
2946
2947     <item> \ return text;
2948
2949     <item>}
2950   </nf-chunk||>
2951
2952   <subsubsection|One happy chunk>
2953
2954   All the mode tracker chunks are referred to here:
2955
2956   <\nf-chunk|mode-tracker>
2957     <item><nf-ref|new_mode_tracker()|>
2958
2959     <item><nf-ref|mode_tracker()|>
2960   </nf-chunk||>
2961
2962   <subsubsection|Tests>
2963
2964   We can test this function like this:
2965
2966   <\nf-chunk|pca-test.awk>
2967     <item><nf-ref|error()|>
2968
2969     <item><nf-ref|mode-tracker|>
2970
2971     <item><nf-ref|parse_chunk_args()|>
2972
2973     <item>BEGIN {
2974
2975     <item> \ SUBSEP=".";
2976
2977     <item> \ <nf-ref|mode-definitions|>
2978
2979     <item>
2980
2981     <item> \ <nf-ref|test:mode-definitions|>
2982
2983     <item>}
2984   </nf-chunk|awk|>
2985
2986   <\nf-chunk|pca-test.awk:summary>
2987     <item>if (e) {
2988
2989     <item> \ printf "Failed " e
2990
2991     <item> \ for (b in a) {
2992
2993     <item> \ \ \ print "a[" b "] =\<gtr\> " a[b];
2994
2995     <item> \ }
2996
2997     <item>} else {
2998
2999     <item> \ print "Passed"
3000
3001     <item>}
3002
3003     <item>split("", a);
3004
3005     <item>e=0;
3006   </nf-chunk|awk|>
3007
3008   which should give this output:
3009
3010   <\nf-chunk|pca-test.awk-results>
3011     <item>a[foo.quux.quirk] =\<gtr\>\
3012
3013     <item>a[foo.quux.a] =\<gtr\> fleeg
3014
3015     <item>a[foo.bar] =\<gtr\> baz
3016
3017     <item>a[etc] =\<gtr\>\
3018
3019     <item>a[name] =\<gtr\> freddie
3020   </nf-chunk||>
3021
3022   <section|Escaping and Quoting>
3023
3024   For the time being and to get around <TeXmacs> inability to export a
3025   <kbd|TAB> character, the right arrow <with|mode|math|\<mapsto\>> whose
3026   UTF-8 sequence is ...
3027
3028   <todo|complete>
3029
3030   Another special character is used, the left-arrow
3031   <with|mode|math|\<mapsfrom\>> with UTF-8 sequence 0xE2 0x86 0xA4 is used to
3032   strip any preceding white space as a way of un-tabbing and removing indent
3033   that has been applied <emdash> this is important for bash here documents,
3034   and the like. It's a filthy hack.
3035
3036   <todo|remove the hack>
3037
3038   <\nf-chunk|mode_tracker>
3039     \;
3040
3041     <item>function untab(text) {
3042
3043     <item> \ gsub("[[:space:]]*\\xE2\\x86\\xA4","", text);
3044
3045     <item> \ return text;
3046
3047     <item>}
3048   </nf-chunk||>
3049
3050   Each nested mode can optionally define a set of transforms to be applied to
3051   any text that is included from another language.
3052
3053   This code can perform transforms from index c downwards.
3054
3055   <\nf-chunk|mode_tracker>
3056     <item>function transform_escape(context, text, top,
3057
3058     <item> \ c, cp, cpl, s, r)
3059
3060     <item>{
3061
3062     <item> \ for(c = top; c \<gtr\>= 0; c--) {
3063
3064     <item> \ \ \ if ( (context[c, "language"], context[c, "mode"]) in
3065     escapes) {
3066
3067     <item> \ \ \ \ \ cpl = escapes[context[c, "language"], context[c,
3068     "mode"]];
3069
3070     <item> \ \ \ \ \ for (cp = 1; cp \<less\>= cpl; cp ++) {
3071
3072     <item> \ \ \ \ \ \ \ s = escapes[context[c, "language"], context[c,
3073     "mode"], cp, "s"];
3074
3075     <item> \ \ \ \ \ \ \ r = escapes[context[c, "language"], context[c,
3076     "mode"], cp, "r"];
3077
3078     <item> \ \ \ \ \ \ \ if (length(s)) {
3079
3080     <item> \ \ \ \ \ \ \ \ \ gsub(s, r, text);
3081
3082     <item> \ \ \ \ \ \ \ }
3083
3084     <item> \ \ \ \ \ \ \ if ( (context[c, "language"], context[c, "mode"],
3085     cp, "t") in escapes ) {
3086
3087     <item> \ \ \ \ \ \ \ \ \ quotes[src, "t"] = escapes[context[c,
3088     "language"], context[c, "mode"], cp, "t"];
3089
3090     <item> \ \ \ \ \ \ \ }
3091
3092     <item> \ \ \ \ \ }
3093
3094     <item> \ \ \ }
3095
3096     <item> \ }
3097
3098     <item> \ return text;
3099
3100     <item>}
3101
3102     <item>function dump_escaper(quotes, r, cc) {
3103
3104     <item> \ for(cc=1; cc\<less\>=c; cc++) {
3105
3106     <item> \ \ \ printf("%2d s[%s] r[%s]\\n", cc, quotes[cc, "s"], quotes[cc,
3107     "r"]) \<gtr\> "/dev/stderr"
3108
3109     <item> \ }
3110
3111     <item>}
3112   </nf-chunk|awk|>
3113
3114   <\nf-chunk|test:escapes>
3115     <item>echo escapes test
3116
3117     <item>passtest $FANGLE -Rtest:comment-quote $TXT_SRC &\<gtr\>/dev/null
3118     \|\| ( echo "Comment-quote failed" && exit 1 )
3119   </nf-chunk|sh|>
3120
3121   <chapter|Recognizing Chunks>
3122
3123   Fangle recognizes noweb chunks, but as we also want better <LaTeX>
3124   integration we will recognize any of these:
3125
3126   <\itemize>
3127     <item>notangle chunks matching the pattern
3128     <verbatim|^\<less\>\<less\>.*?\<gtr\>\<gtr\>=>
3129
3130     <item>chunks beginning with <verbatim|\\begin{lstlistings}>, possibly
3131     with <verbatim|\\Chunk{...}> on the previous line
3132
3133     <item>an older form I have used, beginning with
3134     <verbatim|\\begin{Chunk}[options]> --- also more suitable for plain
3135     <LaTeX> users<\footnote>
3136       Is there such a thing as plain <LaTeX>?
3137     </footnote>.
3138   </itemize>
3139
3140   <section|Chunk start>
3141
3142   The variable chunking is used to signify that we are processing a code
3143   chunk and not document. In such a state, input lines will be assigned to
3144   the current chunk; otherwise they are ignored.
3145
3146   <subsection|<TeXmacs>>
3147
3148   We don't handle <TeXmacs> files natively yet, but rather instead emit
3149   unicode character sequences to mark up the text-export file which we do
3150   process.
3151
3152   These hacks detect the unicode character sequences and retro-fit in the old
3153   <TeX> parsing.
3154
3155   We convert <math|\<mapsto\>> into a tab character.
3156
3157   <\nf-chunk|recognize-chunk>
3158     \;
3159
3160     <item>#/\\n/ {
3161
3162     <item># \ gsub("\\n*$","");
3163
3164     <item># \ gsub("\\n", " ");
3165
3166     <item>#}
3167
3168     <item>#===
3169
3170     <item>/\\xE2\\x86\\xA6/ {
3171
3172     <item> \ gsub("\\\\xE2\\\\x86\\\\xA6", "\\x09");
3173
3174     <item>}
3175   </nf-chunk||>
3176
3177   <TeXmacs> back-tick handling is obscure, and a cut-n-paste back-tick from a
3178   shell window comes out as a unicode sequence<\footnote>
3179     that won't export to html, except as a NULL character (literal 0x00)
3180   </footnote> that is fixed-up here.
3181
3182   <\nf-chunk|recognize-chunk>
3183     <item>
3184
3185     <item>/\\xE2\\x80\\x98/ {
3186
3187     <item> \ gsub("\\\\xE2\\\\x80\\\\x98", "`");
3188
3189     <item>}
3190   </nf-chunk||>
3191
3192   In the <TeXmacs> output, the start of a chunk will appear like this:
3193
3194   <verbatim| \ 5b\<less\>example-chunk<key|^K>[1](arg1,<key|^K>
3195   arg2<key|^K><key|^K>), lang=C\<gtr\> <math|\<equiv\>>>
3196
3197   We detect the the start of a <TeXmacs> chunk by detecting the
3198   <math|\<equiv\>> symbol which occurs near the end of the line. We obtain
3199   the chunk name, the chunk parameters, and the chunk language.
3200
3201   <\nf-chunk|recognize-chunk>
3202     <item>
3203
3204     <item>/\\xE2\\x89\\xA1/ {
3205
3206     <item> \ if (match($0, "^ *([^[ ]* \|)\<less\>([^[
3207     ]*)\\\\[[0-9]*\\\\][(](.*)[)].*, lang=([^ ]*)\<gtr\>", line)) {
3208
3209     <item> \ \ \ next_chunk_name=line[2];
3210
3211     <item> \ \ \ get_texmacs_chunk_args(line[3], next_chunk_params);
3212
3213     <item> \ \ \ gsub(ARG_SEPARATOR ",? ?", ";", line[3]);
3214
3215     <item> \ \ \ params = "params=" line[3];
3216
3217     <item> \ \ \ if ((line[4])) {
3218
3219     <item> \ \ \ \ \ params = params ",language=" line[4]
3220
3221     <item> \ \ \ }
3222
3223     <item> \ \ \ get_tex_chunk_args(params, next_chunk_opts);
3224
3225     <item> \ \ \ new_chunk(next_chunk_name, next_chunk_opts,
3226     next_chunk_params);
3227
3228     <item> \ \ \ texmacs_chunking = 1;
3229
3230     <item> \ } else {
3231
3232     <item> \ \ \ # warning(sprintf("Unexpected chunk match: %s\\n", $_))
3233
3234     <item> \ }
3235
3236     <item> \ next;
3237
3238     <item>}
3239   </nf-chunk||>
3240
3241   <subsection|lstlistings>
3242
3243   Our current scheme is to recognize the new lstlisting chunks, but these may
3244   be preceded by a <verbatim|\\Chunk> command which in <LyX> is a more
3245   convenient way to pass the chunk name to the
3246   <verbatim|\\begin{lstlistings}> command, and a more visible way to specify
3247   other <verbatim|lstset> settings.
3248
3249   The arguments to the <verbatim|\\Chunk> command are a name, and then a
3250   comma-seperated list of key-value pairs after the manner of
3251   <verbatim|\\lstset>. (In fact within the <LaTeX> <verbatim|\\Chunk> macro
3252   (section <reference|sub:The-chunk-command>) the text <verbatim|name=> is
3253   prefixed to the argument which is then literally passed to
3254   <verbatim|\\lstset>).
3255
3256   <\nf-chunk|recognize-chunk>
3257     <item>/^\\\\Chunk{/ {
3258
3259     <item> \ if (match($0, "^\\\\\\\\Chunk{ *([^ ,}]*),?(.*)}", line)) {
3260
3261     <item> \ \ \ next_chunk_name = line[1];
3262
3263     <item> \ \ \ get_tex_chunk_args(line[2], next_chunk_opts);
3264
3265     <item> \ }
3266
3267     <item> \ next;
3268
3269     <item>}
3270   </nf-chunk|awk|>
3271
3272   We also make a basic attempt to parse the name out of the
3273   <verbatim|\\lstlistings[name=chunk-name]> text, otherwise we fall back to
3274   the name found in the previous chunk command. This attempt is very basic
3275   and doesn't support commas or spaces or square brackets as part of the
3276   chunkname. We also recognize <verbatim|\\begin{Chunk}> which is convenient
3277   for some users<\footnote>
3278     but not yet supported in the <LaTeX> macros
3279   </footnote>.
3280
3281   <\nf-chunk|recognize-chunk>
3282     <item>/^\\\\begin{lstlisting}\|^\\\\begin{Chunk}/ {
3283
3284     <item> \ if (match($0, "}.*[[,] *name= *{? *([^], }]*)", line)) {
3285
3286     <item> \ \ \ new_chunk(line[1]);
3287
3288     <item> \ } else {
3289
3290     <item> \ \ \ new_chunk(next_chunk_name, next_chunk_opts);
3291
3292     <item> \ }
3293
3294     <item> \ chunking=1;
3295
3296     <item> \ next;
3297
3298     <item>}
3299   </nf-chunk||>
3300
3301   <section|Chunk Body>
3302
3303   <subsection|<TeXmacs>>
3304
3305   A chunk body in <TeXmacs> ends with <verbatim|\|________>... if it is the
3306   final chunklet of a chunk, or if there are further chunklets it ends with
3307   <verbatim|\|\\/\\/\\/>... which is a depiction of a jagged line of torn
3308   paper.
3309
3310   <\nf-chunk|recognize-chunk>
3311     <item>/^ *\\\|____________*/ && texmacs_chunking {
3312
3313     <item> \ active_chunk="";
3314
3315     <item> \ texmacs_chunking=0;
3316
3317     <item> \ chunking=0;
3318
3319     <item>}
3320
3321     <item>/^ *\\\|\\/\\\\/ && texmacs_chunking {
3322
3323     <item> \ texmacs_chunking=0;
3324
3325     <item> \ chunking=0;
3326
3327     <item> \ active_chunk="";
3328
3329     <item>}
3330   </nf-chunk||>
3331
3332   It has been observed that not every line of output when a <TeXmacs> chunk
3333   is active is a line of chunk. This may no longer be true, but we set a
3334   variable <verbatim|texmacs_chunk> if the current line is a chunk line.
3335
3336   Initially we set this to zero...
3337
3338   <\nf-chunk|recognize-chunk>
3339     <item>texmacs_chunk=0;
3340   </nf-chunk||>
3341
3342   ...and then we look to see if the current line is a chunk line.
3343
3344   <TeXmacs> lines look like this: <verbatim| \ 3 \| main() {> so we detect
3345   the lines by leading white space, digits, more whiter space and a vertical
3346   bar followed by at least once space.
3347
3348   If we find such a line, we remove this line-header and set
3349   <verbatim|texmacs_chunk=1> as well as <verbatim|chunking=1>
3350
3351   <\nf-chunk|recognize-chunk>
3352     <item>/^ *[1-9][0-9]* *\\\| / {
3353
3354     <item> \ if (texmacs_chunking) {
3355
3356     <item> \ \ \ chunking=1;
3357
3358     <item> \ \ \ texmacs_chunk=1;
3359
3360     <item> \ \ \ gsub("^ *[1-9][0-9]* *\\\\\| ", "")
3361
3362     <item> \ }
3363
3364     <item>}
3365   </nf-chunk||>
3366
3367   When <TeXmacs> chunking, lines that commence with <verbatim|\\/> or
3368   <verbatim|__> are not chunk content but visual framing, and are skipped.
3369
3370   <\nf-chunk|recognize-chunk>
3371     <item>/^ *\\.\\/\\\\/ && texmacs_chunking {
3372
3373     <item> \ next;
3374
3375     <item>}
3376
3377     <item>/^ *__*$/ && texmacs_chunking {
3378
3379     <item> \ next;
3380
3381     <item>}
3382   </nf-chunk||>
3383
3384   Any other line when <TeXmacs> chunking is considered to be a line-wrapped
3385   line.
3386
3387   <\nf-chunk|recognize-chunk>
3388     <item>texmacs_chunking {
3389
3390     <item> \ if (! texmacs_chunk) {
3391
3392     <item> \ \ \ # must be a texmacs continued line
3393
3394     <item> \ \ \ chunking=1;
3395
3396     <item> \ \ \ texmacs_chunk=1;
3397
3398     <item> \ }
3399
3400     <item>}
3401   </nf-chunk||>
3402
3403   This final chunklet seems bogus and probably stops <LyX> working.
3404
3405   <\nf-chunk|recognize-chunk>
3406     <item>! texmacs_chunk {
3407
3408     <item># \ texmacs_chunking=0;
3409
3410     <item> \ chunking=0;
3411
3412     <item>}
3413   </nf-chunk||>
3414
3415   <subsection|Noweb>
3416
3417   We recognize notangle style chunks too:
3418
3419   <\nf-chunk|recognize-chunk>
3420     <item>/^[\<less\>]\<less\>.*[\<gtr\>]\<gtr\>=/ {
3421
3422     <item> \ if (match($0, "^[\<less\>]\<less\>(.*)[\<gtr\>]\<gtr\>= *$",
3423     line)) {
3424
3425     <item> \ \ \ chunking=1;
3426
3427     <item> \ \ \ notangle_mode=1;
3428
3429     <item> \ \ \ new_chunk(line[1]);
3430
3431     <item> \ \ \ next;
3432
3433     <item> \ }
3434
3435     <item>}
3436   </nf-chunk|awk|>
3437
3438   <section|Chunk end>
3439
3440   Likewise, we need to recognize when a chunk ends.
3441
3442   <subsection|lstlistings>
3443
3444   The <verbatim|e> in <verbatim|[e]nd{lislisting}> is surrounded by square
3445   brackets so that when this document is processed, this chunk doesn't
3446   terminate early when the lstlistings package recognizes it's own
3447   end-string!<\footnote>
3448     This doesn't make sense as the regex is anchored with ^, which this line
3449     does not begin with!
3450   </footnote>
3451
3452   <\nf-chunk|recognize-chunk>
3453     <item>/^\\\\[e]nd{lstlisting}\|^\\\\[e]nd{Chunk}/ {
3454
3455     <item> \ chunking=0;
3456
3457     <item> \ active_chunk="";
3458
3459     <item> \ next;
3460
3461     <item>}
3462   </nf-chunk||>
3463
3464   <subsection|noweb>
3465
3466   <\nf-chunk|recognize-chunk>
3467     <item>/^@ *$/ {
3468
3469     <item> \ chunking=0;
3470
3471     <item> \ active_chunk="";
3472
3473     <item>}
3474   </nf-chunk||>
3475
3476   All other recognizers are only of effect if we are chunking; there's no
3477   point in looking at lines if they aren't part of a chunk, so we just ignore
3478   them as efficiently as we can.
3479
3480   <\nf-chunk|recognize-chunk>
3481     <item>! chunking { next; }
3482   </nf-chunk||>
3483
3484   <section|Chunk contents>
3485
3486   Chunk contents are any lines read while <verbatim|chunking> is true. Some
3487   chunk contents are special in that they refer to other chunks, and will be
3488   replaced by the contents of these chunks when the file is generated.
3489
3490   <label|sub:ORS-chunk-text>We add the output record separator <verbatim|ORS>
3491   to the line now, because we will set <verbatim|ORS> to the empty string
3492   when we generate the output<\footnote>
3493     So that we can partial print lines using <verbatim|print> instead of
3494     <verbatim|printf>. <todo|This does't make sense>
3495   </footnote>.
3496
3497   <\nf-chunk|recognize-chunk>
3498     <item>length(active_chunk) {
3499
3500     <item> \ <nf-ref|process-chunk-tabs|>
3501
3502     <item> \ <nf-ref|process-chunk|>
3503
3504     <item>}
3505   </nf-chunk||>
3506
3507   If a chunk just consisted of plain text, we could handle the chunk like
3508   this:
3509
3510   <\nf-chunk|process-chunk-simple>
3511     <item>chunk_line(active_chunk, $0 ORS);
3512   </nf-chunk||>
3513
3514   but in fact a chunk can include references to other chunks. Chunk includes
3515   are traditionally written as <verbatim|\<less\>\<less\>chunk-name\<gtr\>\<gtr\>>
3516   but we support other variations, some of which are more suitable for
3517   particular editing systems.
3518
3519   However, we also process tabs at this point. A tab at input can be replaced
3520   by a number of spaces defined by the <verbatim|tabs> variable, set by the
3521   <verbatim|-T> option. Of course this is poor tab behaviour, we should
3522   probably have the option to use proper counted tab-stops and process this
3523   on output.
3524
3525   <\nf-chunk|process-chunk-tabs>
3526     <item>if (length(tabs)) {
3527
3528     <item> \ gsub("\\t", tabs);
3529
3530     <item>}
3531   </nf-chunk||>
3532
3533   <subsection|lstlistings><label|sub:lst-listings-includes>
3534
3535   If <verbatim|\\lstset{escapeinside={=\<less\>}{\<gtr\>}}> is set, then we
3536   can use <verbatim|<nf-ref|chunk-name|>> in listings. The sequence
3537   <verbatim|=\<less\>> was chosen because:
3538
3539   <\enumerate>
3540     <item>it is a better mnemonic than <verbatim|\<less\>\<less\>chunk-name\<gtr\>\<gtr\>>
3541     in that the <verbatim|=> sign signifies equivalence or substitutability.
3542
3543     <item>and because <verbatim|=\<less\>> is not valid in C or any language
3544     I can think of.
3545
3546     <item>and also because lstlistings doesn't like <verbatim|\<gtr\>\<gtr\>>
3547     as an end delimiter for the <em|texcl> escape, so we must make do with a
3548     single <verbatim|\<gtr\>> which is better complemented by
3549     <verbatim|=\<less\>> than by <verbatim|\<less\>\<less\>>.
3550   </enumerate>
3551
3552   Unfortunately the <verbatim|=\<less\>...\<gtr\>> that we use re-enters a
3553   <LaTeX> parsing mode in which some characters are special, e.g. <verbatim|#
3554   \\> and so these cause trouble if used in arguments to
3555   <verbatim|\\chunkref>. At some point I must fix the <LaTeX> command
3556   <verbatim|\\chunkref> so that it can accept these literally, but until
3557   then, when writing chunkref argumemts that need these characters, I must
3558   use the forms <verbatim|\\textbackslash{}> and <verbatim|\\#>; so I also
3559   define a hacky chunk <verbatim|delatex> to be used further on whose purpose
3560   it is to remove these from any arguments parsed by fangle.
3561
3562   <\nf-chunk|delatex>
3563     <item># FILTHY HACK
3564
3565     <item>gsub("\\\\\\\\#", "#", ${text});
3566
3567     <item>gsub("\\\\\\\\textbackslash{}", "\\\\", ${text});
3568
3569     <item>gsub("\\\\\\\\\\\\^", "^", ${text});
3570   </nf-chunk||<tuple|text>>
3571
3572   As each chunk line may contain more than one chunk include, we will split
3573   out chunk includes in an iterative fashion<\footnote>
3574     Contrary to our use of split when substituting parameters in chapter
3575     <reference|Here-we-split>
3576   </footnote>.
3577
3578   First, as long as the chunk contains a <verbatim|\\chunkref> command we
3579   take as much as we can up to the first <verbatim|\\chunkref> command.
3580
3581   <TeXmacs> text output uses <math|\<langle\>>...<math|\<rangle\>> which
3582   comes out as unicode sequences <verbatim|0xC2> <verbatim|0xAB> ...
3583   <verbatim|0xC2> <verbatim|0xBB>. Modern awk will interpret
3584   <verbatim|[^\\xC2\\xBB]> as a single unicode character if <verbatim|LANG>
3585   is set correctly to the sub-type <verbatim|UTF-8>, e.g.
3586   <verbatim|LANG=en_GB.UTF-8>, otherwise <verbatim|[^\\xC2\\xBB]> will be
3587   treated as a two character negated match <emdash> but this should not
3588   interfere with the function.
3589
3590   <\nf-chunk|process-chunk>
3591     <item>chunk = $0;
3592
3593     <item>indent = 0;
3594
3595     <item>while(match(chunk,"(\\xC2\\xAB)([^\\xC2\\xBB]*)
3596     [^\\xC2\\xBB]*\\xC2\\xBB", line) \|\|
3597
3598     <item> \ \ \ \ \ match(chunk,\
3599
3600     <item> \ \ \ \ \ \ \ \ \ \ \ "([=]\<less\>\\\\\\\\chunkref{([^}\<gtr\>]*)}(\\\\(.*\\\\)\|)\<gtr\>\|\<less\>\<less\>([a-zA-Z_][-a-zA-Z0-9_]*)\<gtr\>\<gtr\>)",\
3601
3602     <item> \ \ \ \ \ \ \ \ \ \ \ line)\\
3603
3604     <item>) {
3605
3606     <item> \ chunklet = substr(chunk, 1, RSTART - 1);
3607   </nf-chunk||>
3608
3609   We keep track of the indent count, by counting the number of literal
3610   characters found. We can then preserve this indent on each output line when
3611   multi-line chunks are expanded.
3612
3613   We then process this first part literal text, and set the chunk which is
3614   still to be processed to be the text after the <verbatim|\\chunkref>
3615   command, which we will process next as we continue around the loop.
3616
3617   <\nf-chunk|process-chunk>
3618     <item> \ indent += length(chunklet);
3619
3620     <item> \ chunk_line(active_chunk, chunklet);
3621
3622     <item> \ chunk = substr(chunk, RSTART + RLENGTH);
3623   </nf-chunk||>
3624
3625   We then consider the type of chunk command we have found, whether it is the
3626   fangle style command beginning with <verbatim|=\<less\>> the older notangle
3627   style beginning with <verbatim|\<less\>\<less\>>.
3628
3629   Fangle chunks may have parameters contained within square brackets. These
3630   will be matched in <verbatim|line[3]> and are considered at this stage of
3631   processing to be part of the name of the chunk to be included.
3632
3633   <\nf-chunk|process-chunk>
3634     <item> \ if (substr(line[1], 1, 1) == "=") {
3635
3636     <item> \ \ \ # chunk name up to }
3637
3638     <item> \ \ \ \ \ \ \ <nf-ref|delatex|<tuple|line[3]>>
3639
3640     <item> \ \ \ chunk_include(active_chunk, line[2] line[3], indent);
3641
3642     <item> \ } else if (substr(line[1], 1, 1) == "\<less\>") {
3643
3644     <item> \ \ \ chunk_include(active_chunk, line[4], indent);
3645
3646     <item> \ } else if (line[1] == "\\xC2\\xAB") {
3647
3648     <item> \ \ \ chunk_include(active_chunk, line[2], indent);
3649
3650     <item> \ } else {
3651
3652     <item> \ \ \ error("Unknown chunk fragment: " line[1]);
3653
3654     <item> \ }
3655   <|nf-chunk>
3656     \;
3657   </nf-chunk|>
3658
3659   The loop will continue until there are no more chunkref statements in the
3660   text, at which point we process the final part of the chunk.
3661
3662   <\nf-chunk|process-chunk>
3663     <item>}
3664
3665     <item>chunk_line(active_chunk, chunk);
3666   </nf-chunk||>
3667
3668   <label|lone-newline>We add the newline character as a chunklet on it's own,
3669   to make it easier to detect new lines and thus manage indentation when
3670   processing the output.
3671
3672   <\nf-chunk|process-chunk>
3673     <item>chunk_line(active_chunk, "\\n");
3674   <|nf-chunk>
3675     \;
3676   </nf-chunk|>
3677
3678   We will also permit a chunk-part number to follow in square brackets, so
3679   that <verbatim|<nf-ref|chunk-name[1]|>> will refer to the first part only.
3680   This can make it easy to include a C function prototype in a header file,
3681   if the first part of the chunk is just the function prototype without the
3682   trailing semi-colon. The header file would include the prototype with the
3683   trailing semi-colon, like this:
3684
3685   <verbatim|<nf-ref|chunk-name[1]|>>
3686
3687   This is handled in section <reference|sub:Chunk-parts>
3688
3689   We should perhaps introduce a notion of language specific chunk options; so
3690   that perhaps we could specify:
3691
3692   <verbatim|=\<less\>\\chunkref{chunk-name[function-declaration]}>
3693
3694   which applies a transform <verbatim|function-declaration> to the chunk ---
3695   which in this case would extract a function prototype from a function.
3696   <todo|Do it>
3697
3698   <chapter|Processing Options>
3699
3700   At the start, first we set the default options.
3701
3702   <\nf-chunk|default-options>
3703     <item>debug=0;
3704
3705     <item>linenos=0;
3706
3707     <item>notangle_mode=0;
3708
3709     <item>root="*";
3710
3711     <item>tabs = "";
3712   </nf-chunk||>
3713
3714   Then we use getopt the standard way, and null out ARGV afterwards in the
3715   normal AWK fashion.
3716
3717   <\nf-chunk|read-options>
3718     <item>Optind = 1 \ \ \ # skip ARGV[0]
3719
3720     <item>while(getopt(ARGC, ARGV, "R:LdT:hr")!=-1) {
3721
3722     <item> \ <nf-ref|handle-options|>
3723
3724     <item>}
3725
3726     <item>for (i=1; i\<less\>Optind; i++) { ARGV[i]=""; }
3727   </nf-chunk||>
3728
3729   This is how we handle our options:
3730
3731   <\nf-chunk|handle-options>
3732     <item>if (Optopt == "R") root = Optarg;
3733
3734     <item>else if (Optopt == "r") root="";
3735
3736     <item>else if (Optopt == "L") linenos = 1;
3737
3738     <item>else if (Optopt == "d") debug = 1;
3739
3740     <item>else if (Optopt == "T") tabs = indent_string(Optarg+0);
3741
3742     <item>else if (Optopt == "h") help();
3743
3744     <item>else if (Optopt == "?") help();
3745   </nf-chunk||>
3746
3747   We do all of this at the beginning of the program
3748
3749   <\nf-chunk|begin>
3750     <item>BEGIN {
3751
3752     <item> \ <nf-ref|constants|>
3753
3754     <item> \ <nf-ref|mode-definitions|>
3755
3756     <item> \ <nf-ref|default-options|>
3757
3758     <item>
3759
3760     <item> \ <nf-ref|read-options|>
3761
3762     <item>}
3763   </nf-chunk||>
3764
3765   And have a simple help function
3766
3767   <\nf-chunk|help()>
3768     <item>function help() {
3769
3770     <item> \ print "Usage:"
3771
3772     <item> \ print " \ fangle [-L] -R\<less\>rootname\<gtr\> [source.tex
3773     ...]"
3774
3775     <item> \ print " \ fangle -r [source.tex ...]"
3776
3777     <item> \ print " \ If the filename, source.tex is not specified then
3778     stdin is used"
3779
3780     <item> \ print
3781
3782     <item> \ print "-L causes the C statement: #line \<less\>lineno\<gtr\>
3783     \\"filename\\"" to be issued"
3784
3785     <item> \ print "-R causes the named root to be written to stdout"
3786
3787     <item> \ print "-r lists all roots in the file (even those used
3788     elsewhere)"
3789
3790     <item> \ exit 1;
3791
3792     <item>}
3793   </nf-chunk||>
3794
3795   <chapter|Generating the Output>
3796
3797   We generate output by calling output_chunk, or listing the chunk names.
3798
3799   <\nf-chunk|generate-output>
3800     <item>if (length(root)) output_chunk(root);
3801
3802     <item>else output_chunk_names();
3803   </nf-chunk||>
3804
3805   We also have some other output debugging:
3806
3807   <\nf-chunk|debug-output>
3808     <item>if (debug) {
3809
3810     <item> \ print "------ chunk names "
3811
3812     <item> \ output_chunk_names();
3813
3814     <item> \ print "====== chunks"
3815
3816     <item> \ output_chunks();
3817
3818     <item> \ print "++++++ debug"
3819
3820     <item> \ for (a in chunks) {
3821
3822     <item> \ \ \ print a "=" chunks[a];
3823
3824     <item> \ }
3825
3826     <item>}
3827   </nf-chunk||>
3828
3829   We do both of these at the end. We also set <verbatim|ORS=""> because each
3830   chunklet is not necessarily a complete line, and we already added
3831   <verbatim|ORS> to each input line in section
3832   <reference|sub:ORS-chunk-text>.
3833
3834   <\nf-chunk|end>
3835     <item>END {
3836
3837     <item> \ <nf-ref|debug-output|>
3838
3839     <item> \ ORS="";
3840
3841     <item> \ <nf-ref|generate-output|>
3842
3843     <item>}
3844   </nf-chunk||>
3845
3846   We write chunk names like this. If we seem to be running in notangle
3847   compatibility mode, then we enclose the name like this
3848   <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>> the same way notangle does:
3849
3850   <\nf-chunk|output_chunk_names()>
3851     <item>function output_chunk_names( \ \ c, prefix, suffix)\
3852
3853     <item>{
3854
3855     <item> \ if (notangle_mode) {
3856
3857     <item> \ \ \ prefix="\<less\>\<less\>";
3858
3859     <item> \ \ \ suffix="\<gtr\>\<gtr\>";
3860
3861     <item> \ }
3862
3863     <item> \ for (c in chunk_names) {
3864
3865     <item> \ \ \ print prefix c suffix "\\n";
3866
3867     <item> \ }
3868
3869     <item>}
3870   </nf-chunk||>
3871
3872   This function would write out all chunks
3873
3874   <\nf-chunk|output_chunks()>
3875     <item>function output_chunks( \ a)\
3876
3877     <item>{
3878
3879     <item> \ for (a in chunk_names) {
3880
3881     <item> \ \ \ output_chunk(a);
3882
3883     <item> \ }
3884
3885     <item>}
3886
3887     <item>
3888
3889     <item>function output_chunk(chunk) {
3890
3891     <item> \ newline = 1;
3892
3893     <item> \ lineno_needed = linenos;
3894
3895     <item>
3896
3897     <item> \ write_chunk(chunk);
3898
3899     <item>}
3900
3901     <item>
3902   </nf-chunk||>
3903
3904   <section|Assembling the Chunks>
3905
3906   <verbatim|chunk_path> holds a string consisting of the names of all the
3907   chunks that resulted in this chunk being output. It should probably also
3908   contain the source line numbers at which each inclusion also occured.
3909
3910   We first initialize the mode tracker for this chunk.
3911
3912   <\nf-chunk|write_chunk()>
3913     <item>function write_chunk(chunk_name) {
3914
3915     <item> \ <nf-ref|awk-delete-array|<tuple|context>>
3916
3917     <item> \ return write_chunk_r(chunk_name, context);
3918
3919     <item>}
3920
3921     <item>
3922
3923     <item>function write_chunk_r(chunk_name, context, indent, tail,
3924
3925     <item> \ # optional vars
3926
3927     <item> \ <with|font-shape|italic|chunk_path>, chunk_args,\
3928
3929     <item> \ # local vars
3930
3931     <item> \ context_origin,
3932
3933     <item> \ chunk_params, part, max_part, part_line, frag, max_frag, text,\
3934
3935     <item> \ chunklet, only_part, call_chunk_args, new_context)
3936
3937     <item>{
3938
3939     <item> \ if (debug) debug_log("write_chunk_r(" chunk_name ")");
3940   </nf-chunk|awk|>
3941
3942   <subsection|Chunk Parts><label|sub:Chunk-parts>
3943
3944   As mentioned in section <reference|sub:lstlistings-includes>, a chunk name
3945   may contain a part specifier in square brackets, limiting the parts that
3946   should be emitted.
3947
3948   <\nf-chunk|write_chunk()>
3949     <item> \ if (match(chunk_name, "^(.*)\\\\[([0-9]*)\\\\]$",
3950     chunk_name_parts)) {
3951
3952     <item> \ \ \ chunk_name = chunk_name_parts[1];
3953
3954     <item> \ \ \ only_part = chunk_name_parts[2];
3955
3956     <item> \ }
3957   </nf-chunk||>
3958
3959   We then create a mode tracker
3960
3961   <\nf-chunk|write_chunk()>
3962     <item> \ context_origin = push_mode_tracker(context, chunks[chunk_name,
3963     "language"], "");
3964   </nf-chunk||>
3965
3966   We extract into <verbatim|chunk_params> the names of the parameters that
3967   this chunk accepts, whose values were (optionally) passed in
3968   <verbatim|chunk_args>.
3969
3970   <\nf-chunk|write_chunk()>
3971     <item> \ split(chunks[chunk_name, "params"], chunk_params, " *; *");
3972   </nf-chunk||>
3973
3974   To assemble a chunk, we write out each part.
3975
3976   <\nf-chunk|write_chunk()>
3977     <item> \ if (! (chunk_name in chunk_names)) {
3978
3979     <item> \ \ \ error(sprintf(_"The root module
3980     \<less\>\<less\>%s\<gtr\>\<gtr\> was not defined.\\nUsed by: %s",\\
3981
3982     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ chunk_name, chunk_path));
3983
3984     <item> \ }
3985
3986     <item>
3987
3988     <item> \ max_part = chunks[chunk_name, "part"];
3989
3990     <item> \ for(part = 1; part \<less\>= max_part; part++) {
3991
3992     <item> \ \ \ if (! only_part \|\| part == only_part) {
3993
3994     <item> \ \ \ \ \ <nf-ref|write-part|>
3995
3996     <item> \ \ \ }
3997
3998     <item> \ }
3999
4000     <item> \ if (! pop_mode_tracker(context, context_origin)) {
4001
4002     <item> \ \ \ dump_mode_tracker(context);
4003
4004     <item> \ \ \ error(sprintf(_"Module %s did not close context
4005     properly.\\nUsed by: %s\\n", chunk_name, chunk_path));
4006
4007     <item> \ }
4008
4009     <item>}
4010   </nf-chunk||>
4011
4012   A part can either be a chunklet of lines, or an include of another chunk.
4013
4014   Chunks may also have parameters, specified in LaTeX style with braces after
4015   the chunk name --- looking like this in the document: chunkname{param1,
4016   param2}. Arguments are passed in square brackets:
4017   <verbatim|\\chunkref{chunkname}[arg1, arg2]>.
4018
4019   Before we process each part, we check that the source position hasn't
4020   changed unexpectedly, so that we can know if we need to output a new
4021   file-line directive.
4022
4023   <\nf-chunk|write-part>
4024     <item><nf-ref|check-source-jump|>
4025
4026     <item>
4027
4028     <item>chunklet = chunks[chunk_name, "part", part];
4029
4030     <item>if (chunks[chunk_name, "part", part, "type"] == part_type_chunk) {
4031
4032     <item> \ <nf-ref|write-included-chunk|>
4033
4034     <item>} else if (chunklet SUBSEP "line" in chunks) {
4035
4036     <item> \ <nf-ref|write-chunklets|>
4037
4038     <item>} else {
4039
4040     <item> \ # empty last chunklet
4041
4042     <item>}
4043   </nf-chunk||>
4044
4045   To write an included chunk, we must detect any optional chunk arguments in
4046   parenthesis. Then we recurse calling <verbatim|write_chunk()>.
4047
4048   <\nf-chunk|write-included-chunk>
4049     <item>if (match(chunklet, "^([^\\\\[\\\\(]*)\\\\((.*)\\\\)$",
4050     chunklet_parts)) {
4051
4052     <item> \ chunklet = chunklet_parts[1];
4053
4054     <item># hack
4055
4056     <item>gsub(sprintf("%c",11), "", chunklet);
4057
4058     <item>gsub(sprintf("%c",11), "", chunklet_parts[2]);
4059
4060     <item> \ parse_chunk_args("c-like", chunklet_parts[2], call_chunk_args,
4061     "(");
4062
4063     <item> \ for (c in call_chunk_args) {
4064
4065     <item> \ \ \ call_chunk_args[c] = expand_chunk_args(call_chunk_args[c],
4066     chunk_params, chunk_args);
4067
4068     <item> \ }
4069
4070     <item>} else {
4071
4072     <item> \ split("", call_chunk_args);
4073
4074     <item>}
4075
4076     <item>
4077
4078     <item>write_chunk_r(chunklet, context,
4079
4080     <item> \ \ \ \ \ \ \ \ \ \ \ chunks[chunk_name, "part", part, "indent"]
4081     indent,
4082
4083     <item> \ \ \ \ \ \ \ \ \ \ \ chunks[chunk_name, "part", part, "tail"],
4084
4085     <item> \ \ \ \ \ \ \ \ \ \ \ chunk_path "\\n \ \ \ \ \ \ \ \ "
4086     chunk_name,
4087
4088     <item> \ \ \ \ \ \ \ \ \ \ \ call_chunk_args);
4089   </nf-chunk||>
4090
4091   Before we output a chunklet of lines, we first emit the file and line
4092   number if we have one, and if it is safe to do so.
4093
4094   Chunklets are generally broken up by includes, so the start of a chunklet
4095   is a good place to do this. Then we output each line of the chunklet.
4096
4097   When it is not safe, such as in the middle of a multi-line macro
4098   definition, <verbatim|lineno_suppressed> is set to true, and in such a case
4099   we note that we want to emit the line statement when it is next safe.
4100
4101   <\nf-chunk|write-chunklets>
4102     <item>max_frag = chunks[chunklet, "line"];
4103
4104     <item>for(frag = 1; frag \<less\>= max_frag; frag++) {
4105
4106     <item> \ <nf-ref|write-file-line|>
4107   </nf-chunk||>
4108
4109   We then extract the chunklet text and expand any arguments.
4110
4111   <\nf-chunk|write-chunklets>
4112     <item>
4113
4114     <item> \ text = chunks[chunklet, frag];
4115
4116     <item>\
4117
4118     <item> \ /* check params */
4119
4120     <item> \ text = expand_chunk_args(text, chunk_params, chunk_args);
4121   </nf-chunk||>
4122
4123   If the text is a single newline (which we keep separate - see
4124   <reference|lone-newline>) then we increment the line number. In the case
4125   where this is the last line of a chunk and it is not a top-level chunk we
4126   replace the newline with an empty string --- because the chunk that
4127   included this chunk will have the newline at the end of the line that
4128   included this chunk.
4129
4130   We also note by <verbatim|newline = 1> that we have started a new line, so
4131   that indentation can be managed with the following piece of text.
4132
4133   <\nf-chunk|write-chunklets>
4134     <item>
4135
4136     <item> if (text == "\\n") {
4137
4138     <item> \ \ \ lineno++;
4139
4140     <item> \ \ \ if (part == max_part && frag == max_frag &&
4141     length(chunk_path)) {
4142
4143     <item> \ \ \ \ \ text = "";
4144
4145     <item> \ \ \ \ \ break;
4146
4147     <item> \ \ \ } else {
4148
4149     <item> \ \ \ \ \ newline = 1;
4150
4151     <item> \ \ \ }
4152   </nf-chunk||>
4153
4154   If this text does not represent a newline, but we see that we are the first
4155   piece of text on a newline, then we prefix our text with the current
4156   indent.\
4157
4158   <\note>
4159     <verbatim|newline> is a global output-state variable, but the
4160     <verbatim|indent> is not.
4161   </note>
4162
4163   <\nf-chunk|write-chunklets>
4164     <item> \ } else if (length(text) \|\| length(tail)) {
4165
4166     <item> \ \ \ if (newline) text = indent text;
4167
4168     <item> \ \ \ newline = 0;
4169
4170     <item> \ }
4171
4172     <item>
4173   </nf-chunk||>
4174
4175   Tail will soon no longer be relevant once mode-detection is in place.
4176
4177   <\nf-chunk|write-chunklets>
4178     <item> \ text = text tail;
4179
4180     <item> \ mode_tracker(context, text);
4181
4182     <item> \ print untab(transform_escape(context, text, context_origin));
4183   </nf-chunk||>
4184
4185   If a line ends in a backslash --- suggesting continuation --- then we
4186   supress outputting file-line as it would probably break the continued
4187   lines.
4188
4189   <\nf-chunk|write-chunklets>
4190     <item> \ if (linenos) {
4191
4192     <item> \ \ \ lineno_suppressed = substr(lastline, length(lastline)) ==
4193     "\\\\";
4194
4195     <item> \ }
4196
4197     <item>}
4198   </nf-chunk||>
4199
4200   Of course there is no point in actually outputting the source filename and
4201   line number (file-line) if they don't say anything new! We only need to
4202   emit them if they aren't what is expected, or if we we not able to emit one
4203   when they had changed.
4204
4205   <\nf-chunk|write-file-line>
4206     <item>if (newline && lineno_needed && ! lineno_suppressed) {
4207
4208     <item> \ filename = a_filename;
4209
4210     <item> \ lineno = a_lineno;
4211
4212     <item> \ print "#line " lineno " \\"" filename "\\"\\n"
4213
4214     <item> \ lineno_needed = 0;
4215
4216     <item>}
4217   </nf-chunk||>
4218
4219   We check if a new file-line is needed by checking if the source line
4220   matches what we (or a compiler) would expect.
4221
4222   <\nf-chunk|check-source-jump>
4223     <item>if (linenos && (chunk_name SUBSEP "part" SUBSEP part SUBSEP
4224     "FILENAME" in chunks)) {
4225
4226     <item> \ a_filename = chunks[chunk_name, "part", part, "FILENAME"];
4227
4228     <item> \ a_lineno = chunks[chunk_name, "part", part, "LINENO"];
4229
4230     <item> \ if (a_filename != filename \|\| a_lineno != lineno) {
4231
4232     <item> \ \ \ lineno_needed++;
4233
4234     <item> \ }
4235
4236     <item>}
4237   </nf-chunk||>
4238
4239   <chapter|Storing Chunks>
4240
4241   Awk has pretty limited data structures, so we will use two main hashes.
4242   Uninterrupted sequences of a chunk will be stored in chunklets and the
4243   chunklets used in a chunk will be stored in <verbatim|chunks>.
4244
4245   <\nf-chunk|constants>
4246     <item>part_type_chunk=1;
4247
4248     <item>SUBSEP=",";
4249   </nf-chunk||>
4250
4251   The params mentioned are not chunk parameters for parameterized chunks, as
4252   mentioned in <reference|Chunk Arguments>, but the lstlistings style
4253   parameters used in the <verbatim|\\Chunk> command<\footnote>
4254     The <verbatim|params> parameter is used to hold the parameters for
4255     parameterized chunks
4256   </footnote>.
4257
4258   <\nf-chunk|chunk-storage-functions>
4259     <item>function new_chunk(chunk_name, opts, args,
4260
4261     <item> \ # local vars
4262
4263     <item> \ p, append )
4264
4265     <item>{
4266
4267     <item> \ # HACK WHILE WE CHANGE TO ( ) for PARAM CHUNKS
4268
4269     <item> \ gsub("\\\\(\\\\)$", "", chunk_name);
4270
4271     <item> \ if (! (chunk_name in chunk_names)) {
4272
4273     <item> \ \ \ if (debug) print "New chunk " chunk_name;
4274
4275     <item> \ \ \ chunk_names[chunk_name];
4276
4277     <item> \ \ \ for (p in opts) {
4278
4279     <item> \ \ \ \ \ chunks[chunk_name, p] = opts[p];
4280
4281     <item> \ \ \ \ \ if (debug) print "chunks[" chunk_name "," p "] = "
4282     opts[p];
4283
4284     <item> \ \ \ }
4285
4286     <item> \ \ \ for (p in args) {
4287
4288     <item> \ \ \ \ \ chunks[chunk_name, "params", p] = args[p];
4289
4290     <item> \ \ \ }
4291
4292     <item> \ \ \ if ("append" in opts) {
4293
4294     <item> \ \ \ \ \ append=opts["append"];
4295
4296     <item> \ \ \ \ \ if (! (append in chunk_names)) {
4297
4298     <item> \ \ \ \ \ \ \ warning("Chunk " chunk_name " is appended to chunk "
4299     append " which is not defined yet");
4300
4301     <item> \ \ \ \ \ \ \ new_chunk(append);
4302
4303     <item> \ \ \ \ \ }
4304
4305     <item> \ \ \ \ \ chunk_include(append, chunk_name);
4306
4307     <item> \ \ \ \ \ chunk_line(append, ORS);
4308
4309     <item> \ \ \ }
4310
4311     <item> \ }
4312
4313     <item> \ active_chunk = chunk_name;
4314
4315     <item> \ prime_chunk(chunk_name);
4316
4317     <item>}
4318   </nf-chunk||>
4319
4320   <\nf-chunk|chunk-storage-functions>
4321     <item>
4322
4323     <item>function prime_chunk(chunk_name)
4324
4325     <item>{
4326
4327     <item> \ chunks[chunk_name, "part", ++chunks[chunk_name, "part"] ] = \\
4328
4329     <item> \ \ \ \ \ \ \ \ chunk_name SUBSEP "chunklet" SUBSEP ""
4330     ++chunks[chunk_name, "chunklet"];
4331
4332     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"],
4333     "FILENAME"] = FILENAME;
4334
4335     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "LINENO"]
4336     = FNR + 1;
4337
4338     <item>}
4339
4340     <item>
4341
4342     <item>function chunk_line(chunk_name, line){
4343
4344     <item> \ chunks[chunk_name, "chunklet", chunks[chunk_name, "chunklet"],
4345
4346     <item> \ \ \ \ \ \ \ \ ++chunks[chunk_name, "chunklet",
4347     chunks[chunk_name, "chunklet"], "line"] \ ] = line;
4348
4349     <item>}
4350
4351     <item>
4352   </nf-chunk||>
4353
4354   Chunk include represents a <em|chunkref> statement, and stores the
4355   requirement to include another chunk. The parameter indent represents the
4356   quanity of literal text characters that preceded this <em|chunkref>
4357   statement and therefore by how much additional lines of the included chunk
4358   should be indented.
4359
4360   <\nf-chunk|chunk-storage-functions>
4361     <item>function chunk_include(chunk_name, chunk_ref, indent, tail)
4362
4363     <item>{
4364
4365     <item> \ chunks[chunk_name, "part", ++chunks[chunk_name, "part"] ] =
4366     chunk_ref;
4367
4368     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "type" ]
4369     = part_type_chunk;
4370
4371     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "indent"
4372     ] = indent_string(indent);
4373
4374     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "tail" ]
4375     = tail;
4376
4377     <item> \ prime_chunk(chunk_name);
4378
4379     <item>}
4380
4381     <item>
4382   </nf-chunk||>
4383
4384   The indent is calculated by indent_string, which may in future convert some
4385   spaces into tab characters. This function works by generating a printf
4386   padded format string, like <verbatim|%22s> for an indent of 22, and then
4387   printing an empty string using that format.
4388
4389   <\nf-chunk|chunk-storage-functions>
4390     <item>function indent_string(indent) {
4391
4392     <item> \ return sprintf("%" indent "s", "");
4393
4394     <item>}
4395   </nf-chunk||>
4396
4397   <chapter|getopt><label|cha:getopt>
4398
4399   I use Arnold Robbins public domain getopt (1993 revision). This is probably
4400   the same one that is covered in chapter 12 of “Edition 3 of GAWK:
4401   Effective AWK Programming: A User's Guide for GNU Awk” but as that is
4402   licensed under the GNU Free Documentation License, Version 1.3, which
4403   conflicts with the GPL3, I can't use it from there (or it's accompanying
4404   explanations), so I do my best to explain how it works here.
4405
4406   The getopt.awk header is:
4407
4408   <\nf-chunk|getopt.awk-header>
4409     <item># getopt.awk --- do C library getopt(3) function in awk
4410
4411     <item>#
4412
4413     <item># Arnold Robbins, arnold@skeeve.com, Public Domain
4414
4415     <item>#
4416
4417     <item># Initial version: March, 1991
4418
4419     <item># Revised: May, 1993
4420
4421     <item>
4422   </nf-chunk||>
4423
4424   The provided explanation is:
4425
4426   <\nf-chunk|getopt.awk-notes>
4427     <item># External variables:
4428
4429     <item># \ \ \ Optind -- index in ARGV of first nonoption argument
4430
4431     <item># \ \ \ Optarg -- string value of argument to current option
4432
4433     <item># \ \ \ Opterr -- if nonzero, print our own diagnostic
4434
4435     <item># \ \ \ Optopt -- current option letter
4436
4437     <item>
4438
4439     <item># Returns:
4440
4441     <item># \ \ \ -1 \ \ \ \ at end of options
4442
4443     <item># \ \ \ ? \ \ \ \ \ for unrecognized option
4444
4445     <item># \ \ \ \<less\>c\<gtr\> \ \ \ a character representing the current
4446     option
4447
4448     <item>
4449
4450     <item># Private Data:
4451
4452     <item># \ \ \ _opti \ -- index in multi-flag option, e.g., -abc
4453
4454     <item>
4455   </nf-chunk||>
4456
4457   The function follows. The final two parameters, <verbatim|thisopt> and
4458   <verbatim|i> are local variables and not parameters --- as indicated by the
4459   multiple spaces preceding them. Awk doesn't care, the multiple spaces are a
4460   convention to help us humans.
4461
4462   <\nf-chunk|getopt.awk-getopt()>
4463     <item>function getopt(argc, argv, options, \ \ \ thisopt, i)
4464
4465     <item>{
4466
4467     <item> \ \ \ if (length(options) == 0) \ \ \ # no options given
4468
4469     <item> \ \ \ \ \ \ \ return -1
4470
4471     <item> \ \ \ if (argv[Optind] == "--") { \ # all done
4472
4473     <item> \ \ \ \ \ \ \ Optind++
4474
4475     <item> \ \ \ \ \ \ \ _opti = 0
4476
4477     <item> \ \ \ \ \ \ \ return -1
4478
4479     <item> \ \ \ } else if (argv[Optind] !~ /^-[^: \\t\\n\\f\\r\\v\\b]/) {
4480
4481     <item> \ \ \ \ \ \ \ _opti = 0
4482
4483     <item> \ \ \ \ \ \ \ return -1
4484
4485     <item> \ \ \ }
4486
4487     <item> \ \ \ if (_opti == 0)
4488
4489     <item> \ \ \ \ \ \ \ _opti = 2
4490
4491     <item> \ \ \ thisopt = substr(argv[Optind], _opti, 1)
4492
4493     <item> \ \ \ Optopt = thisopt
4494
4495     <item> \ \ \ i = index(options, thisopt)
4496
4497     <item> \ \ \ if (i == 0) {
4498
4499     <item> \ \ \ \ \ \ \ if (Opterr)
4500
4501     <item> \ \ \ \ \ \ \ \ \ \ \ printf("%c -- invalid option\\n",
4502
4503     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ thisopt)
4504     \<gtr\> "/dev/stderr"
4505
4506     <item> \ \ \ \ \ \ \ if (_opti \<gtr\>= length(argv[Optind])) {
4507
4508     <item> \ \ \ \ \ \ \ \ \ \ \ Optind++
4509
4510     <item> \ \ \ \ \ \ \ \ \ \ \ _opti = 0
4511
4512     <item> \ \ \ \ \ \ \ } else
4513
4514     <item> \ \ \ \ \ \ \ \ \ \ \ _opti++
4515
4516     <item> \ \ \ \ \ \ \ return "?"
4517
4518     <item> \ \ \ }
4519   </nf-chunk||>
4520
4521   At this point, the option has been found and we need to know if it takes
4522   any arguments.
4523
4524   <\nf-chunk|getopt.awk-getopt()>
4525     <item> \ \ \ if (substr(options, i + 1, 1) == ":") {
4526
4527     <item> \ \ \ \ \ \ \ # get option argument
4528
4529     <item> \ \ \ \ \ \ \ if (length(substr(argv[Optind], _opti + 1)) \<gtr\>
4530     0)
4531
4532     <item> \ \ \ \ \ \ \ \ \ \ \ Optarg = substr(argv[Optind], _opti + 1)
4533
4534     <item> \ \ \ \ \ \ \ else
4535
4536     <item> \ \ \ \ \ \ \ \ \ \ \ Optarg = argv[++Optind]
4537
4538     <item> \ \ \ \ \ \ \ _opti = 0
4539
4540     <item> \ \ \ } else
4541
4542     <item> \ \ \ \ \ \ \ Optarg = ""
4543
4544     <item> \ \ \ if (_opti == 0 \|\| _opti \<gtr\>= length(argv[Optind])) {
4545
4546     <item> \ \ \ \ \ \ \ Optind++
4547
4548     <item> \ \ \ \ \ \ \ _opti = 0
4549
4550     <item> \ \ \ } else
4551
4552     <item> \ \ \ \ \ \ \ _opti++
4553
4554     <item> \ \ \ return thisopt
4555
4556     <item>}
4557   </nf-chunk||>
4558
4559   A test program is built in, too
4560
4561   <\nf-chunk|getopt.awk-begin>
4562     <item>BEGIN {
4563
4564     <item> \ \ \ Opterr = 1 \ \ \ # default is to diagnose
4565
4566     <item> \ \ \ Optind = 1 \ \ \ # skip ARGV[0]
4567
4568     <item> \ \ \ # test program
4569
4570     <item> \ \ \ if (_getopt_test) {
4571
4572     <item> \ \ \ \ \ \ \ while ((_go_c = getopt(ARGC, ARGV, "ab:cd")) != -1)
4573
4574     <item> \ \ \ \ \ \ \ \ \ \ \ printf("c = \<less\>%c\<gtr\>, optarg =
4575     \<less\>%s\<gtr\>\\n",
4576
4577     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ _go_c,
4578     Optarg)
4579
4580     <item> \ \ \ \ \ \ \ printf("non-option arguments:\\n")
4581
4582     <item> \ \ \ \ \ \ \ for (; Optind \<less\> ARGC; Optind++)
4583
4584     <item> \ \ \ \ \ \ \ \ \ \ \ printf("\\tARGV[%d] = \<less\>%s\<gtr\>\\n",
4585
4586     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Optind,
4587     ARGV[Optind])
4588
4589     <item> \ \ \ }
4590
4591     <item>}
4592   </nf-chunk||>
4593
4594   The entire getopt.awk is made out of these chunks in order
4595
4596   <\nf-chunk|getopt.awk>
4597     <item><nf-ref|getopt.awk-header|>
4598
4599     <item>
4600
4601     <item><nf-ref|getopt.awk-notes|>
4602
4603     <item><nf-ref|getopt.awk-getopt()|>
4604
4605     <item><nf-ref|getopt.awk-begin|>
4606   </nf-chunk||>
4607
4608   Although we only want the header and function:
4609
4610   <\nf-chunk|getopt>
4611     <item># try: locate getopt.awk for the full original file
4612
4613     <item># as part of your standard awk installation
4614
4615     <item><nf-ref|getopt.awk-header|>
4616
4617     <item>
4618
4619     <item><nf-ref|getopt.awk-getopt()|>
4620   </nf-chunk||>
4621
4622   <chapter|Fangle LaTeX source code><label|latex-source>
4623
4624   <section|fangle module>
4625
4626   Here we define a <LyX> <verbatim|.module> file that makes it convenient to
4627   use <LyX> for writing such literate programs.
4628
4629   This file <verbatim|./fangle.module> can be installed in your personal
4630   <verbatim|.lyx/layouts> folder. You will need to Tools Reconfigure so that
4631   <LyX> notices it. It adds a new format Chunk, which should precede every
4632   listing and contain the chunk name.
4633
4634   <\nf-chunk|./fangle.module>
4635     <item>#\\DeclareLyXModule{Fangle Literate Listings}
4636
4637     <item>#DescriptionBegin
4638
4639     <item># \ Fangle literate listings allow one to write
4640
4641     <item># \ \ literate programs after the fashion of noweb, but without
4642     having
4643
4644     <item># \ \ to use noweave to generate the documentation. Instead the
4645     listings
4646
4647     <item># \ \ package is extended in conjunction with the noweb package to
4648     implement
4649
4650     <item># \ \ to code formating directly as latex.
4651
4652     <item># \ The fangle awk script
4653
4654     <item>#DescriptionEnd
4655
4656     <item>
4657
4658     <item><nf-ref|gpl3-copyright.hashed|>
4659
4660     <item>
4661
4662     <item>Format 11
4663
4664     <item>
4665
4666     <item>AddToPreamble
4667
4668     <item><nf-ref|./fangle.sty|>
4669
4670     <item>EndPreamble
4671
4672     <item>
4673
4674     <item><nf-ref|chunkstyle|>
4675
4676     <item>
4677
4678     <item><nf-ref|chunkref|>
4679   </nf-chunk|lyx-module|>
4680
4681   Because <LyX> modules are not yet a language supported by fangle or
4682   lstlistings, we resort to this fake awk chunk below in order to have each
4683   line of the GPL3 license commence with a #
4684
4685   <\nf-chunk|gpl3-copyright.hashed>
4686     <item>#<nf-ref|gpl3-copyright|>
4687
4688     <item>
4689   </nf-chunk|awk|>
4690
4691   <subsection|The Chunk style>
4692
4693   The purpose of the <name|chunk> style is to make it easier for <LyX> users
4694   to provide the name to <verbatim|lstlistings>. Normally this requires
4695   right-clicking on the listing, choosing settings, advanced, and then typing
4696   <verbatim|name=chunk-name>. This has the further disadvantage that the name
4697   (and other options) are not generally visible during document editing.
4698
4699   The chunk style is defined as a <LaTeX> command, so that all text on the
4700   same line is passed to the <verbatim|LaTeX> command <verbatim|Chunk>. This
4701   makes it easy to parse using <verbatim|fangle>, and easy to pass these
4702   options on to the listings package. The first word in a chunk section
4703   should be the chunk name, and will have <verbatim|name=> prepended to it.
4704   Any other words are accepted arguments to <verbatim|lstset>.
4705
4706   We set PassThru to 1 because the user is actually entering raw latex.
4707
4708   <\nf-chunk|chunkstyle>
4709     <item>Style Chunk
4710
4711     <item> \ LatexType \ \ \ \ \ \ \ \ \ \ \ \ Command
4712
4713     <item> \ LatexName \ \ \ \ \ \ \ \ \ \ \ \ Chunk
4714
4715     <item> \ Margin \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ First_Dynamic
4716
4717     <item> \ LeftMargin \ \ \ \ \ \ \ \ \ \ \ Chunk:xxx
4718
4719     <item> \ LabelSep \ \ \ \ \ \ \ \ \ \ \ \ \ xx
4720
4721     <item> \ LabelType \ \ \ \ \ \ \ \ \ \ \ \ Static
4722
4723     <item> \ LabelString \ \ \ \ \ \ \ \ \ \ "Chunk:"
4724
4725     <item> \ Align \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Left
4726
4727     <item> \ PassThru \ \ \ \ \ \ \ \ \ \ \ \ \ 1
4728
4729     <item>
4730   </nf-chunk||>
4731
4732   To make the label very visible we choose a larger font coloured red.
4733
4734   <\nf-chunk|chunkstyle>
4735     <item> \ LabelFont
4736
4737     <item> \ \ \ Family \ \ \ \ \ \ \ \ \ \ \ \ \ Sans
4738
4739     <item> \ \ \ Size \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Large
4740
4741     <item> \ \ \ Series \ \ \ \ \ \ \ \ \ \ \ \ \ Bold
4742
4743     <item> \ \ \ Shape \ \ \ \ \ \ \ \ \ \ \ \ \ \ Italic
4744
4745     <item> \ \ \ Color \ \ \ \ \ \ \ \ \ \ \ \ \ \ red
4746
4747     <item> \ EndFont
4748
4749     <item>End
4750   </nf-chunk||>
4751
4752   <subsection|The chunkref style>
4753
4754   We also define the Chunkref style which can be used to express cross
4755   references to chunks.
4756
4757   <\nf-chunk|chunkref>
4758     <item>InsetLayout Chunkref
4759
4760     <item> \ LyxType \ \ \ \ \ \ \ \ \ \ \ \ \ \ charstyle
4761
4762     <item> \ LatexType \ \ \ \ \ \ \ \ \ \ \ \ Command
4763
4764     <item> \ LatexName \ \ \ \ \ \ \ \ \ \ \ \ chunkref
4765
4766     <item> \ PassThru \ \ \ \ \ \ \ \ \ \ \ \ \ 1
4767
4768     <item> \ LabelFont \ \ \ \ \ \ \ \ \ \ \ \
4769
4770     <item> \ \ \ Shape \ \ \ \ \ \ \ \ \ \ \ \ \ \ Italic
4771
4772     <item> \ \ \ Color \ \ \ \ \ \ \ \ \ \ \ \ \ \ red
4773
4774     <item> \ EndFont
4775
4776     <item>End
4777   </nf-chunk||>
4778
4779   <section|Latex Macros><label|sec:Latex-Macros>
4780
4781   We require the listings, noweb and xargs packages. As noweb defines it's
4782   own <verbatim|\\code> environment, we re-define the one that <LyX> logical
4783   markup module expects here.
4784
4785   <\nf-chunk|./fangle.sty>
4786     <item>\\usepackage{listings}%
4787
4788     <item>\\usepackage{noweb}%
4789
4790     <item>\\usepackage{xargs}%
4791
4792     <item>\\renewcommand{\\code}[1]{\\texttt{#1}}%
4793   </nf-chunk|tex|>
4794
4795   We also define a <verbatim|CChunk> macro, for use as:
4796   <verbatim|\\begin{CChunk}> which will need renaming to
4797   <verbatim|\\begin{Chunk}> when I can do this without clashing with
4798   <verbatim|\\Chunk>.
4799
4800   <\nf-chunk|./fangle.sty>
4801     <item>\\lstnewenvironment{Chunk}{\\relax}{\\relax}%
4802   </nf-chunk||>
4803
4804   We also define a suitable <verbatim|\\lstset> of parameters that suit the
4805   literate programming style after the fashion of <name|noweave>.
4806
4807   <\nf-chunk|./fangle.sty>
4808     <item>\\lstset{numbers=left, stepnumber=5, numbersep=5pt,
4809
4810     <item> \ \ \ \ \ \ \ breaklines=false,basicstyle=\\ttfamily,
4811
4812     <item> \ \ \ \ \ \ \ numberstyle=\\tiny, language=C}%
4813   </nf-chunk||>
4814
4815   We also define a notangle-like mechanism for escaping to <LaTeX> from the
4816   listing, and by which we can refer to other listings. We declare the
4817   <verbatim|=\<less\>...\<gtr\>> sequence to contain <LaTeX> code, and
4818   include another like this chunk: <verbatim|<nf-ref|chunkname|>>. However,
4819   because <verbatim|=\<less\>...\<gtr\>> is already defined to contain
4820   <LaTeX> code for this document --- this is a fangle document after all ---
4821   the code fragment below effectively contains the <LaTeX> code:
4822   <verbatim|}{>. To avoid problems with document generation, I had to declare
4823   an lstlistings property: <verbatim|escapeinside={}> for this listing only;
4824   which in <LyX> was done by right-clicking the listings inset, choosing
4825   settings-\<gtr\>advanced. Therefore <verbatim|=\<less\>> isn't interpreted
4826   literally here, in a listing when the escape sequence is already defined as
4827   shown... we need to somehow escape this representation...
4828
4829   <\nf-chunk|./fangle.sty>
4830     <item>\\lstset{escapeinside={=\<less\>}{\<gtr\>}}%
4831   </nf-chunk||>
4832
4833   Although our macros will contain the <verbatim|@> symbol, they will be
4834   included in a <verbatim|\\makeatletter> section by <LyX>; however we keep
4835   the commented out <verbatim|\\makeatletter> as a reminder. The listings
4836   package likes to centre the titles, but noweb titles are specially
4837   formatted and must be left aligned. The simplest way to do this turned out
4838   to be by removing the definition of <verbatim|\\lst@maketitle>. This may
4839   interact badly if other listings want a regular title or caption. We
4840   remember the old maketitle in case we need it.
4841
4842   <\nf-chunk|./fangle.sty>
4843     <item>%\\makeatletter
4844
4845     <item>%somehow re-defining maketitle gives us a left-aligned title
4846
4847     <item>%which is extactly what our specially formatted title needs!
4848
4849     <item>\\global\\let\\fangle@lst@maketitle\\lst@maketitle%
4850
4851     <item>\\global\\def\\lst@maketitle{}%
4852   </nf-chunk||>
4853
4854   <subsection|The chunk command><label|sub:The-chunk-command>
4855
4856   Our chunk command accepts one argument, and calls <verbatim|\\ltset>.
4857   Although <verbatim|\\ltset> will note the name, this is erased when the
4858   next <verbatim|\\lstlisting> starts, so we make a note of this in
4859   <verbatim|\\lst@chunkname> and restore in in lstlistings Init hook.
4860
4861   <\nf-chunk|./fangle.sty>
4862     <item>\\def\\Chunk#1{%
4863
4864     <item> \ \\lstset{title={\\fanglecaption},name=#1}%
4865
4866     <item> \ \\global\\edef\\lst@chunkname{\\lst@intname}%
4867
4868     <item>}%
4869
4870     <item>\\def\\lst@chunkname{\\empty}%
4871   </nf-chunk||>
4872
4873   <subsubsection|Chunk parameters>
4874
4875   Fangle permits parameterized chunks, and requires the paramters to be
4876   specified as listings options. The fangle script uses this, and although we
4877   don't do anything with these in the <LaTeX> code right now, we need to stop
4878   the listings package complaining.
4879
4880   <\nf-chunk|./fangle.sty>
4881     <item>\\lst@Key{params}\\relax{\\def\\fangle@chunk@params{#1}}%
4882   </nf-chunk||>
4883
4884   As it is common to define a chunk which then needs appending to another
4885   chunk, and annoying to have to declare a single line chunk to manage the
4886   include, we support an append= option.
4887
4888   <\nf-chunk|./fangle.sty>
4889     <item>\\lst@Key{append}\\relax{\\def\\fangle@chunk@append{#1}}%
4890   </nf-chunk||>
4891
4892   <subsection|The noweb styled caption>
4893
4894   We define a public macro <verbatim|\\fanglecaption> which can be set as a
4895   regular title. By means of <verbatim|\\protect>, It expands to
4896   <verbatim|\\fangle@caption> at the appopriate time when the caption is
4897   emitted.
4898
4899   <nf-chunk|./fangle.sty|\\def\\fanglecaption{\\protect\\fangle@caption}%||>
4900
4901   <\big-figure>
4902     22c <math|\<langle\>>some-chunk 19b<math|\<rangle\>><math|\<equiv\>>+
4903     \ \ <math|\<vartriangleleft\>>22b 24d<math|\<vartriangleright\>>
4904
4905     \;
4906
4907     In this example, the current chunk is 22c, and therefore the third chunk
4908     on page 22.
4909
4910     It's name is some-chunk.\
4911
4912     The first chunk with this name (19b) occurs as the second chunk on page
4913     19.
4914
4915     The previous chunk (22d) with the same name is the second chunk on page
4916     22.
4917
4918     The next chunk (24d) is the fourth chunk on page 24.
4919   </big-figure|Noweb Heading<label|noweb heading>>
4920
4921   The general noweb output format compactly identifies the current chunk, and
4922   references to the first chunk, and the previous and next chunks that have
4923   the same name.
4924
4925   This means that we need to keep a counter for each chunk-name, that we use
4926   to count chunks of the same name.
4927
4928   <subsection|The chunk counter>
4929
4930   It would be natural to have a counter for each chunk name, but TeX would
4931   soon run out of counters<\footnote>
4932     ...soon did run out of counters and so I had to re-write the LaTeX macros
4933     to share a counter as described here.
4934   </footnote>, so we have one counter which we save at the end of a chunk and
4935   restore at the beginning of a chunk.
4936
4937   <\nf-chunk|./fangle.sty>
4938     <item>\\newcounter{fangle@chunkcounter}%
4939   </nf-chunk||>
4940
4941   We construct the name of this variable to store the counter to be the text
4942   <verbatim|lst-chunk-> prefixed onto the chunks own name, and store it in
4943   <verbatim|\\chunkcount>.\
4944
4945   We save the counter like this:
4946
4947   <nf-chunk|save-counter|\\global\\expandafter\\edef\\csname
4948   \\chunkcount\\endcsname{\\arabic{fangle@chunkcounter}}%||>
4949
4950   and restore the counter like this:
4951
4952   <nf-chunk|restore-counter|\\setcounter{fangle@chunkcounter}{\\csname
4953   \\chunkcount\\endcsname}%||>
4954
4955   If there does not already exist a variable whose name is stored in
4956   <verbatim|\\chunkcount>, then we know we are the first chunk with this
4957   name, and then define a counter.\
4958
4959   Although chunks of the same name share a common counter, they must still be
4960   distinguished. We use is the internal name of the listing, suffixed by the
4961   counter value. So the first chunk might be <verbatim|something-1> and the
4962   second chunk be <verbatim|something-2>, etc.
4963
4964   We also calculate the name of the previous chunk if we can (before we
4965   increment the chunk counter). If this is the first chunk of that name, then
4966   <verbatim|\\prevchunkname> is set to <verbatim|\\relax> which the noweb
4967   package will interpret as not existing.
4968
4969   <\nf-chunk|./fangle.sty>
4970     <item>\\def\\fangle@caption{%
4971
4972     <item> \ \\edef\\chunkcount{lst-chunk-\\lst@intname}%
4973
4974     <item> \ \\@ifundefined{\\chunkcount}{%
4975
4976     <item> \ \ \ \\expandafter\\gdef\\csname \\chunkcount\\endcsname{0}%
4977
4978     <item> \ \ \ \\setcounter{fangle@chunkcounter}{\\csname
4979     \\chunkcount\\endcsname}%
4980
4981     <item> \ \ \ \\let\\prevchunkname\\relax%
4982
4983     <item> \ }{%
4984
4985     <item> \ \ \ \\setcounter{fangle@chunkcounter}{\\csname
4986     \\chunkcount\\endcsname}%
4987
4988     <item> \ \ \ \\edef\\prevchunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
4989
4990     <item> \ }%
4991   </nf-chunk||>
4992
4993   After incrementing the chunk counter, we then define the name of this
4994   chunk, as well as the name of the first chunk.
4995
4996   <\nf-chunk|./fangle.sty>
4997     <item> \ \\addtocounter{fangle@chunkcounter}{1}%
4998
4999     <item> \ \\global\\expandafter\\edef\\csname
5000     \\chunkcount\\endcsname{\\arabic{fangle@chunkcounter}}%
5001
5002     <item> \ \\edef\\chunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
5003
5004     <item> \ \\edef\\firstchunkname{\\lst@intname-1}%
5005   </nf-chunk||>
5006
5007   We now need to calculate the name of the next chunk. We do this by
5008   temporarily skipping the counter on by one; however there may not actually
5009   be another chunk with this name! We detect this by also defining a label
5010   for each chunk based on the chunkname. If there is a next chunkname then it
5011   will define a label with that name. As labels are persistent, we can at
5012   least tell the second time <LaTeX> is run. If we don't find such a defined
5013   label then we define <verbatim|\\nextchunkname> to <verbatim|\\relax>.
5014
5015   <\nf-chunk|./fangle.sty>
5016     <item> \ \\addtocounter{fangle@chunkcounter}{1}%
5017
5018     <item> \ \\edef\\nextchunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
5019
5020     <item> \ \\@ifundefined{r@label-\\nextchunkname}{\\let\\nextchunkname\\relax}{}%
5021   </nf-chunk||>
5022
5023   The noweb package requires that we define a <verbatim|\\sublabel> for every
5024   chunk, with a unique name, which is then used to print out it's navigation
5025   hints.
5026
5027   We also define a regular label for this chunk, as was mentioned above when
5028   we calculated <verbatim|\\nextchunkname>. This requires <LaTeX> to be run
5029   at least twice after new chunk sections are added --- but noweb requried
5030   that anyway.
5031
5032   <\nf-chunk|./fangle.sty>
5033     <item> \ \\sublabel{\\chunkname}%
5034
5035     <item>% define this label for every chunk instance, so we
5036
5037     <item>% can tell when we are the last chunk of this name
5038
5039     <item> \ \\label{label-\\chunkname}%
5040   </nf-chunk||>
5041
5042   We also try and add the chunk to the list of listings, but I'm afraid we
5043   don't do very well. We want each chunk name listing once, with all of it's
5044   references.
5045
5046   <\nf-chunk|./fangle.sty>
5047     <item> \ \\addcontentsline{lol}{lstlisting}{\\lst@name~[\\protect\\subpageref{\\chunkname}]}%
5048   </nf-chunk||>
5049
5050   We then call the noweb output macros in the same way that noweave generates
5051   them, except that we don't need to call <verbatim|\\nwstartdeflinemarkup>
5052   or <verbatim|\\nwenddeflinemarkup> <emdash> and if we do, it messes up the
5053   output somewhat.
5054
5055   <\nf-chunk|./fangle.sty>
5056     <item> \ \\nwmargintag{%
5057
5058     <item> \ \ \ {%
5059
5060     <item> \ \ \ \ \ \\nwtagstyle{}%
5061
5062     <item> \ \ \ \ \ \\subpageref{\\chunkname}%
5063
5064     <item> \ \ \ }%
5065
5066     <item> \ }%
5067
5068     <item>%
5069
5070     <item> \ \\moddef{%
5071
5072     <item> \ \ \ {\\lst@name}%
5073
5074     <item> \ \ \ {%
5075
5076     <item> \ \ \ \ \ \\nwtagstyle{}\\/%
5077
5078     <item> \ \ \ \ \ \\@ifundefined{fangle@chunk@params}{}{%
5079
5080     <item> \ \ \ \ \ \ \ (\\fangle@chunk@params)%
5081
5082     <item> \ \ \ \ \ }%
5083
5084     <item> \ \ \ \ \ [\\csname \\chunkcount\\endcsname]~%
5085
5086     <item> \ \ \ \ \ \\subpageref{\\firstchunkname}%
5087
5088     <item> \ \ \ }%
5089
5090     <item> \ \ \ \\@ifundefined{fangle@chunk@append}{}{%
5091
5092     <item> \ \ \ \\ifx{}\\fangle@chunk@append{x}\\else%
5093
5094     <item> \ \ \ \ \ \ \ ,~add~to~\\fangle@chunk@append%
5095
5096     <item> \ \ \ \\fi%
5097
5098     <item> \ \ \ }%
5099
5100     <item>\\global\\def\\fangle@chunk@append{}%
5101
5102     <item>\\lstset{append=x}%
5103
5104     <item> \ }%
5105
5106     <item>%
5107
5108     <item> \ \\ifx\\relax\\prevchunkname\\endmoddef\\else\\plusendmoddef\\fi%
5109
5110     <item>% \ \\nwstartdeflinemarkup%
5111
5112     <item> \ \\nwprevnextdefs{\\prevchunkname}{\\nextchunkname}%
5113
5114     <item>% \ \\nwenddeflinemarkup%
5115
5116     <item>}%
5117   </nf-chunk||>
5118
5119   Originally this was developed as a <verbatim|listings> aspect, in the Init
5120   hook, but it was found easier to affect the title without using a hook
5121   <emdash> <verbatim|\\lst@AddToHookExe{PreSet}> is still required to set the
5122   listings name to the name passed to the <verbatim|\\Chunk> command, though.
5123
5124   <\nf-chunk|./fangle.sty>
5125     <item>%\\lst@BeginAspect{fangle}
5126
5127     <item>%\\lst@Key{fangle}{true}[t]{\\lstKV@SetIf{#1}{true}}
5128
5129     <item>\\lst@AddToHookExe{PreSet}{\\global\\let\\lst@intname\\lst@chunkname}
5130
5131     <item>\\lst@AddToHook{Init}{}%\\fangle@caption}
5132
5133     <item>%\\lst@EndAspect
5134   </nf-chunk||>
5135
5136   <subsection|Cross references>
5137
5138   We define the <verbatim|\\chunkref> command which makes it easy to generate
5139   visual references to different code chunks, e.g.
5140
5141   <block|<tformat|<table|<row|<cell|Macro>|<cell|Appearance>>|<row|<cell|<verbatim|\\chunkref{preamble}>>|<cell|>>|<row|<cell|<verbatim|\\chunkref[3]{preamble}>>|<cell|>>|<row|<cell|<verbatim|\\chunkref{preamble}[arg1,
5142   arg2]>>|<cell|>>>>>
5143
5144   Chunkref can also be used within a code chunk to include another code
5145   chunk. The third optional parameter to chunkref is a comma sepatarated list
5146   of arguments, which will replace defined parameters in the chunkref.
5147
5148   <\note>
5149     Darn it, if I have: <verbatim|=\<less\>\\chunkref{new-mode-tracker}[{chunks[chunk_name,
5150     "language"]},{mode}]\<gtr\>> the inner braces (inside [ ]) cause _ to
5151     signify subscript even though we have <verbatim|lst@ReplaceIn>
5152   </note>
5153
5154   <\nf-chunk|./fangle.sty>
5155     <item>\\def\\chunkref@args#1,{%
5156
5157     <item> \ \\def\\arg{#1}%
5158
5159     <item> \ \\lst@ReplaceIn\\arg\\lst@filenamerpl%
5160
5161     <item> \ \\arg%
5162
5163     <item> \ \\@ifnextchar){\\relax}{, \\chunkref@args}%
5164
5165     <item>}%
5166
5167     <item>\\newcommand\\chunkref[2][0]{%
5168
5169     <item> \ \\@ifnextchar({\\chunkref@i{#1}{#2}}{\\chunkref@i{#1}{#2}()}%
5170
5171     <item>}%
5172
5173     <item>\\def\\chunkref@i#1#2(#3){%
5174
5175     <item> \ \\def\\zero{0}%
5176
5177     <item> \ \\def\\chunk{#2}%
5178
5179     <item> \ \\def\\chunkno{#1}%
5180
5181     <item> \ \\def\\chunkargs{#3}%
5182
5183     <item> \ \\ifx\\chunkno\\zero%
5184
5185     <item> \ \ \ \\def\\chunkname{#2-1}%
5186
5187     <item> \ \\else%
5188
5189     <item> \ \ \ \\def\\chunkname{#2-\\chunkno}%
5190
5191     <item> \ \\fi%
5192
5193     <item> \ \\let\\lst@arg\\chunk%
5194
5195     <item> \ \\lst@ReplaceIn\\chunk\\lst@filenamerpl%
5196
5197     <item> \ \\LA{%\\moddef{%
5198
5199     <item> \ \ \ {\\chunk}%
5200
5201     <item> \ \ \ {%
5202
5203     <item> \ \ \ \ \ \\nwtagstyle{}\\/%
5204
5205     <item> \ \ \ \ \ \\ifx\\chunkno\\zero%
5206
5207     <item> \ \ \ \ \ \\else%
5208
5209     <item> \ \ \ \ \ [\\chunkno]%
5210
5211     <item> \ \ \ \ \ \\fi%
5212
5213     <item> \ \ \ \ \ \\ifx\\chunkargs\\empty%
5214
5215     <item> \ \ \ \ \ \\else%
5216
5217     <item> \ \ \ \ \ \ \ (\\chunkref@args #3,)%
5218
5219     <item> \ \ \ \ \ \\fi%
5220
5221     <item> \ \ \ \ \ ~\\subpageref{\\chunkname}%
5222
5223     <item> \ \ \ }%
5224
5225     <item> \ }%
5226
5227     <item> \ \\RA%\\endmoddef%
5228
5229     <item>}%
5230   </nf-chunk||>
5231
5232   <subsection|The end>
5233
5234   <\nf-chunk|./fangle.sty>
5235     <item>%
5236
5237     <item>%\\makeatother
5238   </nf-chunk||>
5239
5240   <chapter|Extracting fangle>
5241
5242   <section|Extracting from Lyx>
5243
5244   To extract from <LyX>, you will need to configure <LyX> as explained in
5245   section <reference|Configuring-the-build>.
5246
5247   <label|lyx-build-script>And this lyx-build scrap will extract fangle for
5248   me.
5249
5250   <\nf-chunk|lyx-build>
5251     <item>#! /bin/sh
5252
5253     <item>set -x
5254
5255     <item>
5256
5257     <item><nf-ref|lyx-build-helper|>
5258
5259     <item>cd $PROJECT_DIR \|\| exit 1
5260
5261     <item>
5262
5263     <item>/usr/local/bin/fangle -R./fangle $TEX_SRC \<gtr\> ./fangle
5264
5265     <item>/usr/local/bin/fangle -R./fangle.module $TEX_SRC \<gtr\>
5266     ./fangle.module
5267
5268     <item>
5269
5270     <item>export FANGLE=./fangle
5271
5272     <item>export TMP=${TMP:-/tmp}
5273
5274     <item><nf-ref|test:*|>
5275   </nf-chunk|sh|>
5276
5277   With a lyx-build-helper
5278
5279   <\nf-chunk|lyx-build-helper>
5280     <item>PROJECT_DIR="$LYX_r"
5281
5282     <item>LYX_SRC="$PROJECT_DIR/${LYX_i%.tex}.lyx"
5283
5284     <item>TEX_DIR="$LYX_p"
5285
5286     <item>TEX_SRC="$TEX_DIR/$LYX_i"
5287
5288     <item>TXT_SRC="$TEX_SRC"
5289   </nf-chunk|sh|>
5290
5291   <section|Extracting documentation>
5292
5293   <\nf-chunk|./gen-www>
5294     <item>#python -m elyxer --css lyx.css $LYX_SRC \| \\
5295
5296     <item># \ iconv -c -f utf-8 -t ISO-8859-1//TRANSLIT \| \\
5297
5298     <item># \ sed 's/UTF-8"\\(.\\)\<gtr\>/ISO-8859-1"\\1\<gtr\>/' \<gtr\>
5299     www/docs/fangle.html
5300
5301     <item>
5302
5303     <item>python -m elyxer --css lyx.css --iso885915 --html --destdirectory
5304     www/docs/fangle.e \\
5305
5306     <item> \ \ \ \ \ \ fangle.lyx \<gtr\> www/docs/fangle.e/fangle.html
5307
5308     <item>
5309
5310     <item>( mkdir -p www/docs/fangle && cd www/docs/fangle && \\
5311
5312     <item> \ lyx -e latex ../../../fangle.lyx && \\
5313
5314     <item> \ htlatex ../../../fangle.tex "xhtml,fn-in" && \\
5315
5316     <item> \ sed -i -e 's/\<less\>!--l\\. [0-9][0-9]* *--\<gtr\>//g'
5317     fangle.html
5318
5319     <item>)
5320
5321     <item>
5322
5323     <item>( mkdir -p www/docs/literate && cd www/docs/literate && \\
5324
5325     <item> \ lyx -e latex ../../../literate.lyx && \\
5326
5327     <item> \ htlatex ../../../literate.tex "xhtml,fn-in" && \\
5328
5329     <item> \ sed -i -e 's/\<less\>!--l\\. [0-9][0-9]* *--\<gtr\>$//g'
5330     literate.html
5331
5332     <item>)
5333   </nf-chunk||>
5334
5335   <section|Extracting from the command line>
5336
5337   First you will need the tex output, then you can extract:
5338
5339   <\nf-chunk|lyx-build-manual>
5340     <item>lyx -e latex fangle.lyx
5341
5342     <item>fangle -R./fangle fangle.tex \<gtr\> ./fangle
5343
5344     <item>fangle -R./fangle.module fangle.tex \<gtr\> ./fangle.module
5345   </nf-chunk|sh|>
5346
5347   \;
5348
5349   <part|Tests>
5350
5351   <chapter|Tests>
5352
5353   <\nf-chunk|test:*>
5354     <item>#! /bin/bash
5355
5356     <item>
5357
5358     <item>export SRC="${SRC:-./fangle.tm}"
5359
5360     <item>export FANGLE="${FANGLE:-./fangle}"
5361
5362     <item>export TMP="${TMP:-/tmp}"
5363
5364     <item>export TESTDIR="$TMP/$USER/fangle.tests"
5365
5366     <item>export TXT_SRC="${TXT_SRC:-$TESTDIR/fangle.txt}"
5367
5368     <item>
5369
5370     <item>mkdir -p "$TESTDIR"
5371
5372     <item>
5373
5374     <item>tm -s -c "$SRC" "$TXT_SRC" -q
5375
5376     <item>
5377
5378     <item><nf-ref|test:helpers|>
5379
5380     <item>run_tests() {
5381
5382     <item> \ <nf-ref|test:run-tests|>
5383
5384     <item>}
5385
5386     <item>
5387
5388     <item># test current fangle
5389
5390     <item>echo Testing current fangle
5391
5392     <item>run_tests
5393
5394     <item>
5395
5396     <item># extract new fangle
5397
5398     <item>echo testing new fangle
5399
5400     <item>$FANGLE -R./fangle "$TXT_SRC" \<gtr\> "$TESTDIR/fangle"
5401
5402     <item>export FANGLE="$TESTDIR/fangle"
5403
5404     <item>run_tests
5405
5406     <item>
5407
5408     <item># Now check that it can extract a fangle that also passes the
5409     tests!
5410
5411     <item>echo testing if new fangle can generate itself
5412
5413     <item>$FANGLE -R./fangle "$TXT_SRC" \<gtr\> "$TESTDIR/fangle.new"
5414
5415     <item>passtest diff -bwu "$FANGLE" "$TESTDIR/fangle.new"
5416
5417     <item>export FANGLE="$TESTDIR/fangle.new"
5418
5419     <item>run_tests
5420   </nf-chunk||>
5421
5422   <\nf-chunk|test:run-tests>
5423     <item># run tests
5424
5425     <item>$FANGLE -Rpca-test.awk $TXT_SRC \| awk -f - \|\| exit 1
5426
5427     <item><nf-ref|test:cromulence|>
5428
5429     <item><nf-ref|test:escapes|>
5430
5431     <item><nf-ref|test:test-chunk|<tuple|test:example-sh>>
5432
5433     <item><nf-ref|test:test-chunk|<tuple|test:example-makefile>>
5434
5435     <item><nf-ref|test:test-chunk|<tuple|test:q:1>>
5436
5437     <item><nf-ref|test:test-chunk|<tuple|test:make:1>>
5438
5439     <item><nf-ref|test:test-chunk|<tuple|test:make:2>>
5440
5441     <item><nf-ref|test:chunk-params|>
5442   </nf-chunk|sh|>
5443
5444   <\nf-chunk|test:helpers>
5445     <item>passtest() {
5446
5447     <item> \ if "$@"
5448
5449     <item> \ then echo "Passed $TEST"
5450
5451     <item> \ else echo "Failed $TEST"
5452
5453     <item> \ \ \ \ \ \ return 1
5454
5455     <item> \ fi
5456
5457     <item>}
5458
5459     <item>
5460
5461     <item>failtest() {
5462
5463     <item> \ if ! "$@"
5464
5465     <item> \ then echo "Passed $TEST"
5466
5467     <item> \ else echo "Failed $TEST"
5468
5469     <item> \ \ \ \ \ \ return 1
5470
5471     <item> \ fi
5472
5473     <item>}
5474   </nf-chunk||>
5475
5476   This chunk will render a named chunk and compare it to another rendered
5477   nameed chunk
5478
5479   <\nf-chunk|test:test-chunk>
5480     <item><nf-ref|test:test-chunk-result|<tuple|<nf-arg|chunk>|<nf-arg|chunk>.result>>
5481   </nf-chunk|sh|<tuple|chunk>>
5482
5483   <\nf-chunk|test:test-chunk-result>
5484     <item>TEST="<nf-arg|result>" passtest diff -u --label "<nf-arg|chunk>"
5485     \<less\>( $FANGLE -R<nf-arg|chunk> $TXT_SRC ) \\
5486
5487     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ --label
5488     "<nf-arg|result>" \<less\>( $FANGLE -R<nf-arg|result> $TXT_SRC )
5489   </nf-chunk|sh|<tuple|chunk|result>>
5490
5491   <chapter|Chunk Parameters>
5492
5493   <section|<LyX>>
5494
5495   <\nf-chunk|test:lyx:chunk-params:sub>
5496     <item>I see a ${THING},
5497
5498     <item>a ${THING} of colour ${colour},\
5499
5500     <item>and looking closer =\<less\>\\chunkref{test:lyx:chunk-params:sub:sub}(${colour})\<gtr\>
5501   </nf-chunk||<tuple|THING|colour>>
5502
5503   <\nf-chunk|test:lyx:chunk-params:sub:sub>
5504     <item>a funny shade of ${colour}
5505   </nf-chunk||<tuple|colour>>
5506
5507   <\nf-chunk|test:lyx:chunk-params:text>
5508     <item>What do you see? "=\<less\>\\chunkref{test:lyx:chunk-params:sub}(joe,
5509     red)\<gtr\>"
5510
5511     <item>Well, fancy!
5512   </nf-chunk||>
5513
5514   Should generate output:
5515
5516   <\nf-chunk|test:lyx:chunk-params:result>
5517     <item>What do you see? "I see a joe,
5518
5519     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ a joe of colour red,\
5520
5521     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ and looking closer a funny shade
5522     of red"
5523
5524     <item>Well, fancy!
5525   </nf-chunk||>
5526
5527   And this chunk will perform the test:
5528
5529   <\nf-chunk|test:chunk-params>
5530     <item><nf-ref|test:test-chunk-result|<tuple|test:lyx:chunk-params:text|test:lyx:chunk-params:result>>
5531     \|\| exit 1
5532   </nf-chunk||>
5533
5534   <section|<TeXmacs>>
5535
5536   <\nf-chunk|test:chunk-params:sub>
5537     <item>I see a <nf-arg|THING>,
5538
5539     <item>a <nf-arg|THING> of colour <nf-arg|colour>,\
5540
5541     <item>and looking closer <nf-ref|test:chunk-params:sub:sub|<tuple|<nf-arg|colour>>>
5542   </nf-chunk||<tuple|THING|colour>>
5543
5544   <\nf-chunk|test:chunk-params:sub:sub>
5545     <item>a funny shade of <nf-arg|colour>
5546   </nf-chunk||<tuple|colour>>
5547
5548   <\nf-chunk|test:chunk-params:text>
5549     <item>What do you see? "<nf-ref|test:chunk-params:sub|<tuple|joe|red>>"
5550
5551     <item>Well, fancy!
5552   </nf-chunk||>
5553
5554   Should generate output:
5555
5556   <\nf-chunk|test:chunk-params:result>
5557     <item>What do you see? "I see a joe,
5558
5559     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ a joe of colour red,\
5560
5561     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ and looking closer a funny shade
5562     of red"
5563
5564     <item>Well, fancy!
5565   </nf-chunk||>
5566
5567   And this chunk will perform the test:
5568
5569   <\nf-chunk|test:chunk-params>
5570     <item><nf-ref|test:test-chunk-result|<tuple|test:chunk-params:text|test:chunk-params:result>>
5571     \|\| exit 1
5572   </nf-chunk||>
5573
5574   <chapter|Compile-log-lyx><label|Compile-log-lyx>
5575
5576   <\nf-chunk|Chunk:./compile-log-lyx>
5577     <item>#! /bin/sh
5578
5579     <item># can't use gtkdialog -i, cos it uses the "source" command which
5580     ubuntu sh doesn't have
5581
5582     <item>
5583
5584     <item>main() {
5585
5586     <item> \ errors="/tmp/compile.log.$$"
5587
5588     <item># \ if grep '^[^ ]*:\\( In \\\|[0-9][0-9]*: [^ ]*:\\)' \<gtr\>
5589     $errors
5590
5591     <item>if grep '^[^ ]*(\\([0-9][0-9]*\\)) *: *\\(error\\\|warning\\)'
5592     \<gtr\> $errors
5593
5594     <item> \ then
5595
5596     <item> \ \ \ sed -i -e 's/^[^ ]*[/\\\\]\\([^/\\\\]*\\)(\\([ 0-9][
5597     0-9]*\\)) *: */\\1:\\2\|\\2\|/' $errors
5598
5599     <item> \ \ \ COMPILE_DIALOG='
5600
5601     <item> \<less\>vbox\<gtr\>
5602
5603     <item> \ \<less\>text\<gtr\>
5604
5605     <item> \ \ \ \<less\>label\<gtr\>Compiler errors:\<less\>/label\<gtr\>
5606
5607     <item> \ \<less\>/text\<gtr\>
5608
5609     <item> \ \<less\>tree exported_column="0"\<gtr\>
5610
5611     <item> \ \ \ \<less\>variable\<gtr\>LINE\<less\>/variable\<gtr\>
5612
5613     <item> \ \ \ \<less\>height\<gtr\>400\<less\>/height\<gtr\>\<less\>width\<gtr\>800\<less\>/width\<gtr\>
5614
5615     <item> \ \ \ \<less\>label\<gtr\>File \| Line \|
5616     Message\<less\>/label\<gtr\>
5617
5618     <item> \ \ \ \<less\>action\<gtr\>'". $SELF ; "'lyxgoto
5619     $LINE\<less\>/action\<gtr\>
5620
5621     <item> \ \ \ \<less\>input\<gtr\>'"cat $errors"'\<less\>/input\<gtr\>
5622
5623     <item> \ \<less\>/tree\<gtr\>
5624
5625     <item> \ \<less\>hbox\<gtr\>
5626
5627     <item> \ \ \<less\>button\<gtr\>\<less\>label\<gtr\>Build\<less\>/label\<gtr\>
5628
5629     <item> \ \ \ \ \<less\>action\<gtr\>lyxclient -c "LYXCMD:build-program"
5630     &\<less\>/action\<gtr\>
5631
5632     <item> \ \ \<less\>/button\<gtr\>
5633
5634     <item> \ \ \<less\>button ok\<gtr\>\<less\>/button\<gtr\>
5635
5636     <item> \ \<less\>/hbox\<gtr\>
5637
5638     <item> \<less\>/vbox\<gtr\>
5639
5640     <item>'
5641
5642     <item> \ \ \ export COMPILE_DIALOG
5643
5644     <item> \ \ \ ( gtkdialog --program=COMPILE_DIALOG ; rm $errors ) &
5645
5646     <item> \ else
5647
5648     <item> \ \ \ rm $errors
5649
5650     <item> \ fi
5651
5652     <item>}
5653
5654     <item>
5655
5656     <item>lyxgoto() {
5657
5658     <item> \ file="${LINE%:*}"
5659
5660     <item> \ line="${LINE##*:}"
5661
5662     <item> \ extraline=`cat $file \| head -n $line \| tac \| sed
5663     '/^\\\\\\\\begin{lstlisting}/q' \| wc -l`
5664
5665     <item> \ extraline=`expr $extraline - 1`
5666
5667     <item> \ lyxclient -c "LYXCMD:command-sequence server-goto-file-row $file
5668     $line ; char-forward ; repeat $extraline paragraph-down ;
5669     paragraph-up-select"
5670
5671     <item>}
5672
5673     <item>
5674
5675     <item>SELF="$0"
5676
5677     <item>if test -z "$COMPILE_DIALOG"
5678
5679     <item>then main "$@"\
5680
5681     <item>fi
5682   </nf-chunk|sh|>
5683
5684   \;
5685 </body>
5686
5687 <\initial>
5688   <\collection>
5689     <associate|info-flag|short>
5690     <associate|page-medium|paper>
5691     <associate|page-screen-height|982016tmpt>
5692     <associate|page-screen-margin|false>
5693     <associate|page-screen-width|1686528tmpt>
5694     <associate|page-show-hf|true>
5695     <associate|preamble|false>
5696     <associate|sfactor|5>
5697   </collection>
5698 </initial>
5699
5700 <\references>
5701 </references>
5702
5703 <\auxiliary>
5704 </auxiliary>