fangle.tm

   1 <TeXmacs|1.0.7.10>
   2
   3 <style|<tuple|book|fangle|header-book|tmdoc-keyboard>>
   4
   5 <\body>
   6   <hide-preamble|<assign|LyX|<macro|L<space|-0.1667em><move|Y|0fn|-0.25em><space|-0.125em>X>><assign|par-first|0fn><assign|par-par-sep|0.5fn>>
   7
   8   <doc-data|<doc-title|fangle>|<doc-author-data|<author-name|Sam
   9   Liddicott>|<\author-address>
  10     sam@liddicott.com
  11   </author-address>>|<doc-date|August 2009>>
  12
  13   <section*|Introduction>
  14
  15   <name|Fangle> is a tool for fangled literate programming. Newfangled is
  16   defined as <em|New and often needlessly novel> by
  17   <name|TheFreeDictionary.com>.
  18
  19   In this case, fangled means yet another not-so-new<footnote|but improved.>
  20   method for literate programming.
  21
  22   <name|Literate Programming> has a long history starting with the great
  23   <name|Donald Knuth> himself, whose literate programming tools seem to make
  24   use of as many escape sequences for semantic markup as <TeX> (also by
  25   <name|Donald Knuth>).
  26
  27   <name|Norman Ramsey> wrote the <name|Noweb> set of tools
  28   (<verbatim|notangle>, <verbatim|noweave> and <verbatim|noroots>) and
  29   helpfully reduced the amount of magic character sequences to pretty much
  30   just <verbatim|\<less\>\<less\>>, <verbatim|\<gtr\>\<gtr\>> and
  31   <verbatim|@>, and in doing so brought the wonders of literate programming
  32   within my reach.
  33
  34   While using the <LyX> editor for <LaTeX> editing I had various troubles
  35   with the noweb tools, some of which were my fault, some of which were
  36   noweb's fault and some of which were <LyX>'s fault.
  37
  38   <name|Noweb> generally brought literate programming to the masses through
  39   removing some of the complexity of the original literate programming, but
  40   this would be of no advantage to me if the <LyX> / <LaTeX> combination
  41   brought more complications in their place.
  42
  43   <name|Fangle> was thus born (originally called <name|Newfangle>) as an awk
  44   replacement for notangle, adding some important features, like better
  45   integration with <LyX> and <LaTeX> (and later <TeXmacs>), multiple output
  46   format conversions, and fixing notangle bugs like indentation when using -L
  47   for line numbers.
  48
  49   Significantly, fangle is just one program which replaces various programs
  50   in <name|Noweb>. Noweave is done away with and implemented directly as
  51   <LaTeX> macros, and noroots is implemented as a function of the untangler
  52   fangle.
  53
  54   Fangle is written in awk for portability reasons, awk being available for
  55   most platforms. A Python version<\footnote>
  56     hasn't anyone implemented awk in python yet?
  57   </footnote> was considered for the benefit of <LyX> but a scheme version
  58   for <TeXmacs> will probably materialise first; as <TeXmacs> macro
  59   capabilities help make edit-time and format-time rendering of fangle chunks
  60   simple enough for my weak brain.
  61
  62   As an extension to many literate-programming styles, Fangle permits code
  63   chunks to take parameters and thus operate somewhat like C pre-processor
  64   macros, or like C++ templates. Name parameters (or even local
  65   <em|variables> in the callers scope) are anticipated, as parameterized
  66   chunks <emdash> useful though they are <emdash> are hard to comprehend in
  67   the literate document.
  68
  69   <section*|License><new-page*><label|License>
  70
  71   Fangle is licensed under the GPL 3 (or later).
  72
  73   This doesn't mean that sources generated by fangle must be licensed under
  74   the GPL 3.
  75
  76   This doesn't mean that you can't use or distribute fangle with sources of
  77   an incompatible license, but it means you must make the source of fangle
  78   available too.
  79
  80   As fangle is currently written in awk, an interpreted language, this should
  81   not be too hard.
  82
  83   <\nf-chunk|gpl3-copyright>
  84     <item># fangle - fully featured notangle replacement in awk
  85
  86     <item>#
  87
  88     <item># Copyright (C) 2009-2010 Sam Liddicott
  89     \<less\>sam@liddicott.com\<gtr\>
  90
  91     <item>#
  92
  93     <item># This program is free software: you can redistribute it and/or
  94     modify
  95
  96     <item># it under the terms of the GNU General Public License as published
  97     by
  98
  99     <item># the Free Software Foundation, either version 3 of the License, or
 100
 101     <item># (at your option) any later version.
 102
 103     <item>#
 104
 105     <item># This program is distributed in the hope that it will be useful,
 106
 107     <item># but WITHOUT ANY WARRANTY; without even the implied warranty of
 108
 109     <item># MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. \ See the
 110
 111     <item># GNU General Public License for more details.
 112
 113     <item>#
 114
 115     <item># You should have received a copy of the GNU General Public License
 116
 117     <item># along with this program. \ If not, see
 118     \<less\>http://www.gnu.org/licenses/\<gtr\>.
 119   </nf-chunk|text|>
 120
 121   <\table-of-contents|toc>
 122   </table-of-contents>
 123
 124   <part|Using Fangle>
 125
 126   <chapter|Introduction to Literate Programming>
 127
 128   Todo: Should really follow on from a part-0 explanation of what literate
 129   programming is.
 130
 131   <chapter|Running Fangle>
 132
 133   Fangle is a replacement for <name|noweb>, which consists of
 134   <verbatim|notangle>, <verbatim|noroots> and <verbatim|noweave>.
 135
 136   Like <verbatim|notangle> and <verbatim|noroots>, <verbatim|fangle> can read
 137   multiple named files, or from stdin.
 138
 139   <section|Listing roots>
 140
 141   The -r option causes fangle to behave like noroots.
 142
 143   <code*|fangle -r filename.tex>
 144
 145   will print out the fangle roots of a tex file.\
 146
 147   Unlike the <verbatim|noroots> command, the printed roots are not enclosed
 148   in angle brackets e.g. <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>>,
 149   unless at least one of the roots is defined using the <verbatim|notangle>
 150   notation <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>=>.
 151
 152   Also, unlike noroots, it prints out all roots --- not just those that are
 153   not used elsewhere. I find that a root not being used doesn't make it
 154   particularly top level <emdash> and so-called top level roots could also be
 155   included in another root as well.\
 156
 157   My convention is that top level roots to be extracted begin with
 158   <verbatim|./> and have the form of a filename.
 159
 160   Makefile.inc, discussed in <reference|makefile.inc>, can automatically
 161   extract all such sources prefixed with <verbatim|./>
 162
 163   <section|Extracting roots>
 164
 165   notangle's <verbatim|-R> and <verbatim|-L> options are supported.
 166
 167   If you are using <LyX> or <LaTeX>, the standard way to extract a file would
 168   be:
 169
 170   <verbatim|fangle -R./Makefile.inc fangle.tex \<gtr\> ./Makefile.inc>
 171
 172   If you are using <TeXmacs>, the standard way to extract a file would
 173   similarly be:
 174
 175   <verbatim|fangle -R./Makefile.inc fangle.txt \<gtr\> ./Makefile.inc>
 176
 177   <TeXmacs> users would obtain the text file with a <em|verbatim> export from
 178   <TeXmacs> which can be done on the command line with <verbatim|texmacs -s
 179   -c fangle.tm fangle.txt -q>
 180
 181   Unlike the <verbatim|noroots> command, the <verbatim|<verbatim|-L>> option
 182   to generate C pre-preocessor <verbatim|#file> style line-number
 183   directives,does not break indenting of the generated file..
 184
 185   Also, thanks to mode tracking (described in <reference|modes>) the
 186   <verbatim|-L> option does not interrupt (and break) multi-line C macros
 187   either.
 188
 189   This does mean that sometimes the compiler might calculate the source line
 190   wrongly when generating error messages in such cases, but there isn't any
 191   other way around if multi-line macros include other chunks.
 192
 193   Future releases will include a mapping file so that line/character
 194   references from the C compiler can be converted to the correct part of the
 195   source document.
 196
 197   <section|Formatting the document>
 198
 199   The noweave replacement built into the editing and formatting environment
 200   for <TeXmacs>, <LyX> (which uses <LaTeX>), and even for raw <LaTeX>.
 201
 202   Use of fangle with <TeXmacs>, <LyX> and <LaTeX> are explained the the next
 203   few chapters.
 204
 205   <chapter|Using Fangle with <LaTeX>>
 206
 207   Because the noweave replacement is impemented in <LaTeX>, there is no
 208   processing stage required before running the <LaTeX> command. Of course,
 209   <LaTeX> may need running two or more times, so that the code chunk
 210   references can be fully calculated.
 211
 212   The formatting is managed by a set of macros shown in
 213   <reference|latex-source>, and can be included with:
 214
 215   <verbatim|\\usepackage{fangle.sty}>
 216
 217   Norman Ramsay's origial <filename|noweb.sty> package is currently required
 218   as it is used for formatting the code chunk captions.
 219
 220   The <filename|listings.sty> package is required, and is used for formatting
 221   the code chunks and syntax highlighting.
 222
 223   The <filename|xargs.sty> package is also required, and makes writing
 224   <LaTeX> macro so much more pleasant.
 225
 226   <todo|Add examples of use of Macros>
 227
 228   <chapter|Using Fangle with <LyX>>
 229
 230   <LyX> uses the same <LaTeX> macros shown in <reference|latex-source> as
 231   part of a <LyX> module file <filename|fangle.module>, which automatically
 232   includes the macros in the document pre-amble provided that the fangle
 233   <LyX> module is used in the document.
 234
 235   <section|Installing the <LyX> module>
 236
 237   Copy <filename|fangle.module> to your <LyX> layouts directory, which for
 238   unix users will be <filename|~/.lyx/layouts>
 239
 240   In order to make the new literate styles availalble, you will need to
 241   reconfigure <LyX> by clicking Tools-\<gtr\>Reconfigure, and then re-start
 242   <LyX>.
 243
 244   <section|Obtaining a decent mono font>
 245
 246   The syntax high-lighting features of <name|lstlistings> makes use of bold;
 247   however a mono-space tt font is used to typeset the listings. Obtaining a
 248   <with|font-family|tt|<strong|bold> tt font> can be impossibly difficult and
 249   amazingly easy. I spent many hours at it, following complicated
 250   instructions from those who had spend many hours over it, and was finally
 251   delivered the simple solution on the lyx mailing list.
 252
 253   <subsection|txfonts>
 254
 255   The simple way was to add this to my preamble:
 256
 257   <\verbatim>
 258     \\usepackage{txfonts}
 259
 260     \\renewcommand{\\ttdefault}{txtt}
 261   </verbatim>
 262
 263   \;
 264
 265   <subsection|ams pmb>
 266
 267   The next simplest way was to use ams poor-mans-bold, by adding this to the
 268   pre-amble:
 269
 270   <\verbatim>
 271     \\usepackage{amsbsy}
 272
 273     %\\renewcommand{\\ttdefault}{txtt}
 274
 275     %somehow make \\pmb be the command for bold, forgot how, sorry, above
 276     line not work
 277   </verbatim>
 278
 279   It works, but looks wretched on the dvi viewer.
 280
 281   <subsection|Luximono>
 282
 283   The lstlistings documention suggests using Luximono.
 284
 285   Luximono was installed according to the instructions in Ubuntu Forums
 286   thread 1159181<\footnote>
 287     http://ubuntuforums.org/showthread.php?t=1159181
 288   </footnote> with tips from miknight<\footnote>
 289     http://miknight.blogspot.com/2005/11/how-to-install-luxi-mono-font-in.html
 290   </footnote> stating that <verbatim|sudo updmap --enable MixedMap ul9.map>
 291   is required. It looks fine in PDF and PS view but still looks rotten in dvi
 292   view.
 293
 294   <section|Formatting your Lyx document>
 295
 296   It is not necessary to base your literate document on any of the original
 297   <LyX> literate classes; so select a regular class for your document type.
 298
 299   Add the new module <em|Fangle Literate Listings> and also <em|Logical
 300   Markup> which is very useful.
 301
 302   In the drop-down style listbox you should notice a new style defined,
 303   called <em|Chunk>.
 304
 305   When you wish to insert a literate chunk, you enter it's plain name in the
 306   Chunk style, instead of the old <name|noweb> method that uses
 307   <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>=> type tags. In the line (or
 308   paragraph) following the chunk name, you insert a listing with:
 309   Insert-\<gtr\>Program Listing.
 310
 311   Inside the white listing box you can type (or paste using
 312   <kbd|shift+ctrl+V>) your listing. There is no need to use <kbd|ctrl+enter>
 313   at the end of lines as with some older <LyX> literate techniques --- just
 314   press enter as normal.
 315
 316   <subsection|Customising the listing appearance>
 317
 318   The code is formatted using the <name|lstlistings> package. The chunk style
 319   doesn't just define the chunk name, but can also define any other chunk
 320   options supported by the lstlistings package <verbatim|\\lstset> command.
 321   In fact, what you type in the chunk style is raw latex. If you want to set
 322   the chunk language without having to right-click the listing, just add
 323   <verbatim|,lanuage=C> after the chunk name. (Currently the language will
 324   affect all subsequent listings, so you may need to specify
 325   <verbatim|,language=> quite a lot).
 326
 327   <todo|so fix the bug>
 328
 329   Of course you can do this by editing the listings box advanced properties
 330   by right-clicking on the listings box, but that takes longer, and you can't
 331   see at-a-glance what the advanced settings are while editing the document;
 332   also advanced settings apply only to that box --- the chunk settings apply
 333   through the rest of the document<\footnote>
 334     It ought to apply only to subsequent chunks of the same name. I'll fix
 335     that later
 336   </footnote>.
 337
 338   <todo|So make sure they only apply to chunks of that name>
 339
 340   <subsection|Global customisations>
 341
 342   As lstlistings is used to set the code chunks, it's <verbatim|\\lstset>
 343   command can be used in the pre-amble to set some document wide settings.
 344
 345   If your source has many words with long sequences of capital letters, then
 346   <verbatim|columns=fullflexible> may be a good idea, or the capital letters
 347   will get crowded. (I think lstlistings ought to use a slightly smaller font
 348   for captial letters so that they still fit).
 349
 350   The font family <verbatim|\\ttfamily> looks more normal for code, but has
 351   no bold (an alternate typewriter font is used).\
 352
 353   With <verbatim|\\ttfamily>, I must also specify
 354   <verbatim|columns=fullflexible> or the wrong letter spacing is used.
 355
 356   In my <LaTeX> pre-amble I usually specialise my code format with:
 357
 358   <\nf-chunk|document-preamble>
 359     <item>\\lstset{
 360
 361     <item>numbers=left, stepnumber=1, numbersep=5pt,
 362
 363     <item>breaklines=false,
 364
 365     <item>basicstyle=\\footnotesize\\ttfamily,
 366
 367     <item>numberstyle=\\tiny,
 368
 369     <item>language=C,
 370
 371     <item>columns=fullflexible,
 372
 373     <item>numberfirstline=true
 374
 375     <item>}
 376   </nf-chunk|tex|>
 377
 378   \;
 379
 380   <section|Configuring the build script>
 381
 382   You can invoke code extraction and building from the <LyX> menu option
 383   Document-\<gtr\>Build Program.
 384
 385   First, make sure you don't have a conversion defined for Lyx-\<gtr\>Program
 386
 387   From the menu Tools-\<gtr\>Preferences, add a conversion from
 388   Latex(Plain)-\<gtr\>Program as:
 389
 390   <\verbatim>
 391     set -x ; fangle -Rlyx-build $$i \|\
 392
 393     \ \ env LYX_b=$$b LYX_i=$$i LYX_o=$$o LYX_p=$$p LYX_r=$$r bash
 394   </verbatim>
 395
 396   (But don't cut-n-paste it from this document or you may be be pasting a
 397   multi-line string which will break your lyx preferences file).\
 398
 399   I hope that one day, <LyX> will set these into the environment when calling
 400   the build script.
 401
 402   You may also want to consider adding options to this conversion...
 403
 404   <verbatim|parselog=/usr/share/lyx/scripts/listerrors>
 405
 406   ...but if you do you will lose your stderr<\footnote>
 407     There is some bash plumbing to get a copy of stderr but this footnote is
 408     too small
 409   </footnote>.
 410
 411   Now, a shell script chunk called <filename|lyx-build> will be extracted and
 412   run whenever you choose the Document-\<gtr\>Build Program menu item.
 413
 414   This document was originally managed using <LyX> and lyx-build script for
 415   this document is shown here for historical reference.\
 416
 417   <\verbatim>
 418     lyx -e latex fangle.lyx && \\
 419
 420     \ \ fangle fangle.lyx \<gtr\> ./autoboot
 421   </verbatim>
 422
 423   This looks simple enough, but as mentioned, fangle has to be had from
 424   somewhere before it can be extracted.
 425
 426   <subsection|...>
 427
 428   When the lyx-build chunk is executed, the current directory will be a
 429   temporary directory, and <verbatim|LYX_SOURCE> will refer to the tex file
 430   in this temporary directory. This is unfortunate as our makefile wants to
 431   run from the project directory where the Lyx file is kept.
 432
 433   We can extract the project directory from <verbatim|$$r>, and derive the
 434   probable Lyx filename from the noweb file that Lyx generated.
 435
 436   <\nf-chunk|lyx-build-helper>
 437     <item>PROJECT_DIR="$LYX_r"
 438
 439     <item>LYX_SRC="$PROJECT_DIR/${LYX_i%.tex}.lyx"
 440
 441     <item>TEX_DIR="$LYX_p"
 442
 443     <item>TEX_SRC="$TEX_DIR/$LYX_i"
 444   </nf-chunk|sh|>
 445
 446   And then we can define a lyx-build fragment similar to the autoboot
 447   fragment
 448
 449   <\nf-chunk|lyx-build>
 450     <item>#! /bin/sh
 451
 452     <item>=\<less\>\\chunkref{lyx-build-helper}\<gtr\>
 453
 454     <item>cd $PROJECT_DIR \|\| exit 1
 455
 456     <item>
 457
 458     <item>#/usr/bin/fangle -filter ./notanglefix-filter \\
 459
 460     <item># \ -R./Makefile.inc "../../noweb-lyx/noweb-lyx3.lyx" \\
 461
 462     <item># \ \| sed '/NOWEB_SOURCE=/s/=.*/=samba4-dfs.lyx/' \\
 463
 464     <item># \ \<gtr\> ./Makefile.inc
 465
 466     <item>#
 467
 468     <item>#make -f ./Makefile.inc fangle_sources
 469   </nf-chunk|sh|>
 470
 471   \;
 472
 473   <chapter|Using Fangle with <TeXmacs>>
 474
 475   <todo|Write this chapter>
 476
 477   <chapter|Fangle with Makefiles><label|makefile.inc>
 478
 479   Here we describe a <filename|Makefile.inc> that you can include in your own
 480   Makefiles, or glue as a recursive make to other projects.
 481
 482   <filename|Makefile.inc> will cope with extracting all the other source
 483   files from this or any specified literate document and keeping them up to
 484   date.\
 485
 486   It may also be included by a <verbatim|Makefile> or <verbatim|Makefile.am>
 487   defined in a literate document to automatically deal with the extraction of
 488   source files and documents during normal builds.
 489
 490   Thus, if <verbatim|Makefile.inc> is included into a main project makefile
 491   it add rules for the source files, capable of extracting the source files
 492   from the literate document.
 493
 494   <section|A word about makefiles formats>
 495
 496   Whitespace formatting is very important in a Makefile. The first character
 497   of each action line must be a TAB.\
 498
 499   <\verbatim>
 500     target: pre-requisite
 501
 502     <nf-tab>action
 503
 504     <nf-tab>action
 505   </verbatim>
 506
 507   This requires that the literate programming environment have the ability to
 508   represent a TAB character in a way that fangle will generate an actual TAB
 509   character.
 510
 511   We also adopt a convention that code chunks whose names beginning with
 512   <verbatim|./> should always be automatically extracted from the document.
 513   Code chunks whose names do not begin with <verbatim|./> are for internal
 514   reference. Such chunks may be extracted directly, but will not be
 515   automatically extracted by this Makefile.
 516
 517   <section|Extracting Sources>
 518
 519   Our makefile has two parts; variables must be defined before the targets
 520   that use them.
 521
 522   As we progress through this chapter, explaining concepts, we will be adding
 523   lines to <nf-ref|Makefile.inc-vars|> and <nf-ref|Makefile.inc-targets|>
 524   which are included in <nf-ref|./Makefile.inc|> below.
 525
 526   <\nf-chunk|./Makefile.inc>
 527     <item><nf-ref|Makefile.inc-vars|>
 528
 529     <item><nf-ref|Makefile.inc-targets|>
 530   </nf-chunk|make|>
 531
 532   We first define a placeholder for <verbatim|LITERATE_SOURCE> to hold the
 533   name of this document. This will normally be passed on the command line.
 534
 535   <\nf-chunk|Makefile.inc-vars>
 536     <item>LITERATE_SOURCE=
 537   </nf-chunk||>
 538
 539   Fangle cannot process <LyX> or <TeXmacs> documents directly, so the first
 540   stage is to convert these to more suitable text based formats<\footnote>
 541     <LyX> and <TeXmacs> formats are text-based, but not suitable for fangle
 542   </footnote>.
 543
 544   <subsection|Converting from <LyX> to <LaTeX>><label|Converting-from-Lyx>
 545
 546   The first stage will always be to convert the <LyX> file to a <LaTeX> file.
 547   Fangle must run on a <TeX> file because the <LyX> command
 548   <verbatim|server-goto-file-line><\footnote>
 549     The Lyx command <verbatim|server-goto-file-line> is used to position the
 550     Lyx cursor at the compiler errors.
 551   </footnote> requries that the line number provided be a line of the <TeX>
 552   file and always maps this the line in the <LyX> docment. We use
 553   <verbatim|server-goto-file-line> when moving the cursor to error lines
 554   during compile failures.
 555
 556   The command <verbatim|lyx -e literate fangle.lyx> will produce
 557   <verbatim|fangle.tex>, a <TeX> file; so we define a make target to be the
 558   same as the <LyX> file but with the <verbatim|.tex> extension.
 559
 560   The <verbatim|EXTRA_DIST> is for automake support so that the <TeX> files
 561   will automaticaly be distributed with the source, to help those who don't
 562   have <LyX> installed.
 563
 564   <\nf-chunk|Makefile.inc-vars>
 565     <item>TEX_SOURCE=$(LYX_SOURCE:.lyx=.tex)
 566
 567     <item>EXTRA_DIST+=$(TEX_SOURCE)
 568   </nf-chunk||>
 569
 570   We then specify that the <TeX> source is to be generated from the <LyX>
 571   source.
 572
 573   <\nf-chunk|Makefile.inc-targets>
 574     <item>$(TEX_SOURCE): $(LYX_SOURCE)
 575
 576     <item><nf-tab>lyx -e latex $\<less\>
 577
 578     <item>clean_tex:
 579
 580     <item><nf-tab>rm -f -- $(TEX_SOURCE)
 581
 582     <item>clean: clean_tex
 583   </nf-chunk||>
 584
 585   <subsection|Converting from <TeXmacs>><label|Converting-from-Lyx>
 586
 587   Fangle cannot process <TeXmacs> files directly<\footnote>
 588     but this is planned when <TeXmacs> uses xml as it's native format
 589   </footnote>, but must first convert them to text files.
 590
 591   The command <verbatim|texmacs -c fangle.tm fangle.txt -q> will produce
 592   <verbatim|fangle.txt>, a text file; so we define a make target to be the
 593   same as the <TeXmacs> file but with the <verbatim|.txt> extension.
 594
 595   The <verbatim|EXTRA_DIST> is for automake support so that the <TeX> files
 596   will automaticaly be distributed with the source, to help those who don't
 597   have <LyX> installed.
 598
 599   <\nf-chunk|Makefile.inc-vars>
 600     <item>TXT_SOURCE=$(LITERATE_SOURCE:.tm=.txt)
 601
 602     <item>EXTRA_DIST+=$(TXT_SOURCE)
 603   </nf-chunk||>
 604
 605   <todo|Add loop around each $\<less\> so multiple targets can be specified>
 606
 607   <\nf-chunk|Makefile.inc-targets>
 608     <item>$(TXT_SOURCE): $(LITERATE_SOURCE)
 609
 610     <item><nf-tab>texmacs -c $\<less\> $(TXT_SOURCE) -q
 611
 612     <item>clean_txt:
 613
 614     <item><nf-tab>rm -f -- $(TXT_SOURCE)
 615
 616     <item>clean: clean_txt
 617   </nf-chunk||>
 618
 619   <section|Extracting Program Source>
 620
 621   The program source is extracted using fangle, which is designed to operate
 622   on text or a <LaTeX> documents<\footnote>
 623     <LaTeX> documents are just slightly special text documents
 624   </footnote>.
 625
 626   <\nf-chunk|Makefile.inc-vars>
 627     <item>FANGLE_SOURCE=$(TEX_SOURCE) $(TXT_SOURCE)
 628   </nf-chunk||>
 629
 630   The literate document can result in any number of source files, but not all
 631   of these will be changed each time the document is updated. We certainly
 632   don't want to update the timestamps of these files and cause the whole
 633   source tree to be recompiled just because the literate explanation was
 634   revised. We use <verbatim|CPIF> from the <em|Noweb> tools to avoid updating
 635   the file if the content has not changed, but should probably write our own.
 636
 637   However, if a source file is not updated, then the fangle file will always
 638   have a newer time-stamp and the makefile would always re-attempt to extact
 639   a newer source file which would be a waste of time.
 640
 641   Because of this, we use a stamp file which is always updated each time the
 642   sources are fully extracted from the <LaTeX> document. If the stamp file is
 643   newer than the document, then we can avoid an attempt to re-extract any of
 644   the sources. Because this stamp file is only updated when extraction is
 645   complete, it is safe for the user to interrupt the build-process
 646   mid-extraction.
 647
 648   We use <verbatim|echo> rather than <verbatim|touch> to update the stamp
 649   file beause the <verbatim|touch> command does not work very well over an
 650   <verbatim|sshfs>mount \ that I was using.
 651
 652   <\nf-chunk|Makefile.inc-vars>
 653     <item>FANGLE_SOURCE_STAMP=$(FANGLE_SOURCE).stamp
 654   </nf-chunk||>
 655
 656   <\nf-chunk|Makefile.inc-targets>
 657     <item>$(FANGLE_SOURCE_STAMP): $(FANGLE_SOURCE) \\
 658
 659     <item><nf-tab> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ $(FANGLE_SOURCES) ; \\
 660
 661     <item><nf-tab>echo -n \<gtr\> $(FANGLE_SOURCE_STAMP)
 662
 663     <item>clean_stamp:
 664
 665     <item><nf-tab>rm -f $(FANGLE_SOURCE_STAMP)
 666
 667     <item>clean: clean_stamp
 668   </nf-chunk||>
 669
 670   <section|Extracting Source Files>
 671
 672   We compute <verbatim|FANGLE_SOURCES> to hold the names of all the source
 673   files defined in the document. We compute this only once, by means of
 674   <verbatim|:=> in assignent. The sed deletes the any
 675   <verbatim|\<less\>\<less\>> and <verbatim|\<gtr\>\<gtr\>> which may
 676   surround the roots names (for compatibility with Noweb's noroots command).
 677
 678   As we use chunk names beginning with <filename|./> to denote top level
 679   fragments that should be extracted, we filter out all fragments that do not
 680   begin with <filename|./>
 681
 682   <\note>
 683     <verbatim|FANGLE_PREFIX> is set to <verbatim|./> by default, but whatever
 684     it may be overridden to, the prefix is replaced by a literal
 685     <verbatim|./> before extraction so that files will be extracted in the
 686     current directory whatever the prefix. This helps namespace or
 687     sub-project prefixes like <verbatim|documents:> for chunks like
 688     <verbatim|documents:docbook/intro.xml>
 689   </note>
 690
 691   <todo|This doesn't work though, because it loses the full name and doesn't
 692   know what to extact!>
 693
 694   <\nf-chunk|Makefile.inc-vars>
 695     <item>FANGLE_PREFIX:=\\.\\/
 696
 697     <item>FANGLE_SOURCES:=$(shell \\
 698
 699     <item> \ fangle -r $(FANGLE_SOURCE) \|\\
 700
 701     <item> \ sed -e 's/^[\<less\>][\<less\>]//;s/[\<gtr\>][\<gtr\>]$$//;/^$(FANGLE_PREFIX)/!d'
 702     \\
 703
 704     <item> \ \ \ \ \ -e 's/^$(FANGLE_PREFIX)/\\.\\//' )
 705   </nf-chunk||>
 706
 707   The target below, <verbatim|echo_fangle_sources> is a helpful debugging
 708   target and shows the names of the files that would be extracted.
 709
 710   <\nf-chunk|Makefile.inc-targets>
 711     <item>.PHONY: echo_fangle_sources
 712
 713     <item>echo_fangle_sources: ; @echo $(FANGLE_SOURCES)
 714   </nf-chunk||>
 715
 716   We define a convenient target called <verbatim|fangle_sources> so that
 717   <verbatim|make -f fangle_sources> will re-extract the source if the
 718   literate document has been updated.\
 719
 720   <\nf-chunk|Makefile.inc-targets>
 721     <item>.PHONY: fangle_sources
 722
 723     <item>fangle_sources: $(FANGLE_SOURCE_STAMP)
 724   </nf-chunk||>
 725
 726   And also a convenient target to remove extracted sources.
 727
 728   <\nf-chunk|Makefile.inc-targets>
 729     <item>.PHONY: clean_fangle_sources
 730
 731     <item>clean_fangle_sources: ; \\
 732
 733     <item> \ \ \ \ \ \ \ rm -f -- $(FANGLE_SOURCE_STAMP) $(FANGLE_SOURCES)
 734   </nf-chunk||>
 735
 736   We now look at the extraction of the source files.
 737
 738   This makefile macro <verbatim|if_extension> takes 4 arguments: the filename
 739   <verbatim|$(1)>, some extensions to match <verbatim|$(2)> and a shell
 740   command to return if the filename does match the exensions <verbatim|$(3)>,
 741   and a shell command to return if it does not match the extensions
 742   <verbatim|$(4)>.
 743
 744   <\nf-chunk|Makefile.inc-vars>
 745     <item>if_extension=$(if $(findstring $(suffix $(1)),$(2)),$(3),$(4))
 746   </nf-chunk||>
 747
 748   For some source files like C files, we want to output the line number and
 749   filename of the original <LaTeX> document from which the source
 750   came<\footnote>
 751     I plan to replace this option with a separate mapping file so as not to
 752     pollute the generated source, and also to allow a code pretty-printing
 753     reformatter like <verbatim|indent> be able to re-format the file and
 754     adjust for changes through comparing the character streams.
 755   </footnote>.
 756
 757   To make this easier we define the file extensions for which we want to do
 758   this.
 759
 760   <\nf-chunk|Makefile.inc-vars>
 761     <item>C_EXTENSIONS=.c .h
 762   </nf-chunk||>
 763
 764   We can then use the <verbatim|if_extensions> macro to define a macro which
 765   expands out to the <verbatim|-L> option if fangle is being invoked in a C
 766   source file, so that C compile errors will refer to the line number in the
 767   <TeX> document.\
 768
 769   <\nf-chunk|Makefile.inc-vars>
 770     <item>TABS=8
 771
 772     <item>nf_line=-L -T$(TABS)
 773
 774     <item>fangle=fangle $(call if_extension,$(2),$(C_EXTENSIONS),$(nf_line))
 775     -R"$(2)" $(1)
 776   </nf-chunk||>
 777
 778   We can use a similar trick to define an indent macro which takes just the
 779   filename as an argument and can return a pipeline stage calling the indent
 780   command. Indent can be turned off with <verbatim|make fangle_sources
 781   indent=>
 782
 783   <\nf-chunk|Makefile.inc-vars>
 784     <item>indent_options=-npro -kr -i8 -ts8 -sob -l80 -ss -ncs
 785
 786     <item>indent=$(call if_extension,$(1),$(C_EXTENSIONS), \| indent
 787     $(indent_options))
 788   </nf-chunk||>
 789
 790   We now define the pattern for extracting a file. The files are written
 791   using noweb's <verbatim|cpif> so that the file timestamp will not be
 792   touched if the contents haven't changed. This avoids the need to rebuild
 793   the entire project because of a typographical change in the documentation,
 794   or if none or a few C source files have changed.
 795
 796   <\nf-chunk|Makefile.inc-vars>
 797     <item>fangle_extract=@mkdir -p $(dir $(1)) && \\
 798
 799     <item> \ $(call fangle,$(2),$(1)) \<gtr\> "$(1).tmp" && \\
 800
 801     <item> \ cat "$(1).tmp" $(indent) \| cpif "$(1)" \\
 802
 803     <item> \ && rm -- "$(1).tmp" \|\| \\
 804
 805     <item> \ (echo error newfangling $(1) from $(2) ; exit 1)
 806   </nf-chunk||>
 807
 808   We define a target which will extract or update all sources. To do this we
 809   first defined a makefile template that can do this for any source file in
 810   the <LaTeX> document.
 811
 812   <\nf-chunk|Makefile.inc-vars>
 813     <item>define FANGLE_template
 814
 815     <item> \ $(1): $(2)
 816
 817     <item><nf-tab>$$(call fangle_extract,$(1),$(2))
 818
 819     <item> \ FANGLE_TARGETS+=$(1)
 820
 821     <item>endef
 822   </nf-chunk||>
 823
 824   We then enumerate the discovered <verbatim|FANGLE_SOURCES> to generate a
 825   makefile rule for each one using the makefile template we defined above.
 826
 827   <\nf-chunk|Makefile.inc-targets>
 828     <item>$(foreach source,$(FANGLE_SOURCES),\\
 829
 830     <item> \ $(eval $(call FANGLE_template,$(source),$(FANGLE_SOURCE))) \\
 831
 832     <item>)
 833   </nf-chunk||>
 834
 835   These will all be built with <verbatim|FANGLE_SOURCE_STAMP>.
 836
 837   We also remove the generated sources on a make distclean.
 838
 839   <\nf-chunk|Makefile.inc-targets>
 840     <item>_distclean: clean_fangle_sources
 841   </nf-chunk||>
 842
 843   <section|Extracting Documentation>
 844
 845   We then identify the intermediate stages of the documentation and their
 846   build and clean targets.
 847
 848   <subsection|Formatting <TeX>>
 849
 850   <subsubsection|Running pdflatex>
 851
 852   We produce a pdf file from the tex file.
 853
 854   <\nf-chunk|Makefile.inc-vars>
 855     <item>FANGLE_PDF=$(TEX_SOURCE:.tex=.pdf)
 856   </nf-chunk||>
 857
 858   We run pdflatex twice to be sure that the contents and aux files are up to
 859   date. We certainly are <em|required> to run pdflatex at least twice if
 860   these files do not exist.
 861
 862   <\nf-chunk|Makefile.inc-targets>
 863     <item>$(FANGLE_PDF): $(TEX_SOURCE)
 864
 865     <item><nf-tab>pdflatex $\<less\> && pdflatex $\<less\>
 866
 867     <item>
 868
 869     <item>clean_pdf:
 870
 871     <item><nf-tab>rm -f -- $(FANGLE_PDF) $(TEX_SOURCE:.tex=.toc) \\
 872
 873     <item><nf-tab> \ $(TEX_SOURCE:.tex=.log) $(TEX_SOURCE:.tex=.aux)
 874   </nf-chunk||>
 875
 876   <subsection|Formatting <TeXmacs>>
 877
 878   <TeXmacs> can produce a PDF file directly.
 879
 880   <\nf-chunk|Makefile.inc-vars>
 881     <item>FANGLE_PDF=$(TEX_SOURCE:.tm=.pdf)
 882   </nf-chunk||>
 883
 884   <\todo>
 885     Outputting the PDF may not be enough to update the links and page
 886     references. I think
 887
 888     we need to update twice, generate a pdf, update twice mode and generate a
 889     new PDF.
 890
 891     Basically the PDF export of <TeXmacs> is pretty rotten and doesn't work
 892     properly from the CLI
 893   </todo>
 894
 895   <\nf-chunk|Makefile.inc-targets>
 896     <item>$(FANGLE_PDF): $(TEXMACS_SOURCE)
 897
 898     <item><nf-tab>texmacs -c $(TEXMACS_SOURCE) $\<less\> -q
 899
 900     <item>
 901
 902     <item>clean_pdf:
 903
 904     <item><nf-tab>rm -f -- $(FANGLE_PDF)
 905   </nf-chunk||>
 906
 907   <subsection|Building the Documentation as a Whole>
 908
 909   Currently we only build pdf as a final format, but <verbatim|FANGLE_DOCS>
 910   may later hold other output formats.
 911
 912   <\nf-chunk|Makefile.inc-vars>
 913     <item>FANGLE_DOCS=$(FANGLE_PDF)
 914   </nf-chunk||>
 915
 916   We also define <verbatim|fangle_docs> as a convenient phony target.
 917
 918   <\nf-chunk|Makefile.inc-targets>
 919     <item>.PHONY: fangle_docs
 920
 921     <item>fangle_docs: $(FANGLE_DOCS)
 922
 923     <item>docs: fangle_docs
 924   </nf-chunk||>
 925
 926   And define a convenient <verbatim|clean_fangle_docs> which we add to the
 927   regular clean target
 928
 929   <\nf-chunk|Makefile.inc-targets>
 930     <item>.PHONEY: clean_fangle_docs
 931
 932     <item>clean_fangle_docs: clean_tex clean_pdf
 933
 934     <item>clean: clean_fangle_docs
 935
 936     <item>
 937
 938     <item>distclean_fangle_docs: clean_tex clean_fangle_docs
 939
 940     <item>distclean: clean distclean_fangle_docs
 941   </nf-chunk||>
 942
 943   <section|Other helpers>
 944
 945   If <filename|Makefile.inc> is included into <filename|Makefile>, then
 946   extracted files can be updated with this command:
 947
 948   <verbatim|make fangle_sources>
 949
 950   otherwise, with:
 951
 952   <verbatim|make -f Makefile.inc fangle_sources>
 953
 954   <section|Boot-strapping the extraction>
 955
 956   As well as having the makefile extract or update the source files as part
 957   of it's operation, it also seems convenient to have the makefile
 958   re-extracted itself from <em|this> document.
 959
 960   It would also be convenient to have the code that extracts the makefile
 961   from this document to also be part of this document, however we have to
 962   start somewhere and this unfortunately requires us to type at least a few
 963   words by hand to start things off.
 964
 965   Therefore we will have a minimal root fragment, which, when extracted, can
 966   cope with extracting the rest of the source. This shell script fragment can
 967   do that. It's name is <verbatim|*> <emdash> out of regard for <name|Noweb>,
 968   but when extracted might better be called <verbatim|autoupdate>.
 969
 970   <todo|De-lyxify>
 971
 972   <\nf-chunk|*>
 973     <item>#! /bin/sh
 974
 975     <item>
 976
 977     <item>MAKE_SRC="${1:-${NW_LYX:-../../noweb-lyx/noweb-lyx3.lyx}}"
 978
 979     <item>MAKE_SRC=`dirname "$MAKE_SRC"`/`basename "$MAKE_SRC" .lyx`
 980
 981     <item>NOWEB_SRC="${2:-${NOWEB_SRC:-$MAKE_SRC.lyx}}"
 982
 983     <item>lyx -e latex $MAKE_SRC
 984
 985     <item>
 986
 987     <item>fangle -R./Makefile.inc ${MAKE_SRC}.tex \\
 988
 989     <item> \ \| sed "/FANGLE_SOURCE=/s/^/#/;T;aNOWEB_SOURCE=$FANGLE_SRC" \\
 990
 991     <item> \ \| cpif ./Makefile.inc
 992
 993     <item>
 994
 995     <item>make -f ./Makefile.inc fangle_sources
 996   </nf-chunk|sh|>
 997
 998   The general Makefile can be invoked with <filename|./autoboot> and can also
 999   be included into any automake file to automatically re-generate the source
1000   files.
1001
1002   The <em|autoboot> can be extracted with this command:
1003
1004   <\verbatim>
1005     lyx -e latex fangle.lyx && \\
1006
1007     \ \ fangle fangle.lyx \<gtr\> ./autoboot
1008   </verbatim>
1009
1010   This looks simple enough, but as mentioned, fangle has to be had from
1011   somewhere before it can be extracted.
1012
1013   On a unix system this will extract <filename|fangle.module> and the
1014   <filename|fangle> awk script, and run some basic tests.\
1015
1016   <todo|cross-ref to test chapter when it is a chapter all on its own>
1017
1018   <section|Incorporating Makefile.inc into existing projects>
1019
1020   If you are writing a literate module of an existing non-literate program
1021   you may find it easier to use a slight recursive make instead of directly
1022   including <verbatim|Makefile.inc> in the projects makefile.\
1023
1024   This way there is less chance of definitions in <verbatim|Makefile.inc>
1025   interfering with definitions in the main makefile, or with definitions in
1026   other <verbatim|Makefile.inc> from other literate modules of the same
1027   project.
1028
1029   To do this we add some <em|glue> to the project makefile that invokes
1030   Makefile.inc in the right way. The glue works by adding a <verbatim|.PHONY>
1031   target to call the recursive make, and adding this target as an additional
1032   pre-requisite to the existing targets.
1033
1034   <paragraph|Example>Sub-module of existing system
1035
1036   In this example, we are building <verbatim|module.so> as a literate module
1037   of a larger project.
1038
1039   We will show the sort glue that can be inserted into the projects Makefile
1040   <emdash> or more likely <emdash> a regular Makefile included in or invoked
1041   by the projects Makefile.
1042
1043   <\nf-chunk|makefile-glue>
1044     <item>module_srcdir=modules/module
1045
1046     <item>MODULE_SOURCE=module.tm
1047
1048     <item>MODULE_STAMP=$(MODULE_SOURCE).stamp
1049   </nf-chunk||>
1050
1051   The existing build system may already have a build target for
1052   <filename|module.o>, but we just add another pre-requisite to that. In this
1053   case we use <filename|module.tm.stamp> as a pre-requisite, the stamp file's
1054   modified time indicating when all sources were extracted<\footnote>
1055     If the projects build system does not know how to build the module from
1056     the extracted sources, then just add build actions here as normal.
1057   </footnote>.
1058
1059   <\nf-chunk|makefile-glue>
1060     <item>$(module_srcdir)/module.o: $(module_srcdir)/$(MODULE_STAMP)
1061   </nf-chunk|make|>
1062
1063   The target for this new pre-requisite will be generated by a recursive make
1064   using <filename|Makefile.inc> which will make sure that the source is up to
1065   date, before it is built by the main projects makefile.
1066
1067   <\nf-chunk|makefile-glue>
1068     <item>$(module_srcdir)/$(MODULE_STAMP): $(module_srcdir)/$(MODULE_SOURCE)
1069
1070     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc fangle_sources
1071     LITERATE_SOURCE=$(MODULE_SOURCE)
1072   </nf-chunk||>
1073
1074   We can do similar glue for the docs, clean and distclean targets. In this
1075   example the main prject was using a double colon for these targets, so we
1076   must use the same in our glue.
1077
1078   <\nf-chunk|makefile-glue>
1079     <item>docs:: docs_module
1080
1081     <item>.PHONY: docs_module
1082
1083     <item>docs_module:
1084
1085     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc docs
1086     LITERATE_SOURCE=$(MODULE_SOURCE)
1087
1088     <item>
1089
1090     <item>clean:: clean_module
1091
1092     <item>.PHONEY: clean_module
1093
1094     <item>clean_module:
1095
1096     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc clean
1097     LITERATE_SOURCE=$(MODULE_SOURCE)
1098
1099     <item>
1100
1101     <item>distclean:: distclean_module
1102
1103     <item>.PHONY: distclean_module
1104
1105     <item>distclean_module:
1106
1107     <item><nf-tab>$(MAKE) -C $(module_srcdir) -f Makefile.inc distclean
1108     LITERATE_SOURCE=$(MODULE_SOURCE)
1109   </nf-chunk||>
1110
1111   We could do similarly for install targets to install the generated docs.
1112
1113   <part|Source Code>
1114
1115   <chapter|Fangle awk source code>
1116
1117   We use the copyright notice from chapter <reference|License>.
1118
1119   <\nf-chunk|./fangle>
1120     <item>#! /usr/bin/awk -f
1121
1122     <item># <nf-ref|gpl3-copyright|>
1123   </nf-chunk|awk|>
1124
1125   We also use code from <person|Arnold Robbins> public domain getopt (1993
1126   revision) defined in <reference|getopt>, and naturally want to attribute
1127   this appropriately.
1128
1129   <\nf-chunk|./fangle>
1130     <item># NOTE: Arnold Robbins public domain getopt for awk is also used:
1131
1132     <item><nf-ref|getopt.awk-header|>
1133
1134     <item><nf-ref|getopt.awk-getopt()|>
1135
1136     <item>
1137   </nf-chunk||>
1138
1139   And include the following chunks (which are explained further on) to make
1140   up the program:
1141
1142   <\nf-chunk|./fangle>
1143     <item><nf-ref|helper-functions|>
1144
1145     <item><nf-ref|mode-tracker|>
1146
1147     <item><nf-ref|parse_chunk_args|>
1148
1149     <item><nf-ref|chunk-storage-functions|>
1150
1151     <item><nf-ref|output_chunk_names()|>
1152
1153     <item><nf-ref|output_chunks()|>
1154
1155     <item><nf-ref|write_chunk()|>
1156
1157     <item><nf-ref|expand_chunk_args()|>
1158
1159     <item>
1160
1161     <item><nf-ref|begin|>
1162
1163     <item><nf-ref|recognize-chunk|>
1164
1165     <item><nf-ref|end|>
1166   </nf-chunk||>
1167
1168   <section|AWK tricks>
1169
1170   The portable way to erase an array in awk is to split the empty string, so
1171   we define a fangle macro that can split an array, like this:
1172
1173   <\nf-chunk|awk-delete-array>
1174     <item>split("", <nf-arg|ARRAY>);
1175   </nf-chunk|awk|<tuple|ARRAY>>
1176
1177   For debugging it is sometimes convenient to be able to dump the contents of
1178   an array to <verbatim|stderr>, and so this macro is also useful.
1179
1180   <\nf-chunk|dump-array>
1181     <item>print "\\nDump: <nf-arg|ARRAY>\\n--------\\n" \<gtr\>
1182     "/dev/stderr";
1183
1184     <item>for (_x in <nf-arg|ARRAY>) {
1185
1186     <item> \ print _x "=" <nf-arg|ARRAY>[_x] "\\n" \<gtr\> "/dev/stderr";
1187
1188     <item>}
1189
1190     <item>print "========\\n" \<gtr\> "/dev/stderr";
1191   </nf-chunk|awk|<tuple|ARRAY>>
1192
1193   <section|Catching errors>
1194
1195   Fatal errors are issued with the error function:
1196
1197   <\nf-chunk|error()>
1198     <item>function error(message)
1199
1200     <item>{
1201
1202     <item> \ print "ERROR: " FILENAME ":" FNR " " message \<gtr\>
1203     "/dev/stderr";
1204
1205     <item> \ exit 1;
1206
1207     <item>}
1208   </nf-chunk|awk|>
1209
1210   and likewise for non-fatal warnings:
1211
1212   <\nf-chunk|error()>
1213     <item>function warning(message)
1214
1215     <item>{
1216
1217     <item> \ print "WARNING: " FILENAME ":" FNR " " message \<gtr\>
1218     "/dev/stderr";
1219
1220     <item> \ warnings++;
1221
1222     <item>}
1223   </nf-chunk|awk|>
1224
1225   and debug output too:
1226
1227   <\nf-chunk|error()>
1228     <item>function debug_log(message)
1229
1230     <item>{
1231
1232     <item> \ print "DEBUG: " FILENAME ":" FNR " " message \<gtr\>
1233     "/dev/stderr";
1234
1235     <item>}
1236   </nf-chunk|awk|>
1237
1238   <todo|append=helper-functions>
1239
1240   <\nf-chunk|helper-functions>
1241     <item><nf-ref|error()|>
1242   </nf-chunk||>
1243
1244   <chapter|<TeXmacs> args>
1245
1246   <TeXmacs> functions with arguments<\footnote>
1247     or function declarations with parameters
1248   </footnote> appear like this:
1249
1250   <math|<math-tt|blah(><wide*|<wide|<math-tt|I came, I saw, I
1251   conquered>|\<wide-overbrace\>><rsup|argument 1><wide|<math-tt|<key|^K>>,
1252   |\<wide-overbrace\>><rsup|sep.><wide|and then went home
1253   asd|\<wide-overbrace\>><rsup|argument 3><wide|<math-tt|<key|^K>><math-tt|)>|\<wide-overbrace\>><rsup|term.>|\<wide-underbrace\>><rsub|arguments>>
1254
1255   Arguments commence after the opening parenthesis. The first argument runs
1256   up till the next <key|^K>.\
1257
1258   If the following character is a <key|,> then another argument follows. If
1259   the next character after the <key|,> is a space character, then it is also
1260   eaten. The fangle stylesheet emits <key|^K><key|,><key|space> as
1261   separators, but the fangle untangler will forgive a missing space.
1262
1263   If the following character is <key|)> then this is a terminator and there
1264   are no more arguments.
1265
1266   <\nf-chunk|constants>
1267     <item>ARG_SEPARATOR=sprintf("%c", 11);
1268   </nf-chunk||>
1269
1270   To process the <verbatim|text> in this fashion, we split the string on
1271   <key|^K>
1272
1273   \;
1274
1275   <\nf-chunk|get_chunk_args>
1276     <item>function get_texmacs_chunk_args(text, args, \ \ a, done) {
1277
1278     <item> \ split(text, args, ARG_SEPARATOR);
1279
1280     <item>
1281
1282     <item> \ done=0
1283
1284     <item> \ for (a=1; (a in args); a++) if (a\<gtr\>1) {
1285
1286     <item> \ \ \ if (args[a] == "" \|\| substr(args[a], 1, 1) == ")") done=1;
1287
1288     <item> \ \ \ if (done) {
1289
1290     <item> \ \ \ \ \ delete args[a];
1291
1292     <item> \ \ \ \ \ break;
1293
1294     <item> \ \ \ }
1295
1296     <item>
1297
1298     <item> \ \ \ if (substr(args[a], 1, 2) == ", ") args[a]=substr(args[a],
1299     3);
1300
1301     <item> \ \ \ else if (substr(args[a], 1, 1) == ",")
1302     args[a]=substr(args[a], 2); \
1303
1304     <item> \ }
1305
1306     <item>}
1307   </nf-chunk||>
1308
1309   <chapter|<LaTeX> and lstlistings>
1310
1311   <todo|Split LyX and TeXmacs parts>
1312
1313   For <LyX> and <LaTeX>, the <verbatim|lstlistings> package is used to format
1314   the lines of code chunks. You may recal from chapter XXX that arguments to
1315   a chunk definition are pure <LaTeX> code. This means that fangle needs to
1316   be able to parse <LaTeX> a little.
1317
1318   <LaTeX> arguments to <verbatim|lstlistings> macros are a comma seperated
1319   list of key-value pairs, and values containing commas are enclosed in
1320   <verbatim|{> braces <verbatim|}> (which is to be expected for <LaTeX>).
1321
1322   A sample expressions is:
1323
1324   <verbatim|name=thomas, params={a, b}, something, something-else>
1325
1326   but we see that this is just a simpler form of this expression:
1327
1328   <verbatim|name=freddie, foo={bar=baz, quux={quirk, a=fleeg}}, etc>
1329
1330   We may consider that we need a function that can parse such <LaTeX>
1331   expressions and assign the values to an AWK associated array, perhaps using
1332   a recursive parser into a multi-dimensional hash<\footnote>
1333     as AWK doesn't have nested-hash support
1334   </footnote>, resulting in:
1335
1336   <tabular|<tformat|<cwith|2|6|1|2|cell-lborder|0.5pt>|<cwith|2|6|1|2|cell-rborder|0.5pt>|<cwith|2|6|1|2|cell-bborder|0.5pt>|<cwith|2|6|1|2|cell-tborder|0.5pt>|<cwith|1|1|1|2|cell-lborder|0.5pt>|<cwith|1|1|1|2|cell-rborder|0.5pt>|<cwith|1|1|1|2|cell-bborder|0.5pt>|<cwith|1|1|1|2|cell-tborder|0.5pt>|<table|<row|<cell|key>|<cell|value>>|<row|<cell|a[name]>|<cell|freddie>>|<row|<cell|a[foo,
1337   bar]>|<cell|baz>>|<row|<cell|a[foo, quux,
1338   quirk]>|<cell|>>|<row|<cell|a[foo, quux,
1339   a]>|<cell|fleeg>>|<row|<cell|a[etc]>|<cell|>>>>>
1340
1341   Yet, also, on reflection it seems that sometimes such nesting is not
1342   desirable, as the braces are also used to delimit values that contain
1343   commas --- we may consider that
1344
1345   <verbatim|name={williamson, freddie}>
1346
1347   should assign <verbatim|williamson, freddie> to <verbatim|name>.
1348
1349   In fact we are not so interested in the detail so as to be bothered by
1350   this, which turns out to be a good thing for two reasons. Firstly <TeX> has
1351   a malleable parser with no strict syntax, and secondly whether or not
1352   <verbatim|williamson> and <verbatim|freddie> should count as two items will
1353   be context dependant anyway.
1354
1355   We need to parse this latex for only one reason; which is that we are
1356   extending lstlistings to add some additional arguments which will be used
1357   to express chunk parameters and other chunk options.
1358
1359   <section|Additional lstlstings parameters>
1360
1361   Further on we define a <verbatim|\\Chunk> <LaTeX> macro whose arguments
1362   will consist of a the chunk name, optionally followed by a comma and then a
1363   comma separated list of arguments. In fact we will just need to prefix
1364   <verbatim|name=> to the arguments to in order to create valid lstlistings
1365   arguments.\
1366
1367   There will be other arguments supported too;\
1368
1369   <\description-long>
1370     <item*|params>As an extension to many literate-programming styles, fangle
1371     permits code chunks to take parameters and thus operate somewhat like C
1372     pre-processor macros, or like C++ templates. Chunk parameters are
1373     declared with a chunk argument called params, which holds a semi-colon
1374     separated list of parameters, like this:
1375
1376     <verbatim|achunk,language=C,params=name;address>
1377
1378     <item*|addto>a named chunk that this chunk is to be included into. This
1379     saves the effort of having to declare another listing of the named chunk
1380     merely to include this one.
1381   </description-long>
1382
1383   Function get_chunk_args() will accept two paramters, text being the text to
1384   parse, and values being an array to receive the parsed values as described
1385   above. The optional parameter path is used during recursion to build up the
1386   multi-dimensional array path.
1387
1388   <\nf-chunk|./fangle>
1389     <item>=\<less\>\\chunkref{get_chunk_args()}\<gtr\>
1390   </nf-chunk||>
1391
1392   <\nf-chunk|get_chunk_args()>
1393     <item>function get_tex_chunk_args(text, values,
1394
1395     <item> \ # optional parameters
1396
1397     <item> \ path, # hierarchical precursors
1398
1399     <item> \ # local vars
1400
1401     <item> \ a, name)
1402   </nf-chunk||>
1403
1404   The strategy is to parse the name, and then look for a value. If the value
1405   begins with a brace <verbatim|{>, then we recurse and consume as much of
1406   the text as necessary, returning the remaining text when we encounter a
1407   leading close-brace <verbatim|}>. This being the strategy --- and executed
1408   in a loop --- we realise that we must first look for the closing brace
1409   (perhaps preceded by white space) in order to terminate the recursion, and
1410   returning remaining text.
1411
1412   <\nf-chunk|get_chunk_args()>
1413     <item>{
1414
1415     <item> \ split("", values);
1416
1417     <item> \ while(length(text)) {
1418
1419     <item> \ \ \ if (match(text, "^ *}(.*)", a)) {
1420
1421     <item> \ \ \ \ \ return a[1];
1422
1423     <item> \ \ \ }
1424
1425     <item> \ \ \ =\<less\>\\chunkref{parse-chunk-args}\<gtr\>
1426
1427     <item> \ }
1428
1429     <item> \ return text;
1430
1431     <item>}
1432   </nf-chunk||>
1433
1434   We can see that the text could be inspected with this regex:
1435
1436   <\nf-chunk|parse-chunk-args>
1437     <item>if (! match(text, " *([^,=]*[^,= ]) *(([,=]) *(([^,}]*) *,*
1438     *(.*))\|)$", a)) {
1439
1440     <item> \ return text;
1441
1442     <item>}
1443   </nf-chunk||>
1444
1445   and that <verbatim|a> will have the following values:
1446
1447   <tabular|<tformat|<cwith|2|7|1|2|cell-lborder|0.5pt>|<cwith|2|7|1|2|cell-rborder|0.5pt>|<cwith|2|7|1|2|cell-bborder|0.5pt>|<cwith|2|7|1|2|cell-tborder|0.5pt>|<cwith|1|1|1|2|cell-lborder|0.5pt>|<cwith|1|1|1|2|cell-rborder|0.5pt>|<cwith|1|1|1|2|cell-bborder|0.5pt>|<cwith|1|1|1|2|cell-tborder|0.5pt>|<table|<row|<cell|a[n]>|<cell|assigned
1448   text>>|<row|<cell|1>|<cell|freddie>>|<row|<cell|2>|<cell|=freddie,
1449   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>|<row|<cell|3>|<cell|=>>|<row|<cell|4>|<cell|freddie,
1450   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>|<row|<cell|5>|<cell|freddie>>|<row|<cell|6>|<cell|,
1451   foo={bar=baz, quux={quirk, a=fleeg}}, etc>>>>>
1452
1453   <verbatim|a[3]> will be either <verbatim|=> or <verbatim|,> and signify
1454   whether the option named in <verbatim|a[1]> has a value or not
1455   (respectively).
1456
1457   If the option does have a value, then if the expression
1458   <verbatim|substr(a[4],1,1)> returns a brace <verbatim|{> it will signify
1459   that we need to recurse:
1460
1461   <\nf-chunk|parse-chunk-args>
1462     <item>name=a[1];
1463
1464     <item>if (a[3] == "=") {
1465
1466     <item> \ if (substr(a[4],1,1) == "{") {
1467
1468     <item> \ \ \ text = get_tex_chunk_args(substr(a[4],2), values, path name
1469     SUBSEP);
1470
1471     <item> \ } else {
1472
1473     <item> \ \ \ values[path name]=a[5];
1474
1475     <item> \ \ \ text = a[6];
1476
1477     <item> \ }
1478
1479     <item>} else {
1480
1481     <item> \ values[path name]="";
1482
1483     <item> \ text = a[2];
1484
1485     <item>}
1486   </nf-chunk||>
1487
1488   We can test this function like this:
1489
1490   <\nf-chunk|gca-test.awk>
1491     <item>=\<less\>\\chunkref{get_chunk_args()}\<gtr\>
1492
1493     <item>BEGIN {
1494
1495     <item> \ SUBSEP=".";
1496
1497     <item>
1498
1499     <item> \ print get_tex_chunk_args("name=freddie, foo={bar=baz,
1500     quux={quirk, a=fleeg}}, etc", a);
1501
1502     <item> \ for (b in a) {
1503
1504     <item> \ \ \ print "a[" b "] =\<gtr\> " a[b];
1505
1506     <item> \ }
1507
1508     <item>}
1509   </nf-chunk||>
1510
1511   which should give this output:
1512
1513   <\nf-chunk|gca-test.awk-results>
1514     <item>a[foo.quux.quirk] =\<gtr\>\
1515
1516     <item>a[foo.quux.a] =\<gtr\> fleeg
1517
1518     <item>a[foo.bar] =\<gtr\> baz
1519
1520     <item>a[etc] =\<gtr\>\
1521
1522     <item>a[name] =\<gtr\> freddie
1523   </nf-chunk||>
1524
1525   <section|Parsing chunk arguments><label|Chunk Arguments>
1526
1527   Arguments to paramterized chunks are expressed in round brackets as a comma
1528   separated list of optional arguments. For example, a chunk that is defined
1529   with:
1530
1531   <verbatim|\\Chunk{achunk, params=name ; address}>
1532
1533   could be invoked as:
1534
1535   <verbatim|\\chunkref{achunk}(John Jones, jones@example.com)>
1536
1537   An argument list may be as simple as in <verbatim|\\chunkref{pull}(thing,
1538   otherthing)> or as complex as:
1539
1540   <verbatim|\\chunkref{pull}(things[x, y], get_other_things(a, "(all)"))>
1541
1542   --- which for all it's commas and quotes and parenthesis represents only
1543   two parameters: <verbatim|things[x, y]> and <verbatim|get_other_things(a,
1544   "(all)")>.
1545
1546   If we simply split parameter list on commas, then the comma in
1547   <verbatim|things[x,y]> would split into two seperate arguments:
1548   <verbatim|things[x> and <verbatim|y]>--- neither of which make sense on
1549   their own.
1550
1551   One way to prevent this would be by refusing to split text between matching
1552   delimiters, such as <verbatim|[>, <verbatim|]>, <verbatim|(>, <verbatim|)>,
1553   <verbatim|{>, <verbatim|}> and most likely also <verbatim|">, <verbatim|">
1554   and <verbatim|'>, <verbatim|'>. Of course this also makes it impossible to
1555   pass such mis-matched code fragments as parameters, but I think that it
1556   would be hard for readers to cope with authors who would pass such code
1557   unbalanced fragments as chunk parameters<\footnote>
1558     I know that I couldn't cope with users doing such things, and although
1559     the GPL3 license prevents me from actually forbidding anyone from trying,
1560     if they want it to work they'll have to write the code themselves and not
1561     expect any support from me.
1562   </footnote>.
1563
1564   Unfortunately, the full set of matching delimiters may vary from language
1565   to language. In certain C++ template contexts, <verbatim|\<less\>> and
1566   <verbatim|\<gtr\>> would count as delimiters, and yet in other contexts
1567   they would not.
1568
1569   This puts me in the unfortunate position of having to parse-somewhat all
1570   programming languages without knowing what they are!
1571
1572   However, if this universal mode-tracking is possible, then parsing the
1573   arguments would be trivial. Such a mode tracker is described in chapter
1574   <reference|modes> and used here with simplicity.
1575
1576   <\nf-chunk|parse_chunk_args>
1577     <item>function parse_chunk_args(language, text, values, mode,
1578
1579     <item> \ # local vars
1580
1581     <item> \ c, context, rest)
1582
1583     <item>{
1584
1585     <item> \ =\<less\>\\chunkref{new-mode-tracker}(context, language,
1586     mode)\<gtr\>
1587
1588     <item> \ rest = mode_tracker(context, text, values);
1589
1590     <item> \ # extract values
1591
1592     <item> \ for(c=1; c \<less\>= context[0, "values"]; c++) {
1593
1594     <item> \ \ \ values[c] = context[0, "values", c];
1595
1596     <item> \ }
1597
1598     <item> \ return rest;
1599
1600     <item>}
1601   </nf-chunk||>
1602
1603   <section|Expanding parameters in the text>
1604
1605   Within the body of the chunk, the parameters are referred to with:
1606   <verbatim|${name}> and <verbatim|${address}>. There is a strong case that a
1607   <LaTeX> style notation should be used, like <verbatim|\\param{name}> which
1608   would be expressed in the listing as <verbatim|=\<less\>\\param{name}\<gtr\>>
1609   and be rendered as <verbatim|<nf-arg|name>>. Such notation would make me go
1610   blind, but I do intend to adopt it.
1611
1612   We therefore need a function <verbatim|expand_chunk_args> which will take a
1613   block of text, a list of permitted parameters, and the arguments which must
1614   substitute for the parameters.\
1615
1616   Here we split the text on <verbatim|${> which means that all parts except
1617   the first will begin with a parameter name which will be terminated by
1618   <verbatim|}>. The split function will consume the literal <verbatim|${> in
1619   each case.
1620
1621   <\nf-chunk|expand_chunk_args()>
1622     <item>function expand_chunk_args(text, params, args, \
1623
1624     <item> \ p, text_array, next_text, v, t, l)
1625
1626     <item>{
1627
1628     <item> \ if (split(text, text_array, "\\\\${")) {
1629
1630     <item> \ \ \ <nf-ref|substitute-chunk-args|>
1631
1632     <item> \ }
1633
1634     <item>
1635
1636     <item> \ return text;
1637
1638     <item>}
1639   </nf-chunk||>
1640
1641   First, we produce an associative array of substitution values indexed by
1642   parameter names. This will serve as a cache, allowing us to look up the
1643   replacement values as we extract each name.
1644
1645   <\nf-chunk|substitute-chunk-args>
1646     <item>for(p in params) {
1647
1648     <item> \ v[params[p]]=args[p];
1649
1650     <item>}
1651   </nf-chunk||>
1652
1653   We accumulate substituted text in the variable text. As the first part of
1654   the split function is the part before the delimiter --- which is
1655   <verbatim|${> in our case --- this part will never contain a parameter
1656   reference, so we assign this directly to the result kept in
1657   <verbatim|$text>.
1658
1659   <\nf-chunk|substitute-chunk-args>
1660     <item>text=text_array[1];
1661   </nf-chunk||>
1662
1663   We then iterate over the remaining values in the array<\footnote>
1664     I don't know why I think that it will enumerate the array in order, but
1665     it seems to work
1666   </footnote><todo|fix or prove it>, and substitute each reference for it's
1667   argument.
1668
1669   <\nf-chunk|substitute-chunk-args>
1670     <item>for(t=2; t in text_array; t++) {
1671
1672     <item> \ =\<less\>\\chunkref{substitute-chunk-arg}\<gtr\>
1673
1674     <item>}
1675   </nf-chunk||>
1676
1677   After the split on <verbatim|${> a valid parameter reference will consist
1678   of valid parameter name terminated by a close-brace <verbatim|}>. A valid
1679   character name begins with the underscore or a letter, and may contain
1680   letters, digits or underscores.
1681
1682   A valid looking reference that is not actually the name of a parameter will
1683   be and not substituted. This is good because there is nothing to substitute
1684   anyway, and it avoids clashes when writing code for languages where
1685   <verbatim|${...}> is a valid construct --- such constructs will not be
1686   interfered with unless the parameter name also matches.
1687
1688   <\nf-chunk|substitute-chunk-arg>
1689     <item>if (match(text_array[t], "^([a-zA-Z_][a-zA-Z0-9_]*)}", l) &&
1690
1691     <item> \ \ \ l[1] in v)\
1692
1693     <item>{
1694
1695     <item> \ text = text v[l[1]] substr(text_array[t], length(l[1])+2);
1696
1697     <item>} else {
1698
1699     <item> \ text = text "${" text_array[t];
1700
1701     <item>}
1702   </nf-chunk||>
1703
1704   <chapter|Language Modes & Quoting><label|modes>
1705
1706   <section|Modes>
1707
1708   <verbatim|lstlistings> and <verbatim|fangle> both recognize source
1709   languages, and perform some basic parsing. <verbatim|lstlistings> can
1710   detect strings and comments within a language definition and perform
1711   suitable rendering, such as italics for comments, and visible-spaces within
1712   strings.
1713
1714   Fangle similarly can recognize strings, and comments, etc, within a
1715   language, so that any chunks included with <verbatim|\\chunkref> can be
1716   suitably escape or quoted.
1717
1718   <subsection|Modes to keep code together>
1719
1720   As an example, in the C language there are a few parse modes, affecting the
1721   interpretation of characters.
1722
1723   One parse mode is the strings mode. The string mode is commenced by an
1724   un-escaped quotation mark <verbatim|"> and terminated by the same. Within
1725   the string mode, only one additional mode can be commenced, it is the
1726   backslash mode <verbatim|\\>, which is always terminated after the folloing
1727   character.
1728
1729   Another mode is <verbatim|[> which is terminated by a <verbatim|]> (unless
1730   it occurs in a string).
1731
1732   Consider this fragment of C code:
1733
1734   \;
1735
1736   <math|things<wide|<around|[|x, y|]>|\<wide-overbrace\>><rsup|1. [ mode>,
1737   get_other_things<wide|<around|(|a, <wide*|<text|"><around|(|all|)><text|">|\<wide-underbrace\>><rsub|3.
1738   " mode>|)>|\<wide-overbrace\>><rsup|2. ( mode>>
1739
1740   \;
1741
1742   Mode nesting prevents the close parenthesis in the quoted string (part 3)
1743   from terminating the parenthesis mode (part 2).
1744
1745   Each language has a set of modes, the default mode being the null mode.
1746   Each mode can lead to other modes.
1747
1748   <subsection|Modes affect included chunks>
1749
1750   For instance, consider this chunk with language=perl:
1751
1752   <nf-chunk|example-perl|print "hello world $0\\n";|perl|>
1753
1754   If it were included in a chunk with <verbatim|language=sh>, like this:
1755
1756   <nf-chunk|example-sh|perl -e "=\<less\>\\chunkref{example-perl}\<gtr\>"|sh|>
1757
1758   fangle would <em|want> to generate output like this:
1759
1760   <verbatim|perl -e "print \\"hello world \\$0\\\\n\\";" >
1761
1762   See that the double quote <verbatim|">, back-slash <verbatim|\\> and
1763   <verbatim|$> have been quoted with a back-slash to protect them from shell
1764   interpretation.
1765
1766   If that were then included in a chunk with language=make, like this:
1767
1768   <\nf-chunk|example-makefile>
1769     <item>target: pre-req
1770
1771     <item><htab|5mm>=\<less\>\\chunkref{example-sh}\<gtr\>
1772   </nf-chunk|make|>
1773
1774   We would need the output to look like this --- note the <verbatim|$$>:
1775
1776   <\verbatim>
1777     target: pre-req
1778
1779     \ \ \ \ \ \ \ \ perl -e "print \\"hello world \\$$0\\\\n\\";"
1780   </verbatim>
1781
1782   In order to make this work, we need to define a mode-tracker supporting
1783   each language, that can detect the various quoting modes, and provide a
1784   transformation that must be applied to any included text so that included
1785   text will be interpreted correctly after any interpolation that it may be
1786   subject to at run-time.
1787
1788   For example, the sed transformation for text to be inserted into shell
1789   double-quoted strings would be something like:
1790
1791   <verbatim|s/\\\\/\\\\\\\\/g;s/$/\\\\$/g;s/"/\\\\"/g;>
1792
1793   which protects <verbatim|\\ $ ">.
1794
1795   <todo|I don't think this example is true>The mode tracker must also track
1796   nested mode-changes, as in this sh example.
1797
1798   <verbatim|echo "hello `id ...`">
1799
1800   <phantom|<verbatim|echo "hello `id >><math|\<uparrow\>>
1801
1802   Any characters inserted at the point marked <math|\<uparrow\>> would need
1803   to be escaped, including <verbatim|`> <verbatim|\|> <verbatim|*> among
1804   others. First it would need escaping for the back-ticks <verbatim|`>, and
1805   then for the double-quotes <verbatim|">.
1806
1807   <todo|MAYBE>Escaping need not occur if the format and mode of the included
1808   chunk matches that of the including chunk.
1809
1810   As each chunk is output a new mode tracker for that language is initialized
1811   in it's normal state. As text is output for that chunk the output mode is
1812   tracked. When a new chunk is included, a transformation appropriate to that
1813   mode is selected and pushed onto a stack of transformations. Any text to be
1814   output is first passed through this stack of transformations.
1815
1816   It remains to consider if the chunk-include function should return it's
1817   generated text so that the caller can apply any transformations (and
1818   formatting), or if it should apply the stack of transformations itself.
1819
1820   Note that the transformed text should have the property of not being able
1821   to change the mode in the current chunk.
1822
1823   <todo|Note chunk parameters should probably also be transformed>
1824
1825   <section|Language Mode Definitions>
1826
1827   All modes are stored in a single multi-dimensional hash. The first index is
1828   the language, and the second index is the mode-identifier. The third
1829   indexes are terminators, and optionally, submodes, and delimiters.
1830
1831   A useful set of mode definitions for a nameless general C-type language is
1832   shown here. (Don't be confused by the double backslash escaping needed in
1833   awk. One set of escaping is for the string, and the second set of escaping
1834   is for the regex).
1835
1836   <\todo>
1837     TODO: Add =\<less\>\\mode{}\<gtr\> command which will allow us to signify
1838     that a string is
1839
1840     \ regex and thus fangle will quote it for us.
1841   </todo>
1842
1843   Submodes are entered by the characters \ <verbatim|"> <verbatim|'>
1844   <verbatim|{> <verbatim|(> <verbatim|[> <verbatim|/*>
1845
1846   <\nf-chunk|common-mode-definitions>
1847     <item>modes[${language}, "", \ "submodes"]="\\\\\\\\\|\\"\|'\|{\|\\\\(\|\\\\[";
1848   </nf-chunk||<tuple|language>>
1849
1850   In the default mode, a comma surrounded by un-important white space is a
1851   delimiter of language items<\footnote>
1852     whatever a <em|language item> might be
1853   </footnote>.
1854
1855   <\nf-chunk|common-mode-definitions>
1856     <item>modes[${language}, "", \ "delimiters"]=" *, *";
1857   </nf-chunk||language>
1858
1859   and should pass this test:<todo|Why do the tests run in ?(? mode and not ??
1860   mode>
1861
1862   <\nf-chunk|test:mode-definitions>
1863     <item>parse_chunk_args("c-like", "1,2,3", a, "");
1864
1865     <item>if (a[1] != "1") e++;
1866
1867     <item>if (a[2] != "2") e++;
1868
1869     <item>if (a[3] != "3") e++;
1870
1871     <item>if (length(a) != 3) e++;
1872
1873     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
1874
1875     <item>
1876
1877     <item>parse_chunk_args("c-like", "joe, red", a, "");
1878
1879     <item>if (a[1] != "joe") e++;
1880
1881     <item>if (a[2] != "red") e++;
1882
1883     <item>if (length(a) != 2) e++;
1884
1885     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
1886
1887     <item>
1888
1889     <item>parse_chunk_args("c-like", "${colour}", a, "");
1890
1891     <item>if (a[1] != "${colour}") e++;
1892
1893     <item>if (length(a) != 1) e++;
1894
1895     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
1896   </nf-chunk||>
1897
1898   Nested modes are identified by a backslash, a double or single quote,
1899   various bracket styles or a <verbatim|/*> comment.
1900
1901   For each of these sub-modes modes we must also identify at a mode
1902   terminator, and any sub-modes or delimiters that may be entered<\footnote>
1903     Because we are using the sub-mode characters as the mode identifier it
1904     means we can't currently have a mode character dependant on it's context;
1905     i.e. <verbatim|{> can't behave differently when it is inside
1906     <verbatim|[>.
1907   </footnote>.
1908
1909   <subsection|Backslash>
1910
1911   The backslash mode has no submodes or delimiters, and is terminated by any
1912   character. Note that we are not so much interested in evaluating or
1913   interpolating content as we are in delineating content. It is no matter
1914   that a double backslash (<verbatim|\\\\>) may represent a single backslash
1915   while a backslash-newline may represent white space, but it does matter
1916   that the newline in a backslash newline should not be able to terminate a C
1917   pre-processor statement; and so the newline will be consumed by the
1918   backslash however it is to be interpreted.
1919
1920   <\nf-chunk|common-mode-definitions>
1921     <item>modes[${language}, "\\\\", "terminators"]=".";
1922   </nf-chunk||>
1923
1924   <subsection|Strings>
1925
1926   Common languages support two kinds of strings quoting, double quotes and
1927   single quotes.
1928
1929   In a string we have one special mode, which is the backslash. This may
1930   escape an embedded quote and prevent us thinking that it should terminate
1931   the string.
1932
1933   <\nf-chunk|mode:common-string>
1934     <item>modes[${language}, ${quote}, "submodes"]="\\\\\\\\";
1935   </nf-chunk||<tuple|language|quote>>
1936
1937   Otherwise, the string will be terminated by the same character that
1938   commenced it.
1939
1940   <\nf-chunk|mode:common-string>
1941     <item>modes[${language}, ${quote}, "terminators"]=${quote};
1942   </nf-chunk||language>
1943
1944   In C type languages, certain escape sequences exist in strings. We need to
1945   define mechanism to enclode any chunks included in this mode using those
1946   escape sequences. These are expressed in two parts, s meaning search, and r
1947   meaning replace.
1948
1949   The first substitution is to replace a backslash with a double backslash.
1950   We do this first as other substitutions may introduce a backslash which we
1951   would not then want to escape again here.
1952
1953   Note: Backslashes need double-escaping in the search pattern but not in the
1954   replacement string, hence we are replacing a literal <verbatim|\\> with a
1955   literal <verbatim|\\\\>.
1956
1957   <\nf-chunk|mode:common-string>
1958     <item>escapes[${language}, ${quote}, ++escapes[${language}, ${quote}],
1959     "s"]="\\\\\\\\";
1960
1961     <item>escapes[${language}, ${quote}, \ \ escapes[${language}, ${quote}],
1962     "r"]="\\\\\\\\";
1963   </nf-chunk||language>
1964
1965   If the quote character occurs in the text, it should be preceded by a
1966   backslash, otherwise it would terminate the string unexpectedly.
1967
1968   <\nf-chunk|mode:common-string>
1969     <item>escapes[${language}, ${quote}, ++escapes[${language}, ${quote}],
1970     "s"]=${quote};
1971
1972     <item>escapes[${language}, ${quote}, \ \ escapes[${language}, ${quote}],
1973     "r"]="\\\\" ${quote};
1974   </nf-chunk||language>
1975
1976   Any newlines in the string, must be replaced by <verbatim|\\n>.
1977
1978   <\nf-chunk|mode:common-string>
1979     <item>escapes[${language}, ${quote}, ++escapes[${language}, ${quote}],
1980     "s"]="\\n";
1981
1982     <item>escapes[${language}, ${quote}, \ \ escapes[${language}, ${quote}],
1983     "r"]="\\\\n";
1984   </nf-chunk||language>
1985
1986   For the common modes, we define this string handling for double and single
1987   quotes.
1988
1989   <\nf-chunk|common-mode-definitions>
1990     <item>=\<less\>\\chunkref{mode:common-string}(${language},
1991     "\\textbackslash{}"")\<gtr\>
1992
1993     <item>=\<less\>\\chunkref{mode:common-string}(${language}, "'")\<gtr\>
1994   </nf-chunk||>
1995
1996   Working strings should pass this test:
1997
1998   <\nf-chunk|test:mode-definitions>
1999     <item>parse_chunk_args("c-like", "say \\"I said, \\\\\\"Hello, how are
2000     you\\\\\\".\\", for me", a, "");
2001
2002     <item>if (a[1] != "say \\"I said, \\\\\\"Hello, how are you\\\\\\".\\"")
2003     e++;
2004
2005     <item>if (a[2] != "for me") e++;
2006
2007     <item>if (length(a) != 2) e++;
2008
2009     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
2010   </nf-chunk||>
2011
2012   <subsection|Parentheses, Braces and Brackets>
2013
2014   Where quotes are closed by the same character, parentheses, brackets and
2015   braces are closed by an alternate character.
2016
2017   <\nf-chunk|mode:common-brackets>
2018     <item>modes[<nf-arg|language>, <nf-arg|open>, \ "submodes"
2019     ]="\\\\\\\\\|\\"\|{\|\\\\(\|\\\\[\|'\|/\\\\*";
2020
2021     <item>modes[<nf-arg|language>, <nf-arg|open>, \ "delimiters"]=" *, *";
2022
2023     <item>modes[<nf-arg|language>, <nf-arg|open>,
2024     \ "terminators"]=<nf-arg|close>;
2025   </nf-chunk||<tuple|language|open|close>>
2026
2027   Note that the open is NOT a regex but the close token IS. <todo|When we can
2028   quote regex we won't have to put the slashes in here>
2029
2030   <\nf-chunk|common-mode-definitions>
2031     <item>=\<less\>\\chunkref{mode:common-brackets}(${language}, "{",
2032     "}")\<gtr\>
2033
2034     <item>=\<less\>\\chunkref{mode:common-brackets}(${language}, "[",
2035     "\\textbackslash{}\\textbackslash{}]")\<gtr\>
2036
2037     <item>=\<less\>\\chunkref{mode:common-brackets}(${language}, "(",
2038     "\\textbackslash{}\\textbackslash{})")\<gtr\>
2039   </nf-chunk||>
2040
2041   <subsection|Customizing Standard Modes>
2042
2043   <\nf-chunk|mode:add-submode>
2044     <item>modes[${language}, ${mode}, "submodes"] = modes[${language},
2045     ${mode}, "submodes"] "\|" ${submode};
2046   </nf-chunk||<tuple|language|mode|submode>>
2047
2048   <\nf-chunk|mode:add-escapes>
2049     <item>escapes[${language}, ${mode}, ++escapes[${language}, ${mode}],
2050     "s"]=${search};
2051
2052     <item>escapes[${language}, ${mode}, \ \ escapes[${language}, ${mode}],
2053     "r"]=${replace};
2054   </nf-chunk||<tuple|language|mode|search|replace>>
2055
2056   \;
2057
2058   <subsection|Comments>
2059
2060   We can define <verbatim|/* comment */> style comments and
2061   <verbatim|//comment> style comments to be added to any language:
2062
2063   <\nf-chunk|mode:multi-line-comments>
2064     <item>=\<less\>\\chunkref{mode:add-submode}(${language}, "",
2065     "/\\textbackslash{}\\textbackslash{}*")\<gtr\>
2066
2067     <item>modes[${language}, "/*", "terminators"]="\\\\*/";
2068   </nf-chunk||<tuple|language>>
2069
2070   <\nf-chunk|mode:single-line-slash-comments>
2071     <item>=\<less\>\\chunkref{mode:add-submode}(${language}, "", "//")\<gtr\>
2072
2073     <item>modes[${language}, "//", "terminators"]="\\n";
2074
2075     <item>=\<less\>\\chunkref{mode:add-escapes}(${language}, "//",
2076     "\\textbackslash{}n", "\\textbackslash{}n//")\<gtr\>
2077   </nf-chunk||language>
2078
2079   We can also define <verbatim|# comment> style comments (as used in awk and
2080   shell scripts) in a similar manner.
2081
2082   <todo|I'm having to use # for hash and \textbackslash{} for \ and have
2083   hacky work-arounds in the parser for now>
2084
2085   <\nf-chunk|mode:add-hash-comments>
2086     <item>=\<less\>\\chunkref{mode:add-submode}(${language}, "",
2087     "\\#")\<gtr\>
2088
2089     <item>modes[${language}, "#", "terminators"]="\\n";
2090
2091     <item>=\<less\>\\chunkref{mode:add-escapes}(${language}, "\\#",
2092     "\\textbackslash{}n", "\\textbackslash{}n\\#")\<gtr\>
2093   </nf-chunk||<tuple|language>>
2094
2095   In C, the <verbatim|#> denotes pre-processor directives which can be
2096   multi-line
2097
2098   <\nf-chunk|mode:add-hash-defines>
2099     <item>=\<less\>\\chunkref{mode:add-submode}(${language}, "",
2100     "\\#")\<gtr\>
2101
2102     <item>modes[${language}, "#", "submodes" ]="\\\\\\\\";
2103
2104     <item>modes[${language}, "#", "terminators"]="\\n";
2105
2106     <item>=\<less\>\\chunkref{mode:add-escapes}(${language}, "\\#",
2107     "\\textbackslash{}n", "\\textbackslash{}\\textbackslash{}\\textbackslash{}\\textbackslash{}\\textbackslash{}n")\<gtr\>
2108   </nf-chunk||<tuple|language>>
2109
2110   <\nf-chunk|mode:quote-dollar-escape>
2111     <item>escapes[${language}, ${quote}, ++escapes[${language}, ${quote}],
2112     "s"]="\\\\$";
2113
2114     <item>escapes[${language}, ${quote}, \ \ escapes[${language}, ${quote}],
2115     "r"]="\\\\$";
2116   </nf-chunk||<tuple|language|quote>>
2117
2118   We can add these definitions to various languages
2119
2120   <\nf-chunk|mode-definitions>
2121     <item><nf-ref|common-mode-definitions|<tuple|"c-like">>
2122
2123     <item>
2124
2125     <item><nf-ref|common-mode-definitions|<tuple|"c">>
2126
2127     <item>=\<less\>\\chunkref{mode:multi-line-comments}("c")\<gtr\>
2128
2129     <item>=\<less\>\\chunkref{mode:single-line-slash-comments}("c")\<gtr\>
2130
2131     <item>=\<less\>\\chunkref{mode:add-hash-defines}("c")\<gtr\>
2132
2133     <item>
2134
2135     <item>=\<less\>\\chunkref{common-mode-definitions}("awk")\<gtr\>
2136
2137     <item>=\<less\>\\chunkref{mode:add-hash-comments}("awk")\<gtr\>
2138
2139     <item>=\<less\>\\chunkref{mode:add-naked-regex}("awk")\<gtr\>
2140   </nf-chunk||>
2141
2142   The awk definitions should allow a comment block like this:
2143
2144   <nf-chunk|test:comment-quote|<item># Comment:
2145   =\<less\>\\chunkref{test:comment-text}\<gtr\>|awk|>
2146
2147   <\nf-chunk|test:comment-text>
2148     <item>Now is the time for
2149
2150     <item>the quick brown fox to bring lemonade
2151
2152     <item>to the party
2153   </nf-chunk||>
2154
2155   to come out like this:
2156
2157   <\nf-chunk|test:comment-quote:result>
2158     <item># Comment: Now is the time for
2159
2160     <item>#the quick brown fox to bring lemonade
2161
2162     <item>#to the party
2163   </nf-chunk||>
2164
2165   The C definition for such a block should have it come out like this:
2166
2167   <\nf-chunk|test:comment-quote:C-result>
2168     <item># Comment: Now is the time for\\
2169
2170     <item>the quick brown fox to bring lemonade\\
2171
2172     <item>to the party
2173   </nf-chunk||>
2174
2175   <subsection|Regex>
2176
2177   This pattern is incomplete, but meant to detect naked regular expressions
2178   in awk and perl; e.g. <verbatim|/.*$/>, however required capabilities are
2179   not present.
2180
2181   Current it only detects regexes anchored with ^ as used in fangle.
2182
2183   For full regex support, modes need to be named not after their starting
2184   character, but some other more fully qualified name.
2185
2186   <\nf-chunk|mode:add-naked-regex>
2187     <item>=\<less\>\\chunkref{mode:add-submode}(${language}, "",
2188     "/\\textbackslash{}\\textbackslash{}\\^")\<gtr\>
2189
2190     <item>modes[${language}, "/^", "terminators"]="/";
2191   </nf-chunk||<tuple|language>>
2192
2193   <subsection|Perl>
2194
2195   <\nf-chunk|mode-definitions>
2196     <item>=\<less\>\\chunkref{common-mode-definitions}("perl")\<gtr\>
2197
2198     <item>=\<less\>\\chunkref{mode:multi-line-comments}("perl")\<gtr\>
2199
2200     <item>=\<less\>\\chunkref{mode:add-hash-comments}("perl")\<gtr\>
2201   </nf-chunk||>
2202
2203   Still need to add add <verbatim|s/>, submode <verbatim|/>, terminate both
2204   with <verbatim|//>. This is likely to be impossible as perl regexes can
2205   contain perl.
2206
2207   <subsection|sh>
2208
2209   <\nf-chunk|mode-definitions>
2210     <item>=\<less\>\\chunkref{common-mode-definitions}("sh")\<gtr\>
2211
2212     <item>#\<less\>\\chunkref{mode:common-string}("sh",
2213     "\\textbackslash{}"")\<gtr\>
2214
2215     <item>#\<less\>\\chunkref{mode:common-string}("sh", "'")\<gtr\>
2216
2217     <item>=\<less\>\\chunkref{mode:add-hash-comments}("sh")\<gtr\>
2218
2219     <item>=\<less\>\\chunkref{mode:quote-dollar-escape}("sh", "\\"")\<gtr\>
2220   </nf-chunk||>
2221
2222   <section|Some tests>
2223
2224   Also, the parser must return any spare text at the end that has not been
2225   processed due to a mode terminator being found.
2226
2227   <\nf-chunk|test:mode-definitions>
2228     <item>rest = parse_chunk_args("c-like", "1, 2, 3) spare", a, "(");
2229
2230     <item>if (a[1] != 1) e++;
2231
2232     <item>if (a[2] != 2) e++;
2233
2234     <item>if (a[3] != 3) e++;
2235
2236     <item>if (length(a) != 3) e++;
2237
2238     <item>if (rest != " spare") e++;
2239
2240     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
2241   </nf-chunk||>
2242
2243   We must also be able to parse the example given earlier.
2244
2245   <\nf-chunk|test:mode-definitions>
2246     <item>parse_chunk_args("c-like", "things[x, y], get_other_things(a,
2247     \\"(all)\\"), 99", a, "(");
2248
2249     <item>if (a[1] != "things[x, y]") e++;
2250
2251     <item>if (a[2] != "get_other_things(a, \\"(all)\\")") e++;
2252
2253     <item>if (a[3] != "99") e++;
2254
2255     <item>if (length(a) != 3) e++;
2256
2257     <item>=\<less\>\\chunkref{pca-test.awk:summary}\<gtr\>
2258   </nf-chunk||>
2259
2260   <section|A non-recursive mode tracker>
2261
2262   <subsection|Constructor>
2263
2264   The mode tracker holds its state in a stack based on a numerically indexed
2265   hash. This function, when passed an empty hash, will intialize it.
2266
2267   <\nf-chunk|new_mode_tracker()>
2268     <item>function new_mode_tracker(context, language, mode) {
2269
2270     <item> \ context[""] = 0;
2271
2272     <item> \ context[0, "language"] = language;
2273
2274     <item> \ context[0, "mode"] = mode;
2275
2276     <item>}
2277   </nf-chunk||>
2278
2279   Because awk functions cannot return an array, we must create the array
2280   first and pass it in, so we have a fangle macro to do this:
2281
2282   <\nf-chunk|new-mode-tracker>
2283     <item><nf-ref|awk-delete-array|<tuple|context>>
2284
2285     <item>new_mode_tracker(${context}, ${language}, ${mode});
2286   </nf-chunk|awk|<tuple|context|language|mode>>
2287
2288   <subsection|Management>
2289
2290   And for tracking modes, we dispatch to a mode-tracker action based on the
2291   current language
2292
2293   <\nf-chunk|mode_tracker>
2294     <item>function push_mode_tracker(context, language, mode,
2295
2296     <item> \ # local vars
2297
2298     <item> \ top)
2299
2300     <item>{
2301
2302     <item> \ if (! ("" in context)) {
2303
2304     <item> \ \ \ <nf-ref|new-mode-tracker|<tuple|context|language|mode>>
2305
2306     <item> \ } else {
2307
2308     <item> \ \ \ top = context[""];
2309
2310     <item> \ \ \ if (context[top, "language"] == language && mode=="") mode =
2311     context[top, "mode"];
2312
2313     <item> \ \ \ top++;
2314
2315     <item> \ \ \ context[top, "language"] = language;
2316
2317     <item> \ \ \ context[top, "mode"] = mode;
2318
2319     <item> \ \ \ context[""] = top;
2320
2321     <item> \ }
2322
2323     <item>}
2324   </nf-chunk|awk|>
2325
2326   <\nf-chunk|mode_tracker>
2327     <item>function dump_mode_tracker(context, \
2328
2329     <item> \ c, d)
2330
2331     <item>{
2332
2333     <item> \ for(c=0; c \<less\>= context[""]; c++) {
2334
2335     <item> \ \ \ printf(" %2d \ \ %s:%s\\n", c, context[c, "language"],
2336     context[c, "mode"]) \<gtr\> "/dev/stderr";
2337
2338     <item> \ \ \ for(d=1; ( (c, "values", d) in context); d++) {
2339
2340     <item> \ \ \ \ \ printf(" \ \ %2d %s\\n", d, context[c, "values", d])
2341     \<gtr\> "/dev/stderr";
2342
2343     <item> \ \ \ }
2344
2345     <item> \ }
2346
2347     <item>}
2348   </nf-chunk||>
2349
2350   <\nf-chunk|mode_tracker>
2351     <item>function finalize_mode_tracker(context)
2352
2353     <item>{
2354
2355     <item> \ if ( ("" in context) && context[""] != 0) return 0;
2356
2357     <item> \ return 1;
2358
2359     <item>}
2360   </nf-chunk||>
2361
2362   This implies that any chunk must be syntactically whole; for instance, this
2363   is fine:
2364
2365   <\nf-chunk|test:whole-chunk>
2366     <item>if (1) {
2367
2368     <item> \ =\<less\>\\chunkref{test:say-hello}\<gtr\>
2369
2370     <item>}
2371   </nf-chunk||>
2372
2373   <\nf-chunk|test:say-hello>
2374     <item>print "hello";
2375   </nf-chunk||>
2376
2377   But this is not fine; the chunk <nf-ref|test:hidden-else|> is not properly
2378   cromulent.
2379
2380   <\nf-chunk|test:partial-chunk>
2381     <item>if (1) {
2382
2383     <item> \ =\<less\>\\chunkref{test:hidden-else}\<gtr\>
2384
2385     <item>}
2386   </nf-chunk||>
2387
2388   <\nf-chunk|test:hidden-else>
2389     <item> \ print "I'm fine";
2390
2391     <item>} else {
2392
2393     <item> \ print "I'm not";
2394   </nf-chunk||>
2395
2396   These tests will check for correct behaviour:
2397
2398   <\nf-chunk|test:cromulence>
2399     <item>echo Cromulence test
2400
2401     <item>passtest $FANGLE -Rtest:whole-chunk $TEX_SRC &\<gtr\>/dev/null \|\|
2402     ( echo "Whole chunk failed" && exit 1 )
2403
2404     <item>failtest $FANGLE -Rtest:partial-chunk $TEX_SRC &\<gtr\>/dev/null
2405     \|\| ( echo "Partial chunk failed" && exit 1 )
2406   </nf-chunk||>
2407
2408   <subsection|Tracker>
2409
2410   We must avoid recursion as a language construct because we intend to employ
2411   mode-tracking to track language mode of emitted code, and the code is
2412   emitted from a function which is itself recursive, so instead we implement
2413   psuedo-recursion using our own stack based on a hash.
2414
2415   <\nf-chunk|mode_tracker()>
2416     <item>function mode_tracker(context, text, values,\
2417
2418     <item> \ # optional parameters
2419
2420     <item> \ # local vars
2421
2422     <item> \ mode, submodes, language,
2423
2424     <item> \ cindex, c, a, part, item, name, result, new_values, new_mode,\
2425
2426     <item> \ delimiters, terminators)
2427
2428     <item>{
2429   </nf-chunk|awk|>
2430
2431   We could be re-commencing with a valid context, so we need to setup the
2432   state according to the last context.
2433
2434   <\nf-chunk|mode_tracker()>
2435     <item> \ cindex = context[""] + 0;
2436
2437     <item> \ mode = context[cindex, "mode"];
2438
2439     <item> \ language = context[cindex, "language" ];
2440   </nf-chunk||>
2441
2442   First we construct a single large regex combining the possible sub-modes
2443   for the current mode along with the terminators for the current mode.
2444
2445   <\nf-chunk|parse_chunk_args-reset-modes>
2446     <item> \ submodes=modes[language, mode, "submodes"];
2447
2448     <item>
2449
2450     <item> \ if ((language, mode, "delimiters") in modes) {
2451
2452     <item> \ \ \ delimiters = modes[language, mode, "delimiters"];
2453
2454     <item> \ \ \ if (length(submodes)\<gtr\>0) submodes = submodes "\|";
2455
2456     <item> \ \ \ submodes=submodes delimiters;
2457
2458     <item> \ } else delimiters="";
2459
2460     <item> \ if ((language, mode, "terminators") in modes) {
2461
2462     <item> \ \ \ terminators = modes[language, mode, "terminators"];
2463
2464     <item> \ \ \ if (length(submodes)\<gtr\>0) submodes = submodes "\|";
2465
2466     <item> \ \ \ submodes=submodes terminators;
2467
2468     <item> \ } else terminators="";
2469   </nf-chunk||>
2470
2471   If we don't find anything to match on --- probably because the language is
2472   not supported --- then we return the entire text without matching anything.
2473
2474   <\nf-chunk|parse_chunk_args-reset-modes>
2475     <item> if (! length(submodes)) return text;
2476   </nf-chunk||>
2477
2478   <\nf-chunk|mode_tracker()>
2479     <item>=\<less\>\\chunkref{parse_chunk_args-reset-modes}\<gtr\>
2480   </nf-chunk||>
2481
2482   We then iterate the text (until there is none left) looking for sub-modes
2483   or terminators in the regex.
2484
2485   <\nf-chunk|mode_tracker()>
2486     <item> \ while((cindex \<gtr\>= 0) && length(text)) {
2487
2488     <item> \ \ \ if (match(text, "(" submodes ")", a)) {
2489   </nf-chunk||>
2490
2491   A bug that creeps in regularly during development is bad regexes of zero
2492   length which result in an infinite loop (as no text is consumed), so I
2493   catch that right away with this test.
2494
2495   <\nf-chunk|mode_tracker()>
2496     <item> \ \ \ \ \ if (RLENGTH\<less\>1) {
2497
2498     <item> \ \ \ \ \ \ \ error(sprintf("Internal error, matched zero length
2499     submode, should be impossible - likely regex computation error\\n" \\
2500
2501     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ "Language=%s\\nmode=%s\\nmatch=%s\\n",
2502     language, mode, submodes));
2503
2504     <item> \ \ \ \ \ }
2505   </nf-chunk||>
2506
2507   part is defined as the text up to the sub-mode or terminator, and this is
2508   appended to item --- which is the current text being gathered. If a mode
2509   has a delimiter, then item is reset each time a delimiter is found.
2510
2511   <math|<wide|<with|mode|prog|"><wide*|hello|\<wide-underbrace\>><rsub|item>,
2512   <wide*|there|\<wide-underbrace\>><rsub|item><with|mode|prog|">|\<wide-overbrace\>><rsup|item>,
2513   \ <wide|he said.|\<wide-overbrace\>><rsup|item>>
2514
2515   <\nf-chunk|mode_tracker()>
2516     <item> \ \ \ \ \ part = substr(text, 1, RSTART -1);
2517
2518     <item> \ \ \ \ \ item = item part;
2519   </nf-chunk||>
2520
2521   We must now determine what was matched. If it was a terminator, then we
2522   must restore the previous mode.
2523
2524   <\nf-chunk|mode_tracker()>
2525     <item> \ \ \ \ \ if (match(a[1], "^" terminators "$")) {
2526
2527     <item>#printf("%2d EXIT \ MODE [%s] by [%s] [%s]\\n", cindex, mode, a[1],
2528     text) \<gtr\> "/dev/stderr"
2529
2530     <item> \ \ \ \ \ \ \ context[cindex, "values", ++context[cindex,
2531     "values"]] = item;
2532
2533     <item> \ \ \ \ \ \ \ delete context[cindex];
2534
2535     <item> \ \ \ \ \ \ \ context[""] = --cindex;
2536
2537     <item> \ \ \ \ \ \ \ if (cindex\<gtr\>=0) {
2538
2539     <item> \ \ \ \ \ \ \ \ \ mode = context[cindex, "mode"];
2540
2541     <item> \ \ \ \ \ \ \ \ \ language = context[cindex, "language"];
2542
2543     <item> \ \ \ \ \ \ \ \ \ =\<less\>\\chunkref{parse_chunk_args-reset-modes}\<gtr\>
2544
2545     <item> \ \ \ \ \ \ \ }
2546
2547     <item> \ \ \ \ \ \ \ item = item a[1];
2548
2549     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2550     length(a[1]));
2551
2552     <item> \ \ \ \ \ }
2553   </nf-chunk||>
2554
2555   If a delimiter was matched, then we must store the current item in the
2556   parsed values array, and reset the item.
2557
2558   <\nf-chunk|mode_tracker()>
2559     <item> \ \ \ \ \ else if (match(a[1], "^" delimiters "$")) {
2560
2561     <item> \ \ \ \ \ \ \ if (cindex==0) {
2562
2563     <item> \ \ \ \ \ \ \ \ \ context[cindex, "values", ++context[cindex,
2564     "values"]] = item;
2565
2566     <item> \ \ \ \ \ \ \ \ \ item = "";
2567
2568     <item> \ \ \ \ \ \ \ } else {
2569
2570     <item> \ \ \ \ \ \ \ \ \ item = item a[1];
2571
2572     <item> \ \ \ \ \ \ \ }
2573
2574     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2575     length(a[1]));
2576
2577     <item> \ \ \ \ \ }
2578   </nf-chunk||>
2579
2580   otherwise, if a new submode is detected (all submodes have terminators), we
2581   must create a nested parse context until we find the terminator for this
2582   mode.
2583
2584   <\nf-chunk|mode_tracker()>
2585     <item> else if ((language, a[1], "terminators") in modes) {
2586
2587     <item> \ \ \ \ \ \ \ #check if new_mode is defined
2588
2589     <item> \ \ \ \ \ \ \ item = item a[1];
2590
2591     <item>#printf("%2d ENTER MODE [%s] in [%s]\\n", cindex, a[1], text)
2592     \<gtr\> "/dev/stderr"
2593
2594     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2595     length(a[1]));
2596
2597     <item> \ \ \ \ \ \ \ context[""] = ++cindex;
2598
2599     <item> \ \ \ \ \ \ \ context[cindex, "mode"] = a[1];
2600
2601     <item> \ \ \ \ \ \ \ context[cindex, "language"] = language;
2602
2603     <item> \ \ \ \ \ \ \ mode = a[1];
2604
2605     <item> \ \ \ \ \ \ \ =\<less\>\\chunkref{parse_chunk_args-reset-modes}\<gtr\>
2606
2607     <item> \ \ \ \ \ } else {
2608
2609     <item> \ \ \ \ \ \ \ error(sprintf("Submode '%s' set unknown mode in
2610     text: %s\\nLanguage %s Mode %s\\n", a[1], text, language, mode));
2611
2612     <item> \ \ \ \ \ \ \ text = substr(text, 1 + length(part) +
2613     length(a[1]));
2614
2615     <item> \ \ \ \ \ }
2616
2617     <item> \ \ \ }
2618   </nf-chunk||>
2619
2620   In the final case, we parsed to the end of the string. If the string was
2621   entire, then we should have no nested mode context, but if the string was
2622   just a fragment we may have a mode context which must be preserved for the
2623   next fragment. Todo: Consideration ought to be given if sub-mode strings
2624   are split over two fragments.
2625
2626   <\nf-chunk|mode_tracker()>
2627     <item>else {
2628
2629     <item> \ \ \ \ \ context[cindex, "values", ++context[cindex, "values"]] =
2630     item text;
2631
2632     <item> \ \ \ \ \ text = "";
2633
2634     <item> \ \ \ \ \ item = "";
2635
2636     <item> \ \ \ }
2637
2638     <item> \ }
2639
2640     <item>
2641
2642     <item> \ context["item"] = item;
2643
2644     <item>
2645
2646     <item> \ if (length(item)) context[cindex, "values", ++context[cindex,
2647     "values"]] = item;
2648
2649     <item> \ return text;
2650
2651     <item>}
2652   </nf-chunk||>
2653
2654   <subsubsection|One happy chunk>
2655
2656   All the mode tracker chunks are referred to here:
2657
2658   <\nf-chunk|mode-tracker>
2659     <item><nf-ref|new_mode_tracker()|>
2660
2661     <item><nf-ref|mode_tracker()|>
2662   </nf-chunk||>
2663
2664   <subsubsection|Tests>
2665
2666   We can test this function like this:
2667
2668   <\nf-chunk|pca-test.awk>
2669     <item>=\<less\>\\chunkref{error()}\<gtr\>
2670
2671     <item>=\<less\>\\chunkref{mode-tracker}\<gtr\>
2672
2673     <item>=\<less\>\\chunkref{parse_chunk_args()}\<gtr\>
2674
2675     <item>BEGIN {
2676
2677     <item> \ SUBSEP=".";
2678
2679     <item> \ =\<less\>\\chunkref{mode-definitions}\<gtr\>
2680
2681     <item>
2682
2683     <item> \ =\<less\>\\chunkref{test:mode-definitions}\<gtr\>
2684
2685     <item>}
2686   </nf-chunk|awk|>
2687
2688   <\nf-chunk|pca-test.awk:summary>
2689     <item>if (e) {
2690
2691     <item> \ printf "Failed " e
2692
2693     <item> \ for (b in a) {
2694
2695     <item> \ \ \ print "a[" b "] =\<gtr\> " a[b];
2696
2697     <item> \ }
2698
2699     <item>} else {
2700
2701     <item> \ print "Passed"
2702
2703     <item>}
2704
2705     <item>split("", a);
2706
2707     <item>e=0;
2708   </nf-chunk|awk|>
2709
2710   which should give this output:
2711
2712   <\nf-chunk|pca-test.awk-results>
2713     <item>a[foo.quux.quirk] =\<gtr\>\
2714
2715     <item>a[foo.quux.a] =\<gtr\> fleeg
2716
2717     <item>a[foo.bar] =\<gtr\> baz
2718
2719     <item>a[etc] =\<gtr\>\
2720
2721     <item>a[name] =\<gtr\> freddie
2722   </nf-chunk||>
2723
2724   <section|Escaping and Quoting>
2725
2726   For the time being and to get around <TeXmacs> inability to export a
2727   <kbd|TAB> character, the right arrow <with|mode|math|\<mapsto\>> whose
2728   UTF-8 sequence is ...
2729
2730   <todo|complete>
2731
2732   Another special character is used, the left-arrow
2733   <with|mode|math|\<mapsfrom\>> with UTF-8 sequence 0xE2 0x86 0xA4 is used to
2734   strip any preceding white space as a way of un-tabbing and removing indent
2735   that has been applied <emdash> this is important for bash here documents,
2736   and the like. It's a filthy hack.
2737
2738   <todo|remove the hack>
2739
2740   <\nf-chunk|mode_tracker>
2741     \;
2742
2743     <item>function untab(text) {
2744
2745     <item> \ gsub("[[:space:]]*\\xE2\\x86\\xA4","", text);
2746
2747     <item> \ return text;
2748
2749     <item>}
2750   </nf-chunk||>
2751
2752   Each nested mode can optionally define a set of transforms to be applied to
2753   any text that is included from another language.
2754
2755   This code can perform transforms
2756
2757   <\nf-chunk|mode_tracker>
2758     <item>function transform_escape(s, r, text,
2759
2760     <item> \ \ \ # optional
2761
2762     <item> \ \ \ max,\
2763
2764     <item> \ \ \ \ \ \ \ # local vars
2765
2766     <item> \ \ \ \ \ \ \ c)
2767
2768     <item>{
2769
2770     <item> \ for(c=1; c \<less\>= max && (c in s); c++) {
2771
2772     <item> \ \ \ gsub(s[c], r[c], text);
2773
2774     <item> \ }
2775
2776     <item> \ return text;
2777
2778     <item>}
2779   </nf-chunk|awk|>
2780
2781   This function must append from index c onwards, and escape transforms from
2782   the supplied context, and return c + number of new transforms.
2783
2784   <\nf-chunk|mode_tracker>
2785     <item>function mode_escaper(context, s, r, src,
2786
2787     <item> \ c, cp, cpl)
2788
2789     <item>{
2790
2791     <item> \ for(c = context[""]; c \<gtr\>= 0; c--) {
2792
2793     <item> \ \ \ if ( (context[c, "language"], context[c, "mode"]) in
2794     escapes) {
2795
2796     <item> \ \ \ \ \ cpl = escapes[context[c, "language"], context[c,
2797     "mode"]];
2798
2799     <item> \ \ \ \ \ for (cp = 1; cp \<less\>= cpl; cp ++) {
2800
2801     <item> \ \ \ \ \ \ \ ++src;
2802
2803     <item> \ \ \ \ \ \ \ s[src] = escapes[context[c, "language"], context[c,
2804     "mode"], cp, "s"];
2805
2806     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ r[src]
2807     = escapes[context[c, "language"], context[c, "mode"], cp, "r"];
2808
2809     <item> \ \ \ \ \ }
2810
2811     <item> \ \ \ }
2812
2813     <item> \ }
2814
2815     <item> \ return src;
2816
2817     <item>}
2818
2819     <item>function dump_escaper(c, s, r, cc) {
2820
2821     <item> \ for(cc=1; cc\<less\>=c; cc++) {
2822
2823     <item> \ \ \ printf("%2d s[%s] r[%s]\\n", cc, s[cc], r[cc]) \<gtr\>
2824     "/dev/stderr"
2825
2826     <item> \ }
2827
2828     <item>}
2829   </nf-chunk|awk|>
2830
2831   <\nf-chunk|test:escapes>
2832     <item>echo escapes test
2833
2834     <item>passtest $FANGLE -Rtest:comment-quote $TEX_SRC &\<gtr\>/dev/null
2835     \|\| ( echo "Comment-quote failed" && exit 1 )
2836   </nf-chunk|sh|>
2837
2838   <chapter|Recognizing Chunks>
2839
2840   Fangle recognizes noweb chunks, but as we also want better <LaTeX>
2841   integration we will recognize any of these:
2842
2843   <\itemize>
2844     <item>notangle chunks matching the pattern
2845     <verbatim|^\<less\>\<less\>.*?\<gtr\>\<gtr\>=>
2846
2847     <item>chunks beginning with <verbatim|\\begin{lstlistings}>, possibly
2848     with <verbatim|\\Chunk{...}> on the previous line
2849
2850     <item>an older form I have used, beginning with
2851     <verbatim|\\begin{Chunk}[options]> --- also more suitable for plain
2852     <LaTeX> users<\footnote>
2853       Is there such a thing as plain <LaTeX>?
2854     </footnote>.
2855   </itemize>
2856
2857   <section|Chunk start>
2858
2859   The variable chunking is used to signify that we are processing a code
2860   chunk and not document. In such a state, input lines will be assigned to
2861   the current chunk; otherwise they are ignored.
2862
2863   <subsection|<TeXmacs>>
2864
2865   We don't handle <TeXmacs> files natively yet, but rather instead emit
2866   unicode character sequences to mark up the text-export file which we do
2867   process.
2868
2869   These hacks detect the unicode character sequences and retro-fit in the old
2870   <TeX> parsing.
2871
2872   We convert <math|\<mapsto\>> into a tab character.
2873
2874   <\nf-chunk|recognize-chunk>
2875     \;
2876
2877     <item>#/\\n/ {
2878
2879     <item># \ gsub("\\n*$","");
2880
2881     <item># \ gsub("\\n", " ");
2882
2883     <item>#}
2884
2885     <item>#===
2886
2887     <item>/\\xE2\\x86\\xA6/ {
2888
2889     <item> \ gsub("\\\\xE2\\\\x86\\\\xA6", "\\x09");
2890
2891     <item>}
2892   </nf-chunk||>
2893
2894   <TeXmacs> back-tick handling is obscure, and a cut-n-paste back-tick from a
2895   shell window comes out as a unicode sequence<\footnote>
2896     that won't export to html, except as a NULL character (literal 0x00)
2897   </footnote> that is fixed-up here.
2898
2899   <\nf-chunk|recognize-chunk>
2900     <item>
2901
2902     <item>/\\xE2\\x80\\x98/ {
2903
2904     <item> \ gsub("\\\\xE2\\\\x80\\\\x98", "`");
2905
2906     <item>}
2907   </nf-chunk||>
2908
2909   In the <TeXmacs> output, the start of a chunk will appear like this:
2910
2911   <verbatim| \ 5b\<less\>example-chunk<key|^K>[1](arg1,<key|^K>
2912   arg2<key|^K><key|^K>), lang=C\<gtr\> <math|\<equiv\>>>
2913
2914   We detect the the start of a <TeXmacs> chunk by detecting the
2915   <math|\<equiv\>> symbol which occurs near the end of the line. We obtain
2916   the chunk name, the chunk parameters, and the chunk language.
2917
2918   <\nf-chunk|recognize-chunk>
2919     <item>
2920
2921     <item>/\\xE2\\x89\\xA1/ {
2922
2923     <item> \ if (match($0, "^ *([^[ ]* \|)\<less\>([^[
2924     ]*)\\\\[[0-9]*\\\\][(](.*)[)].*, lang=([^ ]*)", line)) {
2925
2926     <item> \ \ \ next_chunk_name=line[2];
2927
2928     <item> \ \ \ get_texmacs_chunk_args(line[3], next_chunk_params);
2929
2930     <item> \ \ \ gsub(ARG_SEPARATOR ",? ?", ";", line[3]);
2931
2932     <item> \ \ \ params = "params=" line[3];
2933
2934     <item> \ \ \ if ((line[4])) {
2935
2936     <item> \ \ \ \ \ params = params ",language=" line[4]
2937
2938     <item> \ \ \ }
2939
2940     <item> \ \ \ get_tex_chunk_args(params, next_chunk_opts);
2941
2942     <item> \ \ \ new_chunk(next_chunk_name, next_chunk_opts,
2943     next_chunk_params);
2944
2945     <item> \ \ \ texmacs_chunking = 1;
2946
2947     <item> \ } else {
2948
2949     <item> \ \ \ warning(sprintf("Unexpected chunk match: %s\\n", $_))
2950
2951     <item> \ }
2952
2953     <item> \ next;
2954
2955     <item>}
2956   </nf-chunk||>
2957
2958   <subsection|lstlistings>
2959
2960   Our current scheme is to recognize the new lstlisting chunks, but these may
2961   be preceded by a <verbatim|\\Chunk> command which in <LyX> is a more
2962   convenient way to pass the chunk name to the
2963   <verbatim|\\begin{lstlistings}> command, and a more visible way to specify
2964   other <verbatim|lstset> settings.
2965
2966   The arguments to the <verbatim|\\Chunk> command are a name, and then a
2967   comma-seperated list of key-value pairs after the manner of
2968   <verbatim|\\lstset>. (In fact within the <LaTeX> <verbatim|\\Chunk> macro
2969   (section <reference|sub:The-chunk-command>) the text <verbatim|name=> is
2970   prefixed to the argument which is then literally passed to
2971   <verbatim|\\lstset>).
2972
2973   <\nf-chunk|recognize-chunk>
2974     <item>/^\\\\Chunk{/ {
2975
2976     <item> \ if (match($0, "^\\\\\\\\Chunk{ *([^ ,}]*),?(.*)}", line)) {
2977
2978     <item> \ \ \ next_chunk_name = line[1];
2979
2980     <item> \ \ \ get_tex_chunk_args(line[2], next_chunk_opts);
2981
2982     <item> \ }
2983
2984     <item> \ next;
2985
2986     <item>}
2987   </nf-chunk|awk|>
2988
2989   We also make a basic attempt to parse the name out of the
2990   <verbatim|\\lstlistings[name=chunk-name]> text, otherwise we fall back to
2991   the name found in the previous chunk command. This attempt is very basic
2992   and doesn't support commas or spaces or square brackets as part of the
2993   chunkname. We also recognize <verbatim|\\begin{Chunk}> which is convenient
2994   for some users<\footnote>
2995     but not yet supported in the <LaTeX> macros
2996   </footnote>.
2997
2998   <\nf-chunk|recognize-chunk>
2999     <item>/^\\\\begin{lstlisting}\|^\\\\begin{Chunk}/ {
3000
3001     <item> \ if (match($0, "}.*[[,] *name= *{? *([^], }]*)", line)) {
3002
3003     <item> \ \ \ new_chunk(line[1]);
3004
3005     <item> \ } else {
3006
3007     <item> \ \ \ new_chunk(next_chunk_name, next_chunk_opts);
3008
3009     <item> \ }
3010
3011     <item> \ chunking=1;
3012
3013     <item> \ next;
3014
3015     <item>}
3016   </nf-chunk||>
3017
3018   <section|Chunk Body>
3019
3020   <subsection|<TeXmacs>>
3021
3022   A chunk body in <TeXmacs> ends with <verbatim|\|________>... if it is the
3023   final chunklet of a chunk, or if there are further chunklets it ends with
3024   <verbatim|\|\\/\\/\\/>... which is a depiction of a jagged line of torn
3025   paper.
3026
3027   <\nf-chunk|recognize-chunk>
3028     <item>/^ *\\\|____________*/ && texmacs_chunking {
3029
3030     <item> \ active_chunk="";
3031
3032     <item> \ texmacs_chunking=0;
3033
3034     <item> \ chunking=0;
3035
3036     <item>}
3037
3038     <item>/^ *\\\|\\/\\\\/ && texmacs_chunking {
3039
3040     <item> \ texmacs_chunking=0;
3041
3042     <item> \ chunking=0;
3043
3044     <item> \ active_chunk="";
3045
3046     <item>}
3047   </nf-chunk||>
3048
3049   It has been observed that not every line of output when a <TeXmacs> chunk
3050   is active is a line of chunk. This may no longer be true, but we set a
3051   variable <verbatim|texmacs_chunk> if the current line is a chunk line.
3052
3053   Initially we set this to zero...
3054
3055   <\nf-chunk|recognize-chunk>
3056     <item>texmacs_chunk=0;
3057   </nf-chunk||>
3058
3059   ...and then we look to see if the current line is a chunk line.
3060
3061   <TeXmacs> lines look like this: <verbatim| \ 3 \| main() {> so we detect
3062   the lines by leading white space, digits, more whiter space and a vertical
3063   bar followed by at least once space.
3064
3065   If we find such a line, we remove this line-header and set
3066   <verbatim|texmacs_chunk=1> as well as <verbatim|chunking=1>
3067
3068   <\nf-chunk|recognize-chunk>
3069     <item>/^ *[1-9][0-9]* *\\\| / {
3070
3071     <item> \ if (texmacs_chunking) {
3072
3073     <item> \ \ \ chunking=1;
3074
3075     <item> \ \ \ texmacs_chunk=1;
3076
3077     <item> \ \ \ gsub("^ *[1-9][0-9]* *\\\\\| ", "")
3078
3079     <item> \ }
3080
3081     <item>}
3082   </nf-chunk||>
3083
3084   When <TeXmacs> chunking, lines that commence with <verbatim|\\/> or
3085   <verbatim|__> are not chunk content but visual framing, and are skipped.
3086
3087   <\nf-chunk|recognize-chunk>
3088     <item>/^ *\\.\\/\\\\/ && texmacs_chunking {
3089
3090     <item> \ next;
3091
3092     <item>}
3093
3094     <item>/^ *__*$/ && texmacs_chunking {
3095
3096     <item> \ next;
3097
3098     <item>}
3099   </nf-chunk||>
3100
3101   Any other line when <TeXmacs> chunking is considered to be a line-wrapped
3102   line.
3103
3104   <\nf-chunk|recognize-chunk>
3105     <item>texmacs_chunking {
3106
3107     <item> \ if (! texmacs_chunk) {
3108
3109     <item> \ \ \ # must be a texmacs continued line
3110
3111     <item> \ \ \ chunking=1;
3112
3113     <item> \ \ \ texmacs_chunk=1;
3114
3115     <item> \ }
3116
3117     <item>}
3118   </nf-chunk||>
3119
3120   This final chunklet seems bogus and probably stops <LyX> working.
3121
3122   <\nf-chunk|recognize-chunk>
3123     <item>! texmacs_chunk {
3124
3125     <item># \ texmacs_chunking=0;
3126
3127     <item> \ chunking=0;
3128
3129     <item>}
3130   </nf-chunk||>
3131
3132   <subsection|Noweb>
3133
3134   We recognize notangle style chunks too:
3135
3136   <\nf-chunk|recognize-chunk>
3137     <item>/^[\<less\>]\<less\>.*[\<gtr\>]\<gtr\>=/ {
3138
3139     <item> \ if (match($0, "^[\<less\>]\<less\>(.*)[\<gtr\>]\<gtr\>= *$",
3140     line)) {
3141
3142     <item> \ \ \ chunking=1;
3143
3144     <item> \ \ \ notangle_mode=1;
3145
3146     <item> \ \ \ new_chunk(line[1]);
3147
3148     <item> \ \ \ next;
3149
3150     <item> \ }
3151
3152     <item>}
3153   </nf-chunk|awk|>
3154
3155   <section|Chunk end>
3156
3157   Likewise, we need to recognize when a chunk ends.
3158
3159   <subsection|lstlistings>
3160
3161   The <verbatim|e> in <verbatim|[e]nd{lislisting}> is surrounded by square
3162   brackets so that when this document is processed, this chunk doesn't
3163   terminate early when the lstlistings package recognizes it's own
3164   end-string!<\footnote>
3165     This doesn't make sense as the regex is anchored with ^, which this line
3166     does not begin with!
3167   </footnote>
3168
3169   <\nf-chunk|recognize-chunk>
3170     <item>/^\\\\[e]nd{lstlisting}\|^\\\\[e]nd{Chunk}/ {
3171
3172     <item> \ chunking=0;
3173
3174     <item> \ active_chunk="";
3175
3176     <item> \ next;
3177
3178     <item>}
3179   </nf-chunk||>
3180
3181   <subsection|noweb>
3182
3183   <\nf-chunk|recognize-chunk>
3184     <item>/^@ *$/ {
3185
3186     <item> \ chunking=0;
3187
3188     <item> \ active_chunk="";
3189
3190     <item>}
3191   </nf-chunk||>
3192
3193   All other recognizers are only of effect if we are chunking; there's no
3194   point in looking at lines if they aren't part of a chunk, so we just ignore
3195   them as efficiently as we can.
3196
3197   <\nf-chunk|recognize-chunk>
3198     <item>! chunking { next; }
3199   </nf-chunk||>
3200
3201   <section|Chunk contents>
3202
3203   Chunk contents are any lines read while <verbatim|chunking> is true. Some
3204   chunk contents are special in that they refer to other chunks, and will be
3205   replaced by the contents of these chunks when the file is generated.
3206
3207   <label|sub:ORS-chunk-text>We add the output record separator <verbatim|ORS>
3208   to the line now, because we will set <verbatim|ORS> to the empty string
3209   when we generate the output<\footnote>
3210     So that we can partial print lines using <verbatim|print> instead of
3211     <verbatim|printf>. <todo|This does't make sense>
3212   </footnote>.
3213
3214   <\nf-chunk|recognize-chunk>
3215     <item>length(active_chunk) {
3216
3217     <item> \ <nf-ref|process-chunk-tabs|>
3218
3219     <item> \ <nf-ref|process-chunk|>
3220
3221     <item>}
3222   </nf-chunk||>
3223
3224   If a chunk just consisted of plain text, we could handle the chunk like
3225   this:
3226
3227   <\nf-chunk|process-chunk-simple>
3228     <item>chunk_line(active_chunk, $0 ORS);
3229   </nf-chunk||>
3230
3231   but in fact a chunk can include references to other chunks. Chunk includes
3232   are traditionally written as <verbatim|\<less\>\<less\>chunk-name\<gtr\>\<gtr\>>
3233   but we support other variations, some of which are more suitable for
3234   particular editing systems.
3235
3236   However, we also process tabs at this point. A tab at input can be replaced
3237   by a number of spaces defined by the <verbatim|tabs> variable, set by the
3238   <verbatim|-T> option. Of course this is poor tab behaviour, we should
3239   probably have the option to use proper counted tab-stops and process this
3240   on output.
3241
3242   <\nf-chunk|process-chunk-tabs>
3243     <item>if (length(tabs)) {
3244
3245     <item> \ gsub("\\t", tabs);
3246
3247     <item>}
3248   </nf-chunk||>
3249
3250   <subsection|lstlistings><label|sub:lst-listings-includes>
3251
3252   If <verbatim|\\lstset{escapeinside={=\<less\>}{\<gtr\>}}> is set, then we
3253   can use <verbatim|=\<less\>\\chunkref{chunk-name}\<gtr\>> in listings. The
3254   sequence <verbatim|=\<less\>> was chosen because:
3255
3256   <\enumerate>
3257     <item>it is a better mnemonic than <verbatim|\<less\>\<less\>chunk-name\<gtr\>\<gtr\>>
3258     in that the <verbatim|=> sign signifies equivalence or substitutability.
3259
3260     <item>and because <verbatim|=\<less\>> is not valid in C or any language
3261     I can think of.
3262
3263     <item>and also because lstlistings doesn't like <verbatim|\<gtr\>\<gtr\>>
3264     as an end delimiter for the <em|texcl> escape, so we must make do with a
3265     single <verbatim|\<gtr\>> which is better complemented by
3266     <verbatim|=\<less\>> than by <verbatim|\<less\>\<less\>>.
3267   </enumerate>
3268
3269   Unfortunately the <verbatim|=\<less\>...\<gtr\>> that we use re-enters a
3270   <LaTeX> parsing mode in which some characters are special, e.g. <verbatim|#
3271   \\> and so these cause trouble if used in arguments to
3272   <verbatim|\\chunkref>. At some point I must fix the <LaTeX> command
3273   <verbatim|\\chunkref> so that it can accept these literally, but until
3274   then, when writing chunkref argumemts that need these characters, I must
3275   use the forms <verbatim|\\textbackslash{}> and <verbatim|\\#>; so I also
3276   define a hacky chunk <verbatim|delatex> to be used further on whose purpose
3277   it is to remove these from any arguments parsed by fangle.
3278
3279   <\nf-chunk|delatex>
3280     <item># FILTHY HACK
3281
3282     <item>gsub("\\\\\\\\#", "#", ${text});
3283
3284     <item>gsub("\\\\\\\\textbackslash{}", "\\\\", ${text});
3285
3286     <item>gsub("\\\\\\\\\\\\^", "^", ${text});
3287   </nf-chunk||<tuple|text>>
3288
3289   As each chunk line may contain more than one chunk include, we will split
3290   out chunk includes in an iterative fashion<\footnote>
3291     Contrary to our use of split when substituting parameters in chapter
3292     <reference|Here-we-split>
3293   </footnote>.
3294
3295   First, as long as the chunk contains a <verbatim|\\chunkref> command we
3296   take as much as we can up to the first <verbatim|\\chunkref> command.
3297
3298   <TeXmacs> text output uses <math|\<langle\>>...<math|\<rangle\>> which
3299   comes out as unicode sequences <verbatim|0xC2> <verbatim|0xAB> ...
3300   <verbatim|0xC2> <verbatim|0xBB>
3301
3302   <\nf-chunk|process-chunk>
3303     <item>chunk = $0;
3304
3305     <item>indent = 0;
3306
3307     <item>while(match(chunk,"(\\xC2\\xAB)([^\\xC2]*) [^\\xC2]*\\xC2\\xBB",
3308     line) \|\|
3309
3310     <item> \ \ \ \ \ match(chunk,\
3311
3312     <item> \ \ \ \ \ \ \ \ \ \ \ "([=]\<less\>\\\\\\\\chunkref{([^}\<gtr\>]*)}(\\\\(.*\\\\)\|)\<gtr\>\|\<less\>\<less\>([a-zA-Z_][-a-zA-Z0-9_]*)\<gtr\>\<gtr\>)",\
3313
3314     <item> \ \ \ \ \ \ \ \ \ \ \ line)\\
3315
3316     <item>) {
3317
3318     <item> \ chunklet = substr(chunk, 1, RSTART - 1);
3319   </nf-chunk||>
3320
3321   We keep track of the indent count, by counting the number of literal
3322   characters found. We can then preserve this indent on each output line when
3323   multi-line chunks are expanded.
3324
3325   We then process this first part literal text, and set the chunk which is
3326   still to be processed to be the text after the <verbatim|\\chunkref>
3327   command, which we will process next as we continue around the loop.
3328
3329   <\nf-chunk|process-chunk>
3330     <item> \ indent += length(chunklet);
3331
3332     <item> \ chunk_line(active_chunk, chunklet);
3333
3334     <item> \ chunk = substr(chunk, RSTART + RLENGTH);
3335   </nf-chunk||>
3336
3337   We then consider the type of chunk command we have found, whether it is the
3338   fangle style command beginning with <verbatim|=\<less\>> the older notangle
3339   style beginning with <verbatim|\<less\>\<less\>>.
3340
3341   Fangle chunks may have parameters contained within square brackets. These
3342   will be matched in <verbatim|line[3]> and are considered at this stage of
3343   processing to be part of the name of the chunk to be included.
3344
3345   <\nf-chunk|process-chunk>
3346     <item> \ if (substr(line[1], 1, 1) == "=") {
3347
3348     <item> \ \ \ # chunk name up to }
3349
3350     <item> \ \ \ \ \ \ \ =\<less\>\\chunkref{delatex}(line[3])\<gtr\>
3351
3352     <item> \ \ \ chunk_include(active_chunk, line[2] line[3], indent);
3353
3354     <item> \ } else if (substr(line[1], 1, 1) == "\<less\>") {
3355
3356     <item> \ \ \ chunk_include(active_chunk, line[4], indent);
3357
3358     <item> \ } else if (line[1] == "\\xC2\\xAB") {
3359
3360     <item> \ \ \ chunk_include(active_chunk, line[2], indent);
3361
3362     <item> \ } else {
3363
3364     <item> \ \ \ error("Unknown chunk fragment: " line[1]);
3365
3366     <item> \ }
3367   <|nf-chunk>
3368     \;
3369   </nf-chunk|>
3370
3371   The loop will continue until there are no more chunkref statements in the
3372   text, at which point we process the final part of the chunk.
3373
3374   <\nf-chunk|process-chunk>
3375     <item>}
3376
3377     <item>chunk_line(active_chunk, chunk);
3378   </nf-chunk||>
3379
3380   <label|lone-newline>We add the newline character as a chunklet on it's own,
3381   to make it easier to detect new lines and thus manage indentation when
3382   processing the output.
3383
3384   <\nf-chunk|process-chunk>
3385     <item>chunk_line(active_chunk, "\\n");
3386   <|nf-chunk>
3387     \;
3388   </nf-chunk|>
3389
3390   We will also permit a chunk-part number to follow in square brackets, so
3391   that <verbatim|=\<less\>\\chunkref{chunk-name[1]}\<gtr\>> will refer to the
3392   first part only. This can make it easy to include a C function prototype in
3393   a header file, if the first part of the chunk is just the function
3394   prototype without the trailing semi-colon. The header file would include
3395   the prototype with the trailing semi-colon, like this:
3396
3397   <verbatim|=\<less\>\\chunkref{chunk-name[1]}\<gtr\>>
3398
3399   This is handled in section <reference|sub:Chunk-parts>
3400
3401   We should perhaps introduce a notion of language specific chunk options; so
3402   that perhaps we could specify:
3403
3404   <verbatim|=\<less\>\\chunkref{chunk-name[function-declaration]}>
3405
3406   which applies a transform <verbatim|function-declaration> to the chunk ---
3407   which in this case would extract a function prototype from a function.
3408   <todo|Do it>
3409
3410   <chapter|Processing Options>
3411
3412   At the start, first we set the default options.
3413
3414   <\nf-chunk|default-options>
3415     <item>debug=0;
3416
3417     <item>linenos=0;
3418
3419     <item>notangle_mode=0;
3420
3421     <item>root="*";
3422
3423     <item>tabs = "";
3424   </nf-chunk||>
3425
3426   Then we use getopt the standard way, and null out ARGV afterwards in the
3427   normal AWK fashion.
3428
3429   <\nf-chunk|read-options>
3430     <item>Optind = 1 \ \ \ # skip ARGV[0]
3431
3432     <item>while(getopt(ARGC, ARGV, "R:LdT:hr")!=-1) {
3433
3434     <item> \ =\<less\>\\chunkref{handle-options}\<gtr\>
3435
3436     <item>}
3437
3438     <item>for (i=1; i\<less\>Optind; i++) { ARGV[i]=""; }
3439   </nf-chunk||>
3440
3441   This is how we handle our options:
3442
3443   <\nf-chunk|handle-options>
3444     <item>if (Optopt == "R") root = Optarg;
3445
3446     <item>else if (Optopt == "r") root="";
3447
3448     <item>else if (Optopt == "L") linenos = 1;
3449
3450     <item>else if (Optopt == "d") debug = 1;
3451
3452     <item>else if (Optopt == "T") tabs = indent_string(Optarg+0);
3453
3454     <item>else if (Optopt == "h") help();
3455
3456     <item>else if (Optopt == "?") help();
3457   </nf-chunk||>
3458
3459   We do all of this at the beginning of the program
3460
3461   <\nf-chunk|begin>
3462     <item>BEGIN {
3463
3464     <item> \ =\<less\>\\chunkref{constants}\<gtr\>
3465
3466     <item> \ =\<less\>\\chunkref{mode-definitions}\<gtr\>
3467
3468     <item> \ =\<less\>\\chunkref{default-options}\<gtr\>
3469
3470     <item>
3471
3472     <item> \ =\<less\>\\chunkref{read-options}\<gtr\>
3473
3474     <item>}
3475   </nf-chunk||>
3476
3477   And have a simple help function
3478
3479   <\nf-chunk|help()>
3480     <item>function help() {
3481
3482     <item> \ print "Usage:"
3483
3484     <item> \ print " \ fangle [-L] -R\<less\>rootname\<gtr\> [source.tex
3485     ...]"
3486
3487     <item> \ print " \ fangle -r [source.tex ...]"
3488
3489     <item> \ print " \ If the filename, source.tex is not specified then
3490     stdin is used"
3491
3492     <item> \ print
3493
3494     <item> \ print "-L causes the C statement: #line \<less\>lineno\<gtr\>
3495     \\"filename\\"" to be issued"
3496
3497     <item> \ print "-R causes the named root to be written to stdout"
3498
3499     <item> \ print "-r lists all roots in the file (even those used
3500     elsewhere)"
3501
3502     <item> \ exit 1;
3503
3504     <item>}
3505   </nf-chunk||>
3506
3507   <chapter|Generating the Output>
3508
3509   We generate output by calling output_chunk, or listing the chunk names.
3510
3511   <\nf-chunk|generate-output>
3512     <item>if (length(root)) output_chunk(root);
3513
3514     <item>else output_chunk_names();
3515   </nf-chunk||>
3516
3517   We also have some other output debugging:
3518
3519   <\nf-chunk|debug-output>
3520     <item>if (debug) {
3521
3522     <item> \ print "------ chunk names "
3523
3524     <item> \ output_chunk_names();
3525
3526     <item> \ print "====== chunks"
3527
3528     <item> \ output_chunks();
3529
3530     <item> \ print "++++++ debug"
3531
3532     <item> \ for (a in chunks) {
3533
3534     <item> \ \ \ print a "=" chunks[a];
3535
3536     <item> \ }
3537
3538     <item>}
3539   </nf-chunk||>
3540
3541   We do both of these at the end. We also set <verbatim|ORS=""> because each
3542   chunklet is not necessarily a complete line, and we already added
3543   <verbatim|ORS> to each input line in section
3544   <reference|sub:ORS-chunk-text>.
3545
3546   <\nf-chunk|end>
3547     <item>END {
3548
3549     <item> \ =\<less\>\\chunkref{debug-output}\<gtr\>
3550
3551     <item> \ ORS="";
3552
3553     <item> \ =\<less\>\\chunkref{generate-output}\<gtr\>
3554
3555     <item>}
3556   </nf-chunk||>
3557
3558   We write chunk names like this. If we seem to be running in notangle
3559   compatibility mode, then we enclose the name like this
3560   <verbatim|\<less\>\<less\>name\<gtr\>\<gtr\>> the same way notangle does:
3561
3562   <\nf-chunk|output_chunk_names()>
3563     <item>function output_chunk_names( \ \ c, prefix, suffix)\
3564
3565     <item>{
3566
3567     <item> \ if (notangle_mode) {
3568
3569     <item> \ \ \ prefix="\<less\>\<less\>";
3570
3571     <item> \ \ \ suffix="\<gtr\>\<gtr\>";
3572
3573     <item> \ }
3574
3575     <item> \ for (c in chunk_names) {
3576
3577     <item> \ \ \ print prefix c suffix "\\n";
3578
3579     <item> \ }
3580
3581     <item>}
3582   </nf-chunk||>
3583
3584   This function would write out all chunks
3585
3586   <\nf-chunk|output_chunks()>
3587     <item>function output_chunks( \ a)\
3588
3589     <item>{
3590
3591     <item> \ for (a in chunk_names) {
3592
3593     <item> \ \ \ output_chunk(a);
3594
3595     <item> \ }
3596
3597     <item>}
3598
3599     <item>
3600
3601     <item>function output_chunk(chunk) {
3602
3603     <item> \ newline = 1;
3604
3605     <item> \ lineno_needed = linenos;
3606
3607     <item>
3608
3609     <item> \ write_chunk(chunk);
3610
3611     <item>}
3612
3613     <item>
3614   </nf-chunk||>
3615
3616   <section|Assembling the Chunks>
3617
3618   <verbatim|chunk_path> holds a string consisting of the names of all the
3619   chunks that resulted in this chunk being output. It should probably also
3620   contain the source line numbers at which each inclusion also occured.
3621
3622   We first initialize the mode tracker for this chunk.
3623
3624   <\nf-chunk|write_chunk()>
3625     <item>function write_chunk(chunk_name) {
3626
3627     <item> \ =\<less\>\\chunkref{awk-delete-array}(context)\<gtr\>
3628
3629     <item> \ return write_chunk_r(chunk_name, context);
3630
3631     <item>}
3632
3633     <item>
3634
3635     <item>function write_chunk_r(chunk_name, context, indent, tail,
3636
3637     <item> \ # optional vars
3638
3639     <item> \ <with|font-shape|italic|chunk_path>, chunk_args,\
3640
3641     <item> \ s, r, src, new_src,\
3642
3643     <item> \ # local vars
3644
3645     <item> \ chunk_params, part, max_part, part_line, frag, max_frag, text,\
3646
3647     <item> \ chunklet, only_part, call_chunk_args, new_context)
3648
3649     <item>{
3650
3651     <item> \ if (debug) debug_log("write_chunk_r(" chunk_name ")");
3652   </nf-chunk||>
3653
3654   <subsection|Chunk Parts><label|sub:Chunk-parts>
3655
3656   As mentioned in section <reference|sub:lstlistings-includes>, a chunk name
3657   may contain a part specifier in square brackets, limiting the parts that
3658   should be emitted.
3659
3660   <\nf-chunk|write_chunk()>
3661     <item> \ if (match(chunk_name, "^(.*)\\\\[([0-9]*)\\\\]$",
3662     chunk_name_parts)) {
3663
3664     <item> \ \ \ chunk_name = chunk_name_parts[1];
3665
3666     <item> \ \ \ only_part = chunk_name_parts[2];
3667
3668     <item> \ }
3669   </nf-chunk||>
3670
3671   We then create a mode tracker
3672
3673   <\nf-chunk|write_chunk()>
3674     <item> =\<less\>\\chunkref{new-mode-tracker}(context, chunks[chunk_name,
3675     "language"], "")\<gtr\>
3676   </nf-chunk||>
3677
3678   We extract into <verbatim|chunk_params> the names of the parameters that
3679   this chunk accepts, whose values were (optionally) passed in
3680   <verbatim|chunk_args>.
3681
3682   <\nf-chunk|write_chunk()>
3683     <item> split(chunks[chunk_name, "params"], chunk_params, " *; *");
3684   </nf-chunk||>
3685
3686   To assemble a chunk, we write out each part.
3687
3688   <\nf-chunk|write_chunk()>
3689     <item> \ if (! (chunk_name in chunk_names)) {
3690
3691     <item> \ \ \ error(sprintf(_"The root module
3692     \<less\>\<less\>%s\<gtr\>\<gtr\> was not defined.\\nUsed by: %s",\\
3693
3694     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ chunk_name, chunk_path));
3695
3696     <item> \ }
3697
3698     <item>
3699
3700     <item> \ max_part = chunks[chunk_name, "part"];
3701
3702     <item> \ for(part = 1; part \<less\>= max_part; part++) {
3703
3704     <item> \ \ \ if (! only_part \|\| part == only_part) {
3705
3706     <item> \ \ \ \ \ =\<less\>\\chunkref{write-part}\<gtr\>
3707
3708     <item> \ \ \ }
3709
3710     <item> \ }
3711
3712     <item> \ if (! finalize_mode_tracker(context)) {
3713
3714     <item> \ \ \ dump_mode_tracker(context);
3715
3716     <item> \ \ \ error(sprintf(_"Module %s did not close context
3717     properly.\\nUsed by: %s\\n", chunk_name, chunk_path));
3718
3719     <item> \ }
3720
3721     <item>}
3722   </nf-chunk||>
3723
3724   A part can either be a chunklet of lines, or an include of another chunk.
3725
3726   Chunks may also have parameters, specified in LaTeX style with braces after
3727   the chunk name --- looking like this in the document: chunkname{param1,
3728   param2}. Arguments are passed in square brackets:
3729   <verbatim|\\chunkref{chunkname}[arg1, arg2]>.
3730
3731   Before we process each part, we check that the source position hasn't
3732   changed unexpectedly, so that we can know if we need to output a new
3733   file-line directive.
3734
3735   <\nf-chunk|write-part>
3736     <item>=\<less\>\\chunkref{check-source-jump}\<gtr\>
3737
3738     <item>
3739
3740     <item>chunklet = chunks[chunk_name, "part", part];
3741
3742     <item>if (chunks[chunk_name, "part", part, "type"] == part_type_chunk) {
3743
3744     <item> \ =\<less\>\\chunkref{write-included-chunk}\<gtr\>
3745
3746     <item>} else if (chunklet SUBSEP "line" in chunks) {
3747
3748     <item> \ =\<less\>\\chunkref{write-chunklets}\<gtr\>
3749
3750     <item>} else {
3751
3752     <item> \ # empty last chunklet
3753
3754     <item>}
3755   </nf-chunk||>
3756
3757   To write an included chunk, we must detect any optional chunk arguments in
3758   parenthesis. Then we recurse calling <verbatim|write_chunk()>.
3759
3760   <\nf-chunk|write-included-chunk>
3761     <item>if (match(chunklet, "^([^\\\\[\\\\(]*)\\\\((.*)\\\\)$",
3762     chunklet_parts)) {
3763
3764     <item> \ chunklet = chunklet_parts[1];
3765
3766     <item># hack
3767
3768     <item>gsub(sprintf("%c",11), "", chunklet);
3769
3770     <item>gsub(sprintf("%c",11), "", chunklet_parts[2]);
3771
3772     <item> \ parse_chunk_args("c-like", chunklet_parts[2], call_chunk_args,
3773     "(");
3774
3775     <item> \ for (c in call_chunk_args) {
3776
3777     <item> \ \ \ call_chunk_args[c] = expand_chunk_args(call_chunk_args[c],
3778     chunk_params, chunk_args);
3779
3780     <item> \ }
3781
3782     <item>} else {
3783
3784     <item> \ split("", call_chunk_args);
3785
3786     <item>}
3787
3788     <item># update the transforms arrays
3789
3790     <item>new_src = mode_escaper(context, s, r, src);
3791
3792     <item>=\<less\>\\chunkref{awk-delete-array}(new_context)\<gtr\>
3793
3794     <item>write_chunk_r(chunklet, new_context,
3795
3796     <item> \ \ \ \ \ \ \ \ \ \ \ chunks[chunk_name, "part", part, "indent"]
3797     indent,
3798
3799     <item> \ \ \ \ \ \ \ \ \ \ \ chunks[chunk_name, "part", part, "tail"],
3800
3801     <item> \ \ \ \ \ \ \ \ \ \ \ chunk_path "\\n \ \ \ \ \ \ \ \ "
3802     chunk_name,
3803
3804     <item> \ \ \ \ \ \ \ \ \ \ \ call_chunk_args,
3805
3806     <item> \ \ \ \ \ \ \ \ \ \ \ s, r, new_src);
3807   </nf-chunk||>
3808
3809   Before we output a chunklet of lines, we first emit the file and line
3810   number if we have one, and if it is safe to do so.
3811
3812   Chunklets are generally broken up by includes, so the start of a chunklet
3813   is a good place to do this. Then we output each line of the chunklet.
3814
3815   When it is not safe, such as in the middle of a multi-line macro
3816   definition, <verbatim|lineno_suppressed> is set to true, and in such a case
3817   we note that we want to emit the line statement when it is next safe.
3818
3819   <\nf-chunk|write-chunklets>
3820     <item>max_frag = chunks[chunklet, "line"];
3821
3822     <item>for(frag = 1; frag \<less\>= max_frag; frag++) {
3823
3824     <item> \ =\<less\>\\chunkref{write-file-line}\<gtr\>
3825   </nf-chunk||>
3826
3827   We then extract the chunklet text and expand any arguments.
3828
3829   <\nf-chunk|write-chunklets>
3830     <item>
3831
3832     <item> \ text = chunks[chunklet, frag];
3833
3834     <item>\
3835
3836     <item> \ /* check params */
3837
3838     <item> \ text = expand_chunk_args(text, chunk_params, chunk_args);
3839   </nf-chunk||>
3840
3841   If the text is a single newline (which we keep separate - see
3842   <reference|lone-newline>) then we increment the line number. In the case
3843   where this is the last line of a chunk and it is not a top-level chunk we
3844   replace the newline with an empty string --- because the chunk that
3845   included this chunk will have the newline at the end of the line that
3846   included this chunk.
3847
3848   We also note by <verbatim|newline = 1> that we have started a new line, so
3849   that indentation can be managed with the following piece of text.
3850
3851   <\nf-chunk|write-chunklets>
3852     <item>
3853
3854     <item> if (text == "\\n") {
3855
3856     <item> \ \ \ lineno++;
3857
3858     <item> \ \ \ if (part == max_part && frag == max_frag &&
3859     length(chunk_path)) {
3860
3861     <item> \ \ \ \ \ text = "";
3862
3863     <item> \ \ \ \ \ break;
3864
3865     <item> \ \ \ } else {
3866
3867     <item> \ \ \ \ \ newline = 1;
3868
3869     <item> \ \ \ }
3870   </nf-chunk||>
3871
3872   If this text does not represent a newline, but we see that we are the first
3873   piece of text on a newline, then we prefix our text with the current
3874   indent.\
3875
3876   <\note>
3877     <verbatim|newline> is a global output-state variable, but the
3878     <verbatim|indent> is not.
3879   </note>
3880
3881   <\nf-chunk|write-chunklets>
3882     <item> \ } else if (length(text) \|\| length(tail)) {
3883
3884     <item> \ \ \ if (newline) text = indent text;
3885
3886     <item> \ \ \ newline = 0;
3887
3888     <item> \ }
3889
3890     <item>
3891   </nf-chunk||>
3892
3893   Tail will soon no longer be relevant once mode-detection is in place.
3894
3895   <\nf-chunk|write-chunklets>
3896     <item> \ text = text tail;
3897
3898     <item> \ mode_tracker(context, text);
3899
3900     <item> \ print untab(transform_escape(s, r, text, src));
3901   </nf-chunk||>
3902
3903   If a line ends in a backslash --- suggesting continuation --- then we
3904   supress outputting file-line as it would probably break the continued
3905   lines.
3906
3907   <\nf-chunk|write-chunklets>
3908     <item> \ if (linenos) {
3909
3910     <item> \ \ \ lineno_suppressed = substr(lastline, length(lastline)) ==
3911     "\\\\";
3912
3913     <item> \ }
3914
3915     <item>}
3916   </nf-chunk||>
3917
3918   Of course there is no point in actually outputting the source filename and
3919   line number (file-line) if they don't say anything new! We only need to
3920   emit them if they aren't what is expected, or if we we not able to emit one
3921   when they had changed.
3922
3923   <\nf-chunk|write-file-line>
3924     <item>if (newline && lineno_needed && ! lineno_suppressed) {
3925
3926     <item> \ filename = a_filename;
3927
3928     <item> \ lineno = a_lineno;
3929
3930     <item> \ print "#line " lineno " \\"" filename "\\"\\n"
3931
3932     <item> \ lineno_needed = 0;
3933
3934     <item>}
3935   </nf-chunk||>
3936
3937   We check if a new file-line is needed by checking if the source line
3938   matches what we (or a compiler) would expect.
3939
3940   <\nf-chunk|check-source-jump>
3941     <item>if (linenos && (chunk_name SUBSEP "part" SUBSEP part SUBSEP
3942     "FILENAME" in chunks)) {
3943
3944     <item> \ a_filename = chunks[chunk_name, "part", part, "FILENAME"];
3945
3946     <item> \ a_lineno = chunks[chunk_name, "part", part, "LINENO"];
3947
3948     <item> \ if (a_filename != filename \|\| a_lineno != lineno) {
3949
3950     <item> \ \ \ lineno_needed++;
3951
3952     <item> \ }
3953
3954     <item>}
3955   </nf-chunk||>
3956
3957   <chapter|Storing Chunks>
3958
3959   Awk has pretty limited data structures, so we will use two main hashes.
3960   Uninterrupted sequences of a chunk will be stored in chunklets and the
3961   chunklets used in a chunk will be stored in <verbatim|chunks>.
3962
3963   <\nf-chunk|constants>
3964     <item>part_type_chunk=1;
3965
3966     <item>SUBSEP=",";
3967   </nf-chunk||>
3968
3969   The params mentioned are not chunk parameters for parameterized chunks, as
3970   mentioned in <reference|Chunk Arguments>, but the lstlistings style
3971   parameters used in the <verbatim|\\Chunk> command<\footnote>
3972     The <verbatim|params> parameter is used to hold the parameters for
3973     parameterized chunks
3974   </footnote>.
3975
3976   <\nf-chunk|chunk-storage-functions>
3977     <item>function new_chunk(chunk_name, opts, args,
3978
3979     <item> \ # local vars
3980
3981     <item> \ p, append )
3982
3983     <item>{
3984
3985     <item> \ # HACK WHILE WE CHANGE TO ( ) for PARAM CHUNKS
3986
3987     <item> \ gsub("\\\\(\\\\)$", "", chunk_name);
3988
3989     <item> \ if (! (chunk_name in chunk_names)) {
3990
3991     <item> \ \ \ if (debug) print "New chunk " chunk_name;
3992
3993     <item> \ \ \ chunk_names[chunk_name];
3994
3995     <item> \ \ \ for (p in opts) {
3996
3997     <item> \ \ \ \ \ chunks[chunk_name, p] = opts[p];
3998
3999     <item> \ \ \ \ \ if (debug) print "chunks[" chunk_name "," p "] = "
4000     opts[p];
4001
4002     <item> \ \ \ }
4003
4004     <item> \ \ \ for (p in args) {
4005
4006     <item> \ \ \ \ \ chunks[chunk_name, "params", p] = args[p];
4007
4008     <item> \ \ \ }
4009
4010     <item> \ \ \ if ("append" in opts) {
4011
4012     <item> \ \ \ \ \ append=opts["append"];
4013
4014     <item> \ \ \ \ \ if (! (append in chunk_names)) {
4015
4016     <item> \ \ \ \ \ \ \ warning("Chunk " chunk_name " is appended to chunk "
4017     append " which is not defined yet");
4018
4019     <item> \ \ \ \ \ \ \ new_chunk(append);
4020
4021     <item> \ \ \ \ \ }
4022
4023     <item> \ \ \ \ \ chunk_include(append, chunk_name);
4024
4025     <item> \ \ \ \ \ chunk_line(append, ORS);
4026
4027     <item> \ \ \ }
4028
4029     <item> \ }
4030
4031     <item> \ active_chunk = chunk_name;
4032
4033     <item> \ prime_chunk(chunk_name);
4034
4035     <item>}
4036   </nf-chunk||>
4037
4038   <\nf-chunk|chunk-storage-functions>
4039     <item>
4040
4041     <item>function prime_chunk(chunk_name)
4042
4043     <item>{
4044
4045     <item> \ chunks[chunk_name, "part", ++chunks[chunk_name, "part"] ] = \\
4046
4047     <item> \ \ \ \ \ \ \ \ chunk_name SUBSEP "chunklet" SUBSEP ""
4048     ++chunks[chunk_name, "chunklet"];
4049
4050     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"],
4051     "FILENAME"] = FILENAME;
4052
4053     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "LINENO"]
4054     = FNR + 1;
4055
4056     <item>}
4057
4058     <item>
4059
4060     <item>function chunk_line(chunk_name, line){
4061
4062     <item> \ chunks[chunk_name, "chunklet", chunks[chunk_name, "chunklet"],
4063
4064     <item> \ \ \ \ \ \ \ \ ++chunks[chunk_name, "chunklet",
4065     chunks[chunk_name, "chunklet"], "line"] \ ] = line;
4066
4067     <item>}
4068
4069     <item>
4070   </nf-chunk||>
4071
4072   Chunk include represents a <em|chunkref> statement, and stores the
4073   requirement to include another chunk. The parameter indent represents the
4074   quanity of literal text characters that preceded this <em|chunkref>
4075   statement and therefore by how much additional lines of the included chunk
4076   should be indented.
4077
4078   <\nf-chunk|chunk-storage-functions>
4079     <item>function chunk_include(chunk_name, chunk_ref, indent, tail)
4080
4081     <item>{
4082
4083     <item> \ chunks[chunk_name, "part", ++chunks[chunk_name, "part"] ] =
4084     chunk_ref;
4085
4086     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "type" ]
4087     = part_type_chunk;
4088
4089     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "indent"
4090     ] = indent_string(indent);
4091
4092     <item> \ chunks[chunk_name, "part", chunks[chunk_name, "part"], "tail" ]
4093     = tail;
4094
4095     <item> \ prime_chunk(chunk_name);
4096
4097     <item>}
4098
4099     <item>
4100   </nf-chunk||>
4101
4102   The indent is calculated by indent_string, which may in future convert some
4103   spaces into tab characters. This function works by generating a printf
4104   padded format string, like <verbatim|%22s> for an indent of 22, and then
4105   printing an empty string using that format.
4106
4107   <\nf-chunk|chunk-storage-functions>
4108     <item>function indent_string(indent) {
4109
4110     <item> \ return sprintf("%" indent "s", "");
4111
4112     <item>}
4113   </nf-chunk||>
4114
4115   <chapter|getopt><label|cha:getopt>
4116
4117   I use Arnold Robbins public domain getopt (1993 revision). This is probably
4118   the same one that is covered in chapter 12 of “Edition 3 of GAWK:
4119   Effective AWK Programming: A User's Guide for GNU Awk” but as that is
4120   licensed under the GNU Free Documentation License, Version 1.3, which
4121   conflicts with the GPL3, I can't use it from there (or it's accompanying
4122   explanations), so I do my best to explain how it works here.
4123
4124   The getopt.awk header is:
4125
4126   <\nf-chunk|getopt.awk-header>
4127     <item># getopt.awk --- do C library getopt(3) function in awk
4128
4129     <item>#
4130
4131     <item># Arnold Robbins, arnold@skeeve.com, Public Domain
4132
4133     <item>#
4134
4135     <item># Initial version: March, 1991
4136
4137     <item># Revised: May, 1993
4138
4139     <item>
4140   </nf-chunk||>
4141
4142   The provided explanation is:
4143
4144   <\nf-chunk|getopt.awk-notes>
4145     <item># External variables:
4146
4147     <item># \ \ \ Optind -- index in ARGV of first nonoption argument
4148
4149     <item># \ \ \ Optarg -- string value of argument to current option
4150
4151     <item># \ \ \ Opterr -- if nonzero, print our own diagnostic
4152
4153     <item># \ \ \ Optopt -- current option letter
4154
4155     <item>
4156
4157     <item># Returns:
4158
4159     <item># \ \ \ -1 \ \ \ \ at end of options
4160
4161     <item># \ \ \ ? \ \ \ \ \ for unrecognized option
4162
4163     <item># \ \ \ \<less\>c\<gtr\> \ \ \ a character representing the current
4164     option
4165
4166     <item>
4167
4168     <item># Private Data:
4169
4170     <item># \ \ \ _opti \ -- index in multi-flag option, e.g., -abc
4171
4172     <item>
4173   </nf-chunk||>
4174
4175   The function follows. The final two parameters, <verbatim|thisopt> and
4176   <verbatim|i> are local variables and not parameters --- as indicated by the
4177   multiple spaces preceding them. Awk doesn't care, the multiple spaces are a
4178   convention to help us humans.
4179
4180   <\nf-chunk|getopt.awk-getopt()>
4181     <item>function getopt(argc, argv, options, \ \ \ thisopt, i)
4182
4183     <item>{
4184
4185     <item> \ \ \ if (length(options) == 0) \ \ \ # no options given
4186
4187     <item> \ \ \ \ \ \ \ return -1
4188
4189     <item> \ \ \ if (argv[Optind] == "--") { \ # all done
4190
4191     <item> \ \ \ \ \ \ \ Optind++
4192
4193     <item> \ \ \ \ \ \ \ _opti = 0
4194
4195     <item> \ \ \ \ \ \ \ return -1
4196
4197     <item> \ \ \ } else if (argv[Optind] !~ /^-[^: \\t\\n\\f\\r\\v\\b]/) {
4198
4199     <item> \ \ \ \ \ \ \ _opti = 0
4200
4201     <item> \ \ \ \ \ \ \ return -1
4202
4203     <item> \ \ \ }
4204
4205     <item> \ \ \ if (_opti == 0)
4206
4207     <item> \ \ \ \ \ \ \ _opti = 2
4208
4209     <item> \ \ \ thisopt = substr(argv[Optind], _opti, 1)
4210
4211     <item> \ \ \ Optopt = thisopt
4212
4213     <item> \ \ \ i = index(options, thisopt)
4214
4215     <item> \ \ \ if (i == 0) {
4216
4217     <item> \ \ \ \ \ \ \ if (Opterr)
4218
4219     <item> \ \ \ \ \ \ \ \ \ \ \ printf("%c -- invalid option\\n",
4220
4221     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ thisopt)
4222     \<gtr\> "/dev/stderr"
4223
4224     <item> \ \ \ \ \ \ \ if (_opti \<gtr\>= length(argv[Optind])) {
4225
4226     <item> \ \ \ \ \ \ \ \ \ \ \ Optind++
4227
4228     <item> \ \ \ \ \ \ \ \ \ \ \ _opti = 0
4229
4230     <item> \ \ \ \ \ \ \ } else
4231
4232     <item> \ \ \ \ \ \ \ \ \ \ \ _opti++
4233
4234     <item> \ \ \ \ \ \ \ return "?"
4235
4236     <item> \ \ \ }
4237   </nf-chunk||>
4238
4239   At this point, the option has been found and we need to know if it takes
4240   any arguments.
4241
4242   <\nf-chunk|getopt.awk-getopt()>
4243     <item> \ \ \ if (substr(options, i + 1, 1) == ":") {
4244
4245     <item> \ \ \ \ \ \ \ # get option argument
4246
4247     <item> \ \ \ \ \ \ \ if (length(substr(argv[Optind], _opti + 1)) \<gtr\>
4248     0)
4249
4250     <item> \ \ \ \ \ \ \ \ \ \ \ Optarg = substr(argv[Optind], _opti + 1)
4251
4252     <item> \ \ \ \ \ \ \ else
4253
4254     <item> \ \ \ \ \ \ \ \ \ \ \ Optarg = argv[++Optind]
4255
4256     <item> \ \ \ \ \ \ \ _opti = 0
4257
4258     <item> \ \ \ } else
4259
4260     <item> \ \ \ \ \ \ \ Optarg = ""
4261
4262     <item> \ \ \ if (_opti == 0 \|\| _opti \<gtr\>= length(argv[Optind])) {
4263
4264     <item> \ \ \ \ \ \ \ Optind++
4265
4266     <item> \ \ \ \ \ \ \ _opti = 0
4267
4268     <item> \ \ \ } else
4269
4270     <item> \ \ \ \ \ \ \ _opti++
4271
4272     <item> \ \ \ return thisopt
4273
4274     <item>}
4275   </nf-chunk||>
4276
4277   A test program is built in, too
4278
4279   <\nf-chunk|getopt.awk-begin>
4280     <item>BEGIN {
4281
4282     <item> \ \ \ Opterr = 1 \ \ \ # default is to diagnose
4283
4284     <item> \ \ \ Optind = 1 \ \ \ # skip ARGV[0]
4285
4286     <item> \ \ \ # test program
4287
4288     <item> \ \ \ if (_getopt_test) {
4289
4290     <item> \ \ \ \ \ \ \ while ((_go_c = getopt(ARGC, ARGV, "ab:cd")) != -1)
4291
4292     <item> \ \ \ \ \ \ \ \ \ \ \ printf("c = \<less\>%c\<gtr\>, optarg =
4293     \<less\>%s\<gtr\>\\n",
4294
4295     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ _go_c,
4296     Optarg)
4297
4298     <item> \ \ \ \ \ \ \ printf("non-option arguments:\\n")
4299
4300     <item> \ \ \ \ \ \ \ for (; Optind \<less\> ARGC; Optind++)
4301
4302     <item> \ \ \ \ \ \ \ \ \ \ \ printf("\\tARGV[%d] = \<less\>%s\<gtr\>\\n",
4303
4304     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Optind,
4305     ARGV[Optind])
4306
4307     <item> \ \ \ }
4308
4309     <item>}
4310   </nf-chunk||>
4311
4312   The entire getopt.awk is made out of these chunks in order
4313
4314   <\nf-chunk|getopt.awk>
4315     <item>=\<less\>\\chunkref{getopt.awk-header}\<gtr\>
4316
4317     <item>
4318
4319     <item>=\<less\>\\chunkref{getopt.awk-notes}\<gtr\>
4320
4321     <item>=\<less\>\\chunkref{getopt.awk-getopt()}\<gtr\>
4322
4323     <item>=\<less\>\\chunkref{getopt.awk-begin}\<gtr\>
4324   </nf-chunk||>
4325
4326   Although we only want the header and function:
4327
4328   <\nf-chunk|getopt>
4329     <item># try: locate getopt.awk for the full original file
4330
4331     <item># as part of your standard awk installation
4332
4333     <item>=\<less\>\\chunkref{getopt.awk-header}\<gtr\>
4334
4335     <item>
4336
4337     <item>=\<less\>\\chunkref{getopt.awk-getopt()}\<gtr\>
4338   </nf-chunk||>
4339
4340   <chapter|Fangle LaTeX source code><label|latex-source>
4341
4342   <section|fangle module>
4343
4344   Here we define a <LyX> <verbatim|.module> file that makes it convenient to
4345   use <LyX> for writing such literate programs.
4346
4347   This file <verbatim|./fangle.module> can be installed in your personal
4348   <verbatim|.lyx/layouts> folder. You will need to Tools Reconfigure so that
4349   <LyX> notices it. It adds a new format Chunk, which should precede every
4350   listing and contain the chunk name.
4351
4352   <\nf-chunk|./fangle.module>
4353     <item>#\\DeclareLyXModule{Fangle Literate Listings}
4354
4355     <item>#DescriptionBegin
4356
4357     <item># \ Fangle literate listings allow one to write
4358
4359     <item># \ \ literate programs after the fashion of noweb, but without
4360     having
4361
4362     <item># \ \ to use noweave to generate the documentation. Instead the
4363     listings
4364
4365     <item># \ \ package is extended in conjunction with the noweb package to
4366     implement
4367
4368     <item># \ \ to code formating directly as latex.
4369
4370     <item># \ The fangle awk script
4371
4372     <item>#DescriptionEnd
4373
4374     <item>
4375
4376     <item>=\<less\>\\chunkref{gpl3-copyright.hashed}\<gtr\>
4377
4378     <item>
4379
4380     <item>Format 11
4381
4382     <item>
4383
4384     <item>AddToPreamble
4385
4386     <item>=\<less\>\\chunkref{./fangle.sty}\<gtr\>
4387
4388     <item>EndPreamble
4389
4390     <item>
4391
4392     <item>=\<less\>\\chunkref{chunkstyle}\<gtr\>
4393
4394     <item>
4395
4396     <item>=\<less\>\\chunkref{chunkref}\<gtr\>
4397   </nf-chunk|lyx-module|>
4398
4399   Because <LyX> modules are not yet a language supported by fangle or
4400   lstlistings, we resort to this fake awk chunk below in order to have each
4401   line of the GPL3 license commence with a #
4402
4403   <\nf-chunk|gpl3-copyright.hashed>
4404     <item>#=\<less\>\\chunkref{gpl3-copyright}\<gtr\>
4405
4406     <item>
4407   </nf-chunk|awk|>
4408
4409   <subsection|The Chunk style>
4410
4411   The purpose of the <name|chunk> style is to make it easier for <LyX> users
4412   to provide the name to <verbatim|lstlistings>. Normally this requires
4413   right-clicking on the listing, choosing settings, advanced, and then typing
4414   <verbatim|name=chunk-name>. This has the further disadvantage that the name
4415   (and other options) are not generally visible during document editing.
4416
4417   The chunk style is defined as a <LaTeX> command, so that all text on the
4418   same line is passed to the <verbatim|LaTeX> command <verbatim|Chunk>. This
4419   makes it easy to parse using <verbatim|fangle>, and easy to pass these
4420   options on to the listings package. The first word in a chunk section
4421   should be the chunk name, and will have <verbatim|name=> prepended to it.
4422   Any other words are accepted arguments to <verbatim|lstset>.
4423
4424   We set PassThru to 1 because the user is actually entering raw latex.
4425
4426   <\nf-chunk|chunkstyle>
4427     <item>Style Chunk
4428
4429     <item> \ LatexType \ \ \ \ \ \ \ \ \ \ \ \ Command
4430
4431     <item> \ LatexName \ \ \ \ \ \ \ \ \ \ \ \ Chunk
4432
4433     <item> \ Margin \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ First_Dynamic
4434
4435     <item> \ LeftMargin \ \ \ \ \ \ \ \ \ \ \ Chunk:xxx
4436
4437     <item> \ LabelSep \ \ \ \ \ \ \ \ \ \ \ \ \ xx
4438
4439     <item> \ LabelType \ \ \ \ \ \ \ \ \ \ \ \ Static
4440
4441     <item> \ LabelString \ \ \ \ \ \ \ \ \ \ "Chunk:"
4442
4443     <item> \ Align \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Left
4444
4445     <item> \ PassThru \ \ \ \ \ \ \ \ \ \ \ \ \ 1
4446
4447     <item>
4448   </nf-chunk||>
4449
4450   To make the label very visible we choose a larger font coloured red.
4451
4452   <\nf-chunk|chunkstyle>
4453     <item> \ LabelFont
4454
4455     <item> \ \ \ Family \ \ \ \ \ \ \ \ \ \ \ \ \ Sans
4456
4457     <item> \ \ \ Size \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Large
4458
4459     <item> \ \ \ Series \ \ \ \ \ \ \ \ \ \ \ \ \ Bold
4460
4461     <item> \ \ \ Shape \ \ \ \ \ \ \ \ \ \ \ \ \ \ Italic
4462
4463     <item> \ \ \ Color \ \ \ \ \ \ \ \ \ \ \ \ \ \ red
4464
4465     <item> \ EndFont
4466
4467     <item>End
4468   </nf-chunk||>
4469
4470   <subsection|The chunkref style>
4471
4472   We also define the Chunkref style which can be used to express cross
4473   references to chunks.
4474
4475   <\nf-chunk|chunkref>
4476     <item>InsetLayout Chunkref
4477
4478     <item> \ LyxType \ \ \ \ \ \ \ \ \ \ \ \ \ \ charstyle
4479
4480     <item> \ LatexType \ \ \ \ \ \ \ \ \ \ \ \ Command
4481
4482     <item> \ LatexName \ \ \ \ \ \ \ \ \ \ \ \ chunkref
4483
4484     <item> \ PassThru \ \ \ \ \ \ \ \ \ \ \ \ \ 1
4485
4486     <item> \ LabelFont \ \ \ \ \ \ \ \ \ \ \ \
4487
4488     <item> \ \ \ Shape \ \ \ \ \ \ \ \ \ \ \ \ \ \ Italic
4489
4490     <item> \ \ \ Color \ \ \ \ \ \ \ \ \ \ \ \ \ \ red
4491
4492     <item> \ EndFont
4493
4494     <item>End
4495   </nf-chunk||>
4496
4497   <section|Latex Macros><label|sec:Latex-Macros>
4498
4499   We require the listings, noweb and xargs packages. As noweb defines it's
4500   own <verbatim|\\code> environment, we re-define the one that <LyX> logical
4501   markup module expects here.
4502
4503   <\nf-chunk|./fangle.sty>
4504     <item>\\usepackage{listings}%
4505
4506     <item>\\usepackage{noweb}%
4507
4508     <item>\\usepackage{xargs}%
4509
4510     <item>\\renewcommand{\\code}[1]{\\texttt{#1}}%
4511   </nf-chunk|tex|>
4512
4513   We also define a <verbatim|CChunk> macro, for use as:
4514   <verbatim|\\begin{CChunk}> which will need renaming to
4515   <verbatim|\\begin{Chunk}> when I can do this without clashing with
4516   <verbatim|\\Chunk>.
4517
4518   <\nf-chunk|./fangle.sty>
4519     <item>\\lstnewenvironment{Chunk}{\\relax}{\\relax}%
4520   </nf-chunk||>
4521
4522   We also define a suitable <verbatim|\\lstset> of parameters that suit the
4523   literate programming style after the fashion of <name|noweave>.
4524
4525   <\nf-chunk|./fangle.sty>
4526     <item>\\lstset{numbers=left, stepnumber=5, numbersep=5pt,
4527
4528     <item> \ \ \ \ \ \ \ breaklines=false,basicstyle=\\ttfamily,
4529
4530     <item> \ \ \ \ \ \ \ numberstyle=\\tiny, language=C}%
4531   </nf-chunk||>
4532
4533   We also define a notangle-like mechanism for escaping to <LaTeX> from the
4534   listing, and by which we can refer to other listings. We declare the
4535   <verbatim|=\<less\>...\<gtr\>> sequence to contain <LaTeX> code, and
4536   include another like this chunk: <verbatim|=\<less\>\\chunkref{chunkname}\<gtr\>>.
4537   However, because <verbatim|=\<less\>...\<gtr\>> is already defined to
4538   contain <LaTeX> code for this document --- this is a fangle document after
4539   all --- the code fragment below effectively contains the <LaTeX> code:
4540   <verbatim|}{>. To avoid problems with document generation, I had to declare
4541   an lstlistings property: <verbatim|escapeinside={}> for this listing only;
4542   which in <LyX> was done by right-clicking the listings inset, choosing
4543   settings-\<gtr\>advanced. Therefore <verbatim|=\<less\>> isn't interpreted
4544   literally here, in a listing when the escape sequence is already defined as
4545   shown... we need to somehow escape this representation...
4546
4547   <\nf-chunk|./fangle.sty>
4548     <item>\\lstset{escapeinside={=\<less\>}{\<gtr\>}}%
4549   </nf-chunk||>
4550
4551   Although our macros will contain the <verbatim|@> symbol, they will be
4552   included in a <verbatim|\\makeatletter> section by <LyX>; however we keep
4553   the commented out <verbatim|\\makeatletter> as a reminder. The listings
4554   package likes to centre the titles, but noweb titles are specially
4555   formatted and must be left aligned. The simplest way to do this turned out
4556   to be by removing the definition of <verbatim|\\lst@maketitle>. This may
4557   interact badly if other listings want a regular title or caption. We
4558   remember the old maketitle in case we need it.
4559
4560   <\nf-chunk|./fangle.sty>
4561     <item>%\\makeatletter
4562
4563     <item>%somehow re-defining maketitle gives us a left-aligned title
4564
4565     <item>%which is extactly what our specially formatted title needs!
4566
4567     <item>\\global\\let\\fangle@lst@maketitle\\lst@maketitle%
4568
4569     <item>\\global\\def\\lst@maketitle{}%
4570   </nf-chunk||>
4571
4572   <subsection|The chunk command><label|sub:The-chunk-command>
4573
4574   Our chunk command accepts one argument, and calls <verbatim|\\ltset>.
4575   Although <verbatim|\\ltset> will note the name, this is erased when the
4576   next <verbatim|\\lstlisting> starts, so we make a note of this in
4577   <verbatim|\\lst@chunkname> and restore in in lstlistings Init hook.
4578
4579   <\nf-chunk|./fangle.sty>
4580     <item>\\def\\Chunk#1{%
4581
4582     <item> \ \\lstset{title={\\fanglecaption},name=#1}%
4583
4584     <item> \ \\global\\edef\\lst@chunkname{\\lst@intname}%
4585
4586     <item>}%
4587
4588     <item>\\def\\lst@chunkname{\\empty}%
4589   </nf-chunk||>
4590
4591   <subsubsection|Chunk parameters>
4592
4593   Fangle permits parameterized chunks, and requires the paramters to be
4594   specified as listings options. The fangle script uses this, and although we
4595   don't do anything with these in the <LaTeX> code right now, we need to stop
4596   the listings package complaining.
4597
4598   <\nf-chunk|./fangle.sty>
4599     <item>\\lst@Key{params}\\relax{\\def\\fangle@chunk@params{#1}}%
4600   </nf-chunk||>
4601
4602   As it is common to define a chunk which then needs appending to another
4603   chunk, and annoying to have to declare a single line chunk to manage the
4604   include, we support an append= option.
4605
4606   <\nf-chunk|./fangle.sty>
4607     <item>\\lst@Key{append}\\relax{\\def\\fangle@chunk@append{#1}}%
4608   </nf-chunk||>
4609
4610   <subsection|The noweb styled caption>
4611
4612   We define a public macro <verbatim|\\fanglecaption> which can be set as a
4613   regular title. By means of <verbatim|\\protect>, It expands to
4614   <verbatim|\\fangle@caption> at the appopriate time when the caption is
4615   emitted.
4616
4617   <nf-chunk|./fangle.sty|\\def\\fanglecaption{\\protect\\fangle@caption}%||>
4618
4619   <\big-figure>
4620     22c <math|\<langle\>>some-chunk 19b<math|\<rangle\>><math|\<equiv\>>+
4621     \ \ <math|\<vartriangleleft\>>22b 24d<math|\<vartriangleright\>>
4622
4623     \;
4624
4625     In this example, the current chunk is 22c, and therefore the third chunk
4626     on page 22.
4627
4628     It's name is some-chunk.\
4629
4630     The first chunk with this name (19b) occurs as the second chunk on page
4631     19.
4632
4633     The previous chunk (22d) with the same name is the second chunk on page
4634     22.
4635
4636     The next chunk (24d) is the fourth chunk on page 24.
4637   </big-figure|Noweb Heading<label|noweb heading>>
4638
4639   The general noweb output format compactly identifies the current chunk, and
4640   references to the first chunk, and the previous and next chunks that have
4641   the same name.
4642
4643   This means that we need to keep a counter for each chunk-name, that we use
4644   to count chunks of the same name.
4645
4646   <subsection|The chunk counter>
4647
4648   It would be natural to have a counter for each chunk name, but TeX would
4649   soon run out of counters<\footnote>
4650     ...soon did run out of counters and so I had to re-write the LaTeX macros
4651     to share a counter as described here.
4652   </footnote>, so we have one counter which we save at the end of a chunk and
4653   restore at the beginning of a chunk.
4654
4655   <\nf-chunk|./fangle.sty>
4656     <item>\\newcounter{fangle@chunkcounter}%
4657   </nf-chunk||>
4658
4659   We construct the name of this variable to store the counter to be the text
4660   <verbatim|lst-chunk-> prefixed onto the chunks own name, and store it in
4661   <verbatim|\\chunkcount>.\
4662
4663   We save the counter like this:
4664
4665   <nf-chunk|save-counter|\\global\\expandafter\\edef\\csname
4666   \\chunkcount\\endcsname{\\arabic{fangle@chunkcounter}}%||>
4667
4668   and restore the counter like this:
4669
4670   <nf-chunk|restore-counter|\\setcounter{fangle@chunkcounter}{\\csname
4671   \\chunkcount\\endcsname}%||>
4672
4673   If there does not already exist a variable whose name is stored in
4674   <verbatim|\\chunkcount>, then we know we are the first chunk with this
4675   name, and then define a counter.\
4676
4677   Although chunks of the same name share a common counter, they must still be
4678   distinguished. We use is the internal name of the listing, suffixed by the
4679   counter value. So the first chunk might be <verbatim|something-1> and the
4680   second chunk be <verbatim|something-2>, etc.
4681
4682   We also calculate the name of the previous chunk if we can (before we
4683   increment the chunk counter). If this is the first chunk of that name, then
4684   <verbatim|\\prevchunkname> is set to <verbatim|\\relax> which the noweb
4685   package will interpret as not existing.
4686
4687   <\nf-chunk|./fangle.sty>
4688     <item>\\def\\fangle@caption{%
4689
4690     <item> \ \\edef\\chunkcount{lst-chunk-\\lst@intname}%
4691
4692     <item> \ \\@ifundefined{\\chunkcount}{%
4693
4694     <item> \ \ \ \\expandafter\\gdef\\csname \\chunkcount\\endcsname{0}%
4695
4696     <item> \ \ \ \\setcounter{fangle@chunkcounter}{\\csname
4697     \\chunkcount\\endcsname}%
4698
4699     <item> \ \ \ \\let\\prevchunkname\\relax%
4700
4701     <item> \ }{%
4702
4703     <item> \ \ \ \\setcounter{fangle@chunkcounter}{\\csname
4704     \\chunkcount\\endcsname}%
4705
4706     <item> \ \ \ \\edef\\prevchunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
4707
4708     <item> \ }%
4709   </nf-chunk||>
4710
4711   After incrementing the chunk counter, we then define the name of this
4712   chunk, as well as the name of the first chunk.
4713
4714   <\nf-chunk|./fangle.sty>
4715     <item> \ \\addtocounter{fangle@chunkcounter}{1}%
4716
4717     <item> \ \\global\\expandafter\\edef\\csname
4718     \\chunkcount\\endcsname{\\arabic{fangle@chunkcounter}}%
4719
4720     <item> \ \\edef\\chunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
4721
4722     <item> \ \\edef\\firstchunkname{\\lst@intname-1}%
4723   </nf-chunk||>
4724
4725   We now need to calculate the name of the next chunk. We do this by
4726   temporarily skipping the counter on by one; however there may not actually
4727   be another chunk with this name! We detect this by also defining a label
4728   for each chunk based on the chunkname. If there is a next chunkname then it
4729   will define a label with that name. As labels are persistent, we can at
4730   least tell the second time <LaTeX> is run. If we don't find such a defined
4731   label then we define <verbatim|\\nextchunkname> to <verbatim|\\relax>.
4732
4733   <\nf-chunk|./fangle.sty>
4734     <item> \ \\addtocounter{fangle@chunkcounter}{1}%
4735
4736     <item> \ \\edef\\nextchunkname{\\lst@intname-\\arabic{fangle@chunkcounter}}%
4737
4738     <item> \ \\@ifundefined{r@label-\\nextchunkname}{\\let\\nextchunkname\\relax}{}%
4739   </nf-chunk||>
4740
4741   The noweb package requires that we define a <verbatim|\\sublabel> for every
4742   chunk, with a unique name, which is then used to print out it's navigation
4743   hints.
4744
4745   We also define a regular label for this chunk, as was mentioned above when
4746   we calculated <verbatim|\\nextchunkname>. This requires <LaTeX> to be run
4747   at least twice after new chunk sections are added --- but noweb requried
4748   that anyway.
4749
4750   <\nf-chunk|./fangle.sty>
4751     <item> \ \\sublabel{\\chunkname}%
4752
4753     <item>% define this label for every chunk instance, so we
4754
4755     <item>% can tell when we are the last chunk of this name
4756
4757     <item> \ \\label{label-\\chunkname}%
4758   </nf-chunk||>
4759
4760   We also try and add the chunk to the list of listings, but I'm afraid we
4761   don't do very well. We want each chunk name listing once, with all of it's
4762   references.
4763
4764   <\nf-chunk|./fangle.sty>
4765     <item> \ \\addcontentsline{lol}{lstlisting}{\\lst@name~[\\protect\\subpageref{\\chunkname}]}%
4766   </nf-chunk||>
4767
4768   We then call the noweb output macros in the same way that noweave generates
4769   them, except that we don't need to call <verbatim|\\nwstartdeflinemarkup>
4770   or <verbatim|\\nwenddeflinemarkup> <emdash> and if we do, it messes up the
4771   output somewhat.
4772
4773   <\nf-chunk|./fangle.sty>
4774     <item> \ \\nwmargintag{%
4775
4776     <item> \ \ \ {%
4777
4778     <item> \ \ \ \ \ \\nwtagstyle{}%
4779
4780     <item> \ \ \ \ \ \\subpageref{\\chunkname}%
4781
4782     <item> \ \ \ }%
4783
4784     <item> \ }%
4785
4786     <item>%
4787
4788     <item> \ \\moddef{%
4789
4790     <item> \ \ \ {\\lst@name}%
4791
4792     <item> \ \ \ {%
4793
4794     <item> \ \ \ \ \ \\nwtagstyle{}\\/%
4795
4796     <item> \ \ \ \ \ \\@ifundefined{fangle@chunk@params}{}{%
4797
4798     <item> \ \ \ \ \ \ \ (\\fangle@chunk@params)%
4799
4800     <item> \ \ \ \ \ }%
4801
4802     <item> \ \ \ \ \ [\\csname \\chunkcount\\endcsname]~%
4803
4804     <item> \ \ \ \ \ \\subpageref{\\firstchunkname}%
4805
4806     <item> \ \ \ }%
4807
4808     <item> \ \ \ \\@ifundefined{fangle@chunk@append}{}{%
4809
4810     <item> \ \ \ \\ifx{}\\fangle@chunk@append{x}\\else%
4811
4812     <item> \ \ \ \ \ \ \ ,~add~to~\\fangle@chunk@append%
4813
4814     <item> \ \ \ \\fi%
4815
4816     <item> \ \ \ }%
4817
4818     <item>\\global\\def\\fangle@chunk@append{}%
4819
4820     <item>\\lstset{append=x}%
4821
4822     <item> \ }%
4823
4824     <item>%
4825
4826     <item> \ \\ifx\\relax\\prevchunkname\\endmoddef\\else\\plusendmoddef\\fi%
4827
4828     <item>% \ \\nwstartdeflinemarkup%
4829
4830     <item> \ \\nwprevnextdefs{\\prevchunkname}{\\nextchunkname}%
4831
4832     <item>% \ \\nwenddeflinemarkup%
4833
4834     <item>}%
4835   </nf-chunk||>
4836
4837   Originally this was developed as a <verbatim|listings> aspect, in the Init
4838   hook, but it was found easier to affect the title without using a hook
4839   <emdash> <verbatim|\\lst@AddToHookExe{PreSet}> is still required to set the
4840   listings name to the name passed to the <verbatim|\\Chunk> command, though.
4841
4842   <\nf-chunk|./fangle.sty>
4843     <item>%\\lst@BeginAspect{fangle}
4844
4845     <item>%\\lst@Key{fangle}{true}[t]{\\lstKV@SetIf{#1}{true}}
4846
4847     <item>\\lst@AddToHookExe{PreSet}{\\global\\let\\lst@intname\\lst@chunkname}
4848
4849     <item>\\lst@AddToHook{Init}{}%\\fangle@caption}
4850
4851     <item>%\\lst@EndAspect
4852   </nf-chunk||>
4853
4854   <subsection|Cross references>
4855
4856   We define the <verbatim|\\chunkref> command which makes it easy to generate
4857   visual references to different code chunks, e.g.
4858
4859   <block|<tformat|<table|<row|<cell|Macro>|<cell|Appearance>>|<row|<cell|<verbatim|\\chunkref{preamble}>>|<cell|>>|<row|<cell|<verbatim|\\chunkref[3]{preamble}>>|<cell|>>|<row|<cell|<verbatim|\\chunkref{preamble}[arg1,
4860   arg2]>>|<cell|>>>>>
4861
4862   Chunkref can also be used within a code chunk to include another code
4863   chunk. The third optional parameter to chunkref is a comma sepatarated list
4864   of arguments, which will replace defined parameters in the chunkref.
4865
4866   <\note>
4867     Darn it, if I have: <verbatim|=\<less\>\\chunkref{new-mode-tracker}[{chunks[chunk_name,
4868     "language"]},{mode}]\<gtr\>> the inner braces (inside [ ]) cause _ to
4869     signify subscript even though we have <verbatim|lst@ReplaceIn>
4870   </note>
4871
4872   <\nf-chunk|./fangle.sty>
4873     <item>\\def\\chunkref@args#1,{%
4874
4875     <item> \ \\def\\arg{#1}%
4876
4877     <item> \ \\lst@ReplaceIn\\arg\\lst@filenamerpl%
4878
4879     <item> \ \\arg%
4880
4881     <item> \ \\@ifnextchar){\\relax}{, \\chunkref@args}%
4882
4883     <item>}%
4884
4885     <item>\\newcommand\\chunkref[2][0]{%
4886
4887     <item> \ \\@ifnextchar({\\chunkref@i{#1}{#2}}{\\chunkref@i{#1}{#2}()}%
4888
4889     <item>}%
4890
4891     <item>\\def\\chunkref@i#1#2(#3){%
4892
4893     <item> \ \\def\\zero{0}%
4894
4895     <item> \ \\def\\chunk{#2}%
4896
4897     <item> \ \\def\\chunkno{#1}%
4898
4899     <item> \ \\def\\chunkargs{#3}%
4900
4901     <item> \ \\ifx\\chunkno\\zero%
4902
4903     <item> \ \ \ \\def\\chunkname{#2-1}%
4904
4905     <item> \ \\else%
4906
4907     <item> \ \ \ \\def\\chunkname{#2-\\chunkno}%
4908
4909     <item> \ \\fi%
4910
4911     <item> \ \\let\\lst@arg\\chunk%
4912
4913     <item> \ \\lst@ReplaceIn\\chunk\\lst@filenamerpl%
4914
4915     <item> \ \\LA{%\\moddef{%
4916
4917     <item> \ \ \ {\\chunk}%
4918
4919     <item> \ \ \ {%
4920
4921     <item> \ \ \ \ \ \\nwtagstyle{}\\/%
4922
4923     <item> \ \ \ \ \ \\ifx\\chunkno\\zero%
4924
4925     <item> \ \ \ \ \ \\else%
4926
4927     <item> \ \ \ \ \ [\\chunkno]%
4928
4929     <item> \ \ \ \ \ \\fi%
4930
4931     <item> \ \ \ \ \ \\ifx\\chunkargs\\empty%
4932
4933     <item> \ \ \ \ \ \\else%
4934
4935     <item> \ \ \ \ \ \ \ (\\chunkref@args #3,)%
4936
4937     <item> \ \ \ \ \ \\fi%
4938
4939     <item> \ \ \ \ \ ~\\subpageref{\\chunkname}%
4940
4941     <item> \ \ \ }%
4942
4943     <item> \ }%
4944
4945     <item> \ \\RA%\\endmoddef%
4946
4947     <item>}%
4948   </nf-chunk||>
4949
4950   <subsection|The end>
4951
4952   <\nf-chunk|./fangle.sty>
4953     <item>%
4954
4955     <item>%\\makeatother
4956   </nf-chunk||>
4957
4958   <chapter|Extracting fangle>
4959
4960   <section|Extracting from Lyx>
4961
4962   To extract from <LyX>, you will need to configure <LyX> as explained in
4963   section <reference|Configuring-the-build>.
4964
4965   <label|lyx-build-script>And this lyx-build scrap will extract fangle for
4966   me.
4967
4968   <\nf-chunk|lyx-build>
4969     <item>#! /bin/sh
4970
4971     <item>set -x
4972
4973     <item>
4974
4975     <item>=\<less\>\\chunkref{lyx-build-helper}\<gtr\>
4976
4977     <item>cd $PROJECT_DIR \|\| exit 1
4978
4979     <item>
4980
4981     <item>/usr/local/bin/fangle -R./fangle $TEX_SRC \<gtr\> ./fangle
4982
4983     <item>/usr/local/bin/fangle -R./fangle.module $TEX_SRC \<gtr\>
4984     ./fangle.module
4985
4986     <item>
4987
4988     <item>=\<less\>\\chunkref{test:helpers}\<gtr\>
4989
4990     <item>export FANGLE=./fangle
4991
4992     <item>export TMP=${TMP:-/tmp}
4993
4994     <item>=\<less\>\\chunkref{test:run-tests}\<gtr\>
4995
4996     <item># Now check that we can extract a fangle that also passes the
4997     tests!
4998
4999     <item>$FANGLE -R./fangle $TEX_SRC \<gtr\> ./new-fangle
5000
5001     <item>export FANGLE=./new-fangle
5002
5003     <item>=\<less\>\\chunkref{test:run-tests}\<gtr\>
5004   </nf-chunk|sh|>
5005
5006   <\nf-chunk|test:run-tests>
5007     <item># run tests
5008
5009     <item>$FANGLE -Rpca-test.awk $TEX_SRC \| awk -f - \|\| exit 1
5010
5011     <item>=\<less\>\\chunkref{test:cromulence}\<gtr\>
5012
5013     <item>=\<less\>\\chunkref{test:escapes}\<gtr\>
5014
5015     <item>=\<less\>\\chunkref{test:chunk-params}\<gtr\>
5016   </nf-chunk|sh|>
5017
5018   With a lyx-build-helper
5019
5020   <\nf-chunk|lyx-build-helper>
5021     <item>PROJECT_DIR="$LYX_r"
5022
5023     <item>LYX_SRC="$PROJECT_DIR/${LYX_i%.tex}.lyx"
5024
5025     <item>TEX_DIR="$LYX_p"
5026
5027     <item>TEX_SRC="$TEX_DIR/$LYX_i"
5028   </nf-chunk|sh|>
5029
5030   <section|Extracting documentation>
5031
5032   <\nf-chunk|./gen-www>
5033     <item>#python -m elyxer --css lyx.css $LYX_SRC \| \\
5034
5035     <item># \ iconv -c -f utf-8 -t ISO-8859-1//TRANSLIT \| \\
5036
5037     <item># \ sed 's/UTF-8"\\(.\\)\<gtr\>/ISO-8859-1"\\1\<gtr\>/' \<gtr\>
5038     www/docs/fangle.html
5039
5040     <item>
5041
5042     <item>python -m elyxer --css lyx.css --iso885915 --html --destdirectory
5043     www/docs/fangle.e \\
5044
5045     <item> \ \ \ \ \ \ fangle.lyx \<gtr\> www/docs/fangle.e/fangle.html
5046
5047     <item>
5048
5049     <item>( mkdir -p www/docs/fangle && cd www/docs/fangle && \\
5050
5051     <item> \ lyx -e latex ../../../fangle.lyx && \\
5052
5053     <item> \ htlatex ../../../fangle.tex "xhtml,fn-in" && \\
5054
5055     <item> \ sed -i -e 's/\<less\>!--l\\. [0-9][0-9]* *--\<gtr\>//g'
5056     fangle.html
5057
5058     <item>)
5059
5060     <item>
5061
5062     <item>( mkdir -p www/docs/literate && cd www/docs/literate && \\
5063
5064     <item> \ lyx -e latex ../../../literate.lyx && \\
5065
5066     <item> \ htlatex ../../../literate.tex "xhtml,fn-in" && \\
5067
5068     <item> \ sed -i -e 's/\<less\>!--l\\. [0-9][0-9]* *--\<gtr\>$//g'
5069     literate.html
5070
5071     <item>)
5072   </nf-chunk||>
5073
5074   <section|Extracting from the command line>
5075
5076   First you will need the tex output, then you can extract:
5077
5078   <\nf-chunk|lyx-build-manual>
5079     <item>lyx -e latex fangle.lyx
5080
5081     <item>fangle -R./fangle fangle.tex \<gtr\> ./fangle
5082
5083     <item>fangle -R./fangle.module fangle.tex \<gtr\> ./fangle.module
5084   </nf-chunk|sh|>
5085
5086   <section|Testing>
5087
5088   <\nf-chunk|test:helpers>
5089     <item>passtest() {
5090
5091     <item> \ if "$@"
5092
5093     <item> \ then echo "Passed"
5094
5095     <item> \ else echo "Failed"
5096
5097     <item> \ \ \ \ \ \ return 1
5098
5099     <item> \ fi
5100
5101     <item>}
5102
5103     <item>
5104
5105     <item>failtest() {
5106
5107     <item> \ if ! "$@"
5108
5109     <item> \ then echo "Passed"
5110
5111     <item> \ else echo "Failed"
5112
5113     <item> \ \ \ \ \ \ return 1
5114
5115     <item> \ fi
5116
5117     <item>}
5118   </nf-chunk||>
5119
5120   <part|Tests>
5121
5122   <chapter|Chunk Parameters>
5123
5124   <\nf-chunk|test:chunk-params:sub>
5125     <item>I see a ${THING},
5126
5127     <item>a ${THING} of colour ${colour},\
5128
5129     <item>and looking closer =\<less\>\\chunkref{test:chunk-params:sub:sub}(${colour})\<gtr\>
5130   </nf-chunk||<tuple|THING|colour>>
5131
5132   <\nf-chunk|test:chunk-params:sub:sub>
5133     <item>a funny shade of ${colour}
5134   </nf-chunk||<tuple|colour>>
5135
5136   <\nf-chunk|test:chunk-params:text>
5137     <item>What do you see? "=\<less\>\\chunkref{test:chunk-params:sub}(joe,
5138     red)\<gtr\>"
5139
5140     <item>Well, fancy!
5141   </nf-chunk||>
5142
5143   Should generate output:
5144
5145   <\nf-chunk|test:chunk-params:result>
5146     <item>What do you see? "I see a joe,
5147
5148     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ a joe of colour red,\
5149
5150     <item> \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ and looking closer a funny shade
5151     of red"
5152
5153     <item>Well, fancy!
5154   </nf-chunk||>
5155
5156   And this chunk will perform the test:
5157
5158   <\nf-chunk|test:chunk-params>
5159     <item>$FANGLE -Rtest:chunk-params:result $TEX_SRC \<gtr\> $TMP/answer
5160     \|\| exit 1
5161
5162     <item>$FANGLE -Rtest:chunk-params:text $TEX_SRC \<gtr\> $TMP/result \|\|
5163     exit 1
5164
5165     <item>passtest diff $TMP/answer $TMP/result \|\| (echo
5166     test:chunk-params:text failed ; exit 1)
5167   </nf-chunk||>
5168
5169   <chapter|Compile-log-lyx><label|Compile-log-lyx>
5170
5171   <\nf-chunk|Chunk:./compile-log-lyx>
5172     <item>#! /bin/sh
5173
5174     <item># can't use gtkdialog -i, cos it uses the "source" command which
5175     ubuntu sh doesn't have
5176
5177     <item>
5178
5179     <item>main() {
5180
5181     <item> \ errors="/tmp/compile.log.$$"
5182
5183     <item># \ if grep '^[^ ]*:\\( In \\\|[0-9][0-9]*: [^ ]*:\\)' \<gtr\>
5184     $errors
5185
5186     <item>if grep '^[^ ]*(\\([0-9][0-9]*\\)) *: *\\(error\\\|warning\\)'
5187     \<gtr\> $errors
5188
5189     <item> \ then
5190
5191     <item> \ \ \ sed -i -e 's/^[^ ]*[/\\\\]\\([^/\\\\]*\\)(\\([ 0-9][
5192     0-9]*\\)) *: */\\1:\\2\|\\2\|/' $errors
5193
5194     <item> \ \ \ COMPILE_DIALOG='
5195
5196     <item> \<less\>vbox\<gtr\>
5197
5198     <item> \ \<less\>text\<gtr\>
5199
5200     <item> \ \ \ \<less\>label\<gtr\>Compiler errors:\<less\>/label\<gtr\>
5201
5202     <item> \ \<less\>/text\<gtr\>
5203
5204     <item> \ \<less\>tree exported_column="0"\<gtr\>
5205
5206     <item> \ \ \ \<less\>variable\<gtr\>LINE\<less\>/variable\<gtr\>
5207
5208     <item> \ \ \ \<less\>height\<gtr\>400\<less\>/height\<gtr\>\<less\>width\<gtr\>800\<less\>/width\<gtr\>
5209
5210     <item> \ \ \ \<less\>label\<gtr\>File \| Line \|
5211     Message\<less\>/label\<gtr\>
5212
5213     <item> \ \ \ \<less\>action\<gtr\>'". $SELF ; "'lyxgoto
5214     $LINE\<less\>/action\<gtr\>
5215
5216     <item> \ \ \ \<less\>input\<gtr\>'"cat $errors"'\<less\>/input\<gtr\>
5217
5218     <item> \ \<less\>/tree\<gtr\>
5219
5220     <item> \ \<less\>hbox\<gtr\>
5221
5222     <item> \ \ \<less\>button\<gtr\>\<less\>label\<gtr\>Build\<less\>/label\<gtr\>
5223
5224     <item> \ \ \ \ \<less\>action\<gtr\>lyxclient -c "LYXCMD:build-program"
5225     &\<less\>/action\<gtr\>
5226
5227     <item> \ \ \<less\>/button\<gtr\>
5228
5229     <item> \ \ \<less\>button ok\<gtr\>\<less\>/button\<gtr\>
5230
5231     <item> \ \<less\>/hbox\<gtr\>
5232
5233     <item> \<less\>/vbox\<gtr\>
5234
5235     <item>'
5236
5237     <item> \ \ \ export COMPILE_DIALOG
5238
5239     <item> \ \ \ ( gtkdialog --program=COMPILE_DIALOG ; rm $errors ) &
5240
5241     <item> \ else
5242
5243     <item> \ \ \ rm $errors
5244
5245     <item> \ fi
5246
5247     <item>}
5248
5249     <item>
5250
5251     <item>lyxgoto() {
5252
5253     <item> \ file="${LINE%:*}"
5254
5255     <item> \ line="${LINE##*:}"
5256
5257     <item> \ extraline=`cat $file \| head -n $line \| tac \| sed
5258     '/^\\\\\\\\begin{lstlisting}/q' \| wc -l`
5259
5260     <item> \ extraline=`expr $extraline - 1`
5261
5262     <item> \ lyxclient -c "LYXCMD:command-sequence server-goto-file-row $file
5263     $line ; char-forward ; repeat $extraline paragraph-down ;
5264     paragraph-up-select"
5265
5266     <item>}
5267
5268     <item>
5269
5270     <item>SELF="$0"
5271
5272     <item>if test -z "$COMPILE_DIALOG"
5273
5274     <item>then main "$@"\
5275
5276     <item>fi
5277   </nf-chunk|sh|>
5278
5279   \;
5280 </body>
5281
5282 <\initial>
5283   <\collection>
5284     <associate|info-flag|short>
5285     <associate|page-medium|paper>
5286     <associate|page-screen-height|982016tmpt>
5287     <associate|page-screen-margin|false>
5288     <associate|page-screen-width|1686528tmpt>
5289     <associate|preamble|false>
5290     <associate|sfactor|5>
5291   </collection>
5292 </initial>
5293
5294 <\references>
5295 </references>
5296
5297 <\auxiliary>
5298 </auxiliary>