FAQ.txt

   1 .. -*- coding: utf-8 -*-
   2
   3
   4 .. NOTE TO MAINTAINERS: Please add new questions to the end of their
   5    sections, so section/question numbers remain stable.
   6
   7
   8 ===========================================
   9  Docutils FAQ (Frequently Asked Questions)
  10 ===========================================
  11
  12 :Date: $Date$
  13 :Revision: $Revision$
  14 :Web site: http://docutils.sourceforge.net/
  15 :Copyright: This document has been placed in the public domain.
  16
  17 .. contents::
  18 .. sectnum::
  19
  20
  21 This is a work in progress.  If you are reading a local copy, the
  22 `master copy`_ might be newer.  This document uses are relative links;
  23 if they don't work, please use the `master copy`_.
  24
  25 Please feel free to ask questions and/or provide answers; send email
  26 to the `Docutils-users`_ mailing list.  Project members should feel
  27 free to edit the source text file directly.
  28
  29 .. _master copy: http://docutils.sourceforge.net/FAQ.html
  30 .. _let us know:
  31 .. _Docutils-users: docs/user/mailing-lists.html#docutils-users
  32
  33
  34
  35 Docutils
  36 ========
  37
  38 What is Docutils?
  39 -----------------
  40
  41 Docutils_ is a system for processing plaintext documentation into
  42 useful formats, such as HTML, XML, and LaTeX.  It supports multiple
  43 types of input, such as standalone files (implemented), inline
  44 documentation from Python modules and packages (under development),
  45 `PEPs (Python Enhancement Proposals)`_ (implemented), and others as
  46 discovered.
  47
  48 The Docutils distribution consists of:
  49
  50 * a library (the "docutils" package), which `can be used by client
  51   code`_;
  52 * several `front-end tools`_ (such as ``rst2html.py``, which converts
  53   reStructuredText input into HTML output);
  54 * a `test suite`_; and
  55 * extensive documentation_.
  56
  57 For an overview of the Docutils project implementation, see `PEP
  58 258`_, "Docutils Design Specification".
  59
  60 Docutils is implemented in Python_.
  61
  62 .. _Docutils: http://docutils.sourceforge.net/
  63 .. _PEPs (Python Enhancement Proposals):
  64    http://www.python.org/peps/pep-0012.html
  65 .. _can be used by client code: docs/api/publisher.html
  66 .. _front-end tools: docs/user/tools.html
  67 .. _test suite: docs/dev/testing.html
  68 .. _documentation: docs/index.html
  69 .. _PEP 258: http://www.python.org/peps/pep-0258.html
  70 .. _Python: http://www.python.org/
  71
  72
  73 Why is it called "Docutils"?
  74 ----------------------------
  75
  76 Docutils is short for "Python Documentation Utilities".  The name
  77 "Docutils" was inspired by "Distutils", the Python Distribution
  78 Utilities architected by Greg Ward, a component of Python's standard
  79 library.
  80
  81 The earliest known use of the term "docutils" in a Python context was
  82 a `fleeting reference`__ in a message by Fred Drake on 1999-12-02 in
  83 the Python Doc-SIG mailing list.  It was suggested `as a project
  84 name`__ on 2000-11-27 on Doc-SIG, again by Fred Drake, in response to
  85 a question from Tony "Tibs" Ibbs: "What do we want to *call* this
  86 thing?".  This was shortly after David Goodger first `announced
  87 reStructuredText`__ on Doc-SIG.
  88
  89 Tibs used the name "Docutils" for `his effort`__ "to document what the
  90 Python docutils package should support, with a particular emphasis on
  91 documentation strings".  Tibs joined the current project (and its
  92 predecessors) and graciously donated the name.
  93
  94 For more history of reStructuredText and the Docutils project, see `An
  95 Introduction to reStructuredText`_.
  96
  97 Please note that the name is "Docutils", not "DocUtils" or "Doc-Utils"
  98 or any other variation.  It is pronounced as in "DOCumentation
  99 UTILitieS", with emphasis on the first syllable.
 100
 101 .. _An Introduction to reStructuredText: docs/ref/rst/introduction.html
 102 __ http://mail.python.org/pipermail/doc-sig/1999-December/000878.html
 103 __ http://mail.python.org/pipermail/doc-sig/2000-November/001252.html
 104 __ http://mail.python.org/pipermail/doc-sig/2000-November/001239.html
 105 __ http://homepage.ntlworld.com/tibsnjoan/docutils/STpy.html
 106
 107
 108 Is there a GUI authoring environment for Docutils?
 109 --------------------------------------------------
 110
 111 DocFactory_ is under development.  It uses wxPython and looks very
 112 promising.
 113
 114 .. _DocFactory:
 115    http://docutils.sf.net/sandbox/gschwant/docfactory/doc/
 116
 117
 118 What is the status of the Docutils project?
 119 -------------------------------------------
 120
 121 Although useful and relatively stable, Docutils is experimental code,
 122 with APIs and architecture subject to change.
 123
 124 Our highest priority is to fix bugs as they are reported.  So the
 125 latest code from the repository_ (or the snapshots_) is almost always
 126 the most stable (bug-free) as well as the most featureful.
 127
 128
 129 What is the Docutils project release policy?
 130 --------------------------------------------
 131
 132 It's "release early & often".  We also have automatically-generated
 133 snapshots_ which always contain the latest code from the repository_.
 134 As the project matures, we may formalize on a
 135 stable/development-branch scheme, but we're not using anything like
 136 that yet.
 137
 138 .. _repository: docs/dev/repository.html
 139 .. _snapshots: http://docutils.sourceforge.net/#download
 140
 141
 142 reStructuredText
 143 ================
 144
 145 What is reStructuredText?
 146 -------------------------
 147
 148 reStructuredText_ is an easy-to-read, what-you-see-is-what-you-get
 149 plaintext markup syntax and parser system.  The reStructuredText
 150 parser is a component of Docutils_.  reStructuredText is a revision
 151 and reinterpretation of the StructuredText_ and Setext_ lightweight
 152 markup systems.
 153
 154 If you are reading this on the web, you can see for yourself.  `The
 155 source for this FAQ <FAQ.txt>`_ is written in reStructuredText; open
 156 it in another window and compare them side by side.
 157
 158 `A ReStructuredText Primer`_ and the `Quick reStructuredText`_ user
 159 reference are a good place to start.  The `reStructuredText Markup
 160 Specification`_ is a detailed technical specification.
 161
 162 .. _A ReStructuredText Primer: docs/user/rst/quickstart.html
 163 .. _Quick reStructuredText: docs/user/rst/quickref.html
 164 .. _reStructuredText Markup Specification:
 165    docs/ref/rst/restructuredtext.html
 166 .. _reStructuredText: http://docutils.sourceforge.net/rst.html
 167 .. _StructuredText:
 168    http://dev.zope.org/Members/jim/StructuredTextWiki/FrontPage/
 169 .. _Setext: http://docutils.sourceforge.net/mirror/setext.html
 170
 171
 172 Why is it called "reStructuredText"?
 173 ------------------------------------
 174
 175 The name came from a combination of "StructuredText", one of
 176 reStructuredText's predecessors, with "re": "revised", "reworked", and
 177 "reinterpreted", and as in the ``re.py`` regular expression module.
 178 For a detailed history of reStructuredText and the Docutils project,
 179 see `An Introduction to reStructuredText`_.
 180
 181 "reStructuredText" is **ONE** word, *not two!*
 182
 183
 184 What's the standard abbreviation for "reStructuredText"?
 185 --------------------------------------------------------
 186
 187 "RST" and "ReST" (or "reST") are both acceptable.  Care should be
 188 taken with capitalization, to avoid confusion with "REST__", an
 189 acronym for "Representational State Transfer".
 190
 191 The abbreviations "reSTX" and "rSTX"/"rstx" should **not** be used;
 192 they overemphasize reStructuredText's precedessor, Zope's
 193 StructuredText.
 194
 195 __ http://en.wikipedia.org/wiki/Representational_State_Transfer
 196
 197
 198 What's the standard filename extension for a reStructuredText file?
 199 -------------------------------------------------------------------
 200
 201 It's ".txt".  Some people would like to use ".rest" or ".rst" or
 202 ".restx", but why bother?  ReStructuredText source files are meant to
 203 be readable as plaintext, and most operating systems already associate
 204 ".txt" with text files.  Using a specialized filename extension would
 205 require that users alter their OS settings, which is something that
 206 many users will not be willing or able to do.
 207
 208 Also see `What's the official MIME type for reStructuredText data?`_
 209
 210
 211 Are there any reStructuredText editor extensions?
 212 -------------------------------------------------
 213
 214 See `Editor Support for reStructuredText`__.
 215
 216 __ tools/editors/README.html
 217
 218
 219 How can I indicate the document title?  Subtitle?
 220 -------------------------------------------------
 221
 222 A uniquely-adorned section title at the beginning of a document is
 223 treated specially, as the document title.  Similarly, a
 224 uniquely-adorned section title immediately after the document title
 225 becomes the document subtitle.  For example::
 226
 227     This is the Document Title
 228     ==========================
 229
 230     This is the Document Subtitle
 231     -----------------------------
 232
 233     Here's an ordinary paragraph.
 234
 235 Counterexample::
 236
 237     Here's an ordinary paragraph.
 238
 239     This is *not* a Document Title
 240     ==============================
 241
 242     The "ordinary paragraph" above the section title
 243     prevents it from becoming the document title.
 244
 245 Another counterexample::
 246
 247     This is not the Document Title,  because...
 248     ===========================================
 249
 250     Here's an ordinary paragraph.
 251
 252     ... the title adornment is not unique
 253     =====================================
 254
 255     Another ordinary paragraph.
 256
 257
 258 How can I represent esoteric characters (e.g. character entities) in a document?
 259 --------------------------------------------------------------------------------
 260
 261 For example, say you want an em-dash (XML character entity &mdash;,
 262 Unicode character U+2014) in your document: use a real em-dash.
 263 Insert concrete characters (e.g. type a *real* em-dash) into your
 264 input file, using whatever encoding suits your application, and tell
 265 Docutils the input encoding.  Docutils uses Unicode internally, so the
 266 em-dash character is a real em-dash internally.
 267
 268 Emacs users should refer to the `Emacs Support for reStructuredText`__
 269 document.  Tips for other editors are welcome.
 270
 271 __ tools/editors/emacs/README.html
 272
 273 ReStructuredText has no character entity subsystem; it doesn't know
 274 anything about XML charents.  To Docutils, "&mdash;" in input text is
 275 7 discrete characters; no interpretation happens.  When writing HTML,
 276 the "&" is converted to "&amp;", so in the raw output you'd see
 277 "&amp;mdash;".  There's no difference in interpretation for text
 278 inside or outside inline literals or literal blocks -- there's no
 279 character entity interpretation in either case.
 280
 281 If you can't use a Unicode-compatible encoding and must rely on 7-bit
 282 ASCII, there is a workaround.  New in Docutils 0.3.10 is a set of
 283 `Standard Substitution Definition Sets`_, which provide equivalents of
 284 XML & HTML character entity sets as substitution definitions.  For
 285 example, the Japanese yen currency symbol can be used as follows::
 286
 287     .. include:: <xhtml1-lat1.txt>
 288
 289     |yen| 600 for a complete meal?  That's cheap!
 290
 291 For earlier versions of Docutils, equivalent files containing
 292 character entity set substitution definitions using the "unicode_"
 293 directive `are available`_.  Please read the `description and
 294 instructions`_ for use.  Thanks to David Priest for the original idea.
 295
 296 If you insist on using XML-style charents, you'll have to implement a
 297 pre-processing system to convert to UTF-8 or something.  That
 298 introduces complications though; you can no longer *write* about
 299 charents naturally; instead of writing "&mdash;" you'd have to write
 300 "&amp;mdash;".
 301
 302 For the common case of long dashes, you might also want to insert the
 303 following substitution definitons into your document (both of them are
 304 using the "unicode_" directive)::
 305
 306     .. |--| unicode:: U+2013   .. en dash
 307     .. |---| unicode:: U+2014  .. em dash, trimming surrounding whitespace
 308        :trim:
 309
 310 .. |--| unicode:: U+2013   .. en dash
 311 .. |---| unicode:: U+2014  .. em dash, trimming surrounding whitespace
 312    :trim:
 313
 314 Now you can write dashes using pure ASCII: "``foo |--| bar; foo |---|
 315 bar``", rendered as "foo |--| bar; foo |---| bar".  (Note that Mozilla
 316 and Firefox may render this incorrectly.)  The ``:trim:`` option for
 317 the em dash is necessary because you cannot write "``foo|---|bar``";
 318 thus you need to add spaces ("``foo |---| bar``") and advise the
 319 reStructuredText parser to trim the spaces.
 320
 321 .. _Standard Substitution Definition Sets: docs/ref/rst/substitutions.html
 322 .. _unicode: docs/ref/rst/directives.html#unicode-character-codes
 323 .. _are available: http://docutils.sourceforge.net/tmp/charents/
 324 .. _tarball: http://docutils.sourceforge.net/tmp/charents.tgz
 325 .. _description and instructions:
 326    http://docutils.sourceforge.net/tmp/charents/README.html
 327 .. _to-do list: docs/dev/todo.html
 328
 329
 330 How can I generate backticks using a Scandinavian keyboard?
 331 -----------------------------------------------------------
 332
 333 The use of backticks in reStructuredText is a bit awkward with
 334 Scandinavian keyboards, where the backtick is a "dead" key.  To get
 335 one ` character one must press SHIFT-` + SPACE.
 336
 337 Unfortunately, with all the variations out there, there's no way to
 338 please everyone.  For Scandinavian programmers and technical writers,
 339 this is not limited to reStructuredText but affects many languages and
 340 environments.
 341
 342 Possible solutions include
 343
 344 * If you have to input a lot of backticks, simply type one in the
 345   normal/awkward way, select it, copy and then paste the rest (CTRL-V
 346   is a lot faster than SHIFT-` + SPACE).
 347
 348 * Use keyboard macros.
 349
 350 * Remap the keyboard.  The Scandinavian keyboard layout is awkward for
 351   other programming/technical characters too; for example, []{}
 352   etc. are a bit awkward compared to US keyboards.
 353
 354   According to Axel Kollmorgen,
 355
 356       Under Windows, you can use the `Microsoft Keyboard Layout Creator
 357       <http://www.microsoft.com/globaldev/tools/msklc.mspx>`__ to easily
 358       map the backtick key to a real backtick (no dead key). took me
 359       five minutes to load my default (german) keyboard layout, untick
 360       "Dead Key?" from the backtick key properties ("in all shift
 361       states"), "build dll and setup package", install the generated
 362       .msi, and add my custom keyboard layout via Control Panel >
 363       Regional and Language Options > Languages > Details > Add
 364       Keyboard layout (and setting it as default "when you start your
 365       computer").
 366
 367 * Use a virtual/screen keyboard or character palette, such as:
 368
 369   - `Web-based keyboards <http://keyboard.lab.co.il/>`__ (IE only
 370     unfortunately).
 371   - Windows: `Click-N-Type <http://www.lakefolks.org/cnt/>`__.
 372   - Mac OS X: the Character Palette can store a set of favorite
 373     characters for easy input.  Open System Preferences,
 374     International, Input Menu tab, enable "Show input menu in menu
 375     bar", and be sure that Character Palette is enabled in the list.
 376
 377 If anyone knows of other/better solutions, please `let us know`_.
 378
 379
 380 Are there any tools for HTML/XML-to-reStructuredText?  (Round-tripping)
 381 -----------------------------------------------------------------------
 382
 383 People have tossed the idea around, and some implementations of
 384 reStructuredText-generating tools can be found in the `Docutils Link
 385 List`_.
 386
 387 There's no reason why reStructuredText should not be round-trippable
 388 to/from XML; any technicalities which prevent round-tripping would be
 389 considered bugs.  Whitespace would not be identical, but paragraphs
 390 shouldn't suffer.  The tricky parts would be the smaller details, like
 391 links and IDs and other bookkeeping.
 392
 393 For HTML, true round-tripping may not be possible.  Even adding lots
 394 of extra "class" attributes may not be enough.  A "simple HTML" to RST
 395 filter is possible -- for some definition of "simple HTML" -- but HTML
 396 is used as dumb formatting so much that such a filter may not be
 397 particularly useful.  An 80/20 approach should work though: build a
 398 tool that does 80% of the work automatically, leaving the other 20%
 399 for manual tweaks.
 400
 401 .. _Docutils Link List: docs/user/links.html
 402
 403
 404 Are there any Wikis that use reStructuredText syntax?
 405 -----------------------------------------------------
 406
 407 There are several, with various degrees of completeness.  With no
 408 implied endorsement or recommendation, and in no particular order:
 409
 410 * `Ian Bicking's experimental code
 411   <http://docutils.sf.net/sandbox/ianb/wiki/Wiki.py>`__
 412
 413 * `MoinMoin <http://moinmoin.wikiwikiweb.de/>`__ has some support;
 414   `here's a sample <http://moinmoin.wikiwikiweb.de/RestSample>`__
 415
 416 * Zope-based `Zwiki <http://zwiki.org/>`__
 417
 418 * Zope3-based Zwiki (in the Zope 3 source tree as
 419   ``zope.products.zwiki``)
 420
 421 * `StikiWiki <http://mithrandr.moria.org/code/stikiwiki/>`__
 422
 423 * `Trac <http://trac.edgewall.com//>`__ `supports using
 424   reStructuredText
 425   <http://trac.edgewall.com//wiki/WikiRestructuredText>`__ as
 426   an alternative to wiki markup. This includes support for `TracLinks
 427   <http://trac.edgewall.com//wiki/TracLinks>`__ from within
 428   RST text via a custom RST reference-directive or, even easier, an
 429   interpreted text role 'trac'
 430
 431 Please `let us know`_ of any other reStructuredText Wikis.
 432
 433 .. dead link
 434 .. The example application for the `Web Framework Shootout
 435 .. <http://colorstudy.com/docs/shootout.html>`__ article is a Wiki using
 436 .. reStructuredText.
 437
 438
 439 Are there any Weblog (Blog) projects that use reStructuredText syntax?
 440 ----------------------------------------------------------------------
 441
 442 With no implied endorsement or recommendation, and in no particular
 443 order:
 444
 445 * `Firedrop <http://www.voidspace.org.uk/python/firedrop2/>`__
 446 * `PyBloxsom <http://pyblosxom.sourceforge.net/>`__
 447 * `Lino WebMan <http://lino.sourceforge.net/webman.html>`__
 448 * `Pelican <http://blog.getpelican.com/>`__
 449   (also  listed `on PyPi <http://pypi.python.org/pypi/pelican>`__)
 450
 451 Please `let us know`_ of any other reStructuredText Blogs.
 452
 453
 454 .. _Can lists be indented without generating block quotes?:
 455
 456 How should I mark up lists?
 457 ---------------------------
 458
 459 Bullet_ & enumerated_ list markup is very intuitive but there are 2
 460 points that must be noted:
 461
 462 .. _bullet: docs/ref/rst/restructuredtext.html#bullet-lists
 463 .. _enumerated: docs/ref/rst/restructuredtext.html#enumerated-lists
 464
 465 1. Lists should **not** be indented.  This is correct::
 466
 467        paragraph
 468
 469        * list item 1
 470
 471          * nested item 1.1
 472          * nested item 1.2
 473
 474        * list item 2
 475
 476    while this is probably incorrect::
 477
 478        paragraph
 479
 480          * list item 1
 481
 482              * nested item 1.1
 483              * nested item 1.2
 484
 485          * list item 2
 486
 487    The extra indentation (of the list containing items 1.1 and 1.2) is
 488    recognized as a block quote.  This is usually not what you mean and
 489    it causes the list in the output to be indented too much.
 490
 491 2. There **must** be blank lines around list items, except between
 492    items of the same level, where blank lines are optional.  The
 493    example above shows this.
 494
 495 Note that formatting of the *output* is independent of the input, and
 496 is decided by the writer and the stylesheet.  For instance, lists
 497 *are* indented in HTML output by default.  See `How are lists
 498 formatted in HTML?`_ for details.
 499
 500
 501 Could lists be indented without generating block quotes?
 502 --------------------------------------------------------
 503
 504 Some people like to write lists with indentation but don't intend a
 505 blockquote context.  There has been a lot of discussion about allowing
 506 this in reStructuredText, but there are some issues that would need to
 507 be resolved before it could be implemented.  There is a summary of the
 508 issues and pointers to the discussions in `the to-do list`__.
 509
 510 __ docs/dev/todo.html#indented-lists
 511
 512
 513 Could the requirement for blank lines around lists be relaxed?
 514 --------------------------------------------------------------
 515
 516 Short answer: no.
 517
 518 In reStructuredText, it would be impossible to unambigously mark up
 519 and parse lists without blank lines before and after.  Deeply nested
 520 lists may look ugly with so many blank lines, but it's a price we pay
 521 for unambiguous markup.  Some other plaintext markup systems do not
 522 require blank lines in nested lists, but they have to compromise
 523 somehow, either accepting ambiguity or requiring extra complexity.
 524 For example, `Epytext <http://epydoc.sf.net/epytext.html#list>`__ does
 525 not require blank lines around lists, but it does require that lists
 526 be indented and that ambiguous cases be escaped.
 527
 528
 529 How can I include mathematical equations in documents?
 530 ------------------------------------------------------
 531
 532 Use the `math directive`_ and `math role`_, available since Docutils 0.8.
 533
 534 .. _math directive: docs/ref/rst/directives.html#math
 535 .. _math role: docs/ref/rst/roles.html#math
 536
 537
 538 Is nested inline markup possible?
 539 ---------------------------------
 540
 541 Not currently, no.  It's on the `to-do list`__ (`details here`__), and
 542 hopefully will be part of the reStructuredText parser soon.  At that
 543 time, markup like this will become possible::
 544
 545     Here is some *emphasized text containing a `hyperlink`_ and
 546     ``inline literals``*.
 547
 548 __ docs/dev/todo.html#nested-inline-markup
 549 __ docs/dev/rst/alternatives.html#nested-inline-markup
 550
 551 There are workarounds, but they are either convoluted or ugly or both.
 552 They are not recommended.
 553
 554 * Inline markup can be combined with hyperlinks using `substitution
 555   definitions`__ and references__ with the `"replace" directive`__.
 556   For example::
 557
 558       Here is an |emphasized hyperlink|_.
 559
 560       .. |emphasized hyperlink| replace:: *emphasized hyperlink*
 561       .. _emphasized hyperlink: http://example.org
 562
 563   It is not possible for just a portion of the replacement text to be
 564   a hyperlink; it's the entire replacement text or nothing.
 565
 566   __ docs/ref/rst/restructuredtext.html#substitution-definitions
 567   __ docs/ref/rst/restructuredtext.html#substitution-references
 568   __ docs/ref/rst/directives.html#replace
 569
 570 * The `"raw" directive`__ can be used to insert raw HTML into HTML
 571   output::
 572
 573       Here is some |stuff|.
 574
 575       .. |stuff| raw:: html
 576
 577          <em>emphasized text containing a
 578          <a href="http://example.org">hyperlink</a> and
 579          <tt>inline literals</tt></em>
 580
 581   Raw LaTeX is supported for LaTeX output, etc.
 582
 583   __ docs/ref/rst/directives.html#raw
 584
 585
 586 How to indicate a line break or a significant newline?
 587 ------------------------------------------------------
 588
 589 `Line blocks`__ are designed for address blocks, verse, and other
 590 cases where line breaks are significant and must be preserved.  Unlike
 591 literal blocks, the typeface is not changed, and inline markup is
 592 recognized.  For example::
 593
 594     | A one, two, a one two three four
 595     |
 596     | Half a bee, philosophically,
 597     |     must, *ipso facto*, half not be.
 598     | But half the bee has got to be,
 599     |     *vis a vis* its entity.  D'you see?
 600     |
 601     | But can a bee be said to be
 602     |     or not to be an entire bee,
 603     |         when half the bee is not a bee,
 604     |             due to some ancient injury?
 605     |
 606     | Singing...
 607
 608 __ docs/ref/rst/restructuredtext.html#line-blocks
 609
 610 Here's a workaround for manually inserting explicit line breaks in
 611 HTML output::
 612
 613     .. |br| raw:: html
 614
 615        <br />
 616
 617     I want to break this line here: |br| this is after the break.
 618
 619     If the extra whitespace bothers you, |br|\ backslash-escape it.
 620
 621
 622 A URL containing asterisks doesn't work.  What to do?
 623 -----------------------------------------------------
 624
 625 Asterisks are valid URL characters (see :RFC:`2396`), sometimes used
 626 in URLs.  For example::
 627
 628     http://cvs.example.org/viewcvs.py/*checkout*/module/file
 629
 630 Unfortunately, the parser thinks the asterisks are indicating
 631 emphasis.  The slashes serve as delineating punctuation, allowing the
 632 asterisks to be recognized as markup.  The example above is separated
 633 by the parser into a truncated URL, an emphasized word, and some
 634 regular text::
 635
 636     http://cvs.example.org/viewcvs.py/
 637     *checkout*
 638     /module/file
 639
 640 To turn off markup recognition, use a backslash to escape at least the
 641 first asterisk, like this::
 642
 643     http://cvs.example.org/viewcvs.py/\*checkout*/module/file
 644
 645 Escaping the second asterisk doesn't hurt, but it isn't necessary.
 646
 647
 648 How can I make a literal block with *some* formatting?
 649 ------------------------------------------------------
 650
 651 Use the `parsed-literal`_ directive.
 652
 653 .. _parsed-literal: docs/ref/rst/directives.html#parsed-literal
 654
 655 Scenario: a document contains some source code, which calls for a
 656 literal block to preserve linebreaks and whitespace.  But part of the
 657 source code should be formatted, for example as emphasis or as a
 658 hyperlink.  This calls for a *parsed* literal block::
 659
 660     .. parsed-literal::
 661
 662        print "Hello world!"  # *tricky* code [1]_
 663
 664 The emphasis (``*tricky*``) and footnote reference (``[1]_``) will be
 665 parsed.
 666
 667
 668 Can reStructuredText be used for web or generic templating?
 669 -----------------------------------------------------------
 670
 671 Docutils and reStructuredText can be used with or as a component of a
 672 templating system, but they do not themselves include templating
 673 functionality.  Templating should simply be left to dedicated
 674 templating systems.  Users can choose a templating system to apply to
 675 their reStructuredText documents as best serves their interests.
 676
 677 There are many good templating systems for Python (ht2html_, YAPTU_,
 678 Quixote_'s PTL, Cheetah_, etc.; see this non-exhaustive list of `some
 679 other templating systems`_), and many more for other languages, each
 680 with different approaches.  We invite you to try several and find one
 681 you like.  If you adapt it to use Docutils/reStructuredText, please
 682 consider contributing the code to Docutils or `let us know`_ and we'll
 683 keep a list here.
 684
 685 One reST-specific web templating system is `rest2web
 686 <http://www.voidspace.org.uk/python/rest2web>`_, a tool for
 687 automatically building websites, or parts of websites.
 688
 689 .. _ht2html: http://ht2html.sourceforge.net/
 690 .. _YAPTU:
 691    http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/52305
 692 .. _Quixote: http://www.mems-exchange.org/software/quixote/
 693 .. _Cheetah: http://www.cheetahtemplate.org/
 694 .. _some other templating systems:
 695    http://webware.sourceforge.net/Papers/Templates/
 696
 697
 698 How can I mark up a FAQ or other list of questions & answers?
 699 -------------------------------------------------------------
 700
 701 There is no specific syntax for FAQs and Q&A lists.  Here are two
 702 options:
 703
 704 1. For a FAQ (Frequently Asked Questions, usually with answers), a
 705    convenient way to mark up the questions is as section titles, with
 706    the answer(s) as section content.  This document is marked up in
 707    this way.
 708
 709    The advantages of using section titles for questions are: sections
 710    can be numbered automatically, and a table of contents can be
 711    generated automatically.  One limitation of this format is that
 712    questions must fit on one line (section titles may not wrap, in the
 713    source text).  For very long questions, the title may be a summary
 714    of the question, with the full question in the section body.
 715
 716 2. Field lists work well as Q&A lists::
 717
 718        :Q: What kind of questions can we
 719            put here?
 720
 721        :A: Any kind we like!
 722
 723    In order to separate questions, lists can be used:
 724
 725        1. :Q: What kind of question can we
 726               put here?
 727           :A: Any kind we like!
 728
 729        2. :Q: How many answers can a question have?
 730           :A: It can have one,
 731           :A: or more.
 732           :A3: Answers can be numbered like this.
 733           :A: 1. Or like this.
 734               2. We're flexible!
 735
 736    If you don't want to number or otherwise mark questions, you can
 737    use an empty comment between individual field lists to separate
 738    them::
 739
 740        :Q: First question?
 741        :A: Answer.
 742
 743        ..
 744
 745        :Q: Second question?
 746        :A: Answer.
 747
 748
 749 .. _bidi:
 750
 751 Can I produce documents in right-to-left languages?
 752 ---------------------------------------------------
 753
 754 Languages written from right to left, such as Arabic and Hebrew, must
 755 be reordered according to the `Unicode Bidi Algorithm`_.  This
 756 requires support from the editor and special markup in the output
 757 format.
 758
 759 The source format of reStructuredText is relatively bidi-friendly:
 760 most constructs are denoted by punctuation without intrusion of
 761 English and when you must write in English, it's usually on a separate
 762 line.  So any editor that auto-detects direction per-line (like gedit
 763 or geresh_) will suffice.
 764
 765 Moreover, it's possible to translate_ all reStructuredText keywords.
 766 This was not yet done for any RTL language, but when it is, it will be
 767 possible to write an RTL document with vitually no English.  This will
 768 allow reasonable use of editors limited to a single base direction for
 769 the whole document (like Notepad, Vim and text boxes in Firefox).
 770
 771 .. _Unicode Bidi Algorithm: http://www.unicode.org/reports/tr9/
 772 .. _geresh: http://www.typo.co.il/~mooffie/geresh/
 773 .. _translate: docs/howto/i18n.html
 774
 775 The second problem is bidi markup of the output.  There is an almost
 776 transparent implicit solution for HTML:
 777
 778 * Grab http://cben-hacks.sourceforge.net/bidi/hibidi.py and
 779   http://cben-hacks.sourceforge.net/bidi/rst2html_hibidi.py.
 780   Put them both in the same directory and make them executable.
 781
 782 * Use ``rst2html_hibidi.py`` instead of ``rst2html.py``.
 783
 784 * It infers dir attributes in the HTML from the text.  It does it
 785   hierachically, giving much better results than usual.  You can still
 786   use LRM/RLM and LRE/RLE/PDF control codes to help it.
 787
 788   * If you want the gory details: See the full theory_, and note the
 789     incomplete practice_ (this is still a partial implementation - but
 790     sufficient for most needs).
 791
 792     .. _theory: http://cben-hacks.sf.net/bidi/hibidi.html
 793     .. _practice: http://cben-hacks.sf.net/bidi/hibidi.html#practice
 794
 795 There is also an explicit way to set directions through CSS and
 796 classes in the HTML:
 797
 798 * Copy ``default.css`` to a new file and add relevant parts of the
 799   following::
 800
 801       /* Use these two if the main document direction is RTL */
 802       body { direction: rtl; }
 803       div.sidebar { float: left !important; }
 804
 805       /* The next 3 rules are very useful in documents containing pieces
 806       of code in english */
 807       /* Use this if you all your literal blocks (::) are LTR */
 808       pre {direction: ltr; unicode-bidi: embed; }
 809       /* Use this if you all your inline literals (``) are LTR */
 810       tt {direction: ltr; unicode-bidi: embed; }
 811       /* Use this if you all your interpretted text (`) is LTR */
 812       cite {direction: ltr; unicode-bidi: embed; }
 813
 814       /* Allow manual direction override by class directive and roles */
 815       .rtl { direction: rtl; }
 816       .ltr { direction: ltr; }
 817
 818 * Select this new stylesheet with ``--stylesheet=<file>`` or the
 819   stylesheet_ setting.
 820
 821 * Now if you need to override the direction of some element (from a
 822   paragraph to a whole section), write::
 823
 824       .. class:: rtl
 825
 826   or::
 827
 828       .. class:: ltr
 829
 830   before it (see the class_ directive for details).
 831
 832 * To change the direction of some inline text fragment, you can use
 833   RLE/LRE/PDF control characters, or write ``:rtl:`RTL text``` /
 834   ``:ltr:`RTL text```.  To use the latter syntax, you must write this
 835   once at the beginning of your document::
 836
 837       .. role:: ltr
 838       .. role:: rtl
 839
 840 .. _stylesheet: docs/user/config.html#stylesheet
 841 .. _class: docs/ref/rst/directives.txt#class
 842
 843 LaTeX is quite hard to implement (it doesn't support the bidi
 844 algorithm, so all direction changes - even numbers in RTL text - must
 845 be explicitly marked).  Other formats are more-or-less easy.
 846
 847 If you have any questions/problems/bugs related to bidi with docutils,
 848 ask `Beni Cherniavsky`__ directly or the `Docutils-users`_ mailing
 849 list.
 850
 851 __ mailto:cben@users.sf.net
 852
 853
 854 What's the official MIME type for reStructuredText data?
 855 --------------------------------------------------------
 856
 857 While there is no registered MIME type for reStructuredText, the
 858 "official unofficial" standard MIME type is "text/x-rst".  This was
 859 invented for the build system for PEPs (Python Enhancement Proposals),
 860 and it's used by the python.org web site build system.
 861
 862 (The "x-" prefix means it's an unregistered MIME type.)
 863
 864 Also see `What's the standard filename extension for a
 865 reStructuredText file?`_
 866
 867
 868 HTML Writer
 869 ===========
 870
 871 What is the status of the HTML Writer?
 872 --------------------------------------
 873
 874 The HTML Writer module, ``docutils/writers/html4css1.py``, is a
 875 proof-of-concept reference implementation.  While it is a complete
 876 implementation, some aspects of the HTML it produces may be incompatible
 877 with older browsers or specialized applications (such as web templating).
 878 The sandbox has some alternative HTML writers, contributions are welcome.
 879
 880
 881 What kind of HTML does it produce?
 882 ----------------------------------
 883
 884 It produces XHTML compatible with the `XHTML 1.0`_ specification.  A
 885 cascading stylesheet is required for proper viewing with a modern
 886 graphical browser.  Correct rendering of the HTML produced depends on
 887 the CSS support of the browser.  A general-purpose stylesheet,
 888 ``html4css1.css`` is provided with the code, and is embedded by
 889 default.  It is installed in the "writers/html4css1/" subdirectory
 890 within the Docutils package.  Use the ``--help`` command-line option
 891 to see the specific location on your machine.
 892
 893 .. _XHTML 1.0: http://www.w3.org/TR/xhtml1/
 894
 895
 896 What browsers are supported?
 897 ----------------------------
 898
 899 No specific browser is targeted; all modern graphical browsers should
 900 work.  Some older browsers, text-only browsers, and browsers without
 901 full CSS support are known to produce inferior results.  Firefox,
 902 Safari, Mozilla (version 1.0 and up), Opera, and MS Internet Explorer
 903 (version 5.0 and up) are known to give good results.  Reports of
 904 experiences with other browsers are welcome.
 905
 906
 907 Unexpected results from tools/rst2html.py: H1, H1 instead of H1, H2.  Why?
 908 --------------------------------------------------------------------------
 909
 910 Here's the question in full:
 911
 912     I have this text::
 913
 914         Heading 1
 915         =========
 916
 917         All my life, I wanted to be H1.
 918
 919         Heading 1.1
 920         -----------
 921
 922         But along came H1, and so shouldn't I be H2?
 923         No!  I'm H1!
 924
 925         Heading 1.1.1
 926         *************
 927
 928         Yeah, imagine me, I'm stuck at H3!  No?!?
 929
 930     When I run it through tools/rst2html.py, I get unexpected results
 931     (below).  I was expecting H1, H2, then H3; instead, I get H1, H1,
 932     H2::
 933
 934         ...
 935         <html lang="en">
 936         <head>
 937         ...
 938         <title>Heading 1</title>
 939         </head>
 940         <body>
 941         <div class="document" id="heading-1">
 942         <h1 class="title">Heading 1</h1>                <-- first H1
 943         <p>All my life, I wanted to be H1.</p>
 944         <div class="section" id="heading-1-1">
 945         <h1><a name="heading-1-1">Heading 1.1</a></h1>        <-- H1
 946         <p>But along came H1, and so now I must be H2.</p>
 947         <div class="section" id="heading-1-1-1">
 948         <h2><a name="heading-1-1-1">Heading 1.1.1</a></h2>
 949         <p>Yeah, imagine me, I'm stuck at H3!</p>
 950         ...
 951
 952     What gives?
 953
 954 Check the "class" attribute on the H1 tags, and you will see a
 955 difference.  The first H1 is actually ``<h1 class="title">``; this is
 956 the document title, and the default stylesheet renders it centered.
 957 There can also be an ``<h2 class="subtitle">`` for the document
 958 subtitle.
 959
 960 If there's only one highest-level section title at the beginning of a
 961 document, it is treated specially, as the document title.  (Similarly, a
 962 lone second-highest-level section title may become the document
 963 subtitle.)  See `How can I indicate the document title?  Subtitle?`_ for
 964 details.  Rather than use a plain H1 for the document title, we use ``<h1
 965 class="title">`` so that we can use H1 again within the document.  Why
 966 do we do this?  HTML only has H1-H6, so by making H1 do double duty, we
 967 effectively reserve these tags to provide 6 levels of heading beyond the
 968 single document title.
 969
 970 HTML is being used for dumb formatting for nothing but final display.
 971 A stylesheet *is required*, and one is provided; see `What kind of
 972 HTML does it produce?`_ above.  Of course, you're welcome to roll your
 973 own.  The default stylesheet provides rules to format ``<h1
 974 class="title">`` and ``<h2 class="subtitle">`` differently from
 975 ordinary ``<h1>`` and ``<h2>``::
 976
 977     h1.title {
 978       text-align: center }
 979
 980     h2.subtitle {
 981       text-align: center }
 982
 983 If you don't want the top section heading to be interpreted as a
 984 title at all, disable the `doctitle_xform`_ setting
 985 (``--no-doc-title`` option).  This will interpret your document
 986 differently from the standard settings, which might not be a good
 987 idea.  If you don't like the reuse of the H1 in the HTML output, you
 988 can tweak the `initial_header_level`_ setting
 989 (``--initial-header-level`` option) -- but unless you match its value
 990 to your specific document, you might end up with bad HTML (e.g. H3
 991 without H2).
 992
 993 .. _doctitle_xform: docs/user/config.html#doctitle-xform
 994 .. _initial_header_level: docs/user/config.html#initial-header-level
 995
 996 (Thanks to Mark McEahern for the question and much of the answer.)
 997
 998
 999 How are lists formatted in HTML?
1000 --------------------------------
1001
1002 If list formatting looks strange, first check that you understand
1003 `list markup`__.
1004
1005 __ `How should I mark up lists?`_
1006
1007 * By default, HTML browsers indent lists relative to their context.
1008   This follows a long tradition in browsers (but isn't so established
1009   in print).  If you don't like it, you should change the stylesheet.
1010
1011   This is different from how lists look in reStructuredText source.
1012   Extra indentation in the source indicates a blockquote, resulting in
1013   too much indentation in the browser.
1014
1015 * A list item can contain multiple paragraphs etc.  In complex cases
1016   list items are separated by vertical space.  By default this spacing
1017   is omitted in "simple" lists.  A list is simple if every item
1018   contains a simple paragraph and/or a "simple" nested list.  For
1019   example:
1020
1021       * text
1022
1023         * simple
1024
1025           * simple
1026           * simple
1027
1028         * simple
1029
1030         text after a nested list
1031
1032       * multiple
1033
1034         paragraphs
1035
1036   In this example the nested lists are simple (and should appear
1037   compacted) but the outer list is not.
1038
1039   If you want all lists to have equal spacing, disable the
1040   `compact_lists`_ setting (``--no-compact-lists`` option).  The
1041   precise spacing can be controlled in the stylesheet.
1042
1043   Note again that this is not exactly WYSIWYG: it partially resembles
1044   the rules about blank lines being optional between list items in
1045   reStructuredText -- but adding/removing optional blank lines does
1046   not affect spacing in the output!  It's a feature, not a bug: you
1047   write it as you like but the output is styled consistently.
1048
1049   .. _compact_lists: docs/user/config.html#compact-lists
1050
1051
1052 Why do enumerated lists only use numbers (no letters or roman numerals)?
1053 ------------------------------------------------------------------------
1054
1055 The rendering of enumerators (the numbers or letters acting as list
1056 markers) is completely governed by the stylesheet, so either the
1057 browser can't find the stylesheet (try enabling the
1058 `embed_stylesheet`_ setting [``--embed-stylesheet`` option]), or the
1059 browser can't understand it (try a recent Firefox, Mozilla, Konqueror,
1060 Opera, Safari, or even MSIE).
1061
1062 .. _embed_stylesheet: docs/user/config.html#embed-stylesheet
1063
1064
1065 There appear to be garbage characters in the HTML.  What's up?
1066 --------------------------------------------------------------
1067
1068 What you're seeing is most probably not garbage, but the result of a
1069 mismatch between the actual encoding of the HTML output and the
1070 encoding your browser is expecting.  Your browser is misinterpreting
1071 the HTML data, which is encoded text.  A discussion of text encodings
1072 is beyond the scope of this FAQ; see one or more of these documents
1073 for more info:
1074
1075 * `UTF-8 and Unicode FAQ for Unix/Linux
1076   <http://www.cl.cam.ac.uk/~mgk25/unicode.html>`_
1077
1078 * Chapters 3 and 4 of `Introduction to i18n [Internationalization]
1079   <http://www.debian.org/doc/manuals/intro-i18n/>`_
1080
1081 * `Python Unicode Tutorial
1082   <http://www.reportlab.com/i18n/python_unicode_tutorial.html>`_
1083
1084 * `Python Unicode Objects: Some Observations on Working With Non-ASCII
1085   Character Sets <http://effbot.org/zone/unicode-objects.htm>`_
1086
1087 The common case is with the default output encoding (UTF-8), when
1088 either numbered sections are used (via the "sectnum_" directive) or
1089 symbol-footnotes.  3 non-breaking spaces are inserted in each numbered
1090 section title, between the generated number and the title text.  Most
1091 footnote symbols are not available in ASCII, nor are non-breaking
1092 spaces.  When encoded with UTF-8 and viewed with ordinary ASCII tools,
1093 these characters will appear to be multi-character garbage.
1094
1095 You may have an decoding problem in your browser (or editor, etc.).
1096 The encoding of the output is set to "utf-8", but your browswer isn't
1097 recognizing that.  You can either try to fix your browser (enable
1098 "UTF-8 character set", sometimes called "Unicode"), or choose a
1099 different encoding for the HTML output.  You can also try
1100 ``--output-encoding=ascii:xmlcharrefreplace`` for HTML or XML, but not
1101 applicable to non-XMLish outputs (if using runtime
1102 settings/configuration files, use ``output_encoding=ascii`` and
1103 ``output_encoding_error_handler=xmlcharrefreplace``).
1104
1105 If you're generating document fragments, the "Content-Type" metadata
1106 (between the HTML ``<head>`` and ``</head>`` tags) must agree with the
1107 encoding of the rest of the document.  For UTF-8, it should be::
1108
1109     <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
1110
1111 Also, Docutils normally generates an XML declaration as the first line
1112 of the output.  It must also match the document encoding.  For UTF-8::
1113
1114     <?xml version="1.0" encoding="utf-8" ?>
1115
1116 .. _sectnum: docs/ref/rst/directives.html#sectnum
1117
1118
1119 How can I retrieve the body of the HTML document?
1120 -------------------------------------------------
1121
1122 (This is usually needed when using Docutils in conjunction with a
1123 templating system.)
1124
1125 You can use the `docutils.core.publish_parts()`_ function, which
1126 returns a dictionary containing an 'html_body_' entry.
1127
1128 .. _docutils.core.publish_parts(): docs/api/publisher.html#publish-parts
1129 .. _html_body: docs/api/publisher.html#html-body
1130
1131
1132 Why is the Docutils XHTML served as "Content-type: text/html"?
1133 --------------------------------------------------------------
1134
1135 Full question:
1136
1137     Docutils' HTML output looks like XHTML and is advertised as such::
1138
1139       <?xml version="1.0" encoding="utf-8" ?>
1140       <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
1141        "http://www.w3.org/TR/xht ml1/DTD/xhtml1-transitional.dtd">
1142
1143     But this is followed by::
1144
1145       <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
1146
1147     Shouldn't this be "application/xhtml+xml" instead of "text/html"?
1148
1149 In a perfect web, the Docutils XHTML output would be 100% strict
1150 XHTML.  But it's not a perfect web, and a major source of imperfection
1151 is Internet Explorer.  Despite it's drawbacks, IE still represents the
1152 majority of web browsers, and cannot be ignored.
1153
1154 Short answer: if we didn't serve XHTML as "text/html" (which is a
1155 perfectly valid thing to do), it couldn't be viewed in Internet
1156 Explorer.
1157
1158 Long answer: see the `"Criticisms of Internet Explorer" Wikipedia
1159 entry <http://en.wikipedia.org/wiki/Criticisms_of_Internet_Explorer#XHTML>`__.
1160
1161 However, there's also `Sending XHTML as text/html Considered
1162 Harmful`__.  What to do, what to do?  We're damned no matter what we
1163 do.  So we'll continue to do the practical instead of the pure:
1164 support the browsers that are actually out there, and not fight for
1165 strict standards compliance.
1166
1167 __ http://hixie.ch/advocacy/xhtml
1168
1169 (Thanks to Martin F. Krafft, Robert Kern, Michael Foord, and Alan
1170 G. Isaac.)
1171
1172
1173 Python Source Reader
1174 ====================
1175
1176 Can I use Docutils for Python auto-documentation?
1177 -------------------------------------------------
1178
1179 Yes, in conjunction with other projects.
1180
1181 The Sphinx_ documentation generator includes an autodoc module.
1182
1183 .. _Sphinx: http://sphinx.pocoo.org/index.html
1184
1185 Version 2.0 of Ed Loper's `Epydoc <http://epydoc.sourceforge.net/>`_
1186 supports reStructuredText-format docstrings for HTML output.  Docutils
1187 0.3 or newer is required.  Development of a Docutils-specific
1188 auto-documentation tool will continue.  Epydoc works by importing
1189 Python modules to be documented, whereas the Docutils-specific tool,
1190 described above, will parse modules without importing them (as with
1191 `HappyDoc <http://happydoc.sourceforge.net/>`_, which doesn't support
1192 reStructuredText).
1193
1194 The advantages of parsing over importing are security and flexibility;
1195 the disadvantage is complexity/difficulty.
1196
1197 * Security: untrusted code that shouldn't be executed can be parsed;
1198   importing a module executes its top-level code.
1199 * Flexibility: comments and unofficial docstrings (those not supported
1200   by Python syntax) can only be processed by parsing.
1201 * Complexity/difficulty: it's a lot harder to parse and analyze a
1202   module than it is to ``import`` and analyze one.
1203
1204 For more details, please see "Docstring Extraction Rules" in `PEP
1205 258`_, item 3 ("How").
1206
1207
1208 Miscellaneous
1209 =============
1210
1211 Is the Docutils document model based on any existing XML models?
1212 ----------------------------------------------------------------
1213
1214 Not directly, no.  It borrows bits from DocBook, HTML, and others.  I
1215 (David Goodger) have designed several document models over the years,
1216 and have my own biases.  The Docutils document model is designed for
1217 simplicity and extensibility, and has been influenced by the needs of
1218 the reStructuredText markup.
1219
1220 \f
1221 ..
1222    Local Variables:
1223    mode: indented-text
1224    indent-tabs-mode: nil
1225    sentence-end-double-space: t
1226    fill-column: 70
1227    End:
1228
1229 .. Here's a code css to make a table colourful::
1230
1231    /* Table: */
1232
1233    th {
1234        background-color: #ede;
1235    }
1236
1237    /* alternating colors in table rows */
1238    table.docutils tr:nth-child(even) {
1239        background-color: #F3F3FF;
1240    }
1241    table.docutils tr:nth-child(odd) {
1242        background-color: #FFFFEE;
1243    }
1244
1245    table.docutils tr {
1246        border-style: solid none solid none;
1247        border-width: 1px 0 1px 0;
1248        border-color: #AAAAAA;
1249    }