docutils/FAQ.txt

   1 .. -*- coding: utf-8 -*-
   2
   3
   4 .. NOTE TO MAINTAINERS: Please add new questions to the end of their
   5    sections, so section/question numbers remain stable.
   6
   7
   8 ===========================================
   9  Docutils FAQ (Frequently Asked Questions)
  10 ===========================================
  11
  12 :Date: $Date$
  13 :Revision: $Revision$
  14 :Web site: http://docutils.sourceforge.net/
  15 :Copyright: This document has been placed in the public domain.
  16
  17 .. contents::
  18 .. sectnum::
  19
  20
  21 This is a work in progress.  If you are reading a local copy, the
  22 `master copy`_ might be newer.  This document uses are relative links;
  23 if they don't work, please use the `master copy`_.
  24
  25 Please feel free to ask questions and/or provide answers; send email
  26 to the `Docutils-users`_ mailing list.  Project members should feel
  27 free to edit the source text file directly.
  28
  29 .. _master copy: http://docutils.sourceforge.net/FAQ.html
  30 .. _let us know:
  31 .. _Docutils-users: docs/user/mailing-lists.html#docutils-users
  32
  33
  34
  35 Docutils
  36 ========
  37
  38 What is Docutils?
  39 -----------------
  40
  41 Docutils_ is a system for processing plaintext documentation into
  42 useful formats, such as HTML, XML, and LaTeX.  It supports multiple
  43 types of input, such as standalone files (implemented), inline
  44 documentation from Python modules and packages (under development),
  45 `PEPs (Python Enhancement Proposals)`_ (implemented), and others as
  46 discovered.
  47
  48 The Docutils distribution consists of:
  49
  50 * a library (the "docutils" package), which `can be used by client
  51   code`_;
  52 * several `front-end tools`_ (such as ``rst2html.py``, which converts
  53   reStructuredText input into HTML output);
  54 * a `test suite`_; and
  55 * extensive documentation_.
  56
  57 For an overview of the Docutils project implementation, see `PEP
  58 258`_, "Docutils Design Specification".
  59
  60 Docutils is implemented in Python_.
  61
  62 .. _Docutils: http://docutils.sourceforge.net/
  63 .. _PEPs (Python Enhancement Proposals):
  64    http://www.python.org/peps/pep-0012.html
  65 .. _can be used by client code: docs/api/publisher.html
  66 .. _front-end tools: docs/user/tools.html
  67 .. _test suite: docs/dev/testing.html
  68 .. _documentation: docs/index.html
  69 .. _PEP 258: http://www.python.org/peps/pep-0258.html
  70 .. _Python: http://www.python.org/
  71
  72
  73 Why is it called "Docutils"?
  74 ----------------------------
  75
  76 Docutils is short for "Python Documentation Utilities".  The name
  77 "Docutils" was inspired by "Distutils", the Python Distribution
  78 Utilities architected by Greg Ward, a component of Python's standard
  79 library.
  80
  81 The earliest known use of the term "docutils" in a Python context was
  82 a `fleeting reference`__ in a message by Fred Drake on 1999-12-02 in
  83 the Python Doc-SIG mailing list.  It was suggested `as a project
  84 name`__ on 2000-11-27 on Doc-SIG, again by Fred Drake, in response to
  85 a question from Tony "Tibs" Ibbs: "What do we want to *call* this
  86 thing?".  This was shortly after David Goodger first `announced
  87 reStructuredText`__ on Doc-SIG.
  88
  89 Tibs used the name "Docutils" for `his effort`__ "to document what the
  90 Python docutils package should support, with a particular emphasis on
  91 documentation strings".  Tibs joined the current project (and its
  92 predecessors) and graciously donated the name.
  93
  94 For more history of reStructuredText and the Docutils project, see `An
  95 Introduction to reStructuredText`_.
  96
  97 Please note that the name is "Docutils", not "DocUtils" or "Doc-Utils"
  98 or any other variation.  It is pronounced as in "DOCumentation
  99 UTILitieS", with emphasis on the first syllable.
 100
 101 .. _An Introduction to reStructuredText: docs/ref/rst/introduction.html
 102 __ http://mail.python.org/pipermail/doc-sig/1999-December/000878.html
 103 __ http://mail.python.org/pipermail/doc-sig/2000-November/001252.html
 104 __ http://mail.python.org/pipermail/doc-sig/2000-November/001239.html
 105 __ http://homepage.ntlworld.com/tibsnjoan/docutils/STpy.html
 106
 107
 108 Is there a GUI authoring environment for Docutils?
 109 --------------------------------------------------
 110
 111 DocFactory_ is under development.  It uses wxPython and looks very
 112 promising.
 113
 114 .. _DocFactory:
 115    http://docutils.sf.net/sandbox/gschwant/docfactory/doc/
 116
 117
 118 What is the status of the Docutils project?
 119 -------------------------------------------
 120
 121 Although useful and relatively stable, Docutils is experimental code,
 122 with APIs and architecture subject to change.
 123
 124 Our highest priority is to fix bugs as they are reported.  So the
 125 latest code from the repository_ (or the snapshots_) is almost always
 126 the most stable (bug-free) as well as the most featureful.
 127
 128
 129 What is the Docutils project release policy?
 130 --------------------------------------------
 131
 132 It's "release early & often".  We also have automatically-generated
 133 snapshots_ which always contain the latest code from the repository_.
 134 As the project matures, we may formalize on a
 135 stable/development-branch scheme, but we're not using anything like
 136 that yet.
 137
 138 .. _repository: docs/dev/repository.html
 139 .. _snapshots: http://docutils.sourceforge.net/#download
 140
 141
 142 reStructuredText
 143 ================
 144
 145 What is reStructuredText?
 146 -------------------------
 147
 148 reStructuredText_ is an easy-to-read, what-you-see-is-what-you-get
 149 plaintext markup syntax and parser system.  The reStructuredText
 150 parser is a component of Docutils_.  reStructuredText is a revision
 151 and reinterpretation of the StructuredText_ and Setext_ lightweight
 152 markup systems.
 153
 154 If you are reading this on the web, you can see for yourself.  `The
 155 source for this FAQ <FAQ.txt>`_ is written in reStructuredText; open
 156 it in another window and compare them side by side.
 157
 158 `A ReStructuredText Primer`_ and the `Quick reStructuredText`_ user
 159 reference are a good place to start.  The `reStructuredText Markup
 160 Specification`_ is a detailed technical specification.
 161
 162 .. _A ReStructuredText Primer: docs/user/rst/quickstart.html
 163 .. _Quick reStructuredText: docs/user/rst/quickref.html
 164 .. _reStructuredText Markup Specification:
 165    docs/ref/rst/restructuredtext.html
 166 .. _reStructuredText: http://docutils.sourceforge.net/rst.html
 167 .. _StructuredText:
 168    http://dev.zope.org/Members/jim/StructuredTextWiki/FrontPage/
 169 .. _Setext: http://docutils.sourceforge.net/mirror/setext.html
 170
 171
 172 Why is it called "reStructuredText"?
 173 ------------------------------------
 174
 175 The name came from a combination of "StructuredText", one of
 176 reStructuredText's predecessors, with "re": "revised", "reworked", and
 177 "reinterpreted", and as in the ``re.py`` regular expression module.
 178 For a detailed history of reStructuredText and the Docutils project,
 179 see `An Introduction to reStructuredText`_.
 180
 181
 182 What's the standard abbreviation for "reStructuredText"?
 183 --------------------------------------------------------
 184
 185 "RST" and "ReST" (or "reST") are both acceptable.  Care should be
 186 taken with capitalization, to avoid confusion with "REST__", an
 187 acronym for "Representational State Transfer".
 188
 189 The abbreviations "reSTX" and "rSTX"/"rstx" should **not** be used;
 190 they overemphasize reStructuredText's precedessor, Zope's
 191 StructuredText.
 192
 193 __ http://en.wikipedia.org/wiki/Representational_State_Transfer
 194
 195
 196 What's the standard filename extension for a reStructuredText file?
 197 -------------------------------------------------------------------
 198
 199 It's ".txt".  Some people would like to use ".rest" or ".rst" or
 200 ".restx", but why bother?  ReStructuredText source files are meant to
 201 be readable as plaintext, and most operating systems already associate
 202 ".txt" with text files.  Using a specialized filename extension would
 203 require that users alter their OS settings, which is something that
 204 many users will not be willing or able to do.
 205
 206 Also see `What's the official MIME type for reStructuredText data?`_
 207
 208
 209 Are there any reStructuredText editor extensions?
 210 -------------------------------------------------
 211
 212 See `Editor Support for reStructuredText`__.
 213
 214 __ tools/editors/README.html
 215
 216
 217 How can I indicate the document title?  Subtitle?
 218 -------------------------------------------------
 219
 220 A uniquely-adorned section title at the beginning of a document is
 221 treated specially, as the document title.  Similarly, a
 222 uniquely-adorned section title immediately after the document title
 223 becomes the document subtitle.  For example::
 224
 225     This is the Document Title
 226     ==========================
 227
 228     This is the Document Subtitle
 229     -----------------------------
 230
 231     Here's an ordinary paragraph.
 232
 233 Counterexample::
 234
 235     Here's an ordinary paragraph.
 236
 237     This is *not* a Document Title
 238     ==============================
 239
 240     The "ordinary paragraph" above the section title
 241     prevents it from becoming the document title.
 242
 243 Another counterexample::
 244
 245     This is not the Document Title,  because...
 246     ===========================================
 247
 248     Here's an ordinary paragraph.
 249
 250     ... the title adornment is not unique
 251     =====================================
 252
 253     Another ordinary paragraph.
 254
 255
 256 How can I represent esoteric characters (e.g. character entities) in a document?
 257 --------------------------------------------------------------------------------
 258
 259 For example, say you want an em-dash (XML character entity &mdash;,
 260 Unicode character U+2014) in your document: use a real em-dash.
 261 Insert concrete characters (e.g. type a *real* em-dash) into your
 262 input file, using whatever encoding suits your application, and tell
 263 Docutils the input encoding.  Docutils uses Unicode internally, so the
 264 em-dash character is a real em-dash internally.
 265
 266 Emacs users should refer to the `Emacs Support for reStructuredText`__
 267 document.  Tips for other editors are welcome.
 268
 269 __ tools/editors/emacs/README.html
 270
 271 ReStructuredText has no character entity subsystem; it doesn't know
 272 anything about XML charents.  To Docutils, "&mdash;" in input text is
 273 7 discrete characters; no interpretation happens.  When writing HTML,
 274 the "&" is converted to "&amp;", so in the raw output you'd see
 275 "&amp;mdash;".  There's no difference in interpretation for text
 276 inside or outside inline literals or literal blocks -- there's no
 277 character entity interpretation in either case.
 278
 279 If you can't use a Unicode-compatible encoding and must rely on 7-bit
 280 ASCII, there is a workaround.  New in Docutils 0.3.10 is a set of
 281 `Standard Substitution Definition Sets`_, which provide equivalents of
 282 XML & HTML character entity sets as substitution definitions.  For
 283 example, the Japanese yen currency symbol can be used as follows::
 284
 285     .. include:: <xhtml1-lat1.txt>
 286
 287     |yen| 600 for a complete meal?  That's cheap!
 288
 289 For earlier versions of Docutils, equivalent files containing
 290 character entity set substitution definitions using the "unicode_"
 291 directive `are available`_.  Please read the `description and
 292 instructions`_ for use.  Thanks to David Priest for the original idea.
 293
 294 If you insist on using XML-style charents, you'll have to implement a
 295 pre-processing system to convert to UTF-8 or something.  That
 296 introduces complications though; you can no longer *write* about
 297 charents naturally; instead of writing "&mdash;" you'd have to write
 298 "&amp;mdash;".
 299
 300 For the common case of long dashes, you might also want to insert the
 301 following substitution definitons into your document (both of them are
 302 using the "unicode_" directive)::
 303
 304     .. |--| unicode:: U+2013   .. en dash
 305     .. |---| unicode:: U+2014  .. em dash, trimming surrounding whitespace
 306        :trim:
 307
 308 .. |--| unicode:: U+2013   .. en dash
 309 .. |---| unicode:: U+2014  .. em dash, trimming surrounding whitespace
 310    :trim:
 311
 312 Now you can write dashes using pure ASCII: "``foo |--| bar; foo |---|
 313 bar``", rendered as "foo |--| bar; foo |---| bar".  (Note that Mozilla
 314 and Firefox may render this incorrectly.)  The ``:trim:`` option for
 315 the em dash is necessary because you cannot write "``foo|---|bar``";
 316 thus you need to add spaces ("``foo |---| bar``") and advise the
 317 reStructuredText parser to trim the spaces.
 318
 319 .. _Standard Substitution Definition Sets: docs/ref/rst/substitutions.html
 320 .. _unicode: docs/ref/rst/directives.html#unicode-character-codes
 321 .. _are available: http://docutils.sourceforge.net/tmp/charents/
 322 .. _tarball: http://docutils.sourceforge.net/tmp/charents.tgz
 323 .. _description and instructions:
 324    http://docutils.sourceforge.net/tmp/charents/README.html
 325 .. _to-do list: docs/dev/todo.html
 326
 327
 328 How can I generate backticks using a Scandinavian keyboard?
 329 -----------------------------------------------------------
 330
 331 The use of backticks in reStructuredText is a bit awkward with
 332 Scandinavian keyboards, where the backtick is a "dead" key.  To get
 333 one ` character one must press SHIFT-` + SPACE.
 334
 335 Unfortunately, with all the variations out there, there's no way to
 336 please everyone.  For Scandinavian programmers and technical writers,
 337 this is not limited to reStructuredText but affects many languages and
 338 environments.
 339
 340 Possible solutions include
 341
 342 * If you have to input a lot of backticks, simply type one in the
 343   normal/awkward way, select it, copy and then paste the rest (CTRL-V
 344   is a lot faster than SHIFT-` + SPACE).
 345
 346 * Use keyboard macros.
 347
 348 * Remap the keyboard.  The Scandinavian keyboard layout is awkward for
 349   other programming/technical characters too; for example, []{}
 350   etc. are a bit awkward compared to US keyboards.
 351
 352   According to Axel Kollmorgen,
 353
 354       Under Windows, you can use the `Microsoft Keyboard Layout Creator
 355       <http://www.microsoft.com/globaldev/tools/msklc.mspx>`__ to easily
 356       map the backtick key to a real backtick (no dead key). took me
 357       five minutes to load my default (german) keyboard layout, untick
 358       "Dead Key?" from the backtick key properties ("in all shift
 359       states"), "build dll and setup package", install the generated
 360       .msi, and add my custom keyboard layout via Control Panel >
 361       Regional and Language Options > Languages > Details > Add
 362       Keyboard layout (and setting it as default "when you start your
 363       computer").
 364
 365 * Use a virtual/screen keyboard or character palette, such as:
 366
 367   - `Web-based keyboards <http://keyboard.lab.co.il/>`__ (IE only
 368     unfortunately).
 369   - Windows: `Click-N-Type <http://www.lakefolks.org/cnt/>`__.
 370   - Mac OS X: the Character Palette can store a set of favorite
 371     characters for easy input.  Open System Preferences,
 372     International, Input Menu tab, enable "Show input menu in menu
 373     bar", and be sure that Character Palette is enabled in the list.
 374
 375 If anyone knows of other/better solutions, please `let us know`_.
 376
 377
 378 Are there any tools for HTML/XML-to-reStructuredText?  (Round-tripping)
 379 -----------------------------------------------------------------------
 380
 381 People have tossed the idea around, and some implementations of
 382 reStructuredText-generating tools can be found in the `Docutils Link
 383 List`_.
 384
 385 There's no reason why reStructuredText should not be round-trippable
 386 to/from XML; any technicalities which prevent round-tripping would be
 387 considered bugs.  Whitespace would not be identical, but paragraphs
 388 shouldn't suffer.  The tricky parts would be the smaller details, like
 389 links and IDs and other bookkeeping.
 390
 391 For HTML, true round-tripping may not be possible.  Even adding lots
 392 of extra "class" attributes may not be enough.  A "simple HTML" to RST
 393 filter is possible -- for some definition of "simple HTML" -- but HTML
 394 is used as dumb formatting so much that such a filter may not be
 395 particularly useful.  An 80/20 approach should work though: build a
 396 tool that does 80% of the work automatically, leaving the other 20%
 397 for manual tweaks.
 398
 399 .. _Docutils Link List: docs/user/links.html
 400
 401
 402 Are there any Wikis that use reStructuredText syntax?
 403 -----------------------------------------------------
 404
 405 There are several, with various degrees of completeness.  With no
 406 implied endorsement or recommendation, and in no particular order:
 407
 408 * `Webware for Python wiki
 409   <http://wiki.webwareforpython.org/thiswiki.html>`__
 410
 411 * `Ian Bicking's experimental code
 412   <http://docutils.sf.net/sandbox/ianb/wiki/Wiki.py>`__
 413
 414 * `MoinMoin <http://moinmoin.wikiwikiweb.de/>`__ has some support;
 415   `here's a sample <http://moinmoin.wikiwikiweb.de/RestSample>`__
 416
 417 * Zope-based `Zwiki <http://zwiki.org/>`__
 418
 419 * Zope3-based Zwiki (in the Zope 3 source tree as
 420   ``zope.products.zwiki``)
 421
 422 * `StikiWiki <http://mithrandr.moria.org/code/stikiwiki/>`__
 423
 424 * `Trac <http://projects.edgewall.com/trac/>`__ `supports using
 425   reStructuredText
 426   <http://projects.edgewall.com/trac/wiki/WikiRestructuredText>`__ as
 427   an alternative to wiki markup. This includes support for `TracLinks
 428   <http://projects.edgewall.com/trac/wiki/TracLinks>`__ from within
 429   RST text via a custom RST reference-directive or, even easier, an
 430   interpreted text role 'trac'
 431
 432 Please `let us know`_ of any other reStructuredText Wikis.
 433
 434 The example application for the `Web Framework Shootout
 435 <http://colorstudy.com/docs/shootout.html>`__ article is a Wiki using
 436 reStructuredText.
 437
 438
 439 Are there any Weblog (Blog) projects that use reStructuredText syntax?
 440 ----------------------------------------------------------------------
 441
 442 With no implied endorsement or recommendation, and in no particular
 443 order:
 444
 445 * `Firedrop <http://www.voidspace.org.uk/python/firedrop2/>`__
 446 * `Python Desktop Server <http://pyds.muensterland.org/>`__
 447 * `PyBloxsom <http://pyblosxom.sourceforge.net/>`__
 448 * `Lino WebMan <http://lino.sourceforge.net/webman.html>`__
 449
 450 Please `let us know`_ of any other reStructuredText Blogs.
 451
 452
 453 .. _Can lists be indented without generating block quotes?:
 454
 455 How should I mark up lists?
 456 ---------------------------
 457
 458 Bullet_ & enumerated_ list markup is very intuitive but there are 2
 459 points that must be noted:
 460
 461 .. _bullet: docs/ref/rst/restructuredtext.html#bullet-lists
 462 .. _enumerated: docs/ref/rst/restructuredtext.html#enumerated-lists
 463
 464 1. Lists should **not** be indented.  This is correct::
 465
 466        paragraph
 467
 468        * list item 1
 469
 470          * nested item 1.1
 471          * nested item 1.2
 472
 473        * list item 2
 474
 475    while this is probably incorrect::
 476
 477        paragraph
 478
 479          * list item 1
 480
 481              * nested item 1.1
 482              * nested item 1.2
 483
 484          * list item 2
 485
 486    The extra indentation (of the list containing items 1.1 and 1.2) is
 487    recognized as a block quote.  This is usually not what you mean and
 488    it causes the list in the output to be indented too much.
 489
 490 2. There **must** be blank lines around list items, except between
 491    items of the same level, where blank lines are optional.  The
 492    example above shows this.
 493
 494 Note that formatting of the *output* is independent of the input, and
 495 is decided by the writer and the stylesheet.  For instance, lists
 496 *are* indented in HTML output by default.  See `How are lists
 497 formatted in HTML?`_ for details.
 498
 499
 500 Could lists be indented without generating block quotes?
 501 --------------------------------------------------------
 502
 503 Some people like to write lists with indentation but don't intend a
 504 blockquote context.  There has been a lot of discussion about allowing
 505 this in reStructuredText, but there are some issues that would need to
 506 be resolved before it could be implemented.  There is a summary of the
 507 issues and pointers to the discussions in `the to-do list`__.
 508
 509 __ docs/dev/todo.html#indented-lists
 510
 511
 512 Could the requirement for blank lines around lists be relaxed?
 513 --------------------------------------------------------------
 514
 515 Short answer: no.
 516
 517 In reStructuredText, it would be impossible to unambigously mark up
 518 and parse lists without blank lines before and after.  Deeply nested
 519 lists may look ugly with so many blank lines, but it's a price we pay
 520 for unambiguous markup.  Some other plaintext markup systems do not
 521 require blank lines in nested lists, but they have to compromise
 522 somehow, either accepting ambiguity or requiring extra complexity.
 523 For example, `Epytext <http://epydoc.sf.net/epytext.html#list>`__ does
 524 not require blank lines around lists, but it does require that lists
 525 be indented and that ambiguous cases be escaped.
 526
 527
 528 How can I include mathematical equations in documents?
 529 ------------------------------------------------------
 530
 531 There is no elegant built-in way, yet.  There are several ideas, but
 532 no obvious winner.  This issue requires a champion to solve the
 533 technical and aesthetic issues and implement a generic solution.
 534 Here's the `to-do list entry`__.
 535
 536 __ docs/dev/todo.html#math-markup
 537
 538 There are several quick & dirty ways to include equations in documents.
 539 They all presently use LaTeX syntax or dialects of it.
 540
 541 * For LaTeX output, nothing beats raw LaTeX math.  A simple way is to
 542   use the `raw directive`_::
 543
 544       .. raw:: latex
 545
 546           \[ x^3 + 3x^2a + 3xa^2 + a^3, \]
 547
 548   For inline math you could use substitutions of the raw directive but
 549   the recently added `raw role`_ is more convenient.  You must define a
 550   custom role based on it once in your document::
 551
 552       .. role:: raw-latex(raw)
 553           :format: latex
 554
 555   and then you can just use the new role in your text::
 556
 557       the binomial expansion of :raw-latex:`$(x+a)^3$` is
 558
 559   .. _raw directive: docs/ref/rst/directives.html#raw-data-pass-through
 560   .. _raw role: docs/ref/rst/roles.html#raw
 561
 562 * Jens Jørgen Mortensen has implemented a "latex-math" role and
 563   directive, available from `his sandbox`__.
 564
 565   __ http://docutils.sourceforge.net/sandbox/jensj/latex_math/
 566
 567 * For HTML the "Right" w3c-standard way to include math is MathML_.
 568   Unfortunately its rendering is still quite broken (or absent) on many
 569   browsers but it's getting better.  Another bad problem is that typing
 570   or reading raw MathML by humans is *really* painful, so embedding it
 571   in a reST document with the raw directive would defy the goals of
 572   readability and editability of reST (see an `example of raw MathML
 573   <http://sf.net/mailarchive/message.php?msg_id=2177102>`__).
 574
 575   A much less painful way to generate HTML+MathML is to use itex2mml_ to
 576   convert a dialect of LaTeX syntax to presentation MathML.  Here is an
 577   example of potential `itex math markup
 578   <http://article.gmane.org/gmane.text.docutils.user/118>`__.  The
 579   simplest way to use it is to add ``html`` to the format lists for the
 580   raw directive/role and postprocess the resulting document with
 581   itex2mml.  This way you can *generate LaTeX and HTML+MathML from the
 582   same source*, but you must limit yourself to the intersection of LaTeX
 583   and itex markups for this to work.  Alan G. Isaac wrote a detailed
 584   HOWTO_ for this approach.
 585
 586   .. _MathML: http://www.w3.org/Math/
 587   .. _itex2mml: http://pear.math.pitt.edu/mathzilla/itex2mml.html
 588   .. _HOWTO: http://www.american.edu/econ/itex2mml/mathhack.rst
 589
 590 * The other way to add math to HTML is to use images of the equations,
 591   typically generated by TeX.  This is inferior to MathML in the long
 592   term but is perhaps more accessible nowdays.
 593
 594   Of course, doing it by hand is too much work.  Beni Cherniavsky has
 595   written some `preprocessing scripts`__ for converting the
 596   ``texmath`` role/directive into images for HTML output and raw
 597   directives/subsitution for LaTeX output.  This way you can *generate
 598   LaTeX and HTML+images from the same source*.  `Instructions here`__.
 599
 600   __ http://docutils.sourceforge.net/sandbox/cben/rolehack/
 601   __ http://docutils.sourceforge.net/sandbox/cben/rolehack/README.html
 602
 603
 604 Is nested inline markup possible?
 605 ---------------------------------
 606
 607 Not currently, no.  It's on the `to-do list`__ (`details here`__), and
 608 hopefully will be part of the reStructuredText parser soon.  At that
 609 time, markup like this will become possible::
 610
 611     Here is some *emphasized text containing a `hyperlink`_ and
 612     ``inline literals``*.
 613
 614 __ docs/dev/todo.html#nested-inline-markup
 615 __ docs/dev/rst/alternatives.html#nested-inline-markup
 616
 617 There are workarounds, but they are either convoluted or ugly or both.
 618 They are not recommended.
 619
 620 * Inline markup can be combined with hyperlinks using `substitution
 621   definitions`__ and references__ with the `"replace" directive`__.
 622   For example::
 623
 624       Here is an |emphasized hyperlink|_.
 625
 626       .. |emphasized hyperlink| replace:: *emphasized hyperlink*
 627       .. _emphasized hyperlink: http://example.org
 628
 629   It is not possible for just a portion of the replacement text to be
 630   a hyperlink; it's the entire replacement text or nothing.
 631
 632   __ docs/ref/rst/restructuredtext.html#substitution-definitions
 633   __ docs/ref/rst/restructuredtext.html#substitution-references
 634   __ docs/ref/rst/directives.html#replace
 635
 636 * The `"raw" directive`__ can be used to insert raw HTML into HTML
 637   output::
 638
 639       Here is some |stuff|.
 640
 641       .. |stuff| raw:: html
 642
 643          <em>emphasized text containing a
 644          <a href="http://example.org">hyperlink</a> and
 645          <tt>inline literals</tt></em>
 646
 647   Raw LaTeX is supported for LaTeX output, etc.
 648
 649   __ docs/ref/rst/directives.html#raw
 650
 651
 652 How to indicate a line break or a significant newline?
 653 ------------------------------------------------------
 654
 655 `Line blocks`__ are designed for address blocks, verse, and other
 656 cases where line breaks are significant and must be preserved.  Unlike
 657 literal blocks, the typeface is not changed, and inline markup is
 658 recognized.  For example::
 659
 660     | A one, two, a one two three four
 661     |
 662     | Half a bee, philosophically,
 663     |     must, *ipso facto*, half not be.
 664     | But half the bee has got to be,
 665     |     *vis a vis* its entity.  D'you see?
 666     |
 667     | But can a bee be said to be
 668     |     or not to be an entire bee,
 669     |         when half the bee is not a bee,
 670     |             due to some ancient injury?
 671     |
 672     | Singing...
 673
 674 __ docs/ref/rst/restructuredtext.html#line-blocks
 675
 676 Here's a workaround for manually inserting explicit line breaks in
 677 HTML output::
 678
 679     .. |br| raw:: html
 680
 681        <br />
 682
 683     I want to break this line here: |br| this is after the break.
 684
 685     If the extra whitespace bothers you, |br|\ backslash-escape it.
 686
 687
 688 A URL containing asterisks doesn't work.  What to do?
 689 -----------------------------------------------------
 690
 691 Asterisks are valid URL characters (see :RFC:`2396`), sometimes used
 692 in URLs.  For example::
 693
 694     http://cvs.example.org/viewcvs.py/*checkout*/module/file
 695
 696 Unfortunately, the parser thinks the asterisks are indicating
 697 emphasis.  The slashes serve as delineating punctuation, allowing the
 698 asterisks to be recognized as markup.  The example above is separated
 699 by the parser into a truncated URL, an emphasized word, and some
 700 regular text::
 701
 702     http://cvs.example.org/viewcvs.py/
 703     *checkout*
 704     /module/file
 705
 706 To turn off markup recognition, use a backslash to escape at least the
 707 first asterisk, like this::
 708
 709     http://cvs.example.org/viewcvs.py/\*checkout*/module/file
 710
 711 Escaping the second asterisk doesn't hurt, but it isn't necessary.
 712
 713
 714 How can I make a literal block with *some* formatting?
 715 ------------------------------------------------------
 716
 717 Use the `parsed-literal`_ directive.
 718
 719 .. _parsed-literal: docs/ref/rst/directives.html#parsed-literal
 720
 721 Scenario: a document contains some source code, which calls for a
 722 literal block to preserve linebreaks and whitespace.  But part of the
 723 source code should be formatted, for example as emphasis or as a
 724 hyperlink.  This calls for a *parsed* literal block::
 725
 726     .. parsed-literal::
 727
 728        print "Hello world!"  # *tricky* code [1]_
 729
 730 The emphasis (``*tricky*``) and footnote reference (``[1]_``) will be
 731 parsed.
 732
 733
 734 Can reStructuredText be used for web or generic templating?
 735 -----------------------------------------------------------
 736
 737 Docutils and reStructuredText can be used with or as a component of a
 738 templating system, but they do not themselves include templating
 739 functionality.  Templating should simply be left to dedicated
 740 templating systems.  Users can choose a templating system to apply to
 741 their reStructuredText documents as best serves their interests.
 742
 743 There are many good templating systems for Python (ht2html_, YAPTU_,
 744 Quixote_'s PTL, Cheetah_, etc.; see this non-exhaustive list of `some
 745 other templating systems`_), and many more for other languages, each
 746 with different approaches.  We invite you to try several and find one
 747 you like.  If you adapt it to use Docutils/reStructuredText, please
 748 consider contributing the code to Docutils or `let us know`_ and we'll
 749 keep a list here.
 750
 751 One reST-specific web templating system is `rest2web
 752 <http://www.voidspace.org.uk/python/rest2web>`_, a tool for
 753 automatically building websites, or parts of websites.
 754
 755 .. _ht2html: http://ht2html.sourceforge.net/
 756 .. _YAPTU:
 757    http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/52305
 758 .. _Quixote: http://www.mems-exchange.org/software/quixote/
 759 .. _Cheetah: http://www.cheetahtemplate.org/
 760 .. _some other templating systems:
 761    http://webware.sourceforge.net/Papers/Templates/
 762
 763
 764 How can I mark up a FAQ or other list of questions & answers?
 765 -------------------------------------------------------------
 766
 767 There is no specific syntax for FAQs and Q&A lists.  Here are two
 768 options:
 769
 770 1. For a FAQ (Frequently Asked Questions, usually with answers), a
 771    convenient way to mark up the questions is as section titles, with
 772    the answer(s) as section content.  This document is marked up in
 773    this way.
 774
 775    The advantages of using section titles for questions are: sections
 776    can be numbered automatically, and a table of contents can be
 777    generated automatically.  One limitation of this format is that
 778    questions must fit on one line (section titles may not wrap, in the
 779    source text).  For very long questions, the title may be a summary
 780    of the question, with the full question in the section body.
 781
 782 2. Field lists work well as Q&A lists::
 783
 784        :Q: What kind of questions can we
 785            put here?
 786
 787        :A: Any kind we like!
 788
 789    In order to separate questions, lists can be used:
 790
 791        1. :Q: What kind of question can we
 792               put here?
 793           :A: Any kind we like!
 794
 795        2. :Q: How many answers can a question have?
 796           :A: It can have one,
 797           :A: or more.
 798           :A3: Answers can be numbered like this.
 799           :A: 1. Or like this.
 800               2. We're flexible!
 801
 802    If you don't want to number or otherwise mark questions, you can
 803    use an empty comment between individual field lists to separate
 804    them::
 805
 806        :Q: First question?
 807        :A: Answer.
 808
 809        ..
 810
 811        :Q: Second question?
 812        :A: Answer.
 813
 814
 815 .. _bidi:
 816
 817 Can I produce documents in right-to-left languages?
 818 ---------------------------------------------------
 819
 820 Languages written from right to left, such as Arabic and Hebrew, must
 821 be reordered according to the `Unicode Bidi Algorithm`_.  This
 822 requires support from the editor and special markup in the output
 823 format.
 824
 825 The source format of reStructuredText is relatively bidi-friendly:
 826 most constructs are denoted by punctuation without intrusion of
 827 English and when you must write in English, it's usually on a separate
 828 line.  So any editor that auto-detects direction per-line (like gedit
 829 or geresh_) will suffice.
 830
 831 Moreover, it's possible to translate_ all reStructuredText keywords.
 832 This was not yet done for any RTL language, but when it is, it will be
 833 possible to write an RTL document with vitually no English.  This will
 834 allow reasonable use of editors limited to a single base direction for
 835 the whole document (like Notepad, Vim and text boxes in Firefox).
 836
 837 .. _Unicode Bidi Algorithm: http://www.unicode.org/reports/tr9/
 838 .. _geresh: http://www.typo.co.il/~mooffie/geresh/
 839 .. _translate: docs/howto/i18n.html
 840
 841 The second problem is bidi markup of the output.  There is an almost
 842 transparent implicit solution for HTML:
 843
 844 * Grab http://cben-hacks.sourceforge.net/bidi/hibidi.py and
 845   http://cben-hacks.sourceforge.net/bidi/rst2html_hibidi.py.
 846   Put them both in the same directory and make them executable.
 847
 848 * Use ``rst2html_hibidi.py`` instead of ``rst2html.py``.
 849
 850 * It infers dir attributes in the HTML from the text.  It does it
 851   hierachically, giving much better results than usual.  You can still
 852   use LRM/RLM and LRE/RLE/PDF control codes to help it.
 853
 854   * If you want the gory details: See the full theory_, and note the
 855     incomplete practice_ (this is still a partial implementation - but
 856     sufficient for most needs).
 857
 858     .. _theory: http://cben-hacks.sf.net/bidi/hibidi.html
 859     .. _practice: http://cben-hacks.sf.net/bidi/hibidi.html#practice
 860
 861 There is also an explicit way to set directions through CSS and
 862 classes in the HTML:
 863
 864 * Copy ``default.css`` to a new file and add relevant parts of the
 865   following::
 866
 867       /* Use these two if the main document direction is RTL */
 868       body { direction: rtl; }
 869       div.sidebar { float: left !important; }
 870
 871       /* The next 3 rules are very useful in documents containing pieces
 872       of code in english */
 873       /* Use this if you all your literal blocks (::) are LTR */
 874       pre {direction: ltr; unicode-bidi: embed; }
 875       /* Use this if you all your inline literals (``) are LTR */
 876       tt {direction: ltr; unicode-bidi: embed; }
 877       /* Use this if you all your interpretted text (`) is LTR */
 878       cite {direction: ltr; unicode-bidi: embed; }
 879
 880       /* Allow manual direction override by class directive and roles */
 881       .rtl { direction: rtl; }
 882       .ltr { direction: ltr; }
 883
 884 * Select this new stylesheet with ``--stylesheet=<file>`` or the
 885   stylesheet_ setting.
 886
 887 * Now if you need to override the direction of some element (from a
 888   paragraph to a whole section), write::
 889
 890       .. class:: rtl
 891
 892   or::
 893
 894       .. class:: ltr
 895
 896   before it (see the class_ directive for details).
 897
 898 * To change the direction of some inline text fragment, you can use
 899   RLE/LRE/PDF control characters, or write ``:rtl:`RTL text``` /
 900   ``:ltr:`RTL text```.  To use the latter syntax, you must write this
 901   once at the beginning of your document::
 902
 903       .. role:: ltr
 904       .. role:: rtl
 905
 906 .. _stylesheet: docs/user/config.html#stylesheet
 907 .. _class: docs/ref/rst/directives.txt#class
 908
 909 LaTeX is quite hard to implement (it doesn't support the bidi
 910 algorithm, so all direction changes - even numbers in RTL text - must
 911 be explicitly marked).  Other formats are more-or-less easy.
 912
 913 If you have any questions/problems/bugs related to bidi with docutils,
 914 ask `Beni Cherniavsky`__ directly or the `Docutils-users`_ mailing
 915 list.
 916
 917 __ mailto:cben@users.sf.net
 918
 919
 920 What's the official MIME type for reStructuredText data?
 921 --------------------------------------------------------
 922
 923 While there is no registered MIME type for reStructuredText, the
 924 "official unofficial" standard MIME type is "text/x-rst".  This was
 925 invented for the build system for PEPs (Python Enhancement Proposals),
 926 and it's used by the python.org web site build system.
 927
 928 (The "x-" prefix means it's an unregistered MIME type.)
 929
 930 Also see `What's the standard filename extension for a
 931 reStructuredText file?`_
 932
 933
 934 HTML Writer
 935 ===========
 936
 937 What is the status of the HTML Writer?
 938 --------------------------------------
 939
 940 The HTML Writer module, ``docutils/writers/html4css1.py``, is a
 941 proof-of-concept reference implementation.  While it is a complete
 942 implementation, some aspects of the HTML it produces may be
 943 incompatible with older browsers or specialized applications (such as
 944 web templating).  Alternate implementations are welcome.
 945
 946
 947 What kind of HTML does it produce?
 948 ----------------------------------
 949
 950 It produces XHTML compatible with the `XHTML 1.0`_ specification.  A
 951 cascading stylesheet is required for proper viewing with a modern
 952 graphical browser.  Correct rendering of the HTML produced depends on
 953 the CSS support of the browser.  A general-purpose stylesheet,
 954 ``html4css1.css`` is provided with the code, and is embedded by
 955 default.  It is installed in the "writers/html4css1/" subdirectory
 956 within the Docutils package.  Use the ``--help`` command-line option
 957 to see the specific location on your machine.
 958
 959 .. _XHTML 1.0: http://www.w3.org/TR/xhtml1/
 960
 961
 962 What browsers are supported?
 963 ----------------------------
 964
 965 No specific browser is targeted; all modern graphical browsers should
 966 work.  Some older browsers, text-only browsers, and browsers without
 967 full CSS support are known to produce inferior results.  Firefox,
 968 Safari, Mozilla (version 1.0 and up), and MS Internet Explorer
 969 (version 5.0 and up) are known to give good results.  Reports of
 970 experiences with other browsers are welcome.
 971
 972
 973 Unexpected results from tools/rst2html.py: H1, H1 instead of H1, H2.  Why?
 974 --------------------------------------------------------------------------
 975
 976 Here's the question in full:
 977
 978     I have this text::
 979
 980         Heading 1
 981         =========
 982
 983         All my life, I wanted to be H1.
 984
 985         Heading 1.1
 986         -----------
 987
 988         But along came H1, and so shouldn't I be H2?
 989         No!  I'm H1!
 990
 991         Heading 1.1.1
 992         *************
 993
 994         Yeah, imagine me, I'm stuck at H3!  No?!?
 995
 996     When I run it through tools/rst2html.py, I get unexpected results
 997     (below).  I was expecting H1, H2, then H3; instead, I get H1, H1,
 998     H2::
 999
1000         ...
1001         <html lang="en">
1002         <head>
1003         ...
1004         <title>Heading 1</title>
1005         </head>
1006         <body>
1007         <div class="document" id="heading-1">
1008         <h1 class="title">Heading 1</h1>                <-- first H1
1009         <p>All my life, I wanted to be H1.</p>
1010         <div class="section" id="heading-1-1">
1011         <h1><a name="heading-1-1">Heading 1.1</a></h1>        <-- H1
1012         <p>But along came H1, and so now I must be H2.</p>
1013         <div class="section" id="heading-1-1-1">
1014         <h2><a name="heading-1-1-1">Heading 1.1.1</a></h2>
1015         <p>Yeah, imagine me, I'm stuck at H3!</p>
1016         ...
1017
1018     What gives?
1019
1020 Check the "class" attribute on the H1 tags, and you will see a
1021 difference.  The first H1 is actually ``<h1 class="title">``; this is
1022 the document title, and the default stylesheet renders it centered.
1023 There can also be an ``<h2 class="subtitle">`` for the document
1024 subtitle.
1025
1026 If there's only one highest-level section title at the beginning of a
1027 document, it is treated specially, as the document title.  (Similarly, a
1028 lone second-highest-level section title may become the document
1029 subtitle.)  See `How can I indicate the document title?  Subtitle?`_ for
1030 details.  Rather than use a plain H1 for the document title, we use ``<h1
1031 class="title">`` so that we can use H1 again within the document.  Why
1032 do we do this?  HTML only has H1-H6, so by making H1 do double duty, we
1033 effectively reserve these tags to provide 6 levels of heading beyond the
1034 single document title.
1035
1036 HTML is being used for dumb formatting for nothing but final display.
1037 A stylesheet *is required*, and one is provided; see `What kind of
1038 HTML does it produce?`_ above.  Of course, you're welcome to roll your
1039 own.  The default stylesheet provides rules to format ``<h1
1040 class="title">`` and ``<h2 class="subtitle">`` differently from
1041 ordinary ``<h1>`` and ``<h2>``::
1042
1043     h1.title {
1044       text-align: center }
1045
1046     h2.subtitle {
1047       text-align: center }
1048
1049 If you don't want the top section heading to be interpreted as a
1050 title at all, disable the `doctitle_xform`_ setting
1051 (``--no-doc-title`` option).  This will interpret your document
1052 differently from the standard settings, which might not be a good
1053 idea.  If you don't like the reuse of the H1 in the HTML output, you
1054 can tweak the `initial_header_level`_ setting
1055 (``--initial-header-level`` option) -- but unless you match its value
1056 to your specific document, you might end up with bad HTML (e.g. H3
1057 without H2).
1058
1059 .. _doctitle_xform: docs/user/config.html#doctitle-xform
1060 .. _initial_header_level: docs/user/config.html#initial-header-level
1061
1062 (Thanks to Mark McEahern for the question and much of the answer.)
1063
1064
1065 How are lists formatted in HTML?
1066 --------------------------------
1067
1068 If list formatting looks strange, first check that you understand
1069 `list markup`__.
1070
1071 __ `How should I mark up lists?`_
1072
1073 * By default, HTML browsers indent lists relative to their context.
1074   This follows a long tradition in browsers (but isn't so established
1075   in print).  If you don't like it, you should change the stylesheet.
1076
1077   This is different from how lists look in reStructuredText source.
1078   Extra indentation in the source indicates a blockquote, resulting in
1079   too much indentation in the browser.
1080
1081 * A list item can contain multiple paragraphs etc.  In complex cases
1082   list items are separated by vertical space.  By default this spacing
1083   is omitted in "simple" lists.  A list is simple if every item
1084   contains a simple paragraph and/or a "simple" nested list.  For
1085   example:
1086
1087       * text
1088
1089         * simple
1090
1091           * simple
1092           * simple
1093
1094         * simple
1095
1096         text after a nested list
1097
1098       * multiple
1099
1100         paragraphs
1101
1102   In this example the nested lists are simple (and should appear
1103   compacted) but the outer list is not.
1104
1105   If you want all lists to have equal spacing, disable the
1106   `compact_lists`_ setting (``--no-compact-lists`` option).  The
1107   precise spacing can be controlled in the stylesheet.
1108
1109   Note again that this is not exactly WYSIWYG: it partially resembles
1110   the rules about blank lines being optional between list items in
1111   reStructuredText -- but adding/removing optional blank lines does
1112   not affect spacing in the output!  It's a feature, not a bug: you
1113   write it as you like but the output is styled consistently.
1114
1115   .. _compact_lists: docs/user/config.html#compact-lists
1116
1117
1118 Why do enumerated lists only use numbers (no letters or roman numerals)?
1119 ------------------------------------------------------------------------
1120
1121 The rendering of enumerators (the numbers or letters acting as list
1122 markers) is completely governed by the stylesheet, so either the
1123 browser can't find the stylesheet (try enabling the
1124 `embed_stylesheet`_ setting [``--embed-stylesheet`` option]), or the
1125 browser can't understand it (try a recent Firefox, Mozilla, Konqueror,
1126 Opera, Safari, or even MSIE).
1127
1128 .. _embed_stylesheet: docs/user/config.html#embed-stylesheet
1129
1130
1131 There appear to be garbage characters in the HTML.  What's up?
1132 --------------------------------------------------------------
1133
1134 What you're seeing is most probably not garbage, but the result of a
1135 mismatch between the actual encoding of the HTML output and the
1136 encoding your browser is expecting.  Your browser is misinterpreting
1137 the HTML data, which is encoded text.  A discussion of text encodings
1138 is beyond the scope of this FAQ; see one or more of these documents
1139 for more info:
1140
1141 * `UTF-8 and Unicode FAQ for Unix/Linux
1142   <http://www.cl.cam.ac.uk/~mgk25/unicode.html>`_
1143
1144 * Chapters 3 and 4 of `Introduction to i18n [Internationalization]
1145   <http://www.debian.org/doc/manuals/intro-i18n/>`_
1146
1147 * `Python Unicode Tutorial
1148   <http://www.reportlab.com/i18n/python_unicode_tutorial.html>`_
1149
1150 * `Python Unicode Objects: Some Observations on Working With Non-ASCII
1151   Character Sets <http://effbot.org/zone/unicode-objects.htm>`_
1152
1153 The common case is with the default output encoding (UTF-8), when
1154 either numbered sections are used (via the "sectnum_" directive) or
1155 symbol-footnotes.  3 non-breaking spaces are inserted in each numbered
1156 section title, between the generated number and the title text.  Most
1157 footnote symbols are not available in ASCII, nor are non-breaking
1158 spaces.  When encoded with UTF-8 and viewed with ordinary ASCII tools,
1159 these characters will appear to be multi-character garbage.
1160
1161 You may have an decoding problem in your browser (or editor, etc.).
1162 The encoding of the output is set to "utf-8", but your browswer isn't
1163 recognizing that.  You can either try to fix your browser (enable
1164 "UTF-8 character set", sometimes called "Unicode"), or choose a
1165 different encoding for the HTML output.  You can also try
1166 ``--output-encoding=ascii:xmlcharrefreplace`` for HTML or XML, but not
1167 applicable to non-XMLish outputs (if using runtime
1168 settings/configuration files, use ``output_encoding=ascii`` and
1169 ``output_encoding_error_handler=xmlcharrefreplace``).
1170
1171 If you're generating document fragments, the "Content-Type" metadata
1172 (between the HTML ``<head>`` and ``</head>`` tags) must agree with the
1173 encoding of the rest of the document.  For UTF-8, it should be::
1174
1175     <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
1176
1177 Also, Docutils normally generates an XML declaration as the first line
1178 of the output.  It must also match the document encoding.  For UTF-8::
1179
1180     <?xml version="1.0" encoding="utf-8" ?>
1181
1182 .. _sectnum: docs/ref/rst/directives.html#sectnum
1183
1184
1185 How can I retrieve the body of the HTML document?
1186 -------------------------------------------------
1187
1188 (This is usually needed when using Docutils in conjunction with a
1189 templating system.)
1190
1191 You can use the `docutils.core.publish_parts()`_ function, which
1192 returns a dictionary containing an 'html_body_' entry.
1193
1194 .. _docutils.core.publish_parts(): docs/api/publisher.html#publish-parts
1195 .. _html_body: docs/api/publisher.html#html-body
1196
1197
1198 Why is the Docutils XHTML served as "Content-type: text/html"?
1199 --------------------------------------------------------------
1200
1201 Full question:
1202
1203     Docutils' HTML output looks like XHTML and is advertised as such::
1204
1205       <?xml version="1.0" encoding="utf-8" ?>
1206       <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
1207        "http://www.w3.org/TR/xht ml1/DTD/xhtml1-transitional.dtd">
1208
1209     But this is followed by::
1210
1211       <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
1212
1213     Shouldn't this be "application/xhtml+xml" instead of "text/html"?
1214
1215 In a perfect web, the Docutils XHTML output would be 100% strict
1216 XHTML.  But it's not a perfect web, and a major source of imperfection
1217 is Internet Explorer.  Despite it's drawbacks, IE still represents the
1218 majority of web browsers, and cannot be ignored.
1219
1220 Short answer: if we didn't serve XHTML as "text/html" (which is a
1221 perfectly valid thing to do), it couldn't be viewed in Internet
1222 Explorer.
1223
1224 Long answer: see the `"Criticisms of Internet Explorer" Wikipedia
1225 entry <http://en.wikipedia.org/wiki/Criticisms_of_Internet_Explorer#XHTML>`__.
1226
1227 However, there's also `Sending XHTML as text/html Considered
1228 Harmful`__.  What to do, what to do?  We're damned no matter what we
1229 do.  So we'll continue to do the practical instead of the pure:
1230 support the browsers that are actually out there, and not fight for
1231 strict standards compliance.
1232
1233 __ http://hixie.ch/advocacy/xhtml
1234
1235 (Thanks to Martin F. Krafft, Robert Kern, Michael Foord, and Alan
1236 G. Isaac.)
1237
1238
1239 Python Source Reader
1240 ====================
1241
1242 Can I use Docutils for Python auto-documentation?
1243 -------------------------------------------------
1244
1245 Yes, in conjunction with other projects.
1246
1247 Docstring extraction functionality from within Docutils is still under
1248 development.  There is most of a source code parsing module in
1249 docutils/readers/python/moduleparser.py.  We do plan to finish it
1250 eventually.  Ian Bicking wrote an initial front end for the
1251 moduleparser.py module, in sandbox/ianb/extractor/extractor.py.  Ian
1252 also did some work on the Python Source Reader
1253 (docutils.readers.python) component at PyCon DC 2004.
1254
1255 Version 2.0 of Ed Loper's `Epydoc <http://epydoc.sourceforge.net/>`_
1256 supports reStructuredText-format docstrings for HTML output.  Docutils
1257 0.3 or newer is required.  Development of a Docutils-specific
1258 auto-documentation tool will continue.  Epydoc works by importing
1259 Python modules to be documented, whereas the Docutils-specific tool,
1260 described above, will parse modules without importing them (as with
1261 `HappyDoc <http://happydoc.sourceforge.net/>`_, which doesn't support
1262 reStructuredText).
1263
1264 The advantages of parsing over importing are security and flexibility;
1265 the disadvantage is complexity/difficulty.
1266
1267 * Security: untrusted code that shouldn't be executed can be parsed;
1268   importing a module executes its top-level code.
1269 * Flexibility: comments and unofficial docstrings (those not supported
1270   by Python syntax) can only be processed by parsing.
1271 * Complexity/difficulty: it's a lot harder to parse and analyze a
1272   module than it is to ``import`` and analyze one.
1273
1274 For more details, please see "Docstring Extraction Rules" in `PEP
1275 258`_, item 3 ("How").
1276
1277
1278 Miscellaneous
1279 =============
1280
1281 Is the Docutils document model based on any existing XML models?
1282 ----------------------------------------------------------------
1283
1284 Not directly, no.  It borrows bits from DocBook, HTML, and others.  I
1285 (David Goodger) have designed several document models over the years,
1286 and have my own biases.  The Docutils document model is designed for
1287 simplicity and extensibility, and has been influenced by the needs of
1288 the reStructuredText markup.
1289
1290 \f
1291 ..
1292    Local Variables:
1293    mode: indented-text
1294    indent-tabs-mode: nil
1295    sentence-end-double-space: t
1296    fill-column: 70
1297    End: