5 - Lots of profiling, make it faster!
6 - Plugins for major CMSes (very tricky issue)
9 - Lossy alternate character encoding support (characters not in the encoding
10 will get silently dropped).
11 - Revise (HTML|CSS)Definition and Config relationship (groundwork for 2.0)
14 - Directive documentation generation
15 - Rewrite table's child definition to be faster, smart, and regexp free
16 - Allow HTML 4.01 output (cosmetic changes to the generator)
19 - Additional support for poorly written HTML
20 - Implement all non-essential attribute transforms
21 - Microsoft Word HTML cleaning (i.e. MsoNormal)
24 - Make URI validation routines tighter (especially mailto)
25 - More extensive URI filtering schemes
26 - Allow for background-image and list-style-image (see above)
27 - Distinguish between different types of URIs, for instance, a mailto URI
28 in IMG SRC is nonsensical
31 - Add various "levels" of cleaning
32 - Related: Allow strict (X)HTML
35 - Extended HTML capabilities based on namespacing and tag transforms
36 - Hooks for adding custom processors to custom namespaced tags and
37 attributes, offer default implementation
38 - Auto-paragraphing (be sure to leverage fact that we know when things
39 shouldn't be paragraphed, such as lists and tables).
40 - Lots of documentation and samples
42 Unknown release (on a scratch-an-itch basis)
43 - Silently drop content inbetween SCRIPT tags (can be generalized to allow
44 specification of elements that, when detected as foreign, trigger removal
45 of children, although unbalanced tags could wreck havoc (or at least delete
46 the rest of the document)).
47 - Fixes for Firefox's inability to handle COL alignment props (Bug 915)
48 - Automatically add non-breaking spaces to empty table cells when
49 empty-cells:show is applied to have compatibility with Internet Explorer
50 - Pretty-printing HTML (adds dependency of Generator to HTMLDefinition)
51 - Non-lossy dumb alternate character encoding transformations, achieved by
52 numerically encoding all non-ASCII characters
55 - Non-lossy smart alternate character encoding transformations