5 - Lots of profiling, make it faster!
6 - Plugins for major CMSes (very tricky issue)
9 - Additional support for poorly written HTML
10 - Implement all non-essential attribute transforms
11 - Microsoft Word HTML cleaning (i.e. MsoNormal)
14 - Formatters for plaintext
15 - Auto-paragraphing (be sure to leverage fact that we know when things
16 shouldn't be paragraphed, such as lists and tables).
17 - Make URI validation routines tighter (especially mailto)
18 - More extensive URI filtering schemes
19 - Allow for background-image and list-style-image (see above)
20 - Distinguish between different types of URIs, for instance, a mailto URI
21 in IMG SRC is nonsensical
24 - Add various "levels" of cleaning
25 - Related: Allow strict (X)HTML
28 - Extended HTML capabilities based on namespacing and tag transforms
29 - Hooks for adding custom processors to custom namespaced tags and
30 attributes, offer default implementation
31 - Lots of documentation and samples
33 Unknown release (on a scratch-an-itch basis)
34 - Silently drop content inbetween SCRIPT tags (can be generalized to allow
35 specification of elements that, when detected as foreign, trigger removal
36 of children, although unbalanced tags could wreck havoc (or at least delete
37 the rest of the document)).
38 - Fixes for Firefox's inability to handle COL alignment props (Bug 915)
39 - Automatically add non-breaking spaces to empty table cells when
40 empty-cells:show is applied to have compatibility with Internet Explorer
41 - Pretty-printing HTML (adds dependency of Generator to HTMLDefinition)
42 - Non-lossy dumb alternate character encoding transformations, achieved by
43 numerically encoding all non-ASCII characters
46 - Non-lossy smart alternate character encoding transformations