[
Lists Home |
Date Index |
Thread Index
]
- From: John Cowan <cowan@locke.ccil.org>
- To: xml-dev@ic.ac.uk
- Date: Mon, 27 Dec 1999 22:17:02 -0500 (EST)
Simon St.Laurent scripsit:
> While I like Tidy a lot, I'd love to have a parser that tidies up the HTML
> structure and then spits it out as SAX events or a DOM tree, rather than
> the kind of document-to-document work that Tidy does. Seems like that
> shouldn't be much more difficult than the work Tidy does.
It isn't, and in fact the Java version of Tidy (linked from Dave Raggett's
page) provides a mini-DOM. WIth my DOMParser, you can generate SAX events
from the mini-DOM as well.
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo@ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
|