OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] generating DOM from ill-formed HTML docs

[ Lists Home | Date Index | Thread Index ]

* Robert Mena wrote:
>Thanks.  I've tested both tools and so far neko
>outperforms tidy in all aspects besides performance.
>Some "popular" sites such as amazon were rejected by
>tidy but went ok with neko.

You may use the --force-output boolean switch to get your
output even if Tidy finds errors (in which Tidy does not
output a pretty-printed version). Rejection happens because
Tidy cannot be sure in order how to fix a certain error.

Note that there is a Library branch in the Tidy CVS repository
that allows to use Tidy more easily from within your program,
but also note that this is somewhat of a pre-alpha version of
the code.



News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS