OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: Basic Question

[ Lists Home | Date Index | Thread Index ]
  • From: "James Tauber" <jtauber@jtauber.com>
  • To: "Dan Rudman" <rudman@idetix.com>,"'XML-DEV'" <xml-dev@ic.ac.uk>
  • Date: Sat, 13 Mar 1999 00:38:01 +0800

-----Original Message-----
From: Dan Rudman <rudman@idetix.com>
>With the wealth of XML libraries available, I am more and more inclined to
>make use of these libraries to help me create, parse, and utilize my own
>markup language to be embedded within an HTML document.  My understanding
>XML at this point is that it must be well-formed or a fatal error occurs.

Yes, this is correct.

>If this is the case, how can I deal with the fact that most HTML documents
>are NOT well-formed and that most HTML design tools do not enforce,
>or even sometimes support, well-formedness in a document?

You might try Tidy as the initial step. Tidy can take bad HTML and spit out
XML that could then be parsed by any XML parser.

See http://www.w3.org/People/Raggett/tidy/

Hope this helps.

James Tauber / jtauber@jtauber.com / www.jtauber.com
Associate Researcher, Electronic Commerce Network
Curtin University of Technology, Perth, Western Australia

Full-day XML Tutorial @ WWW8 : http://www8.org/

Maintainer of : www.xmlinfo.com,  www.xmlsoftware.com and www.schema.net

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS