[
Lists Home |
Date Index |
Thread Index
]
- From: "James Tauber" <jtauber@jtauber.com>
- To: "Dan Rudman" <rudman@idetix.com>,"'XML-DEV'" <xml-dev@ic.ac.uk>
- Date: Sat, 13 Mar 1999 00:38:01 +0800
-----Original Message-----
From: Dan Rudman <rudman@idetix.com>
>With the wealth of XML libraries available, I am more and more inclined to
>make use of these libraries to help me create, parse, and utilize my own
tag
>markup language to be embedded within an HTML document. My understanding
of
>XML at this point is that it must be well-formed or a fatal error occurs.
Yes, this is correct.
>If this is the case, how can I deal with the fact that most HTML documents
>are NOT well-formed and that most HTML design tools do not enforce,
require,
>or even sometimes support, well-formedness in a document?
You might try Tidy as the initial step. Tidy can take bad HTML and spit out
XML that could then be parsed by any XML parser.
See http://www.w3.org/People/Raggett/tidy/
Hope this helps.
James
--
James Tauber / jtauber@jtauber.com / www.jtauber.com
Associate Researcher, Electronic Commerce Network
Curtin University of Technology, Perth, Western Australia
Full-day XML Tutorial @ WWW8 : http://www8.org/
Maintainer of : www.xmlinfo.com, www.xmlsoftware.com and www.schema.net
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
|