OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] XML's Scylla and Charybdis - parse and regexp

[ Lists Home | Date Index | Thread Index ]

jonathan.robie@datadirect-technologies.com (Jonathan Robie) writes:
>I wasn't so much asking whether faithful lexical round-tripping is an
>important goal for me, I think I know the answer to that question. I
>was more interested in asking why this is important to you, and which
>problems you are solving for which such distinctions are significant.

Most of my day-job use of XML is heavily dependent on entity processing,
both for chapters and for special characters.  Preserving the original
without normalizing the document is useful in such cases, to say the

CDATA section handling/ignoring has become an issue for me in the last
week, thanks to a fairly broken set of assumptions made by a database in
its exporting XML.  That gets especially interesting when combined with
cases where CDATA sections are used intentionally and where you want to
avoid processing entities.

Whitespace in tags, quote styling on attributes, and other similar
features are less of a priority to me, though they can be very useful
for marking content that's to be changed using text-based tools without
affecting consuming applications.  I don't need them as much personally,
but as long as I'm going to the effort, I might as well preserve them
and save someone else the effort later.

Simon St.Laurent
Ring around the content, a pocket full of brackets
Errors, errors, all fall down!
http://simonstl.com -- http://monasticxml.org


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS