[
Lists Home |
Date Index |
Thread Index
]
jonathan.robie@datadirect-technologies.com (Jonathan Robie) writes:
>I wasn't so much asking whether faithful lexical round-tripping is an
>important goal for me, I think I know the answer to that question. I
>was more interested in asking why this is important to you, and which
>problems you are solving for which such distinctions are significant.
Most of my day-job use of XML is heavily dependent on entity processing,
both for chapters and for special characters. Preserving the original
without normalizing the document is useful in such cases, to say the
least.
CDATA section handling/ignoring has become an issue for me in the last
week, thanks to a fairly broken set of assumptions made by a database in
its exporting XML. That gets especially interesting when combined with
cases where CDATA sections are used intentionally and where you want to
avoid processing entities.
Whitespace in tags, quote styling on attributes, and other similar
features are less of a priority to me, though they can be very useful
for marking content that's to be changed using text-based tools without
affecting consuming applications. I don't need them as much personally,
but as long as I'm going to the effort, I might as well preserve them
and save someone else the effort later.
--
Simon St.Laurent
Ring around the content, a pocket full of brackets
Errors, errors, all fall down!
http://simonstl.com -- http://monasticxml.org
|