Lists Home |
Date Index |
- From: "James Tauber" <email@example.com>
- To: "XML-Dev Mailing list" <firstname.lastname@example.org>
- Date: Mon, 12 Apr 1999 23:22:15 +0800
> I'm curious as to how this is handled with entities such as those
> used in mathematical equations, or accented characters, or
> other special characters that aren't strictly 'plain text'?
Well strictly they *are* plain text. That's the whole point of XML
characters being Unicode characters. Accented Latin haracters, Japanese,
symbols are just as much plain text as a capital A.
> I'm writing an XML processing application which reads in an
> XML document, performs some processing (based on another
> XML 'rules' document) and then produces a final XML document.
> Ideally I'd like the entities retained from start to finish, so
> that I can be sure that they survive the transformation unchanged.
> But I'm unclear how I can ensure this? Will I have to wrap all
> entity references in CDATA sections before parsing?
A CDATA wrapper wouldn't work because *after* your processing they'd still
be in a CDATA section or would be things like &eacute;
If you absolutely want to have entity references at the end of the day, your
safest bet would be to post process the character data and replace any
characters you don't want literally with an equivalent. Character references
might be an even better solution and certainly this would make the post
processing easier. Just run over the text replacing (say) any character >
128 with &#...;
James Tauber / email@example.com / www.jtauber.com
XML Standards and Product Coordinator
HarvestRoad Communications / www.harvestroad.com.au
Full-day XML Tutorial @ WWW8 : http://www8.org/
Maintainer of : www.xmlinfo.com, www.xmlsoftware.com and www.schema.net
xml-dev: A list for W3C XML Developers. To post, mailto:firstname.lastname@example.org
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:email@example.com the following message;
To subscribe to the digests, mailto:firstname.lastname@example.org the following message;
List coordinator, Henry Rzepa (mailto:email@example.com)