XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
RE: [xml-dev] Storing illegal XML 1.0 characters in the UnicodePrivate Use Area

Liam, hello,

> Another is to use an escaping mechanism - 
> what I call "UCODE", in which an upper-case 
> "U" is followed by hexadecimal and a trailing 
> X to mark the end

To be sure I understand, this text (2 denotes hex two, 3 denotes hex three):

    2Hello World3

is converted to this XML:

   <text>U2XHello WorldU3X</text>

Is that the approach Liam? 

Have you documented this approach anywhere?

/Roger

-----Original Message-----
From: Liam R E Quin [mailto:liam@w3.org] 
Sent: Friday, November 02, 2012 11:19 AM
To: Costello, Roger L.
Cc: xml-dev@lists.xml.org
Subject: Re: [xml-dev] Storing illegal XML 1.0 characters in the Unicode Private Use Area

On Wed, 2012-10-31 at 18:04 +0000, Costello, Roger L. wrote:
[...]
> One approach is to move any illegal characters into the Private Use Area: 
Another is to use an escaping mechanism - e.g. what I call "UCODE", in
which an upper-case "U" is followed by hexadecimal and a trailing X to
mark the end; this can safely be used in XML element names for example.

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org freenode/#xml
Co-author, 5th edition of "Beginning XML" - Wrox, 2012



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS