XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] Proposed requirements on solutions that convertXML-illegal characters into XML


On Fri, Apr 28, 2017 at 3:18 AM, Mukul Gandhi <gandhi.mukul@gmail.com> wrote:

I've just checked the Char production of XML, and it allows the existence of Unicode code point for NUL character (i.e "\u0000").

Actually, the production says:

Char   ::=   #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */

The comment is inaccurate:  #x0 is definitely not included.  Note that "\u0000" works in XML content because "\" has no special meaning in XML, so that is just six legal characters.  XML 1.1 allows a larger range of characters than XML 1.0, but still definitely excludes #0.

-- 
John Cowan          http://vrici.lojban.org/~cowan        cowan@ccil.org
Where the wombat has walked, it will inevitably walk again.
   (even through brick walls!)



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS