XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] What to escape when serializing XML

In article <00e301c72f37$7dcad4e0$6401a8c0@turtle> you write:

>I think there's a reasonable expectation (and certainly the QT serialization
>spec assumes it) that if there's a DTD or schema that causes whitespace to
>be condensed in this way, that's because there's no semantic difference
>between different whitespace sequences, and that therefore it's OK for the
>serializer to assume that any whitespace sequence will do.

XML parsing ensures that an attribute declared with tokenised type
will be normalised to a sequence of tokens separated by single space
characters, whether the attribute is valid or not.  If it's not valid,
the tokens may not be NMTOKENs (they may even contain tabs, linefeeds
etc), but you can rely on them being separated by single spaces.

So on the one hand there's nothing you can do in serialisation to
maintain spaces within the tokens, but on the other hand if your data
came from parsing an XML document the situation is never going to
arise.

-- Richard



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS