OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] RE: XML versus Unicode ... here are the facts abouttheir differences

> Roger C: Fact: XML parsing is done on codepoints, but XPath does NOT do its string matching operations based on codepoints. XPath uses a byte-for-byte comparison.
> ------------
> David L: I believe this is false.
David L is correct.

When comparing names of elements or attributes, XPath uses codepoint 

When comparing strings in user data, XPath uses a default collation, 
which may be established contextually in some implementation-dependent 
way. For example, it might use the collation appropriate to the current 
user's locale. The default collation might or might not do Unicode 
normalization before comparison.

(In Saxon, the default collation if you don't ask for anything different 
is codepoint collation, because this is adequate for many applications 
and is much faster than locale-sensitive collation. But you can set a 
different collation through the API if required.)

Michael Kay

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS