OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] Best Practice for designing XML vocabulariescontaining accented characters -- allow both composed and decomposed forms

On Sat, 2013-02-02 at 19:03 +0000, Costello, Roger L. wrote:
> Hi Folks,
> I propose the following as Best Practice:
> 	For elements and attributes that have accents,
> 	allow users to express them in either composed
> 	normalized form (NFC) or decomposed normalized
> 	form (NFD).

Best for whom? In what circumstance?

I _think_ you mean, for elements or attributes whose names contain
diacritical marks.

And the right answer is that good practice, where such names are used,
is generally to encode them all the same way.

Note also that if you get into Greek, for example, a single base
character can have two or even three diacritical marks, and the order in
which these are supplied, although generally creating the same
appearance, is significant in XML.


Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org freenode/#xml

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS