OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] Supporting Unicode (was Some comments on the 1.1 draft)

[ Lists Home | Date Index | Thread Index ]

John Cowan <cowan@mercury.ccil.org> wrote:
> Rick Jelliffe scripsit:
> > That makes it clear that control characters are unlike other characters,
> > for which Unicode provides "semantics". The only C0 or C1 characters for
> > which Unicode provides "semantics" are TAB, CR, LF and NEL.
> XML already, however, allows the use of undefined codepoints, which have
> far less semantics than the C0 controls.  And a good thing too, or
> Ethiopic and Thaana and Canadian Aboriginal Syllabics would be totally
> locked out of XML (they are post-Unicode-2.0) instead of merely
> banned in XML names.

Undefined codepoints have the semantic of "potential site for a future
Unicode character codepoint".  It seems to me unlikely that Unicode will
assign any additional character semantics to the C0 and C1 blocks, making
the allowance for C0 controls in XML of dubious value as a "future-proofing"

-Peter S. Housel-   housel@acm.org   http://members.home.com/housel/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS