OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] UTF-8+names

[ Lists Home | Date Index | Thread Index ]

"Simon St.Laurent" <simonstl@simonstl.com> wrote at Sun, 19 Oct 2003 10:47:06 -0400:
> mc@xegesis.org (Mike Champion) writes:
> >exit :-) Tim's approach is taking a real, widespread problem and
> >offering a clean, layered solution -- essentially a character encoding
> >preprocessor -- rather than changing XML itself.
> This makes me laugh hysterically, as the original notion of Unicode was
> to develop a big enough space that all the characters could live in
> happy co-existence without need for layering in the character space.

Actually, that's more the original ISO/IEC 10646 approach that didn't
pass its draft ballot in the period before the Unicode Consortium and
the ISO/IEC WG merged their repertoires.

(It also resembles recent fanciful unofficial proposals on the Unicode
list, but those threads petered out.)

> Unicode itself ran out of room and put in surrogates.  Now it seems

Yes.  I don't doubt that 65,000 characters seemed like enough back in
1988, or that a (fixed) character size larger than 16 bits would have
been an even tougher sell back when Unicode was getting established.

> that we've run out of patience and added yet another layer of
> processing in the middle.

Yet the proposal under discussion doesn't attempt naming either 65,000
characters or 1,000,000+, so I don't see why surrogates have anything
to do with it.


Tony Graham
XML Technology Center - Dublin
Sun Microsystems Ireland Ltd                       Phone: +353 1 8199708
Hamilton House, East Point Business Park, Dublin 3            x(70)19708


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS