OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] Specifying a Unicode subset

[ Lists Home | Date Index | Thread Index ]

tblanchard@mac.com scripsit:

> Lets move on.  UTF-8 is your transfer encoding, use UCS-2 in memory 
> (unless planning to process ancient Sumerian or something - then use 
> UCS-4) and lets all move on to something remotely interesting.

In CJK environments, using UTF-16 for transfer makes sense, because UTF-8
imposes a 50% growth in the size of native-language characters.
That's basically why XML requires both UTF-8 and UTF-16 support of all
conforming parsers.

Not to perambulate              || John Cowan <jcowan@reutershealth.com>
    the corridors               || http://www.reutershealth.com
during the hours of repose      || http://www.ccil.org/~cowan
    in the boots of ascension.  \\ Sign in Austrian ski-resort hotel


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS