[
Lists Home |
Date Index |
Thread Index
]
tblanchard@mac.com scripsit:
> Lets move on. UTF-8 is your transfer encoding, use UCS-2 in memory
> (unless planning to process ancient Sumerian or something - then use
> UCS-4) and lets all move on to something remotely interesting.
In CJK environments, using UTF-16 for transfer makes sense, because UTF-8
imposes a 50% growth in the size of native-language characters.
That's basically why XML requires both UTF-8 and UTF-16 support of all
conforming parsers.
--
Not to perambulate || John Cowan <jcowan@reutershealth.com>
the corridors || http://www.reutershealth.com
during the hours of repose || http://www.ccil.org/~cowan
in the boots of ascension. \\ Sign in Austrian ski-resort hotel
|