OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] Microsoft FUD on binary XML...

[ Lists Home | Date Index | Thread Index ]

On Nov 22, 2003, at 3:37 PM, Alaric B Snell wrote:

> Good point, actually... I suppose that, in general, any language which 
> uses more than 256 code points in general use is actually quite likely 
> to be a language that uses one code point per word.

No, actually.  I don't know much about Chinese, but the average number 
of characters/word in Japanese is two point something; you have to 
learn 1700 or so characters to get out of Japanese high school, and 
literate people pick up quite a few more.  Korean Hangul are syllabics 
and thus there are naturally several per word.  -Tim


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS