OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Language declaration question



At 05:09 31-08-2001, Hewko, Doug wrote:
>When I looked at a chart that has the encoding values available (ie. "UTF-8:
>Compressed Unicode", "ISO-8859-2: Latin-2; Eastern European", "EUC-JP:
>Japanese, Unix", etc), they all imply some language.

"Compressed Unicode" is not a language... and in any case, UTF-8 is 
sometimes compressed and sometimes expanded, depending on the characters 
involved.

>UT-8 is primarily the
>English characters.)

Where did you get this chart?  UTF-8 can represent all of Unicode.

>I thought they would be synonymous with "encoding" just
>being the language that the document was typed in. That is why I got
>confused.
>
>Just to make sure I understand, all encoding does is translate the
>machine-coded values using a table into a standard "master" language that
>the processor can understand? Does all processors use Unicode? (ie. Would a
>Chinese version of MS IE 5.5 use the same Unicode that I would use?)

Unicode is Unicode is Unicode.  (Well, Unicode 2.0 is Unicode 2.0; Unicode 
3.0 is Unicode 3.0... newer versions add characters, and only ever 
deprecate, not remove, old ones.)

See the Unicode Web site, <URL: http://www.unicode.org/ > for some 
clarifications.

-Chris
-- 
Christopher R. Maden, Principal Consultant, HMM Consulting Int'l, Inc.
DTDs/schemas - conversion - ebooks - publishing - Web - B2B - training
<URL: http://www.hmmci.com/ > <URL: http://crism.maden.org/consulting/ >
PGP Fingerprint: BBA6 4085 DED0 E176 D6D4  5DFC AC52 F825 AFEC 58DA