OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] [Summary] UTF-8 Question: e with acute accent should require two bytes, right?

In article <004101c802c8$34fd7640$654d410a@aldebaran> you write:

>I don't see what "UTF-8 character" could mean other than a "(Unicode)
>character encoded in UTF-8".

It can also mean a character in the UTF-8 repertoire.  Hoever, the
UTF-8 repertoire is the same as that of Unicode, so it may well be
that people will usually use "Unicode character" for that meaning and
reserve the "UTF-8 character" for "character encoded in UTF-8".

That doesn't apply to ASCII - it's not just an encoding of some other
character set - and I would guess that "ASCII character" more often
means one in the ASCII repertoire.

There is not a universally agreed precise terminology for these
things.  But using cumbersome phrases to try and achieve precision in
a document for beginners (which is what I take it we were discussing)
is likely to instead just be confusing.

-- Richard
"Consideration shall be given to the need for as many as 32 characters
in some alphabets" - X3.4, 1963.

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS