xml-dev - Supporting Unicode (was Some comments on the 1.1 draft)

Supporting Unicode (was Some comments on the 1.1 draft)

[ Lists Home | Date Index | Thread Index ]

To: <xml-dev@lists.xml.org>
Subject: Supporting Unicode (was Some comments on the 1.1 draft)
From: "Rick Jelliffe" <ricko@allette.com.au>
Date: Thu, 20 Dec 2001 16:22:45 +1100
References: <5C39F806F9939046B4B1AFE652500A3A251914@RED-MSG-10.redmond.corp.microsoft.com> <023101c18873$6ec9a590$4bc8a8c0@AlletteSystems.com> <20011219212436.O9114@io.mds.rmit.edu.au> <E16GkIg-0004P4-00@server2000.ebizhostingsolutions.com> <3C20DB74.5000804@reutershealth.com>

From John Cowen:

> However, the control characters are *characters*, not really very
> different from other control characters in the Unicode space
> which are already allowed: not only the ISO C1 controls, but
> also such things as: the Mongolian variant controls (and the
> Unicode 3.2 generic variant controls); the bidi marks, overrides,
> etc; and the music symbol begins/ends.

The Unicode recommendations w.r.t. control characters are in
 http://www.unicode.org/unicode/uni2book/ch13.pdf

That makes it clear that control characters are unlike other characters,
for which Unicode provides "semantics". The only C0 or C1 characters for
which Unicode provides "semantics" are TAB, CR, LF and NEL.

Unicode completely defers the use and semantics of the other control
characters to whatever makes sense for the application in question.
There is no justification for saying "we need to support the C0 and C1 
characters in order to support Unicode" because Unicode does not
require any such thing.   

But what if we do decide to support these control characters: what does
it mean?  It means that we recognize their semantics, according to which
it is inappropriate to embed most of them (e.g. EOF, BS, BELL, flow control,
etc) in a text file for transmission anyway.   

Cheers
Rick Jelliffe

Follow-Ups:
- Re: [xml-dev] Supporting Unicode (was Some comments on the 1.1 draft)
  - From: John Cowan <cowan@mercury.ccil.org>

References:
- RE: [xml-dev] Some comments on the 1.1 draft
  - From: "Michael Rys" <mrys@microsoft.com>
- Re: [xml-dev] Some comments on the 1.1 draft
  - From: "Rick Jelliffe" <ricko@allette.com.au>
- Re: [xml-dev] Some comments on the 1.1 draft
  - From: Alan Kent <ajk@mds.rmit.edu.au>
- Re: [xml-dev] Some comments on the 1.1 draft
  - From: Gavin Thomas Nicol <gtn@rbii.com>
- Re: [xml-dev] Some comments on the 1.1 draft
  - From: John Cowan <jcowan@reutershealth.com>

Prev by Date: Re: [xml-dev] Why would MS want to make XML break on UNIX, Perl, Python, etc ?
Next by Date: Re: [xml-dev] Why would MS want to make XML break on UNIX, Perl, Python, etc ?
Previous by thread: Re: [xml-dev] Some comments on the 1.1 draft
Next by thread: Re: [xml-dev] Supporting Unicode (was Some comments on the 1.1 draft)
Index(es):
- Date
- Thread