xml-dev - Re: [xml-dev] Specifying a Unicode subset

Re: [xml-dev] Specifying a Unicode subset

[ Lists Home | Date Index | Thread Index ]

To: gustaf.liljegren@xml.se (Gustaf Liljegren)
Subject: Re: [xml-dev] Specifying a Unicode subset
From: John Cowan <jcowan@reutershealth.com>
Date: Mon, 21 Oct 2002 12:24:41 -0400 (EDT)
Cc: xml-dev@lists.xml.org
In-reply-to: <3.0.6.32.20021021180358.0098f730@m1.858.telia.com> from "Gustaf Liljegren" at Oct 21, 2002 06:03:58 PM

Gustaf Liljegren scripsit:

> With XML 1.1 (here's my point), there's a proposal to include more
> characters from Unicode in XML. 

In fact, XML 1.1 allows *fewer* characters than XML 1.0, but not ones that
we expect anyone to have used: the characters #x7F-#x9F, with the exception
of #x85.  

> However, some want more characters in XML, while others don't want them.
> Perhaps we can allow for both by letting documents declare their own subset
> of Unicode?

Unicode is rather resistant to the idea of declared subsets.  The conformance
requirement is essentially "Don't corrupt what you don't understand";
explicit transformations are fine, but in general if a particular process
cannot handle a character, it should pass it through unchanged.  (Rendering
is obviously an exception.)

-- 
Business before pleasure, if not too bloomering long before.
        --Nicholas van Rijn
                John Cowan <jcowan@reutershealth.com>
                        http://www.ccil.org/~cowan  http://www.reutershealth.com

References:
- Specifying a Unicode subset
  - From: Gustaf Liljegren <gustaf.liljegren@xml.se>

Prev by Date: Re: [xml-dev] Specifying a Unicode subset
Next by Date: Re: [xml-dev] Specifying a Unicode subset
Previous by thread: Re: [xml-dev] Specifying a Unicode subset
Next by thread: Re: [xml-dev] Specifying a Unicode subset
Index(es):
- Date
- Thread