Lists Home |
Date Index |
Rick Jelliffe scripsit:
> That makes it clear that control characters are unlike other characters,
> for which Unicode provides "semantics". The only C0 or C1 characters for
> which Unicode provides "semantics" are TAB, CR, LF and NEL.
XML already, however, allows the use of undefined codepoints, which have
far less semantics than the C0 controls. And a good thing too, or
Ethiopic and Thaana and Canadian Aboriginal Syllabics would be totally
locked out of XML (they are post-Unicode-2.0) instead of merely
banned in XML names.
John Cowan http://www.ccil.org/~cowan email@example.com
Please leave your values | Check your assumptions. In fact,
at the front desk. | check your assumptions at the door.
--sign in Paris hotel | --Miles Vorkosigan