[
Lists Home |
Date Index |
Thread Index
]
- From: "Paul W. Abrahams" <abrahams@valinet.com>
- To: XMLDev list <xml-dev@ic.ac.uk>
- Date: Wed, 25 Aug 1999 23:08:04 -0400
Steve Dahl wrote:
> My guess is that 0xB7 and 0x387 were included either by accident, or else in Unicode 1.0, they
> were incorrectly classified as Lm characters. I can't find my old Unicode 1.0 chart, so I
> don't know.
>
> My guess is that Extender character means a character classified as Lm. As far as I can tell,
> these are characters that make no sense unless they follow another letter. Think of them as
> being equivalent to diacritical marks, except that they take up space next to the modified
> character, whereas normal diacritical marks fit in the same visual character cell as the
> character they modify. The Lm character must always be displayed next to its preceding
> character, and it's illegal to break the line between an Lm character and its preceding
> character.
The trouble is that there are lots of other Lm characters besides those classified as
extenders. For example:
>
> 02B2;MODIFIER LETTER SMALL J;Lm;0;L;<super> 006A;;;;N;;;;;
> 02B3;MODIFIER LETTER SMALL R;Lm;0;L;<super> 0072;;;;N;;;;;
Why aren't those also considered to be extenders? There seems to be some history and motivation
here that needs explaining.
Paul
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
|