Lists Home |
Date Index |
- From: John Cowan <email@example.com>
- To: XML Dev <firstname.lastname@example.org>
- Date: Mon, 11 Jan 1999 14:07:50 -0500
Elliotte Rusty Harold wrote:
> In XML ["extender"]
> characters can be used anywhere a base character or ideographic
> character can be used.
This is not quite true, because extenders are not name-start characters
in either XML or Unicode.
> However I have been unable to find in the Unicode book or Web site any
> definition of what makes a character an extender. Can anyone clue me in on
> why some Unicode characters have the extender property while others don't?
> What's the logic behind this grouping of characters across languages?
Roughly (and unofficially) speaking, an extender is something that isn't
a letter or combining mark but often appears embedded in words.
For example, one may use L plus MIDDLE DOT as a compatibility equivalent
of L WITH MIDDLE DOT in writing Catalan, and we do not want a
Catalan name to break into two names at the MIDDLE DOT.
(The dot is used to distinguish two successive Ls, written with
a dot, from the unitary Catalan letter "ll", written without a dot.)
Extenders are enumerated (but not explained) in Section 5.14 of
the Unicode Standard.
John Cowan http://www.ccil.org/~cowan email@example.com
You tollerday donsk? N. You tolkatiff scowegian? Nn.
You spigotty anglease? Nnn. You phonio saxo? Nnnn.
Clear all so! 'Tis a Jute.... (Finnegans Wake 16.5)
xml-dev: A list for W3C XML Developers. To post, mailto:firstname.lastname@example.org
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:email@example.com the following message;
To subscribe to the digests, mailto:firstname.lastname@example.org the following message;
List coordinator, Henry Rzepa (mailto:email@example.com)