[
Lists Home |
Date Index |
Thread Index
]
Interesting. Those are compelling use cases but this significantly
complicates things. In particular, automatically using entities on output
becomes much more complicated. Instead of a simple hash table that maps
character codes to entities, you have to have a trie. I also see a
slippery slope opening up here:
1. single character
2. base character + combining character(s)/other Unicode modifier (MathML)
3. arbitrary sequence of characters (why limit 2? don't want to check
character types)
4. arbitrary well-formed content (3 allows arbitrary text, and for I18N
arbitrary text needs elements for eg BIDI and ruby)
Not clear what the right place to draw the line is here.
--On 06 February 2002 15:31 +0000 David Carlisle <davidc@nag.co.uk> wrote:
>
> <character name="eacute" code="xC9"/>
>
> A large number of the MathML entities do not map to a single unicode
> code point but two (or in a few cases three). Common cases the second
> character being a combining character (eg the negation / ) or the new
> math variant character VS1 (to be) introduced in Unicode 3.2.
> eg the ISOASMN nlE (not less than) would seem to map to
> U02A7D U00338
> where U02A7D is LESS-THAN OR SLANTED EQUAL TO (unicode 3.2)
> and U00338 is COMBINING LONG SOLIDUS OVERLAY
>
> so probably code would need to take a sequence of decimal or hex values,
> not just a single one.
>
> David
>
> _____________________________________________________________________
> This message has been checked for all known viruses by Star Internet
> delivered through the MessageLabs Virus Scanning Service. For further
> information visit http://www.star.net.uk/stats.asp or alternatively call
> Star Internet for details on the Virus Scanning Service.
>
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
> initiative of OASIS <http://www.oasis-open.org>
>
> The list archives are at http://lists.xml.org/archives/xml-dev/
>
> To subscribe or unsubscribe from this list use the subscription
> manager: <http://lists.xml.org/ob/adm.pl>
>
>
>
>
|