[
Lists Home |
Date Index |
Thread Index
]
Daniel Veillard scripsit:
> Just to put some emphasis to what John Cowan already said, I'm afraid
> of the cost of normalizing on-the-fly, the algorithms I could found
> in the Unicode annexes were just scary (in term of complexity and memory
> requirement) maybe there is simpler lean and cheap normalization
> algorithms (I would like pointers ;-) but definitely that cost is better
> done once at generation time. Apparently normalization checking is
> slightly lighter and as said that check is optional c.f. 2.13 wording.
ICU is, as always, the gold standard for this kind of thing. It has
both normalizing and normalization-checking algorithms.
--
John Cowan <jcowan@reutershealth.com>
http://www.ccil.org/~cowan http://www.reutershealth.com
Unified Gaelic in Cyrillic script!
http://groups.yahoo.com/group/Celticonlang
|