Lists Home |
Date Index |
Daniel Veillard scripsit:
> Just to put some emphasis to what John Cowan already said, I'm afraid
> of the cost of normalizing on-the-fly, the algorithms I could found
> in the Unicode annexes were just scary (in term of complexity and memory
> requirement) maybe there is simpler lean and cheap normalization
> algorithms (I would like pointers ;-) but definitely that cost is better
> done once at generation time. Apparently normalization checking is
> slightly lighter and as said that check is optional c.f. 2.13 wording.
ICU is, as always, the gold standard for this kind of thing. It has
both normalizing and normalization-checking algorithms.
John Cowan <email@example.com>
Unified Gaelic in Cyrillic script!