Lists Home |
Date Index |
I'm looking around for Unicode character normalization tools, preferably
with a command-line interface.
So far I have:
Charlint (Perl, UTF-8 only)
Normalizer (part of ICU, don't see a command line)
This seems to be the primary description of what's involved:
Ring around the content, a pocket full of brackets
Errors, errors, all fall down!
http://simonstl.com -- http://monasticxml.org