[
Lists Home |
Date Index |
Thread Index
]
I'm looking around for Unicode character normalization tools, preferably
with a command-line interface.
So far I have:
Normalization Demo
http://www.unicode.org/reports/tr15/Normalizer.html
Charlint (Perl, UTF-8 only)
http://www.w3.org/International/charlint/
Normalizer (part of ICU, don't see a command line)
http://oss.software.ibm.com/icu/userguide/normalization.html
Any others?
This seems to be the primary description of what's involved:
http://www.unicode.org/unicode/reports/tr15/
--
Simon St.Laurent
Ring around the content, a pocket full of brackets
Errors, errors, all fall down!
http://simonstl.com -- http://monasticxml.org
|