[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: Java/Unicode brain damage
- From: Michael Brennan <Michael_Brennan@allegis.com>
- To: xml-dev@lists.xml.org
- Date: Thu, 26 Jul 2001 16:30:08 -0700
I think I'm answering my own question, here. I just noticed the "UCharacter"
class in this library, which is "designed to be a generic code point
information source that handles surrogate pairs". The docs says it supports
Unicode 3.0.
I think folks looking for an immediate Java-based solution should check this
out. It's open source and uses the X open-source license.
> From: Michael Brennan [mailto:Michael_Brennan@allegis.com]
> Sent: Thursday, July 26, 2001 2:34 PM
> To: xml-dev@lists.xml.org
> Subject: RE: Java/Unicode brain damage
>
>
> I don't fully understand the issues, here, (I guess I have
> some studying to
> do) but I'd be interested in hearing from the experts on this
> regarding
> IBM's ICU4J (http://oss.software.ibm.com/icu4j/). Does this
> deal better with
> these issues then the standard Java classes? Does the UTF16
> class help with
> these issues? I notice references to "surrogates" in the API,
> so it seems
> like it has support for surrogate pairs, but I'm not saavy enough with
> Unicode issues to make a judgement, here.