[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Java/Unicode brain damage

From: Michael Brennan <Michael_Brennan@allegis.com>
To: xml-dev@lists.xml.org
Date: Thu, 26 Jul 2001 16:30:08 -0700

I think I'm answering my own question, here. I just noticed the "UCharacter"
class in this library, which is "designed to be a generic code point
information source that handles surrogate pairs". The docs says it supports
Unicode 3.0.

I think folks looking for an immediate Java-based solution should check this
out. It's open source and uses the X open-source license. 

> From: Michael Brennan [mailto:Michael_Brennan@allegis.com]
> Sent: Thursday, July 26, 2001 2:34 PM
> To: xml-dev@lists.xml.org
> Subject: RE: Java/Unicode brain damage
> 
> 
> I don't fully understand the issues, here, (I guess I have 
> some studying to
> do) but I'd be interested in hearing from the experts on this 
> regarding
> IBM's ICU4J (http://oss.software.ibm.com/icu4j/). Does this 
> deal better with
> these issues then the standard Java classes? Does the UTF16 
> class help with
> these issues? I notice references to "surrogates" in the API, 
> so it seems
> like it has support for surrogate pairs, but I'm not saavy enough with
> Unicode issues to make a judgement, here.

Prev by Date: RE: Java/Unicode brain damage
Next by Date: Re: UTF-8 BOM is now official
Previous by thread: Re: Java/Unicode brain damage
Next by thread: defining keys in Schemas
Index(es):
- Date
- Thread