OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: UCS-2, UTF-16, and Java (was RE: U+xxxx, U-xxxxxx, and the basics

[ Lists Home | Date Index | Thread Index ]
  • From: John Cowan <jcowan@reutershealth.com>
  • To: "xml-dev@xml.org" <xml-dev@xml.org>
  • Date: Mon, 06 Mar 2000 17:08:50 -0500

Mike Brown wrote:

> I have a question, though. I have seen a reference somewhere saying that
> Java characters and strings are UCS-2 encoded, and I saw a reference
> somewhere else saying they are UTF-16 encoded. Which is it?

Java pretends surrogates don't exist, basically, and will spit out
bad UTF-8 for a surrogate pair.  So you can call it UCS-2, if you
want, but it's really just a broken implementation.


Schlingt dreifach einen Kreis vom dies! || John Cowan <jcowan@reutershealth.com>
Schliesst euer Aug vor heiliger Schau,  || http://www.reutershealth.com
Denn er genoss vom Honig-Tau,           || http://www.ccil.org/~cowan
Und trank die Milch vom Paradies.            -- Coleridge (tr. Politzer)

This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS