OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   RE: Contracts & Acceptence Testing. Re: IE5 and UTF-8

[ Lists Home | Date Index | Thread Index ]
  • From: Eric Bohlman <ebohlman@netcom.com>
  • To: Lucio Piccoli <Lucio.Piccoli@one2one.co.uk>
  • Date: Fri, 7 Jul 2000 04:36:14 -0700 (PDT)

On Fri, 7 Jul 2000, Lucio Piccoli wrote:

> thanks for your response Rick,
> > If you need to be able to pin down the specific encoding problem, some
> > extra info would be helpful:
> >  - Can you tell us what the particular UTF-8 encoding error is? 
> org.xml.sax.SAXParseException: Character conversion error: "Unconvertible
> UTF-8 character beginning with 0x96" (line number may be too low).

It appears they're sending out character codes in the 128-159 range, which
are defined only in the Windows CP-1252 charset; that one looks like an
en-dash.  I suspect you'll also find codes in the range 0x91-94, which is
where Microsoft tools put typographically-balanced curly quotes.  If
you're really unlikely, you'll find curly quotes being used as attribute
value delimiters, which isn't even well-formed XML.  But it looks like
their documents are actually encoded in win-1252 rather than UTF-8.

This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS