[
Lists Home |
Date Index |
Thread Index
]
Your data is not UTF-8. It is probably the Windows Latin 1 code page, a.k.a "ANSI" a.k.a CP-1252.
The SAX parser is correct to complain. Correct the encoding declaration to "WINDOWS-1252"
which is the preferred name on the Internet.
Cheers
Rick Jelliffe
----- Original Message -----
From: "Malligeswari N" <malliga@datumamerica.com>
To: <xml-dev@lists.xml.org>
Sent: Tuesday, June 03, 2003 5:02 PM
Subject: [xml-dev] Urgent help in XML parser
Hi All,
I'm using SAX parser. My xml document has encoding style : 'UTF-8'.
My inputdata looks like this -
<DATA_DESCRIPTION><![CDATA[ TODAY'S
DATE ]]></DATA_DESCRIPTION>
My parser throws a errors while parsing this particular character " '
" - apos.
" java.io.UTFDataFormatException: invalid byte 1 of 1-byte
UTF-8 sequence (0x92)
void
org.apache.xerces.parsers.StandardParserConfiguration.parse(org.apache.xerce
s.xni.parser.XMLInputSource)
void
org.apache.xerces.parsers.XMLParser.parse(org.apache.xerces.xni.parser.XMLIn
putSource)
void
org.apache.xerces.parsers.AbstractSAXParser.parse(org.xml.sax.InputSource)
..."
Pl. let me know how to solve this...
Thanks and Regards,
Malligen.
|