OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] Urgent help in XML parser

[ Lists Home | Date Index | Thread Index ]

Your data is not UTF-8.   It is probably the Windows Latin 1 code page, a.k.a "ANSI" a.k.a CP-1252.  

The SAX parser is correct to complain. Correct the encoding declaration to "WINDOWS-1252"
which is the preferred name on the Internet.  


Cheers
Rick Jelliffe


----- Original Message ----- 
From: "Malligeswari N" <malliga@datumamerica.com>
To: <xml-dev@lists.xml.org>
Sent: Tuesday, June 03, 2003 5:02 PM
Subject: [xml-dev] Urgent help in XML parser


Hi All,
     I'm using SAX parser. My xml document has encoding style : 'UTF-8'.

     My inputdata looks like this -
                    <DATA_DESCRIPTION><![CDATA[ TODAY'S
DATE ]]></DATA_DESCRIPTION>

    My parser throws a errors while parsing this particular character " '
" - apos.
                " java.io.UTFDataFormatException: invalid byte 1 of 1-byte
UTF-8 sequence (0x92)
             void
org.apache.xerces.parsers.StandardParserConfiguration.parse(org.apache.xerce
s.xni.parser.XMLInputSource)
             void
org.apache.xerces.parsers.XMLParser.parse(org.apache.xerces.xni.parser.XMLIn
putSource)

             void
org.apache.xerces.parsers.AbstractSAXParser.parse(org.xml.sax.InputSource)
..."

   Pl. let me know how to solve this...

Thanks and Regards,

Malligen.








 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS