XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] Problems parsing a weird entity

On 12/20/06, James Carr <james.r.carr@gmail.com> wrote:
> Hi All,
>
> Recently we've been tasked to process several thousand xml files that
> we can't modify, and in the process of parsing them with sax it seems
> it has an invalid entity &#1; . It seems we keep getting an exception
> from this in all files.
>
> Is there anyway to get around this without modifying the files? We are
> using java 1.4.2's stanadard SAX api to parse the files.

You might be able to get around it by somehow parsing it as XML 1.1
without modifying the prolog, but you'll most likely have to just
preprocess the file to replace/remove the character reference.

If the prolog states its version 1.0 and the file contains #1 then its
not really XML, so maybe try and correct it at source...


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS