xml-dev - Re: [xml-dev] How to escape &dt=... (org.dom4j.DocumentException)

Re: [xml-dev] How to escape &dt=... (org.dom4j.DocumentException)

[ Lists Home | Date Index | Thread Index ]

To: xml-dev@lists.xml.org
Subject: Re: [xml-dev] How to escape &dt=... (org.dom4j.DocumentException)
From: lukas.oesterreicher@inode.at
Date: Fri, 5 May 2006 19:09:26 +0200
Reply-to: lukas.oesterreicher@inode.at

>>
>> One of my xml nodes contains an url that contains the text &amp;dt=...
>> 
>> When I try to parse this via DocumentHelper.parseText I get 
>> the following exception:
>> org.dom4j.DocumentException: Error on line 21 of document  : 
>> The reference to entity "dt" must end with the ';' delimiter. 

> Either the text contains & rather than &amp; - or you have put the text
> through the parser twice.

> Michael Kay

I found out what the problem is, but not yet a nice way to solve it:

I get an org.w3c.dom.Element from an external source, this contains
an XML document what I have to handle.

However this XML data has to be send remotely to our server and the
main processing is done there.

So what I do is:
- I convert the org.w3c.dom.Element to a String of the whole XML
document and send it to the remote server.
The code that recieves the Element, converts it to String and
sends it has to be as simple as possible (no import of other jar
files unless absolutely necessary) and has to be fast.

I tried some javax code to convert the Element to a string but
it is extremely slow and takes about the same time as it takes to
send the xml to the server, process it and return a result.

So I constructed the XML String manually which is really fast.
However when I do this and retrieve the node content with
Node.getNodeValue() it returns the value unencoded.

On the server side the XML is parsed to a Document again and
handled there. This parsing fails because the URL mentioned is sent
unencoded, thus i get the mentioned Exception.

What I need to do now is encode the text I get from getNodeValue().
I tested it with replaceAll("&", "&amp;") and that solved the problem,
however this is a very incomplete way of handling it.

Is there a method built into java that encodes the node value
properly?

Thanx in advance,
Lukas

Prev by Date: RE: [xml-dev] Java NVDL implementation
Next by Date: Re: [xml-dev] Request for book recommendations
Previous by thread: RE: [xml-dev] Request for book recommendations
Next by thread: [Watchers of the Web] The evolving form of information on the Web?
Index(es):
- Date
- Thread