[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: load DTD in memory.
- From: Nicolas LEHUEN <nicolas.lehuen@ubicco.com>
- To: 'Giuseppe Sarno' <gsarno@nortelnetworks.com>,"'xml-dev@lists.xml.org'" <xml-dev@lists.xml.org>
- Date: Fri, 07 Sep 2001 14:42:07 +0200
Another possibility that could lead to *no* performance improvement would be
that you used a new cache instance for each document.
You *obviously* HAVE TO reuse the same DTDMemoryCache instance for all
parsing activity, or else you won't notice any improvement...
Regards,
Nicolas
-----Message d'origine-----
De : Nicolas LEHUEN [mailto:nicolas.lehuen@ubicco.com]
Envoyé : vendredi 7 septembre 2001 14:09
À : 'Giuseppe Sarno'; 'xml-dev@lists.xml.org'
Objet : RE: load DTD in memory.
Well then I cannot help you any further. Try the XML entity resolver from
Sun, but if you don't see any improvement with this simple, low-level code I
wonder how you could see some with a higher level framework.
Maybe your performance problem is not related to the DTD loading itself but
to the DTD validation, or maybe even on the document processing that takes
place after parsing. But I cannot go further without any more details on
your application or your benchmarking methodology.
Best regards,
Nicolas
-----Message d'origine-----
De : Giuseppe Sarno [mailto:gsarno@nortelnetworks.com]
Envoyé : vendredi 7 septembre 2001 13:29
À : Nicolas LEHUEN; 'xml-dev@lists.xml.org'
Objet : RE: load DTD in memory.
Well I'm actually using the same DTD but
still no improvements.
cheers.
-----Original Message-----
From: Nicolas LEHUEN [mailto:nicolas.lehuen@ubicco.com]
Sent: 07 September 2001 11:36
To: Sarno, Giuseppe [MAIFP:GM12:EXCH]; 'xml-dev@lists.xml.org'
Subject: RE: load DTD in memory.
Well if you are loading many documents with the same DTD, you'll have
improvements. If your documents DTDs are all differents, then you should
consider having a persistent DTD cache, i.e. storing DTD as files and
associating these files to their PUBLIC id. This is a kind of DTD catalog.
-----Message d'origine-----
De : Giuseppe Sarno [mailto:gsarno@nortelnetworks.com]
Envoyé : vendredi 7 septembre 2001 12:31
À : Nicolas LEHUEN; 'xml-dev@lists.xml.org'
Objet : RE: load DTD in memory.
Hi I implemented this,
but apparently I didn't get any improvements.
I was expecting to be very quick.
Is it possible ?
cheers.
-----Original Message-----
From: Nicolas LEHUEN [mailto:nicolas.lehuen@ubicco.com]
Sent: 06 September 2001 16:10
To: Sarno, Giuseppe [MAIFP:GM12:EXCH]; 'xml-dev@lists.xml.org'
Subject: RE: load DTD in memory.
Oops sorry I wrote the code directly in the mail, without compiling it :).
You should have read (and corrected) :
InputStream isP=urlP.openStream();
Do not hesitate to have a look at the Javadoc to solve such problems. That's
a must if you want to achieve fast development in Java : always have your
Javadoc at hand.
Regards,
Nicolas
-----Message d'origine-----
De : Giuseppe Sarno [mailto:gsarno@nortelnetworks.com]
Envoyé : jeudi 6 septembre 2001 11:33
À : Nicolas LEHUEN; 'xml-dev@lists.xml.org'
Objet : RE: load DTD in memory.
hi I'm getting error on
InputStream isP=urlP.openConnection();
Incompatible type for declaration. Can't convert java.net.URLConnection to
java.io.InputStream.
InputStream isP=urlP.openConnection();
^
1 error
what should i do for that ?
cheers.
-----Original Message-----
From: Nicolas LEHUEN [mailto:nicolas.lehuen@ubicco.com]
Sent: 05 September 2001 14:09
To: Sarno, Giuseppe [MAIFP:GM12:EXCH]; 'xml-dev@lists.xml.org'
Subject: RE: load DTD in memory.
Well, here is a sample class for a in-memory entity cache (it will therefore
cache DTDs). It is a dumb cache, because if the original DTD changes, the
cache is not refreshed, but for many cases it is sufficient. We have
implemented a much more flexible caching framework, but here it's out of
scope.
import java.net.*;
import java.util.*;
import org.xml.sax.*;
public class DTDMemoryCache implements EntityResolver {
private Map cache=new HashMap();
// Remember to handle synchronization issues !!
// A dumb but easy implementation would synchronize the whole
// resolveEntity method
public synchronized InputSource resolveEntity(String publicIdP,String
systemIdP) throws SAXException {
// The PUBLIC id is the key to our cache
byte[] resultP=cache.get(publicIdP);
if(resultP==null) {
try {
// The SYSTEM id is the URL used to fetch the entity
URL urlP=new URL(systemIdP);
InputStream isP=urlP.openConnection();
ByteArrayOutputStream baosP=new
ByteArrayOutputStream();
// We copy the input stream into the output stream
// Fast buffer implementation
// I could have used BufferInputStream and
OutputStream
// But it's much slower
int readP;
byte[] bufferP=new byte[1024];
while((readP=isP.read(bufferP))>-1) {
baosP.write(bufferP,0,readP);
}
resultP=baosP.toByteArray();
// We store the result in the cache.
cache.put(publicIdP,resultP);
}
catch(Exception eP) {
throw new SAXException(eP);
}
}
return new InputSource(new ByteArrayInputStream(resultP));
}
}
Now, how to install this EntityResolver ? Both the org.xml.sax.Parser and
the javax.xml.parsers.DocumentBuilder have a setEntityResolver() method that
enables you to install your own EntotyResolver before parsing your
documents.
Regards,
Nicolas
-----Message d'origine-----
De : Giuseppe Sarno [mailto:gsarno@nortelnetworks.com]
Envoyé : mercredi 5 septembre 2001 14:47
À : Nicolas LEHUEN; 'xml-dev@lists.xml.org'
Objet : RE: load DTD in memory.
hi,
thanks for your answer,
but could you please tell me more on how to do it since the API doc doesn't
explain it very well.
cheers.
-----Original Message-----
From: Nicolas LEHUEN [mailto:nicolas.lehuen@ubicco.com]
Sent: 05 September 2001 13:26
To: Sarno, Giuseppe [MAIFP:GM12:EXCH]; 'xml-dev@lists.xml.org'
Subject: RE: load DTD in memory.
To cache DTDs the easiest thing to do with SAX is to implement your own
EntityResolver. This way you'll be able to cache locally or in memory all
entities, amongst them the DTDs.
Regards,
Nicolas
-----Message d'origine-----
De : Giuseppe Sarno [mailto:gsarno@nortelnetworks.com]
Envoyé : mercredi 5 septembre 2001 13:56
À : xml-dev@lists.xml.org
Objet : load DTD in memory.
Hi ,
I was wondering ,
in case is needed to parse a lot of XML docs , is it possible to
load the DTD or Schema one in memory and always reuse it instead of
reloading it for each Doc validation ?
cheers.
--------------InterScan_NT_MIME_Boundary--
--------------InterScan_NT_MIME_Boundary--
--------------InterScan_NT_MIME_Boundary--
--------------InterScan_NT_MIME_Boundary--
--------------InterScan_NT_MIME_Boundary--
--------------InterScan_NT_MIME_Boundary--