xml-dev - Re: [xml-dev] Compiled XML

Re: [xml-dev] Compiled XML

[ Lists Home | Date Index | Thread Index ]

To: Alaric Snell <alaric@alaric-snell.com>
Subject: Re: [xml-dev] Compiled XML
From: Dennis Sosnoski <dms@sosnoski.com>
Date: Sun, 31 Mar 2002 00:46:12 -0800
Cc: Niels Peter Strandberg <nielspeter@npstrandberg.com>, xml-dev@lists.xml.org
References: <1202CB3E-4187-11D6-B182-000502CB905D@npstrandberg.com> <20020327140449.419CB91095@love.warhead.org.uk>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:0.9.8) Gecko/20020205

Glad to see such an, err, "enthusiastic" response. As the web page says, 
I'd intended to update this long ago. I've been sidetracked but will try 
to get back to it later this month, when I want to compare document size 
and processing speed for collections of documents using a common schema. 
I'll also try to find the fastest available SAX2 parser to use as an 
input-only comparison.

I'd suggest you don't waste time trying Java serialized versions of DOM 
- the results are horrible. You can see some at the bottom of my 
document models benchmarks page, at 
http://www.sosnoski.com/opensrc/xmlbench/results.html. The main problem 
is that all the document representations (DOM, JDOM, dom4j, etc.) are 
tree structures of generally small objects, while Java serialization is 
optimized for graph structures. It uses (fairly large) handles for each 
object, and actually includes the handles in the encoding (as opposed to 
just making the values sequential and implicit). This adds a lot of 
bloat - Java serialized Xerces DOM ran about twice the size of the text 
documents in the tests I've run.

  - Dennis

Alaric Snell wrote:

>http://www.sosnoski.com/opensrc/xmls/results.html
>
> - uuugghh, I just ejaculated (sorry, ladies)!
>
>That's the kind of experiment I was planning to perform this weekend, and the 
>kinds of results I imagined getting.
>
>The only difference is that I'd introduce gzipped versions of the text, 
>serialised DOM tree, and XMLS data, including the time taken to deflate and 
>inflate the data. Just since people keep raising gzipped text.
>
>I'll try and do that this weekend...
>
>ABS
>

References:
- Re: [xml-dev] Compiled XML
  - From: Niels Peter Strandberg <nielspeter@npstrandberg.com>
- Re: [xml-dev] Compiled XML
  - From: Alaric Snell <alaric@alaric-snell.com>

Prev by Date: Potential problem with ASP method TransformNode ( This is an XML ? )
Next by Date: XML Standards Library 1.0 : Updated 1 April 2002
Previous by thread: Re: [xml-dev] Compiled XML
Next by thread: RE: [xml-dev] RELAX NG Marketing
Index(es):
- Date
- Thread