xml-dev - RE: [xml-dev] RE: Incremental transformations with Xalan and performance

RE: [xml-dev] RE: Incremental transformations with Xalan and performance

[ Lists Home | Date Index | Thread Index ]

To: "'Daniela Florescu'" <dflorescu@mac.com>,<andrzej@chaeron.com>
Subject: RE: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
From: "Michael Kay" <mike@saxonica.com>
Date: Mon, 6 Dec 2004 15:44:27 -0000
Cc: <xml-dev@lists.xml.org>
In-reply-to: <0AB7A870-4692-11D9-85AB-000393DC762C@mac.com>
Thread-index: AcTanwV9Ov3kay8/Tq+CT+ChBk+uAgBBLDRw

• Daniela Florescu, Chris Hillery, Donald Kossmann, Paul Lucas, Fabio Riccardi, Till Westmann, Michael J. Carey, Arvind Sundararajan:
The BEA streaming XQuery processor.
VLDB Journal Volume 13, Number 3, September 2004

http://www.vldb.org/conf/2003/papers/S30P01.pdf

An interesting paper.

(Funny how everyone uses Xalan-J as their baseline for performance comparisons - I wonder why?)

In the context of streaming, most of the techniques described are not very different from those used in the better XSLT processors. One big difference is that XSLT 1.0 is typically implemented using a dual push/pull model: XSLT instructions use a push pipeline to write a tree, while XPath expressions use a pull streaming model to read data from trees; whereas this paper describes a model that uses pull iterators uniformly. If you extend this all the way to using a pull parser to read the incoming XML data in the first place (and a pull-based streaming validator), then you do indeed get a system that avoids the need to construct the input document in memory, in the special (and probably rather unusual) case where all operations in the query have a fully streamed implementation.

(Note, however, that the push approach avoids the need to build the *result* document in memory, and in classic stylesheet applications, the result document is generally larger than the source document)

The conclusion of the paper is less than impressive "The running times can be improved... 3.8 MB is much larger that what the implementation of the engine was tuned for..." I'm seeing users doing XSLT transformations up to 200Mb, despite the limitation that the source document has to fit in memory! But nevertheless, the architecture looks very solid, and congratulations to BEA for publishing it, unlike vendors of "high-performance" XSLT engines who make marketing claims but give us no technical information to enable an informed assessment or comparison.

Michael Kay

http://www.saxonica.com/

Follow-Ups:
- Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
  - From: Kevin Jones <kjouk@yahoo.co.uk>
- Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
  - From: Daniela Florescu <dflorescu@mac.com>
- Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
  - From: Wolfgang Hoschek <whoschek@lbl.gov>

References:
- Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
  - From: Daniela Florescu <dflorescu@mac.com>

Prev by Date: RE: [xml-dev] The XML Backlash
Next by Date: Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
Previous by thread: Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
Next by thread: Re: [xml-dev] RE: Incremental transformations with Xalan and performance issues?
Index(es):
- Date
- Thread