Lists Home |
Date Index |
> Michael Kay wrote:
> > I would be very surprised if XML parsing contributes anything noticeable to
> > the cost of a database load (in shredding mode).
These benchmarks were run on different testbeds so this isn't an
This parser performance test of Expat (C, SAX) reported a best time of 0.05 sec
to parse an 884K document with 32K nodes. For 2500 documents, that would be
approx. 125 seconds.
This benchmark compared two SQL APIs. It was written in C and executed in a
client-server mode, so there was network latency. It used an SQL INSERT, not a
The average time to INSERT 2500 rows with ODBC was 23.53 seconds. The minimum
execution time for an SQL SELECT query to return 2500 rows was 0.05 sec.
This single-cpu Java benchmark parsed simpler documents than the Expat test and
the data was closer to the SQL API test. It took about 2 seconds to parse 10,000
records using SAX, or about .5 seconds to parse 2500 records.
This Python benchmark took between 2.32 and 3.97 seconds for a 3.3 MB document.
My guess is if these benchmarks were all run on the same testbed under the same
conditions, we'd see the same pattern:
The parsing overhead for loading a database is negligible if we're processing
simple documents, but becomes more significant as document size grows.