OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   RE: [xml-dev] Parsing Performance

[ Lists Home | Date Index | Thread Index ]

Martin Soukup writes:

 > > Is it some kind of indexing engine, or are you reading live datafeeds?
 > Live data feeds. Yes, I think you might be right, but what option
 > do I have other than SAX?

First you should determine how big a hit you're actually taking from
the exceptions in the context of your full application.  This is far
from perfect, but it might give you a rough idea:

1. Take one of your typical-length XML documents.

2. Create a trimmed-down version of the same document, with half of
   the content missing.

3. In your current system, time how long it takes to process the
   first document (say) 10,000 times, terminating halfway through each
   time by throwing an exception from a SAX handler (at the point
   where the trimmed-down document ends).

4. Next, time how long it takes to process the second document 10,000
   times without premature termination.

This test is slightly misleading, because step #4 might include a
little extra overhead at the end of the XML document (checking for
well-formedness), but it should give you a fairly reasonable idea.  My
guess is that the exception overhead easily gets lost in the noise --
XML parsers do a *lot* of work for every event that they report.  1000
times practically nothing is still practically nothing.

All the best,


David Megginson, david@megginson.com, http://www.megginson.com/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS