xml-dev - Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text ou

Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text ou

[ Lists Home | Date Index | Thread Index ]

To: David Megginson <dmeggin@attglobal.net>
Subject: Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
From: "Stephen D. Williams" <sdw@lig.net>
Date: Mon, 19 Apr 2004 22:45:11 -0400
Cc: XML Developers List <xml-dev@lists.xml.org>
In-reply-to: <40846081.8090708@attglobal.net>
References: <15725CF6AFE2F34DB8A5B4770B7334EE03F9F659@hq1.pcmail.ingr.com> <5F1BB722-920D-11D8-A3E3-000A95CCC59E@xegesis.org> <4083E7A0.90807@attglobal.net> <40844422.5010801@lig.net> <40846081.8090708@attglobal.net>
User-agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.6a) Gecko/20031030

David Megginson wrote:

> Stephen D. Williams wrote:
>
>> Processing overhead, including the major components of parsing / 
>> object creation / data copies / serialization, is not a 'future 
>> problem'.  It has always been a problem.
>
> We don't know how much and what kind of a problem XML will be until we've
> had time to gain experience -- if we try to optimize too early, we'll 
> end up
> optimizing the wrong thing.

I suppose "early" and "time to gain experience" are relative.

> For example, I set up a test for a customer a while back to see how fast
> Expat could parse documents.  On my 900 MHz Dell notebook, with 256MB RAM
> and Gnome, Mozilla, and XEmacs competing for memory and CPU, Expat could
> parse about 3,000 1K XML documents per second (if memory does not fail 
> me).
>  If I had tried to, say, build DOM trees from that, I expect that the 
> number
> would have fallen into the double digits (in C++) or worse.  In this 
> case,
> obviously, there would be far more to be gained from optimizing the 
> code on
> the other side of the parser (say, by implementing a reusable object 
> pool or
> lazy tree building) than there would be from replacing XML with something
> that parsed faster.

Why make the assumption that "optimizing the code on the other side of 
the parser" is the first or only step?  I posit that this is not the 
best way to proceed and artificially narrows possible solutions.  The 
steps needed to parse XML, such as processing Expat events, cause a 
minimum amount of work.  When that data has been parsed, it must be in a 
usable form and data in a usable form must be serialized at some point.  
The format and the difference between it and memory formats create a 
minimum bound on the theoretical least amount of work.  Other data 
formats have lower minimum bounds.

> ...
>
>> The scarce resource is time.  Anything that eats time is bad.  This 
>> could
>> be bandwidth usage, CPU, memory, or suboptimal communication and 
>> semantic
>>  models.
>
> I have some experience with high-volume, high-speed systems as well.  
> They
> tend to be so finely hand-tuned that they couldn't use *any* 
> off-the-shelf
> format or protocol, much less XML or SOAP -- even HTTP (or in some cases,
> TCP) is out of the question.  These are the kinds of people who will use
> deltas to avoid wasting four bytes on every number.

Of course ;-).
I'm just trying to spread the efficiency to something standard.

> All the best,
>
> David


sdw

-- 
swilliams@hpti.com http://www.hpti.com Per: sdw@lig.net http://sdw.st
Stephen D. Williams 703-724-0118W 703-995-0407Fax 20147-4622 AIM: sdw

begin:vcard
fn:Stephen Williams
n:Williams;Stephen
email;internet:sdw@lig.net
tel;work:703-724-0118
tel;fax:703-995-0407
tel;pager:sdwpage@lig.net
tel;home:703-729-5405
tel;cell:703-371-9362
x-mozilla-html:TRUE
version:2.1
end:vcard

Follow-Ups:
- Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fasttext output from SAX?
  - From: Rick Marshall <rjm@zenucom.com>

References:
- RE: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
  - From: "Bullard, Claude L (Len)" <clbullar@ingr.com>
- Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
  - From: Michael Champion <mc@xegesis.org>
- Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
  - From: David Megginson <dmeggin@attglobal.net>
- Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
  - From: "Stephen D. Williams" <sdw@lig.net>
- Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
  - From: David Megginson <dmeggin@attglobal.net>

Prev by Date: RE: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
Next by Date: Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fasttext output from SAX?
Previous by thread: Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fast text output from SAX?
Next by thread: Re: [xml-dev] Validation vs performance - was Re: [xml-dev] Fasttext output from SAX?
Index(es):
- Date
- Thread