xml-dev - RE: [xml-dev] Fast text output from SAX?

RE: [xml-dev] Fast text output from SAX?

[ Lists Home | Date Index | Thread Index ]

To: "Michael Kay" <michael.h.kay@ntlworld.com>
Subject: RE: [xml-dev] Fast text output from SAX?
From: Elliotte Rusty Harold <elharo@metalab.unc.edu>
Date: Thu, 15 Apr 2004 12:54:30 -0400
Cc: "'XML DEV'" <xml-dev@lists.xml.org>
In-reply-to: <20040415154759.QNWQ1202.mta04-svc.ntlworld.com@Turtle>
References: <20040415154759.QNWQ1202.mta04-svc.ntlworld.com@Turtle>

At 4:48 PM +0100 4/15/04, Michael Kay wrote:

>I personally don't think that XML parsing is that time-critical to most
>applications, but I can imagine that there are applications where avoiding
>the cost of repeatedly checking (for example) that every name is made up of
>valid characters would be beneficial.

A lot of parser use naive algorithms for this task that are far from 
the most efficient possible. For one thing, the cost of checking a 
single character can be made O(1) with a very low coefficient, at the 
cost of about 32K of memory. Furthermore, you can cache previously 
verified names and not recheck them when they're encountered 1000s of 
times apiece.

I'm not convinced even the best parsers have hit fundamental limits 
yet. It's way too early in the game to say we a binary format for 
speed purposes. Time would more productively be spent by exploring 
better algorithms for standard parsers.
-- 

   Elliotte Rusty Harold
   elharo@metalab.unc.edu
   Effective XML (Addison-Wesley, 2003)
   http://www.cafeconleche.org/books/effectivexml
   http://www.amazon.com/exec/obidos/ISBN%3D0321150406/ref%3Dnosim/cafeaulaitA

References:
- RE: [xml-dev] Fast text output from SAX?
  - From: "Michael Kay" <michael.h.kay@ntlworld.com>

Prev by Date: RE: [xml-dev] Fast text output from SAX?
Next by Date: Re: [xml-dev] RE: Why a "general" solution?
Previous by thread: RE: [xml-dev] Fast text output from SAX?
Next by thread: Re: [xml-dev] Fast text output from SAX?
Index(es):
- Date
- Thread