OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] Fast text output from SAX?

[ Lists Home | Date Index | Thread Index ]

At 6:58 AM -0700 4/15/04, Jeff Rafter wrote:

>Agreed, well-formednes must be checked. But surely it doesn't need to be
>checked every time a document is parsed (if it is unchanged). I think there
>is definitely room for a parser that can emit SAX events (or something)
>without bogging down in Wellformedness checks. For example, if a document is
>received from a partner, wellformedness is checked, and then pushed into a
>database text field for later processing, there is no need to recheck
>wellformedness.

How do you know no other process or person has modified the field in 
the mean time?

>  In fact, when it is parsed again later, there is no need to
>do a lot of things-- like check end tags-- if wellformedness is known a
>priori the start tag can be pushed onto a stack, and popped when </ is read.
>Then the "parser" can simply skip the length of the end tag. There are lots
>of savings like this-- duplicate attribute checking for example can be
>costly-- but would be unnecessary if wfness was assumed.

But well-formedness can't be assumed. That's the core of XML.
-- 

   Elliotte Rusty Harold
   elharo@metalab.unc.edu
   Effective XML (Addison-Wesley, 2003)
   http://www.cafeconleche.org/books/effectivexml
   http://www.amazon.com/exec/obidos/ISBN%3D0321150406/ref%3Dnosim/cafeaulaitA




 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS