xml-dev - Re: [xml-dev] SAX and ignorableWhitespace

Re: [xml-dev] SAX and ignorableWhitespace

[ Lists Home | Date Index | Thread Index ]

To: "Jeff Rafter" <lists@jeffrafter.com>
Subject: Re: [xml-dev] SAX and ignorableWhitespace
From: Elliotte Rusty Harold <elharo@metalab.unc.edu>
Date: Mon, 5 Jan 2004 21:35:23 -0500
Cc: <xml-dev@lists.xml.org>
In-reply-to: <021501c3d3f8$0b5e5eb0$6403a8c0@ARIMATHEA>
References: <r02000200-1028-3B006A8A3F3311D8B1470003937A08C2@[192.168.124.11]><Pine.GSO.4.58.0401052012180.18798@ic-unix.ic.utoronto.ca><021501c3d3f8$0b5e5eb0$6403a8c0@ARIMATHEA>

At 5:54 PM -0800 1/5/04, Jeff Rafter wrote:
>This is one of those questions that are more for curiousity than anything,
>but does anyone have any information on why ignorableWhitespace was included
>in ContentHandler as opposed to LexicalHandler? Based on my understanding of
>the guidelines used in determining what belongs in the default interfaces
>and what belongs in the extension interfaces it seems to fall under the
>latter. It is non-imperative lexical information associated with the parse.
>Comments?

That is incorrect. XML parsers must report all content, ignorable or 
otherwise. It is not optional to report this content, unlike, for 
example, CDATA section boundaries. The word "ignorable" is an 
unfortunate choice here. It means the application receiving the data 
may choose to ignore it. However, the parser cannot ignore this 
content. It must provide it.

It's also the case that a lot of white space many people think is 
ignorable really isn't. White space is only really ignorable if 
there's a DTD, and even then you may choose not to ignore it. I 
prefer the less loaded term "boundary white space" which identifies 
all white space only text nodes, not just those that are ignorable.
-- 

   Elliotte Rusty Harold
   elharo@metalab.unc.edu
   Effective XML (Addison-Wesley, 2003)
   http://www.cafeconleche.org/books/effectivexml
   http://www.amazon.com/exec/obidos/ISBN%3D0321150406/ref%3Dnosim/cafeaulaitA

Follow-Ups:
- Re: [xml-dev] SAX and ignorableWhitespace
  - From: "Jeff Rafter" <lists@jeffrafter.com>

References:
- Re: [xml-dev] Formalism and complexity
  - From: "Simon St.Laurent" <simonstl@simonstl.com>
- Re: [xml-dev] Formalism and complexity
  - From: Ian Graham <igraham@ic-unix.ic.utoronto.ca>
- SAX and ignorableWhitespace
  - From: "Jeff Rafter" <lists@jeffrafter.com>

Prev by Date: RE: [xml-dev] Re: Cookies at XML Europe 2004 -- Call for Particip ation
Next by Date: RE: [xml-dev] Globbing versus Regular Expressions (was: Regular Associations)
Previous by thread: SAX and ignorableWhitespace
Next by thread: Re: [xml-dev] SAX and ignorableWhitespace
Index(es):
- Date
- Thread