OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: SAX2 RFD: LexicalHandler draft v.1.1

[ Lists Home | Date Index | Thread Index ]
  • From: David Megginson <david@megginson.com>
  • To: Lars Marius Garshol <larsga@ifi.uio.no>
  • Date: Thu, 25 Mar 1999 07:01:28 -0500 (EST)

Lars Marius Garshol writes:

 >   http://xml.org/sax/features/normalize-text
 >     Ensure that all consecutive text is returned in a single callback to
 >     DocumentHandler.characters or DocumentHandler.ignorableWhitespace
 >     (true) or explicitly do not require it (false).
 > This is potentially problematic, since it's unspecified what the
 > parser should do about CDATA sections in this case. (I suspect we will
 > see more problems of this kind when we start using really using and
 > stacking filters.) Should they be normalized, or should they be
 > reported separately? (Ie: what is consecutive text, exactly?) The same
 > problem appears with entity boundaries and character references.

Thanks, Lars -- this is an excellent point.  I think that the
specification belongs, not with the normalize-text feature, but with
the LexicalHandler (since people may define other types of handlers
that we cannot predict).

 > Possible solutions:
 >  - reject normalize-text true if a LexicalHandler has been registered,
 >  and reject LexicalHandler registration if normalize-text has been set
 >  to true
 >  - make normalize-text have a logical interpretation by default, and
 >  switch to lexical if a LexicalHandler has been registered
 >  - make normalize-text always have a lexical interpretation
 >  - have separate normalize-text-logical and normalize-text-lexical
 >  events, with reject-behaviour for the first

The DOM's text-normalisation feature does *not* normalise CDATA
sections, but I think that SAX's should.

All the best,


David Megginson                 david@megginson.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS