OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] JSR 206 and SAX

[ Lists Home | Date Index | Thread Index ]

At 2:55 PM -0500 2/19/04, Norman Walsh wrote:

>- Michael's message raised an issue that we had, as far as I can tell,
>   simply failed to consider: Unicode normalization checking. The XML
>   1.1 spec says "XML processors SHOULD provide a user option to verify
>   that the document being processed is in fully normalized form". We
>   propose that a feature be added to the XMLReaderFactory, perhaps
>   http://xml.org/sax/features/unicode-normalization-checking. If this
>   feature is enabled, the factory returns readers that perform Unicode
>   normalization checking. The EG felt that this feature was most
>   appropriate at the factory level.

Seems reasonable. This feature should be false by default, a true 
value should be optional, and if a problem is encountered the error() 
message in the ErrorHandler should be invoked.

I could be wrong about that last point if anyone wants to argue this 
should be a fatal error or a warning instead. However, please do 
let's specify which of those three methods is called. I've had a 
devil of a time in the past with problems one parser considers fatal, 
the next considers only an error, and the third relegates to warning.

Also, we might want to standardize either en exception subtype or an 
exception message for normalization issues. I think we decided last 
year it was too late for most preexisting error messages, but we 
should be able to get out in front of this one.

   Elliotte Rusty Harold
   Effective XML (Addison-Wesley, 2003)


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS