XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
RE: [xml-dev] Tradeoffs of XML encoding by enclosing all content in CDATA blocks

Title: Tradeoffs of XML encoding by enclosing all content in CDATA blocks
The best argument is that people who adopt this approach usually fail to check for the presence of "]]>", which isn't allowed in CDATA sections. Once you start checking for that and dealing with it properly, it turns out to be easier to check for & and < and escape them as &_amp; and &_lt; respectively. (Underscores inserted to prevent misformatting).
 
Also, the code for escaping & and < works for both elements and attributes (though attributes also need some attention to look for quotes), whereas the CDATA approach only works for elements.
 
Michael Kay
http://www.saxonica.com/


From: Karr, David [mailto:david.karr@wamu.net]
Sent: 29 September 2008 16:03
To: xml-dev@lists.xml.org
Subject: [xml-dev] Tradeoffs of XML encoding by enclosing all content in CDATA blocks

I pointed out to a client that they're seeing failures parsing XML because some of the element content that they're producing contains characters illegal in XML content, like "&" (unencoded).  They acknowledged that should be fixed, but they also said they could instead enclose all content with CDATA blocks.  That seems bizarre to me, but I'm not sure I can immediately come up with all the cogent arguments against that.  Can someone summarize specifically why you should NOT do that?



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS