[
Lists Home |
Date Index |
Thread Index
]
- To: "Ramin Firoozye" <ramin@wizen.com>,<xml-dev@lists.xml.org>
- Subject: RE: [xml-dev] CDATA strangeness
- From: "Dare Obasanjo" <dareo@microsoft.com>
- Date: Tue, 6 Aug 2002 09:50:47 -0700
- Thread-index: AcI9aPKN5r5/lJPnSYKs/xvsMDa9DwAAFA8P
- Thread-topic: [xml-dev] CDATA strangeness
Instead of tweaking tidy or Xerces-C why not just perform a simple search and replace by hand or programmatically (*cough* Perl *cough*).
-----Original Message-----
From: Ramin Firoozye [mailto:ramin@wizen.com]
Sent: Tue 8/6/2002 9:44 AM
To: xml-dev@lists.xml.org
Cc:
Subject: RE: [xml-dev] CDATA strangeness
This comes from an HTML-only site. They don't claim it's XML, but we're
trying to load it in as an XML tree. The problem is when you pass the
snippet through Tidy and Xerces-C, it barfs royally, saying there's a
missing CDATA where it sees the '<![if' string.
Very strange. I thought I'd pose it out here to see if anyone else had seen
similar things, or if I had misread the spec. So far, I've heard 3-4
conflicting opinions on whether this is legal XML or not.
S'ok. Looks like we're going to tweak tidy and/or Xalan-C. We can't really
have breakage on stuff like this and asking the site to change is like
asking an elephant to quit hogging the dance floor.
Others using the Tidy/Xalan-C combo might want to keep an eye out for it.
Best,
Ramin
>
> At 2:45 PM -0700 8/5/02, Ramin Firoozye wrote:
>
> >There's a fairly well-known web-site that returns:
> >
> ><!--[if IE]><script language=javascript>ie5=1;</script><![endif]-->
> ><![if !IE]><script language=javascript>ie5=0;</script><![endif]>
> >...
>
> This is not well-formed XML, not even close. The problem is the <!
> outside the comment. If the site claims this is XML, then the site is
> wrong. However, possibly the site is only serving HTML, not XML or
> XHTML.
> --
>
-----------------------------------------------------------------
The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
initiative of OASIS <http://www.oasis-open.org>
The list archives are at http://lists.xml.org/archives/xml-dev/
To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl>
|