xml-dev - XML's Scylla and Charybdis

XML's Scylla and Charybdis - parse and regexp

[ Lists Home | Date Index | Thread Index ]

To: xml-dev@lists.xml.org
Subject: XML's Scylla and Charybdis - parse and regexp
From: Sean McGrath <sean.mcgrath@propylon.com>
Date: Tue, 01 Apr 2003 09:48:59 +0100

Here is the conundrum that is at the heart of the border guard thread:

Option ! :  Using regexp to do process my XML
1.1 I will not be able to say for sure that it works for all WF inputs - 
false negatives/positives possible
1.2  I will be able to say that my processing will leave things like entity 
refereces, whitespace, attribute delimiters etc. unharmed.

Option 2: Using a parse to process my XML
2.1 I will be able to say for sure that it works for all WF inputs - no 
false positives/negatives possible
2.2. I will not be able to say my processing will not negatively effect 
things like entity references, whitespace, attribute
delimiters etc. unharmed.

Correctness or input fidelity - pick one - you cannot have both.

This is at the core of why I've always argued that we *do* need a data 
model for XML and we *do* need something like
common XML because I want my processing to be both correct *and* non-lossy 
(high input fidelity).

Is that too much to ask?

Any I the only one who wants both?

Sean



http://seanmcgrath.blogspot.com

Follow-Ups:
- Re: [xml-dev] XML's Scylla and Charybdis - parse and regexp
  - From: "Simon St.Laurent" <simonstl@simonstl.com>
- Re: [xml-dev] XML's Scylla and Charybdis - parse and regexp
  - From: Mike Champion <mc@xegesis.org>

Prev by Date: RE: [xml-dev] If XML is too hard for a programmer, perhaps he'd b e better off as a crossing guard
Next by Date: XML Parsing - a slightly different approach.
Previous by thread: RE: [xml-dev] If XML is too hard for a programmer, perhaps he'd b e better off as a crossing guard
Next by thread: Re: [xml-dev] XML's Scylla and Charybdis - parse and regexp
Index(es):
- Date
- Thread