OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: Why the Infoset?

[ Lists Home | Date Index | Thread Index ]
  • From: John Cowan <jcowan@reutershealth.com>
  • To: Jonathan Borden <jborden@mediaone.net>, "xml-dev@xml.org" <xml-dev@xml.org>
  • Date: Thu, 03 Aug 2000 15:47:20 -0400

Jonathan Borden wrote:

> What we mean by "high fidelity" is simply this: the *parsed* model,  what is
> also called a grove, has the capability of saving the document *exactly*,
> character for character, from what it was parsed.

Okay, this is the one that cares about the 74 occurrences of S, which Simon
insists on calling a red herring?

> But it is not hard to describe, that is my point. The XML 1.0 production
> rules create a parse tree which exactly describes the source document, down
> to the byte. This is what I would call the XML property set, and from this
> one can subset to one's desire.

Okay, I understand now.  Actually the production-rule level is *not* the
bottom parsing level, because for the most part PE-references have already
been removed.  Internal PEs are not really structural in XML;
it is a mere validity constraint that requires DTD constructs to begin
and end in the same PE, and

	<!ENTITY % element "<!ELEMENT">
	%element; FOO EMPTY>

is a well-formed though not valid external subset.

> Doesn't it make the most sense to subset from the full description, rather
> than both add to and subtract from a partial description?

IMHO no.  The correct level to aim at, when doing the job for the first
time for the benefit of the SGML-naive, is the middle useful level, what
you call "structure sensitive".  The question then remains, just what is
useful structure and what is not?  I have done my best to answer that


Schlingt dreifach einen Kreis um dies! || John Cowan <jcowan@reutershealth.com>
Schliesst euer Aug vor heiliger Schau,  || http://www.reutershealth.com
Denn er genoss vom Honig-Tau,           || http://www.ccil.org/~cowan
Und trank die Milch vom Paradies.            -- Coleridge (tr. Politzer)


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS