OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: XML and embedded compound data types (was "Why XML data typin is har

[ Lists Home | Date Index | Thread Index ]
  • From: John Cowan <cowan@locke.ccil.org>
  • To: XML Dev <xml-dev@ic.ac.uk>
  • Date: Mon, 30 Nov 1998 13:18:14 -0500

Gabe Beged-Dov wrote:

> I have been reading the latest postings on XML data typing and want to throw
> out a different viewpoint. several caveats are that I haven't perused all of
> the archives to see if I'm tramping down well-worn paths.

Not well-worn, but not utterly novel either.

> The perspective is the possibly naive one that XML can/should be used to
> capture ALL of the structural aspects of a complex tree of data. Wherever a
> compound type occurs in the source data, it should be mapped to a
> corresponding compound type in XML.  A key heuristic to see if this approach
> is being adhered to is to see if any attribute  values or character data
> requires additional parsing by the application in order to deserialize
> additional structure.

A compromise which I proposed --- to resounding denunciation from all
(then existing) quarters --- was a facility overlaid on XML which would
remap specific local syntax characters, when found within the character
content of specified elements, to empty element tags.  Thus, given the
following XML:

	<?map map1 ":" colon?>
	<?map map1 "T" time?>
	<?map map1 "-" dash?>
	<?map map1 "." dot?>
	<?apply-map map1 date?>
	...
	<date>1998-04-05T15:22:23.24</date>

the application would receive the SAX or DOM equivalent of:

	<date>1998<dash/>04<dash/>05<time/>
		15<colon/>22<colon/>23<dot/>24</date>

This is level 1 of my "Local Markup" proposal; level 2 provides
for nested as well as empty tags generated by characters,
by allowing for prioritization of local markup characters.
Only single characters are supported, but since there are
65000 of them, that should be sufficient.

SGML had something like this, but much more powerful/flexible/
hard to parse.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS