Re: [xml-dev] Discover data patterns or Create data patterns?

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

From: Rick Jelliffe <rjelliffe@allette.com.au>
To: xml-dev@lists.xml.org
Date: Sun, 21 Sep 2008 00:50:16 +1000

Michael Kay wrote:
> Rather than "uncover patterns", I would say "formulate abstractions". But it
> amounts to the same thing.
>  
> Either way, data analysis is essentially an exercise in grouping objects
> into types, and the people who do it best are those who are best at finding
> useful abstractions. 
I am not sure I agree with the second para (and not sure my disagreement 
isn't quibbling and off-topic)

"Type" as people use the term in software languages, has an idea of 
mutual exclusivity between primitive
types: so that a string is not an integer for example. (No flames, I 
understand that LISP has print forms
for everything and that dynamic languages may have automatic type 
coercion and so on. But even in
LISP the debates about whether to have mixin inheritance is, I think, an 
example of the same problem
with typing.)

But there are many relationships which don't display this. These are the 
messy parts of XML Schemas,
for example: the KEY/KEYREF/ID mechanisms which are outside the type 
derivation system; and
the strangeness of derivation by union, not to mention the impossibility 
(=fragility) of moving from untyped to
typed by derivation (whaddayamean this text is not a String?)  or 
between primitive types (whaddayamean
this Date is not a String?)

So "pattern" is a much better word than type, I think, because it is 
more general:
an element may "have" a type (or be an instance of that type, or 
participate in that typedness) but
the element "is part of" a pattern. That was why I chose that word for 
Schematron, specifically to
say that we weren't doing typing (though in the mooted <property> 
elements for the updated ISO
Schematron there would indeed be slots for type information on parts of 
a pattern.)  XSD allows
you to ask "have" questions but only limited "is part of' questions.

The thing is that people will ask the questions their tools and 
standards support, and they will ask questions
their tools and standards have trained them to ask. And when the tools 
and standards don't allow those
questions, people just avoid asking them. For example, languages like 
SVG, XSLT and RDF really
are not well served by "type" notions, at least as the term means in 
concrete technology (i.e. XSD.)
(Which is not to say that type abstractions are not being improved.)

Cheers
Rick Jelliffe

Follow-Ups:
- RE: [xml-dev] Discover data patterns or Create data patterns?
  - From: "Michael Kay" <mike@saxonica.com>

References:
- Discover data patterns or Create data patterns?
  - From: "Costello, Roger L." <costello@mitre.org>
- RE: [xml-dev] Discover data patterns or Create data patterns?
  - From: "Michael Kay" <mike@saxonica.com>

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]