[
Lists Home |
Date Index |
Thread Index
]
Right. As the precision of a semantic definition increases,
its frequency decreases. A box is a box is a box is a box.
A part number is not. Also related to Boltzman entropy.
This relates directly to Roger's thread and population
and evolution. The billion dollar question is: so but
so what? Do paragraph tags evolve? Can they? What
about semantic tags?
A <p> can become a member of almost any language.
A <partno> can't. What does that say about a
namespace partitioning of <p>? Is it gratuitous?
Were the arch forms guys right?
Did the <p> evolve/speciate into the <div>?
Thanks Sean. That's useful. If you want to
have more fun, plot it against users and
semantic implementation (the three dimensional
view). The results should be totally unsurprising.
len
From: Sean McGrath [mailto:sean.mcgrath@propylon.com]
Document-centric XML:
XML in which corpora conforming to schema X, exhibit power law
distributions of the element types in X.
Data-centric XML:
XML in which corpora conforming to schema X, exhibit uniform
distributions
of the element types in X.
Not perfect but useful nonetheless I think. Mixed content is missing for a
start.
Anyway, please take a look at the graphs at:
http://seanmcgrath.blogspot.com/2004_05_23_seanmcgrath_archive.html#10857620
2776583412
I'd be very interested in seeing other peoples graphs of the tag-share of
their XML corpora.
|