xml-dev - RE: [xml-dev] Normalizing XML [was: XML information modeling best pract

RE: [xml-dev] Normalizing XML [was: XML information modeling best pract

[ Lists Home | Date Index | Thread Index ]

To: 'Ronald Bourret' <rpbourret@rpbourret.com>, xml-dev@lists.xml.org
Subject: RE: [xml-dev] Normalizing XML [was: XML information modeling best practices]
From: Jeff Lowery <jlowery@scenicsoft.com>
Date: Thu, 2 May 2002 11:25:53 -0700

> > But all this presupposes that we are designing XML 
> documents for storage and
> > query. Most XML documents are designed for messaging of 
> some kind (between
> > humans or between software components). Within the context 
> of a message,
> > duplication is far less of a problem, for example it 
> doesn't matter if I
> > hold product code, description, and price as part of each 
> order-line in an
> > order. Many XML databases are actually archives of such messages, so
> > duplication of data is a fact of life; and since it's an 
> archive, the update
> > problem doesn't arise.
> 
> This is the conclusion I came to.

With all due respect, I couldn't disagree more. In the case of
roundtripping, there is the classic problem of update anomalies. Since the
recieving application may not know what is duplicate (certainly a schema
won't give a clue), there's the risk that some duplicates will be changed by
the receiving application but not others. That means that there has to be
some logic embedded somewhere that (probably in the originating applicaiton)
that performs a validation check that all duplicate values have been
modified; otherwise, you have to pick and choose which values to ignore.
That's not easy, because one of the values may have gotten changed back to
it's original state: was that intended as the 'final' value? Or was it just
a sloppy partial update of other duplicates?

I think if you exchanging data documents, especially between third-party
applications, duplicates should be avoided. You can't denote them in a
schema (unless you key/keyref them all), so the result is that you have an
implicit (or narrative) understanding of what is duplicate. Bad, bad, bad.

For views, though, I agree: normalization isn't necessary.

Prev by Date: RE: [xml-dev] SOAP and the Web
Next by Date: Re: [xml-dev] SOAP and the Web
Previous by thread: XML Conference & Exposition 2002; Call for Presentations, Tutorials, Exhibits, and Sponsors
Next by thread: Out of topic or out of interest?
Index(es):
- Date
- Thread