OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
RE: [xml-dev] Schema based XML compare

Hi David,


As someone who has been working with large libraries of schemas for many years, this tool would be *wonderful*.  I tried down this path myself but there weren’t enough hours in the night.  “Shouldn’t be too hard” must mean you have smaller ambitions that I had wanted.  I knew it would be hard, or at least time consuming maybe is more accurate.

A data model diff would be very useful for doing things like evaluating backward compatibility of libraries of schemas as well as provide some educational material to implementers who need to see how the model has changed since the last release. 


If you find this animal, please do let me know.  If you want to talk offline, I’d be happy to compare notes.


-- Paul



W. Paul Kiel


Consulting Content Engineer

LexisNexis, Global Content Architecture





From: David Lee [mailto:dlee@calldei.com]
Sent: Thursday, December 23, 2010 7:29 AM
To: xml-dev@lists.xml.org
Subject: [xml-dev] Schema based XML compare


I've run into an age-old issue but I don’t see any off-the-shelf solutions for.


Suppose I have 2 XML documents I want to compare (not diff, just give me yes/no are they equivalent).

This is pretty simple to do even with things like ignoring whitespace options etc.  Many tools out there, including one I wrote

( http://www.xmlsh.org/CommandXcmp)


Now here's the twist …


Suppose I want to compare for XSD  data model equivalence, not XDM  equivalence ?








Without type annotation these are different.
But if I declare the type for number to  be xs:double

they should compare equal.


Thus a compare tool should be able to be given a schema and do a comparison and report that these 2 documents are equivalent at the XSD data model level.


Has anyone seen anything like this ?

Would anyone have a use for it ? (I may end up writing it for my own uses).


Not sure how far one can take this before entering murky waters …

Even in the numeric cases there are edge cases where comparisons are not well defined (rounding/precision issues on floating point numbers).

Then add in things like date/times …

But suppose I'm willing to avoid the murky edges and just stick to the obvious cases … shouldn’t be too hard right ?
In fact I suspect its so obvious its been done but I can't find one anywhere.










David A. Lee




[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS