XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] Misuse of Schematron

A range restriction is not a conflict IMHO. A derived type would not be said to conflict with its base type merely because it restricted the base.

What you have is a case where you need a two layer schema.

The second layer may be schematron or it may be a derived type whether by restriction or extension, or even just two parallel dtds.

If the first schema had the appropriate @block then that would be an assertion that you didnt want XSD derived types with a subrange. But that of itself would not imply that a second layer using Schematron should not restrict the range. It just says "dont do it in XSD"

Perhap @final might indicate that byt it should be accompanied by explanatory text otherwise the interpretation of the XSD is up for grabs.

Rick

On 13/04/2015 1:19 AM, "Costello, Roger L." <costello@mitre.org> wrote:

Hi Folks,

So you've got this table which shows the data items that are applicable to each type of publication:

 

Title

Name

Author

Date

ISBN

Publisher

Volume

Lead Story

Book

 

 

 

 

 

 

 

 

Magazine

 

 

 

 

 

 

 

 

Newspaper

 

 

 

 

 

 

 

 

 

In your XML Schema you declare a complexType, Publication, containing all the data items listed along the top row. You declare each of them optional. You include a "Type" element to indicate whether the publication is a Book, Magazine, or Newspaper. Here's what you produce:

    <xs:complexType name="Publication">
       
<xs:sequence>
           
<xs:element name="Type">
               
<xs:simpleType>
                   
<xs:restriction base="xs:string">
                       
<xs:enumeration value="Book" />
                       
<xs:enumeration value="Magazine" />
                       
<xs:enumeration value="Newspaper" />
                    
</xs:restriction>
               
</xs:simpleType>
           
</xs:element>
           
<xs:element name="Title" type="xs:string" minOccurs="0" />
           
<xs:element name="Name" type="xs:string" minOccurs="0" />
           
<xs:element name="Author" type="xs:string" minOccurs="0" />
           
<xs:element name="Date" type="xs:date" minOccurs="0" />
           
<xs:element name="ISBN" type="xs:string" minOccurs="0" />
           
<xs:element name="Publisher" type="xs:string" minOccurs="0" />
            
<xs:element name="Volume" type="xs:unsignedByte" minOccurs="0" />
           
<xs:element name="Lead_Story" type="xs:string" minOccurs="0" />
       
</xs:sequence>
   
</xs:complexType>

Hmm, that's not right. If Type=Book then only a subset of the elements are applicable. Ditto for Type=Magazine and Type=Newspaper. To compensate for this incorrectness you create a Schematron schema to specify the actual allowable elements:

    <sch:pattern id="Publication">
       
        
<sch:rule context="Publication[Type='Book']">
           
<sch:assert test="Title and Author and Date and ISBN and Publisher and
                                                empty(* except (Title[1], Author[1], Date[1], ISBN[1], Publisher[1]))"
>
                A Book contains only this data: Title, Author, Date, ISBN, and Publisher
           
</sch:assert>
       
</sch:rule>
       
        
<sch:rule context="Publication[Type='Magazine']">
           
<sch:assert test="Name and Date and Publisher and Volume and
                                                empty(* except (Name[1], Date[1], Publisher[1], Volume[1]))"
>
                A Magazine contains only this data: Name, Date, Publisher, and Volume
           
</sch:assert>
       
</sch:rule>
       
        
<sch:rule context="Publication[Type='Newspaper']">
           
<sch:assert test="Name and Date and Publisher and Lead_Story and
                                                empty(* except (Name[1], Date[1], Publisher[1], Lead_Story[1]))"
>
                A Newspaper contains only this data: Name, Date, Publisher, and Lead_Story
           
</sch:assert>
       
</sch:rule>
       
    
</sch:pattern>

Let's recap: the XML Schema asserts that, for Type=Book, the Title element is optional and the Schematron schema asserts that it's required … the XML Schema asserts that the Lead_Story element is optional and the Schematron schema asserts that it's disallowed.

The XML Schema and the Schematron schema are making conflicting assertions. Surely that is a sign of poor design.

The problem boils down to this: The above table was viewed as specifying business rules: If the type is Book, then there must be Title, Author, Date, and so forth. But that's not correct. The table defines components, not business rules. That is, the table is defining the data requirements for a Book component, a Magazine component, and a Newspaper component.

The table is a definition of components,
not a definition of business rules.

Here's the right way to implement the table. Define an empty Publication complexType:

    <xs:complexType name="Publication">
       
<xs:sequence />
   
</xs:complexType>

Then define three components – Book, Magazine, Newspaper – that extend Publication:

    <xs:complexType name="Book">
       
<xs:complexContent>
           
<xs:extension base="Publication">
               
<xs:sequence>
                   
<xs:element ref="Title" />
                   
<xs:element ref="Author" />
                   
<xs:element ref="Date" />
                   
<xs:element ref="ISBN" />
                   
<xs:element ref="Publisher" />
               
</xs:sequence>
           
</xs:extension>
       
</xs:complexContent>
   
</xs:complexType>
   
    
<xs:complexType name="Magazine">
       
<xs:complexContent>
           
<xs:extension base="Publication">
               
<xs:sequence>
                   
<xs:element ref="Name" />
                   
<xs:element ref="Date" />
                   
<xs:element ref="Publisher" />
                   
<xs:element ref="Volume" />
               
</xs:sequence>
           
</xs:extension>
       
</xs:complexContent>
   
</xs:complexType>
   
    
<xs:complexType name="Newspaper">
       
<xs:complexContent>
           
<xs:extension base="Publication">
               
<xs:sequence>
                   
<xs:element ref="Name" />
                   
<xs:element ref="Date" />
                    
<xs:element ref="Publisher" />
                   
<xs:element ref="Lead_Story" />
               
</xs:sequence>
           
</xs:extension>
       
</xs:complexContent>
   
</xs:complexType>

Each component defines exactly what elements belong to it, so there is no need for Schematron.

Schematron is an awesome tool. But use it for the purpose it was designed for: to express business rules.

Comments welcome.

/Roger



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS