xml-dev - Re: [xml-dev] is rng compact, compact enough?

Re: [xml-dev] is rng compact, compact enough?

[ Lists Home | Date Index | Thread Index ]

To: Bill de hÓra <bill@dehora.net>
Subject: Re: [xml-dev] is rng compact, compact enough?
From: James Clark <jjc@jclark.com>
Date: Sun, 29 Jun 2003 08:38:50 +0700
Cc: XML Dev <xml-dev@lists.xml.org>
In-reply-to: <3EFD85ED.8030709@dehora.net>
References: <3EFD85ED.8030709@dehora.net>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2.1) Gecko/20030225

Bill de hÓra wrote:

> Playing about with RNG compact syntax, I've come to the conclusion that 
> it's to harder to read than the XML syntax.

There are ways to write it that are readable and ways to write it that 
are not so readable. The style that's most readable in one syntax isn't 
necessarily most readable in the other syntax. In my experience, real 
world schemas tend to make very heavy use of definitions/references, 
partly for modularity and partly because with RELAX NG, as with DTDs, 
redefinition is the main customization mechanism.  If you look at 
Docbook for example, the number of <ref> elements is much greater than 
the number of <element> elements. For this reason, RELAX NG compact 
syntax was designed to optimize the readability of schemas that use 
definitions/references heavily.  The examples in the compact syntax 
tutorial were automatically translated (by trang) from the examples in 
the XML syntax tutorial and haven't been optimized for readability in 
the compact syntax (this should probably be fixed).

> Consider the + quantifier below:
> 
> element html {
>   element head {
>     element title { text }
>   },
>   element body {
>     element table {
>       attribute class { "addressBook" },
>       element tr {
>         attribute class { "card" },
>         element td {
>           attribute class { "name" },
>           mixed {
>             element span {
>               attribute class { "givenName" },
>               text
>             }?,
>             element span {
>               attribute class { "familyName" },
>               text
>             }?
>           }
>         },
>         element td {
>           attribute class { "email" },
>           text
>         }
>       }+
>     }
>   }
> }
> 
> when you could have this:
> 
> 
> element html {
>   element head {
>     element title { text }
>   },
>   element body {
>     element table {
>       attribute class { "addressBook" },
>       element+ tr {
>         attribute class { "card" },
>         element td {
>           attribute class { "name" },
>           mixed {
>             element? span {
>               attribute class { "givenName" },
>               text
>             },
>             element? span {
>               attribute class { "familyName" },
>               text
>             }
>           }
>         },
>         element td {
>           attribute class { "email" },
>           text
>         }
>       }
>     }
>   }
> }

The quantifier after the keyword is an interesting idea, but it weakens 
the similarity to normal regular expressions.  And the example still 
isn't all that readable. A better solution is to use a style more 
suitable for the compact syntax.  I would probably write it something 
like this:

start = html
html = element html { head, body }
head = element head { title }
title = element title { text }
body = element body { addressBook }
addressBook = element table { addressBook.class, card+ }
card = element tr { card.class, name, email }
name = element td { name.class, mixed { givenName?, familyName? } }
givenName = element span { givenName.class, text }
familyName = element span { familyName.class, text }
email = element td { email.class, text }
addressBook.class = attribute class { "addressBook" }
card.class = attribute class { "card" }
name.class = attribute class { "name" }
givenName.class = attribute class { "givenName" }
familyName.class = attribute class { "familyName" }
email.class = attribute class { "email" }

> Also, attributes don't need to be bracketed; by nature they don't have 
> any structure on the rhs. 

Although an attribute cannot contain attributes or elements, the pattern 
for the value of an attribute can be quite complex:

attribute value {
   list { xsd:int { minInclusive = "0" maxInclusive = "100" }+ }
   | "#none"
}

James

References:
- is rng compact, compact enough?
  - From: Bill de hÓra <bill@dehora.net>

Prev by Date: [ANN] Robin: Remote XUL Desktop Prototype Goes Live
Next by Date: Data Islands populating a table
Previous by thread: is rng compact, compact enough?
Next by thread: [ANN] Robin: Remote XUL Desktop Prototype Goes Live
Index(es):
- Date
- Thread