OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [xml-dev] Interoperability [long]

Eddie Robertsson wrote:

> <snip/>
>>I guess the modern schema datatypes kind of allow you
>>to do this via the regexp tools?
> Yes, If I understand the datatype spec correctly you can do either:
> <xs:simpleType name="ISO8859_1">
>   <xs:restriction base="xs:string">
>     <xs:pattern value="\p{BasicLatin}+"/>
>     <xs:pattern value="\p{Latin-1Supplement}+"/>
>   </xs:restriction>
> </xs:simpleType>

This is, IMO, the cleanest way to restrict the content of a document to 

use only ISO latin characters.

One has to note, though, that the constraint is on what W3C XML Schema 
calls the "lexical space", ie the value of the character strings after 
parsing and that it allows to use any encoding in your instance document 
to express these ISO-8859-1 characters as long as this encoding is 
supported by your parser (this is not criticism, but just to make sure 
everyone understands what is restricted).

(Also, you may want in some cases to use "*" instead of "+" to allow 
empty strings)

My 0,02 Euros,

Rendez-vous  Paris pour le Forum XML.
Eric van der Vlist       http://xmlfr.org            http://dyomedea.com
http://xsltunit.org      http://4xt.org           http://examplotron.org