[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [xml-dev] Interoperability [long]
Eddie Robertsson wrote:
> <snip/>
>
>
>>I guess the modern schema datatypes kind of allow you
>>to do this via the regexp tools?
>>
>
> Yes, If I understand the datatype spec correctly you can do either:
>
> <xs:simpleType name="ISO8859_1">
> <xs:restriction base="xs:string">
> <xs:pattern value="\p{BasicLatin}+"/>
> <xs:pattern value="\p{Latin-1Supplement}+"/>
> </xs:restriction>
> </xs:simpleType>
This is, IMO, the cleanest way to restrict the content of a document to
use only ISO latin characters.
One has to note, though, that the constraint is on what W3C XML Schema
calls the "lexical space", ie the value of the character strings after
parsing and that it allows to use any encoding in your instance document
to express these ISO-8859-1 characters as long as this encoding is
supported by your parser (this is not criticism, but just to make sure
everyone understands what is restricted).
(Also, you may want in some cases to use "*" instead of "+" to allow
empty strings)
My 0,02 Euros,
Eric
--
Rendez-vous à Paris pour le Forum XML.
http://www.technoforum.fr/Pages/forumXML01/index.html
------------------------------------------------------------------------
Eric van der Vlist http://xmlfr.org http://dyomedea.com
http://xsltunit.org http://4xt.org http://examplotron.org
------------------------------------------------------------------------