OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] regex for element or attr name


(Actually, I don't know why #xHHHH is used here: it would be better to
use U+HHHH to emphasize that these are characters not codes in the
external encoding.)

Cheers
Rick Jelliffe



[ Lists Home | Date Index | Thread Index ]

Ramkumar Menon wrote:
Hi,

cd anybody let me know what are the permitted tokens within a fully qualified XML element name, or Attribute Name
A regex would help.
For a simple regex, adapt these ones, which are based on the XML 1.1 rules.

The main things the naming checking needs to  do is to prevent numbers and symbols
from starting the name, and to prevent spaces and silliness inside the name. 
The XML 1.0 rules are much more complicated but are a restriction of these.

[4]    NameStartChar    ::=    ":" | [A-Z] | "_" | [a-z] | [#xC0-#xD6] | [#xD8-#xF6] | [#xF8-#x2FF] | [#x370-#x37D] | [#x37F-#x1FFF] | [#x200C-#x200D] | [#x2070-#x218F] | [#x2C00-#x2FEF] | [#x3001-#xD7FF] | [#xF900-#xFDCF] | [#xFDF0-#xFFFD]
[4a]    NameChar    ::=    NameStartChar | "-" | "." | [0-9] | #xB7 | [#x0300-#x036F] | [#x203F-#x2040]
5]    Name    ::=    NameStartChar (NameChar)


 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS