xml-dev - Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering the origina

Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering the origina

[ Lists Home | Date Index | Thread Index ]

To: Tim Bray <tbray@textuality.com>
Subject: Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering the original XML vision)
From: Daniel Veillard <veillard@redhat.com>
Date: Sun, 16 Feb 2003 16:58:07 -0500
Cc: Mike Champion <mc@xegesis.org>, xml-dev@lists.xml.org
In-reply-to: <3E4FD058.3060806@textuality.com>; from tbray@textuality.com on Sun, Feb 16, 2003 at 09:54:32AM -0800
References: <7c.35a8563b.2b7ff77d@aol.com> <06cb01c2d5b0$d0073990$4bc8a8c0@AlletteSystems.com> <3E4FC824.4040208@textuality.com> <oprko65ubyezizxn@smtp.comcast.net> <3E4FD058.3060806@textuality.com>
Reply-to: veillard@redhat.com
User-agent: Mutt/1.2.5.1i

On Sun, Feb 16, 2003 at 09:54:32AM -0800, Tim Bray wrote:
> Well, XML1.1 is moving in that direction.  Even given that, I think that 
> XML 1.0's approach, with a big table right in the spec saying "here are 
> the legal characters", was probably correct; I (and I'm sure many other 
> programmers) ran a perl script over the spec to extract the char parsing 
> tables.   -Tim

 I used vi regexps directly, and recorded those in the C source file :-) !

 :1,$ s/\[#x\([0-9A-Z]*\)-#x\([0-9A-Z]*\)\]/     (((c) >= 0x\1) \&\& ((c) <= 0x\2)) ||/
 and
 :1,$ s/#x\([0-9A-Z]*\)/     ((c) == 0x\1) ||/

 of course the result was later modified a bit to speed up the test.
In order to try to turn a useless post into an useful one, did someone
tried to implement the character normalization checking of XML-1.1 ?
   http://www.w3.org/TR/xml11/#sec2.13
 I looked at the ICU sample code a few months ago and this simply scared
me mostly due to my perception of that code complexity and runtime cost.

Daniel

-- 
Daniel Veillard      | Red Hat Network https://rhn.redhat.com/
veillard@redhat.com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/

Follow-Ups:
- Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering the original XML vision)
  - From: "Rick Jelliffe" <ricko@allette.com.au>
- Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering theoriginal XML vision)
  - From: Tim Bray <tbray@textuality.com>

References:
- Re: [xml-dev] Remembering the original XML vision
  - From: AndrewWatt2000@aol.com
- Re: [xml-dev] Remembering the original XML vision
  - From: "Rick Jelliffe" <ricko@allette.com.au>
- Re: [xml-dev] Remembering the original XML vision
  - From: Tim Bray <tbray@textuality.com>
- Unicode and XML (was Re: [xml-dev] Remembering the original XML vision)
  - From: Mike Champion <mc@xegesis.org>
- Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering theoriginal XML vision)
  - From: Tim Bray <tbray@textuality.com>

Prev by Date: Re: [xml-dev] Remembering the original XML vision
Next by Date: Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering theoriginal XML vision)
Previous by thread: Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering theoriginal XML vision)
Next by thread: Re: [xml-dev] Unicode and XML (was Re: [xml-dev] Remembering theoriginal XML vision)
Index(es):
- Date
- Thread