xml-dev - Re: [xml-dev] [off-topic] xtext -- encoding declarations for text

Re: [xml-dev] [off-topic] xtext -- encoding declarations for text

[ Lists Home | Date Index | Thread Index ]

To: xml-dev@lists.xml.org
Subject: Re: [xml-dev] [off-topic] xtext -- encoding declarations for text
From: Richard Tobin <richard@cogsci.ed.ac.uk>
Date: Tue, 20 May 2003 11:21:19 +0100 (BST)
Cc:
In-reply-to: <019401c31e99$3041bcf0$4bc8a8c0@AlletteSystems.com>
Organization: HCRC, University of Edinburgh

>> Also, the fact that the first few characters depend on the application
>> will make it hard to write general transcoders.

>I had thought that too, but on thinking about it more I cannot see any 
>additional complexity compared with the XML Appendix F.  The details 
>of the algorithm are different but still it is the same three steps

[...]

> * else look for EBCDIC/ASCII signature (use string "[^a-zA-Z01-9]{1-4}xtext\b" 
>    rather than "<?xml\b"

For XML, it's only necessary to look at the first four bytes to cover
Unicode encodings, ascii supersets and ebcdic.  In the xtext case, you
will have to compare a string at several different positions or apply
a regular expression.  Certainly doable, but certainly more complex too!

-- Richard

Follow-Ups:
- Re: [xml-dev] [off-topic] xtext -- encoding declarations for text
  - From: "Bob Foster" <bob@objfac.com>

References:
- Re: [xml-dev] [off-topic] xtext -- encoding declarations for text
  - From: "Rick Jelliffe" <ricko@allette.com.au>

Prev by Date: Re: [xml-dev] Parsing the structure of a form as XML content
Next by Date: Re: [xml-dev] Question about XPath 2.0
Previous by thread: Re: [xml-dev] [off-topic] xtext -- encoding declarations for text
Next by thread: Re: [xml-dev] [off-topic] xtext -- encoding declarations for text
Index(es):
- Date
- Thread