OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: Is CDATA "structure"?

[ Lists Home | Date Index | Thread Index ]
  • From: "Nik O" <niko@cmsplatform.com>
  • To: "- XML-Dev" <xml-dev@ic.ac.uk>
  • Date: Mon, 19 Jul 1999 15:46:57 -0600

Yes, indeed, the MacOS uses CR-delimited text files.  We do quite a bit of
out markup production on Macs, with content management on MS-Windows PCs
(though that is moving to Linux/GNU machines).  So we deal with all three
common flavors of text file delimiters.

Expat implements the XML 1.0 spec correctly (i discovered this the hard way,
when Expat reported offsets that were different that i expected in my
CRLF-delimited files).

John Cowan got to this earlier today, but to amplify his point...

The following excerpt is from Tim Bray's truly excellent
annotated XML 1.0 Specification (http://www.xml.com/axml/testaxml.htm or
http://www.xml.com/axml/target.html to omit the explanation frame).

======= Begin spec excerpt =======
2.11 End-of-Line Handling
XML parsed entities are often stored in computer files which, for editing
convenience, are organized into lines. These lines are typically separated
by some combination of the characters carriage-return (#xD) and line-feed
(#xA).

To simplify the tasks of applications, wherever an external parsed entity or
the literal entity value of an internal parsed entity contains either the
literal two-character sequence "#xD#xA" or a standalone literal #xD, an XML
processor must pass to the application the single character #xA. (This
behavior can conveniently be produced by normalizing all line breaks to #xA
on input, before parsing.)
======= End spec excerpt =======

<rant original_post_to="Z3950IW">
  <excerpt>
..much of the internet is still constrained by Unix's feeble 7-bit character
TTY legacy.  This latter issue is echoed in Java's lack of _unsigned_ bytes
(!), and XML's auto-conversion of CRLF-delimited text records to
LF-delimited records (yet another legacy/bias from Unix).
  </excerpt>
  <question>
Is there historical basis to the above statement?  It was a deduction based
upon the old Xenix "text mode" I/O and the probability that most of the
developers of the XML standard were based in the Unix world.
  </question>
</rant>

Regards,
-Nik O, Content Mgmt Solutions, Jackson, Wyo.



xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)






 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS