[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Closing Blueberry

From: Rob Lugt <roblugt@elcel.com>
To: John Cowan <cowan@mercury.ccil.org>, Tim Bray <tbray@textuality.com>
Date: Thu, 19 Jul 2001 13:59:46 +0100

John Cowan wrote
>
> Repeating: the NEL proposal does *not* change the grammar
> of XML: what it does is add NEL and LS (U+2028) to the
> list of things that are accepted externally and mapped
> to LF first thing by the parser.

I must have missed this the first time you wrote it, because I don't think
it's true.

The xml rec currently states (in 2.11)  "To simplify the tasks of
applications, the characters passed to an application by the XML processor
must be as if the XML processor normalized all line breaks in external
parsed entities (including the document entity) on input, before parsing, by
translating both the two-character sequence #xD #xA and any #xD that is not
followed by #xA to a single #xA character."

I think the important point here is that the characters passed to the
application must be *as if* they were normalised.  The spec does not state
that the normalisation must occur before parsing (it could happen before
passing the characters to the application but after input tokenization).  If
it did then the white space production: s ::== (#x20 | #x9 | #xD | #xA)
wouldn't make sense because there would be no #xDs in the input stream.

Regards
~Rob

--
Rob Lugt
ElCel Technology
http://www.elcel.com/

Follow-Ups:
- Re: Closing Blueberry
  - From: Richard Tobin <richard@cogsci.ed.ac.uk>

References:
- Re: Closing Blueberry
  - From: John Cowan <cowan@mercury.ccil.org>

Prev by Date: Re: Closing Blueberry
Next by Date: RE: Historical I18n Note
Previous by thread: Re: Closing Blueberry
Next by thread: Re: Closing Blueberry
Index(es):
- Date
- Thread