[
Lists Home |
Date Index |
Thread Index
]
This may be an XML 1.0 issue, but XML 1.1 brought it back up for me. I
haven't been able to track it down in archives.
The XML 1.0 spec states:
--------------------------
2.11 End-of-Line Handling
XML parsed entities are often stored in computer files which, for
editing convenience, are organized into lines. These lines are typically
separated by some combination of the characters carriage-return (#xD)
and line-feed (#xA).
To simplify the tasks of applications, the characters passed to an
application by the XML processor must be as if the XML processor
normalized all line breaks in external parsed entities (including the
document entity) on input, before parsing, by translating both the
two-character sequence #xD #xA and any #xD that is not followed by #xA
to a single #xA character.
--------------------------
Does the character-reference version of #xD #xA, 
 , get
normalized to a single #xA?
(I'm writing a brute-force whitespace tool.)
-------------
Simon St.Laurent - SSL is my TLA
http://simonstl.com may be my URI
http://monasticxml.org may be my ascetic URI
urn:oid:1.3.6.1.4.1.6320 is another possibility altogether
|