[
Lists Home |
Date Index |
Thread Index
]
> > You could do it without changing the definition of
> well-formedness by
> > saying that the set of characters considered to be whitespace, and
> > normalized as such, is a property of the encoding.
>
> Fine and dandy for EBCDIC, but not so good for Latin-1 as
> used on mainframes, where 0x85 = NEL.
Then you use encoding="iso-8859-1+NEL".
Most parsers will say they don't handle this, which is fine, others will
accept it, which is also fine.
Michael Kay
Software AG
home: Michael.H.Kay@ntlworld.com
work: Michael.Kay@softwareag.com
|