OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] XML and mainframes, yet again (was RE: [xml-dev] So mecom

[ Lists Home | Date Index | Thread Index ]

Nicolas LEHUEN scripsit:

> Pardon my naive question, but how comes that Unicode, which can handle
> different character representations depending on the encoding used, does not
> have a SINGLE newline codepoint that would map onto 0x0D0A (CRLF) on some
> platform, 0x0D (CR) or 0x0A (LF) on others, 0x85 (NEL) on mainframes, etc. ?

It does: that was why U+2028 LINE SEPARATOR was designed into Unicode.
It was intended to solve the problem once and for all by being Unicode's
only line separator.

In practice, though, people who implemented Unicode retained the platform-
specific line terminator; as a consequence, U+2028 makes the problem

John Cowan           http://www.ccil.org/~cowan              cowan@ccil.org
Please leave your values        |       Check your assumptions.  In fact,
   at the front desk.           |          check your assumptions at the door.
     --sign in Paris hotel      |            --Miles Vorkosigan


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS