OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: XML Blueberry

K. Ari Krupnikov wrote:

> CR, LF and NEL are not the only space characters in Unicode.

We are not discussing space characters, but newline characters.

> But there are cases where this algorithm is non
> deterministic, and so special characters were introduced in Unicode --
> right-to-left space and left-to-right space.

These are not spaces, but invisible characters.  They don't need

to be in the definition of S.

It might have been better if XML string matching ignored these
so-called format characters, but it doesn't and that's that.

There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein