[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: XML Blueberry
- From: John Cowan <jcowan@reutershealth.com>
- To: "K. Ari Krupnikov" <ari@cogsci.ed.ac.uk>
- Date: Fri, 22 Jun 2001 12:41:42 -0400
K. Ari Krupnikov wrote:
> CR, LF and NEL are not the only space characters in Unicode.
We are not discussing space characters, but newline characters.
> But there are cases where this algorithm is non
> deterministic, and so special characters were introduced in Unicode --
> right-to-left space and left-to-right space.
These are not spaces, but invisible characters. They don't need
to be in the definition of S.
It might have been better if XML string matching ignored these
so-called format characters, but it doesn't and that's that.
--
There is / one art || John Cowan <jcowan@reutershealth.com>
no more / no less || http://www.reutershealth.com
to do / all things || http://www.ccil.org/~cowan
with art- / lessness \\ -- Piet Hein