[
Lists Home |
Date Index |
Thread Index
]
On Saturday 12 January 2002 04:12 pm, David Brownell wrote:
> > >And also, do surrogate pairs really introduce any issues that
> > >are not already present in combining character sequences?
> >
> > Yes, I think they do. In particular for this thread, XML 1.0 names
> > (and probably XML 1.1 names) can be checked for well-formedness
> > and validity without worrying about combining characters.
This is more an issue with processing UTF-16. If you always work at
the scalar for validation the issue goes away. This is obvious, but
it's important to blur the character vs. encoding issue.
As characters, surrogate pairs have fewer issues than combining
sequences for display (or certainly no extra issues).
|