Lists Home |
Date Index |
I believe the current Unicode character range, and the one that was
effective for the XML 1.0 standard, is 0x20-0x10000 (note 17 bits) plus
the control characters, '\t' and '\n' and minus the surrogate pair range
and 0xFFFF and 0xFFFE. The fact that Java doesn't have much support for
the surrogate pairs, which are the only way to express character values
greater than 0xFFFF, doesn't mean they won't appear in XML documents.
So the answer is, no there's no single 16-bit maximum character value.
The test requires access to at least the next character and a little code.
Alan Gutierrez wrote:
> I'm implementing B-Tree to index XML documents. I'd like a
> to use maximum character value as a boundry, or failing that a
> minimum character value.
> I'm working in Java.
> Is there a maximum character value guarunteed be greater than
> any other character regardless of the locale? Is that character
> Or do I have to read a book on Unicode? Which one?
> Alan Gutierrez - email@example.com
> - http://engrm.com/blogometer/index.html
> - http://engrm.com/blogometer/rss.2.0.xml