Lists Home |
Date Index |
At 1:49 AM -0700 4/25/04, Jeff Rafter wrote:
>Because I am currently not doing a transformation as described in the note I
>decided to simply discard additional "x" characters. I still have a question
>about the hexadecimal number though. Because the language I am working in
>has a char limit of U+FFFF, I decided to limit my the input to the same
>number. Again, I have the potential of multiple leading "0" characters, but
>I again discard them. Does this sound reasonable?
I'm certainly no Relax expert, but on the face of it is does *NOT*
sound reasonable. In general XML and Unicode processing, one *MUST*
handle characters with code points beyond U+FFFF. They are not
optional. This is true even if your programming language (Java
perhaps?) has inadequate support for them.
Just possibly, if this only applies to characters in XML names, not
content or attribute values, and only XML 1.0 is supported (Not XML
1.1) then this may be reasonable. Otherwise it's clearly not.
Elliotte Rusty Harold
Effective XML (Addison-Wesley, 2003)