[
Lists Home |
Date Index |
Thread Index
]
Bullard, Claude L (Len) wrote:
> An XML server side process is getting a text that has been pasted in.
> Within the text are non-printing characters which
> are apparently handled as UTF-16 and the XML processor
> on the server replaces with character entities. On
> the client side, the XML processor says these are not
> legal characters and is apparently treating them
> as UTF-8. What is the best way to handle these?
Well, if you know the encoding of the pasted-in text, you're OK, because
you can either declare in the XML declaration, or convert it to
something else, or just turn 'em all into numeric char refs. There's
the potential problem of illegal non-XML characters, but you're going to
have to be prepared to deal with that.
Of course, if you *don't* know the encoding of the pasted-in characters,
you're basically hosed. -Tim
|