OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] Handling Non-printing Characters

[ Lists Home | Date Index | Thread Index ]

Bullard, Claude L (Len) wrote:

> An XML server side process is getting a text that has been pasted in. 
> Within the text are non-printing characters which 
> are apparently handled as UTF-16 and the XML processor 
> on the server replaces with character entities. On 
> the client side, the XML processor says these are not 
> legal characters and is apparently treating them 
> as UTF-8.  What is the best way to handle these?

Well, if you know the encoding of the pasted-in text, you're OK, because 
you can either declare in the XML declaration, or convert it to 
something else, or just turn 'em all into numeric char refs.  There's 
the potential problem of illegal non-XML characters, but you're going to 
have to be prepared to deal with that.

Of course, if you *don't* know the encoding of the pasted-in characters, 
you're basically hosed.  -Tim


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS