OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] Detection of non-Unicode characters

[ Lists Home | Date Index | Thread Index ]

 From: "Ann Navarro" <ann@webgeek.com>

> I just ran into this myself, with a styled apostrophe character -- which 
> was only reported as a problem by XML Spy 4.4 upon opening the 1.2MB XML 
> file (character was: Â (0xC2), ' (0x92)).

On thinking about this more, if you have one non-ASCII character ( a styled apostrophe)
and it being represented by two non-ASCII bytes, that is normally a sign that 
the file is actually encoded using UTF-8. 

Check if that entity has an encoding header saying "ISO-8859-1" by mistake,
and try removing it if it does (to force the use of UTF-8). 

Cheers
Rick Jelliffe




 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS