Lists Home |
Date Index |
Consider this document, encoded in UTF-8 with no BOM:
Is there a safe way for a non-XML-aware text editor to find out that this
file is using UTF-8?
There are still a lot of people over here who likes to use ISO 8859-1,
because they have the conception that 'ä' is written 'Ã¤' in UTF-8. I was
about to tell one that it's just his editor that's broken, but then I came
to think about this: maybe there isn't a good way for a general text editor
to know about the UTF-8 encoding? Maybe the EF BB BF signature should have
been made mandatory?
I guess there's something I overlook. Can someone explain?