[
Lists Home |
Date Index |
Thread Index
]
Consider this document, encoded in UTF-8 with no BOM:
<?xml version="1.0"?>
<ä/>
Is there a safe way for a non-XML-aware text editor to find out that this
file is using UTF-8?
There are still a lot of people over here who likes to use ISO 8859-1,
because they have the conception that 'ä' is written 'ä' in UTF-8. I was
about to tell one that it's just his editor that's broken, but then I came
to think about this: maybe there isn't a good way for a general text editor
to know about the UTF-8 encoding? Maybe the EF BB BF signature should have
been made mandatory?
I guess there's something I overlook. Can someone explain?
Gustaf
WordPad.lnk
|