OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Question about UTF-8

[ Lists Home | Date Index | Thread Index ]

Consider this document, encoded in UTF-8 with no BOM:

<?xml version="1.0"?>

Is there a safe way for a non-XML-aware text editor to find out that this
file is using UTF-8?

There are still a lot of people over here who likes to use ISO 8859-1,
because they have the conception that '' is written 'ä' in UTF-8. I was
about to tell one that it's just his editor that's broken, but then I came
to think about this: maybe there isn't a good way for a general text editor
to know about the UTF-8 encoding? Maybe the EF BB BF signature should have
been made mandatory?

I guess there's something I overlook. Can someone explain?




News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS