What I meant was, they chose a representation of the document consisting entirely of ASCII characters, to allow it to be stored with an ASCII encoding.
Of course, if all the characters in a document are ASCII, then the ASCII and UTF-8 encodings of the document are identical.
Michael Kay Saxonica
Hi Mike,
My guess would be that Microsoft chose an ASCII encoding for this file rather than a UTF-8 encoding because, at the time, CVS repositories could be very temperamental about file encodings.
For the XML document, that I cited (it seems to be, contributed by Microsoft) from w3c xml schema test suite, as following,
<doc value="؀؁ ......
Why do you say, its encoded with ASCII (is this what you're saying) and not UTF-8?
--
|