OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
BOM and encodings questions




If an XML document starts with the FF FE BOM (UTF-16, little endian) but the encoding is set to “UTF-8” in the prolog, what is the expected behavior of the Parser?

I think that the parser should respect the BOM, read the prolog assuming it is encoded in UTF-16 little endian and then process the remaining of the XML document in UTF-8 as the prolog says.

Is this correct?




Is an XML parser expected to process a document in alternating encodings? I mean, is there a way to signal the parser that from a certain point on the encoding changes to some other encoding? If so, how?



Is there a way to express the expected encoding of the XML document in the XML Schema? If so, how?





[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS