XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
"Interoperability is getting better" ... What does that mean?

Hi Folks,

Several people have stated:

    Interoperability, on the whole,
    does seem to be getting better.

In the context of our discussion on character encoding, what does that mean?

I will take a stab at defining what it means:

Interoperability means that you and I interpret (decode) the bytes in the XML file in the same way.

Example: I create an XML file and I encode all the characters in it using UTF-8. Here is a graphical depiction (i.e., glyphs) of the bytes that I send to you:

    <Name>Lˇpez</Name>

You receive my XML document but you interpret the bytes as iso-8859-1. 

Oops! 

Now the trouble begins.

In UTF-8 the ˇ is a graphical depiction of the LATIN SMALL LETTER O WITH ACUTE character and it is encoded inside the computer using these two bytes: C3 B3

But in iso-8859-1 the two bytes C3 B3 is the encoding of two characters:

     C3 is the encoding of the ├ character
     B3 is the encoding of the │ character

Thus, you interpret the XML as:

    <Name>L├│pez</Name>

We are interpreting the same XML document (i.e., the same set of bytes) differently.

Interoperability has failed.

So, when we say: 

    Interoperability is getting better.

we mean that the number of occurrences of senders and receivers interpreting the bytes in an XML document is decreasing.  

Is that correct?

/Roger



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS