XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
[xml-dev] Quiz: How do you put a Euro sign in your data if your XMLuses windows-1252 encoding and you use a numeric character reference?

Ouch.  I didn't realize the problem was globally this huge.  Thanks
for the insights.  At least now I know to watch out for it.

--Tim

On Fri, Mar 1, 2013 at 3:08 PM, Michael Kay <mike@saxonica.com> wrote:
>
>
> I've been advising people how to solve character encoding issues for about
> 100 years, but our own internal system for handling Saxon license requestsh
> still gets it wrong. It ain't easy.
>
>> For what it's worth, 1: Joel Spolsky's article on "The Absolute Minimum
>> Every Software Developer Absolutely, Positively Must Know About Unicode and
>> Character Sets (No Excuses!)"
>> <http://www.joelonsoftware.com/articles/Unicode.html> is quite good, I
>> think.
>
>
> The thread seems to be pointing to two conclusions:
>
> (a) there are people who don't understand the theory, and need to be
> educated (I don't know if Roger's insight about &x80 really was a new
> discovery for him, if so I am rather shocked).
>
> (b) but even if you do understand the theory, it's still hard to get it
> right in practice, because our systems are complex and built from
> heterogeneous components, many of which are outside our control, cannot be
> easily changed, and are poorly documented; the more complex they become, the
> more opportunities there are for data to be corrupted across the component
> boundaries.
>
> The underlying problem is that components throw bytes at each other without
> first agreeing what they mean, and because it works most of the time (i.e.
> when you speak English) people live with the problem rather than fixing it;
> and because they don't fix it, it gets worse.
>
> Michael Kay
> Saxonica
>
>
>
>
> _______________________________________________________________________
>
> XML-DEV is a publicly archived, unmoderated list hosted by OASIS
> to support XML implementation and development. To minimize
> spam in the archives, you must subscribe before posting.
>
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org
> List archive: http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>



--
============================================
Timothy Cook, MSc           +55 21 94711995
MLHIM http://www.mlhim.org
Like Us on FB: https://www.facebook.com/mlhim2
Circle us on G+: http://goo.gl/44EV5
Google Scholar: http://goo.gl/MMZ1o
LinkedIn Profile:http://www.linkedin.com/in/timothywaynecook


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS