OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] Data streams

[ Lists Home | Date Index | Thread Index ]

I don't think so.c Did you look at the sample he posted?

 > Every one of the XML data elements had this tagging structure:
 > <Row>
 >    <Cell><Data ss:Type="Number">1</Data></Cell>
 > </Row>
 > In contrast, the CSV had this structure: 1,

Since the "more information" in the XML, precious little that it adds, 
is identical for every data value, the XML format has approximately the 
same entropy as the CSV file. This looks more like a failure of the 
compression algorithm.

Bob Foster

Rick Marshall wrote:
> all you've done is shown that the entropy of the xml file is 
> significantly lower than the csv file. that would mean it carries 
> significantly more information and as others have pointed out, when 
> inspecting the xml, this is indeed the case.
> put another way the correct interpretation of your experiment is that 
> the ratio of the compressed file sizes points to a significant 
> difference in information content. the csv file and the xml file aren't 
> the same stuff.
> rick


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS