Lists Home |
Date Index |
I don't understand the attempt to compare the Excel XML export (which BTW
Excel calls "XML Spreadsheet") and CSV. The purposes of the two formats are
The XML Spreadsheet contains all the necessary information to reconstruct an
Excel spreadsheet, cell formatting, formulas, fonts, etc. You can open this
XML file in Excel and it will display exactly the same
formatting/formulas/etc. as it would if you had saved the spreadsheet in the
native .xls format.
CSV export outputs numbers. No formulas, no formatting, no cell
From: Stephen E. Beller [mailto:email@example.com]
Sent: Tuesday, December 07, 2004 8:20 AM
To: 'Rick Marshall'; 'Bob Foster'
Subject: RE: [xml-dev] Data streams
The compression algorithm is the standard WinZip9.0 application.
And that's the actual structure.
From: Rick Marshall [mailto:firstname.lastname@example.org]
Sent: Monday, December 06, 2004 8:04 PM
To: Bob Foster
Cc: Stephen E. Beller; email@example.com
Subject: Re: [xml-dev] Data streams
Bob Foster wrote:
> I don't think so.c Did you look at the sample he posted?
> > Every one of the XML data elements had this tagging structure: <Row>
> > <Cell><Data ss:Type="Number">1</Data></Cell>
> > </Row>
> > In contrast, the CSV had this structure: 1,
> Since the "more information" in the XML, precious little that it adds,
> is identical for every data value, the XML format has approximately
> the same entropy as the CSV file. This looks more like a failure of
> the compression algorithm.
whoops. do we know what the compression algorithm was? and was that
really the structure or just a summary of the structure?
> Bob Foster
> Rick Marshall wrote:
>> all you've done is shown that the entropy of the xml file is
>> significantly lower than the csv file. that would mean it carries
>> significantly more information and as others have pointed out, when
>> inspecting the xml, this is indeed the case.
>> put another way the correct interpretation of your experiment is that
>> the ratio of the compressed file sizes points to a significant
>> difference in information content. the csv file and the xml file
>> aren't the same stuff.
The xml-dev list is sponsored by XML.org <http://www.xml.org>, an initiative
of OASIS <http://www.oasis-open.org>
The list archives are at http://lists.xml.org/archives/xml-dev/
To subscribe or unsubscribe from this list use the subscription