xml-dev - RE: [xml-dev] Data streams

RE: [xml-dev] Data streams

[ Lists Home | Date Index | Thread Index ]

To: 'Rick Marshall' <rjm@zenucom.com>, 'Bob Foster' <bob@objfac.com>
Subject: RE: [xml-dev] Data streams
From: "Stephen E. Beller" <sbeller@nhds.com>
Date: Tue, 07 Dec 2004 08:19:45 -0500
Cc: xml-dev@lists.xml.org
Importance: Normal
In-reply-to: <41B5016F.1010707@zenucom.com>
Organization: NHDS, Inc.

The compression algorithm is the standard WinZip9.0 application.

And that's the actual structure.

Steve

-----Original Message-----
From: Rick Marshall [mailto:rjm@zenucom.com] 
Sent: Monday, December 06, 2004 8:04 PM
To: Bob Foster
Cc: Stephen E. Beller; xml-dev@lists.xml.org
Subject: Re: [xml-dev] Data streams

Bob Foster wrote:

> I don't think so.c Did you look at the sample he posted?
>
> > Every one of the XML data elements had this tagging structure:
> > <Row>
> >    <Cell><Data ss:Type="Number">1</Data></Cell>
> > </Row>
> >
> > In contrast, the CSV had this structure: 1,
>
> Since the "more information" in the XML, precious little that it adds, 
> is identical for every data value, the XML format has approximately 
> the same entropy as the CSV file. This looks more like a failure of 
> the compression algorithm.

whoops. do we know what the compression algorithm was? and was that 
really the structure or just a summary of the structure?

rick

>
> Bob Foster
>
> Rick Marshall wrote:
>
>> all you've done is shown that the entropy of the xml file is 
>> significantly lower than the csv file. that would mean it carries 
>> significantly more information and as others have pointed out, when 
>> inspecting the xml, this is indeed the case.
>>
>> put another way the correct interpretation of your experiment is that 
>> the ratio of the compressed file sizes points to a significant 
>> difference in information content. the csv file and the xml file 
>> aren't the same stuff.
>>
>> rick
>
>
>

References:
- Re: [xml-dev] Data streams
  - From: Rick Marshall <rjm@zenucom.com>

Prev by Date: RE: [xml-dev] Data streams
Next by Date: RE: [xml-dev] Data streams
Previous by thread: Re: [xml-dev] Data streams
Next by thread: Re: [xml-dev] Data streams
Index(es):
- Date
- Thread