xml-dev - Re: [xml-dev] Xqueeze: Compact XML Alternative

Re: [xml-dev] Xqueeze: Compact XML Alternative

[ Lists Home | Date Index | Thread Index ]

To: Alaric Snell <alaric@alaric-snell.com>
Subject: Re: [xml-dev] Xqueeze: Compact XML Alternative
From: Robin Berjon <robin.berjon@expway.fr>
Date: Fri, 07 Feb 2003 10:24:14 +0100
Cc: xml-dev@lists.xml.org
In-reply-to: <20030206215139.162E35542@calm.warhead.org.uk>
Organization: Expway
References: <p04330103ba65690e85d6@[192.168.254.4]> <E18gipa-0005MH-00@calvin.frontwire.com> <3E424DCF.6050207@sosnoski.com> <20030206215139.162E35542@calm.warhead.org.uk>
Reply-to: robin.berjon@expway.fr
User-agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.2) Gecko/20021126

Alaric Snell wrote:
> On Thursday 06 February 2003 11:58 am, you wrote:
>>It doesn't necessarily (or even generally) work that way - compact
>>binary formats don't generally compress down as well as text, so you end
>>up with size(text) > size(binary) > size(compressed-binary) >
>>size(compressed-text).
> 
> I've always found that compressed binary is smaller than compressed text, as 
> Tahir found. That makes sense logically too; both the binary and text formats 
> have the same CDATA in but the binary format has more compact representations 
> of the elements and so on.

That's not necessarily the case, it very much depends on the binarisation 
process. It is not necessary that both have the same CDATA, especially if said 
CDATA is information available from a schema.

> Of course, one could design binary formats which compress badly, but I've 
> never found that they do by default.

I certainly hope that future improvements on our binary format will in fact make 
it compress badly :) That should happen by making it more compact than it 
currently is (while keeping similar speed, which is why compression is not 
always an option).

It's true however that binary infosets do tend to compress further. In yet 
another benchmark I read yesterday, the smallest results were bin-xml+gz and 
bin-xml+bz2 (well, excluding the same ones with SVG quantize codecs, lossy 
compression of XML documents still scares me ;).

-- 
Robin Berjon <robin.berjon@expway.fr>
Research Engineer, Expway        http://expway.fr/
7FC0 6F5F D864 EFB8 08CE  8E74 58E6 D5DB 4889 2488

Follow-Ups:
- Re: [xml-dev] Xqueeze: Compact XML Alternative
  - From: "Alaric B. Snell" <alaric@alaric-snell.com>

References:
- Re: [xml-dev] Xqueeze: Compact XML Alternative
  - From: Elliotte Rusty Harold <elharo@metalab.unc.edu>
- Re: [xml-dev] Xqueeze: Compact XML Alternative
  - From: "Alaric B. Snell" <alaric@alaric-snell.com>
- Re: [xml-dev] Xqueeze: Compact XML Alternative
  - From: Dennis Sosnoski <dms@sosnoski.com>
- Re: [xml-dev] Xqueeze: Compact XML Alternative
  - From: Alaric Snell <alaric@alaric-snell.com>

Prev by Date: RE: [xml-dev] Urgent !!
Next by Date: GML->SGML->XML->?->?->?->?->?->?->...
Previous by thread: Re: [xml-dev] Xqueeze: Compact XML Alternative
Next by thread: Re: [xml-dev] Xqueeze: Compact XML Alternative
Index(es):
- Date
- Thread