xml-dev - Re: [xml-dev] XML / HTML Transport size

Re: [xml-dev] XML / HTML Transport size

[ Lists Home | Date Index | Thread Index ]

To: <xml-dev@lists.xml.org>
Subject: Re: [xml-dev] XML / HTML Transport size
From: "Karl Waclawek" <karl@waclawek.net>
Date: Mon, 18 Nov 2002 10:43:26 -0500
References: <Pine.LNX.4.44.0211180717280.8859-100000@high-mountain.nihongo.org>

> So, the _optimized_ XML compressor did more poorly than a default general
> purpose compressor by a few percentage points (at least for this data).  
> Their description sounds remarkably similiar to a block sort compressor -
> which is _precisely_ what bzip2 is (minus the 'patent rights' language
> from AT&T ;) ). Which is probably why the end sizes are fairly close for
> both.
> 
> Its _DAMNED HARD_ to improve on a modern general purpose compression
> program for textual data.

There is also another aspect to XML compression, and that is
the potential ability to query or parse the compressed document directly,
without decompressing it first. This does not seem to apply to XMill,
but WBXML or rather Millau encoding do provide this:

http://www10.org/cdrom/papers/542/ and  http://www9.org/w9cdrom/154/154.html

Karl

References:
- Re: [xml-dev] XML / HTML Transport size
  - From: Benjamin Franz <snowhare@nihongo.org>

Prev by Date: Re: [xml-dev] XML / HTML Transport size
Next by Date: Re: [xml-dev] RDF for unstructured databases, RDF for axiomatic
Previous by thread: Re: [xml-dev] XML / HTML Transport size
Next by thread: Re: [xml-dev] XML / HTML Transport size
Index(es):
- Date
- Thread