[
Lists Home |
Date Index |
Thread Index
]
> So, the _optimized_ XML compressor did more poorly than a default general
> purpose compressor by a few percentage points (at least for this data).
> Their description sounds remarkably similiar to a block sort compressor -
> which is _precisely_ what bzip2 is (minus the 'patent rights' language
> from AT&T ;) ). Which is probably why the end sizes are fairly close for
> both.
>
> Its _DAMNED HARD_ to improve on a modern general purpose compression
> program for textual data.
There is also another aspect to XML compression, and that is
the potential ability to query or parse the compressed document directly,
without decompressing it first. This does not seem to apply to XMill,
but WBXML or rather Millau encoding do provide this:
http://www10.org/cdrom/papers/542/ and http://www9.org/w9cdrom/154/154.html
Karl
|