xml-dev - Re: [xml-dev] Indexing solution for native XML database

Re: [xml-dev] Indexing solution for native XML database

[ Lists Home | Date Index | Thread Index ]

To: Michael Kay <mike@saxonica.com>
Subject: Re: [xml-dev] Indexing solution for native XML database
From: Peter Hunsberger <peter.hunsberger@gmail.com>
Date: Tue, 29 Nov 2005 14:40:09 -0600
Cc: Timo Hildén <castaway@daug.net>, xml-dev@lists.xml.org
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=rENk9/Nxi4b0PVcAOoUucYbOPQC4FJbHl8B1aGOaAIZeCipTFiSDkriKWUSeadssmMpv092jkbbVkipuFJz8VjKCG/ipH9Jv8nt9A7vttOetVdm01XUPI1ZG+V6nQYWOBL7wYWQcj6eAgyzVij/TpAoltIDyxJOzZwzZdHCrzdI=
In-reply-to: <20051129193108.869AF2F4D21@mailhost3.dircon.co.uk>
References: <Pine.LNX.4.44.0511280929070.7603-100000@shell.daug.net> <20051129193108.869AF2F4D21@mailhost3.dircon.co.uk>

On 11/29/05, Michael Kay <mike@saxonica.com> wrote:
> > I'm searching for an indexing solution for my native XML
> > database project,
> > which I'm writing as a learning project.
> >
> > I use C++ as development language and a relational database
> > as backend.
>
> Why?
>
> I simply wouldn't start from here. Relational databases are bad at storing
> hierarchic data, they are bad at storing data whose order is significant,
> and they are bad at storing data whose structure is irregular. Many of the
> XPath axis traversals will map to recursive queries, which cannot be
> expressed in first-order predicate calculus. Even the operation of
> determining namespace context will require either a recursive query, or
> highly-redundant data storage.
>
> You'd be better off starting with an object database.

Gee Michael, care to over generalise just a tad?

If your data has low update frequency then a set/subset in-order data
model will have flat query structures but provide direct transforms to
hierarchical structures.  Update and insert operations can be
expensive in the general case for such a model, but for specific cases
it may not be an issue.  An ordered index in many relational databases
can have as good or better performance than in an object database.

Sure, a relational database might not be the best fit for a given
hierarchical structure. But for other cases the fit may be as good or
better than any other database....  Given the rather low level of
hierarchy expressed in the example I can see several ways of
addressing performance, but I'll address direct those directly to the
author of the question.

--
Peter Hunsberger

Follow-Ups:
- RE: [xml-dev] Indexing solution for native XML database
  - From: "Michael Kay" <mike@saxonica.com>

References:
- Indexing solution for native XML database
  - From: Timo Hildén <castaway@daug.net>
- RE: [xml-dev] Indexing solution for native XML database
  - From: "Michael Kay" <mike@saxonica.com>

Prev by Date: Re: [xml-dev] Common Word Processing Format
Next by Date: RE: [xml-dev] Common Word Processing Format
Previous by thread: RE: [xml-dev] Indexing solution for native XML database
Next by thread: RE: [xml-dev] Indexing solution for native XML database
Index(es):
- Date
- Thread