xml-dev - Re: xml parser

Re: xml parser

[ Lists Home | Date Index | Thread Index ]

From: Tim Bray <tbray@textuality.com>
To: "Michael Kay" <M.H.Kay@eng.icl.co.uk>, "Phani Adabala" <phani@www.hsc.wvu.edu>, <xml-dev@ic.ac.uk>
Date: Wed, 04 Nov 1998 08:22:37 -0800

At 10:55 AM 11/4/98 -0000, Michael Kay wrote:
>My immediate answer to this is yes, all the information you need for a
>search engine is available via the SAX or DOM interface offered by many
>parsers.

I disagree.  Few parsers track byte offsets or other locational info in
the file, and I think you need that to do basic things like proximity
and phrase search.

>Of course you don't need to build your own search engine either, all you
>need to do is write an XML filter for an existing search engine. I'm
>surprised no-one seems to have done this yet.

I think you do need to build your own engine.  Reason is, most existing
search engines have an atomic-document view of the world, and break
down completely when asked to model a general recursive hierarchical
structure like XML. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

Follow-Ups:
- SAX, DOM, and Search Engines (was Re: xml parser)
  - From: <david@megginson.com>
- Re: xml parser
  - From: Avi Rappoport <avirr@LanMinds.Com>

Prev by Date: Re: xml parser
Next by Date: Re: xml parser
Previous by thread: Re: xml parser
Next by thread: Re: xml parser
Index(es):
- Date
- Thread