OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: xml search engine?

[ Lists Home | Date Index | Thread Index ]
  • From: Dongwook Shin <dwshin@nlm.nih.gov>
  • To: Martin Bryan <mtbryan@sgml.u-net.com>
  • Date: Tue, 04 Apr 2000 10:06:16 -0400


Martin Bryan wrote:

> The advantage of XML queries over other forms of query is that you can use context to identify the subset of information within a document that you need to search to find a meaningful result. Instead of having to look at all indexed occurrences of the term you only need to look at that subset that are "associated" with a given context. This should, hopefully, reduce the level of information overload we all suffer from at present.

That is right. But even with that advantage, you will be surprised if you see
how the DOM based search engine is slow for a large collection.
Test the site:
(Please understand that I have no intention to blaim the site.
 I just want to show an example that DOM based search engine is not scalable.)

My original claim for the necessity of indexing is for the huge XML files.
Even if you take the full advantage of the context in a big XML file, you often
get into the situation you have to be patient very long to get the answer.


Dongwook Shin
Visiting Scholar
Lister Hill National Center for Biomedical Communications
National Library of Medicine,
8600 Rockville Pike Bethesda 20894, MD
E-mail: dwshin@nlm.nih.gov
Tel: (301) 435-3257
FAX: (301) 480-3035
URL: http://dlb2.nlm.nih.gov/~dwshin

This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS