OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: xml search engine?

[ Lists Home | Date Index | Thread Index ]
  • From: "KenNorth" <KenNorth@email.msn.com>
  • To: "Dongwook Shin" <dwshin@nlm.nih.gov>
  • Date: Mon, 3 Apr 2000 13:19:39 -0700

> I read the article you wrote and found it very interesting.
> But the way WHIRL search engine is doing seems exactly what
> I mean by "vocabulary matching" or "heuristics".

Right. I've not looked at the code but the engine undoubtedly uses stemming,
pattern matching, and other  heuristics to build and update its indexes.

The demos have been taken down but the WHIRL site used to demonstrate a
query that searched for kid's games. One of the games was by a company that
was acquired by Microsoft and WHIRL detected the similarity between the old
name and the new, so it find Web pages that matched either.

William Cohen said he saw no problems with combining WHIRL with a domain
vocabaulary and /or indexing XML documents.

================== Ken North =============================
See you at AD2000 (www.apllicationdevelopment.com)
XML DevCon 2000 (www.xmldevcon2000.com)

This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS