OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: Searching XML

[ Lists Home | Date Index | Thread Index ]
  • From: Avi Rappoport <xml@searchtools.com>
  • To: Warren Hedley <w.hedley@auckland.ac.nz>, xml-dev@ic.ac.uk
  • Date: Tue, 31 Aug 1999 11:26:43 -0700

At 3:04 PM +1200 8/31/1999, Warren Hedley wrote:
>Hey team
>I have a number of HTML and XML files that are used to generate
>our website. We want to add search functionality to this site,
>so that we can look for keywords and text.
>It has proven too slow to search through all of the files, so
>the method I suspect we would use, would be to generate an
>additional database containing all of our main data (perhaps
>all words longer than 4 letters), that we could quickly look
>through to generate search results.
>Does anyone know of an implentation of search functionality
>along these lines (Perl modules would be nice.) Or can anyone
>suggest a better plan of attack?

If you want to look for simple keywords and text, without recognizing 
any fields other than <title>, you could modify any of the free Perl 
scripts that create index files, such as Matt's Simple Search, Selena 
Sol's, Xavatoria, etc. (see listings on my site at 
<http://www.searchtools.com/tools/tools-perl.html>).  For larger 
sites, Ultraseek also recognizes XML: 

Best of luck,

Guide to Local Site, Intranet, and Portal Search Engines: 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

  • References:


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS