OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: xml search engine?

[ Lists Home | Date Index | Thread Index ]
  • From: "Steve Muench" <smuench@us.oracle.com>
  • To: "Walter Underwood" <wunder@infoseek.com>
  • Date: Fri, 31 Mar 2000 16:46:48 -0800

Oracle8i is another to consider, with it's built-in
interMedia Text capability.

Oracle8i Release 2 (8.1.6) supports the blend of
theme and word searching of XML documents with the
ability to leverage the XML structure of the document
to make queries more precise. It also now does
automatic XML tag indexing instead of 8.1.5 where
the process was more manual.

For example,

<!-- claim77804.xml -->
<Claim>
  <ClaimId>77804</ClaimId>
  <Payment>1000</Payment>
  <DamageReport>
    The insured's <Vehicle Make="Volks">Beetle</Vehicle>
    broke through the guard rail and plummeted into a ravine.
    The cause was determined to be <Cause>faulty brakes</Cause>.
    Amazingly there were no casualties.
  </DamageReport>
</Claim>

You could find the claimid of insurance claims
that mention 'brakes' within the <Cause> tag
of the <DamageReport> of the <Claim> 
with the above XML document stored in an "xmldoc" column
using a query like (whitespace added for clarity):

SELECT claimid FROM insurance_claim_table
 WHERE CONTAINS( xmldoc, 'brakes
                         WITHIN 
                        Cause 
                       WITHIN 
                      DamageReport 
                     WITHIN 
                    Claim') > 0

Have fun.

______________________________________________________________
Steve Muench, Lead XML Evangelist & Consulting Product Manager
Business Components for Java Dev't Team, Oracle Corporation

----- Original Message ----- 
From: "Walter Underwood" <wunder@infoseek.com>
To: <xml-dev@xml.org>
Sent: Friday, March 31, 2000 3:39 PM
Subject: Re: xml search engine?


| At 01:00 PM 3/30/00 -0500, David Megginson wrote:
| >Tim Bray <tbray@textuality.com> writes:
| > > I think what you're talking about would normally be called an Information-
| > > Retrieval (IR) system.  Such a system is distinguished from traditional
| > > search engines in the general case in that nobody has ever successfully
| > > built one that, in the general case, works.
| > > 
| > > (putting on my asbestos underwear) -Tim
| >
| >I think that anyone who lived through the excitement, hope, and
| >disappointment of the AI craze in the 1980's (academic) and early
| >1990's (commercial) would have to be very foolish to make any
| >different claim unless they could back it up with running,
| >production-grade software.
| 
| Ultraseek Server is definately production-grade. We added XML
| as a supported document type in September 1998. It's a modern,
| high-quality IR engine, that scales to millions of documents
| and millions of queries per day. And you can buy it now.
| 
| XML documents are treated like HTML documents, Microsoft Word
| documents, or PDF documents. Extract the text and meta data
| (helped by a DTD-specific map), and index them. All are a
| searchable with a common model, but without complex XQL-like
| queries.
| 
| And try the query suggested earlier on www.go.com (same basic
| internal algorithms as Ultraseek Server):
| 
|   have you got anything about the causes of hyperinflation in 
|     inter-war Germany?
| 
| Turns out that the first hit is a good description of the causes
| of hyperinflation in inter-war Germany:
| 
|    http://william-king.www.drexel.edu/top/prin/txt/probs/infl7.html
| 
| wunder
| --
| Walter R. Underwood
| Senior Staff Engineer
| Infoseek Software
| GO Network, part of The Walt Disney Company
| wunder@infoseek.com
| http://software.infoseek.com/cce/ (my product)
| http://www.best.com/~wunder/
| 1-408-543-6946
| 
| 
| ***************************************************************************
| This is xml-dev, the mailing list for XML developers.
| To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
| List archives are available at http://xml.org/archives/xml-dev/
| ***************************************************************************
| 


***************************************************************************
This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/
***************************************************************************




 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS