[
Lists Home |
Date Index |
Thread Index
]
This is getting off-topic from what I originally hoped for :)
I was trying to start a conversation about using an XML indexing system
in a particular problem domain - not something as grand as what you and
Alan are are talking about.
To me using something like Lucene provides for a much faster and much
less memory consuming process to search large collections of XML
documents or relational databases (whether they are story XML or not).
best,
-Rob
Bullard, Claude L (Len) wrote:
> IF it is really a full-text indexing system,
> it scans and infers topics the same way a human
> scans and tags. It would require a rule base
> perhaps similar to a Schematron assertion engine.
> Past attempts made several passes over content
> to create a series of tagged documents that are
> successively refined. However, as in memory-based
> patterning systems, the more abstract the links,
> the more opinionated the system. Such systems
> can become very superstitious in exactly the same
> way people do. How was it phrased: "A schema is
> an opinion about a document..."
>
> http://www.answers.com/main/ntquery?method=4&dsid=2222&dekey=Memory-predicti
> on+framework&gwp=11&curtab=2222_1&linktext=memory-prediction%20framework
>
> len
>
> From: Robert Koberg [mailto:rob@koberg.com]
>
>
>> Which is why I'd propose defining a full-text schema language,
>> so XML content can be described to a full-text search engine.
>
>
>
> It does sound very interesting. How would it work? What would it look
> like? I have tried doing this with XML Schema but gave up. I had tried
> to use annotations to give weight to different things, then I tried to
> make a type system. For me, it was just easier to write java to handle
> it. Now I write org.xml.sax.ext.DefaultHandler2's that suit my needs. I
> know, not very scalable or user friendly.
>
|