[
Lists Home |
Date Index |
Thread Index
]
On torsdag, januari 3, 2002, at 03:16 , Dare Obasanjo wrote:
> When last I looked, Xindice wasn't up to handling large files (> 5
> megabytes).
Yes! But you have to read what the faq suggests that you should do:
http://www.dbxml.org/docs/FAQ
--- quote from the dbXML faq ----
10. My 5 megabyte file is crashing the command line, help?
See FAQ #2. dbXML wasn't designed for monster documents,
rather, it was designed for collections of small to medium
sized documents. The best thing to do in this case would be to
look at your 5 megabyte file, and determine whether or not it's
a good candidate for being sliced into a set of small documents.
If so, you'll want to extract the separate documents and add
them to a dbXML collection individually. A good example of
this, would be a massive document of this form:
<cars>
<car>
<make>Boyoda</make>
<model>Cordova</model>
<year>1989</year>
</car>
<car>
<make>Frod</make>
<model>Tortorus</model>
<year>1990</year>
</car>
...
</cars>
In this case, it makes more sense to split the file into
several <car> documents instead of a single <cars> document.
2. What is dbXML not?
dbXML is not a persistent DOM implementation. It was not designed
to store and manage single monster sized documents, where one
document is treated as a set of mini documents. It was
specifically designed for managing many small to medium sized
documents.
--- quote end -----
As they say, break it up in small pieces. You can use XSLT to break the
document into smaller pieces. Both Xalan and Saxon help you to save the
pieces in separate files. Then you can import those files into dbXML,
useing the command-line tool.
As I wrote in my last posting:
"The answer might be a simple, but it is hard to give any advice without
knowing how you are using the document, how it is structured, and what
kind of changes to the document you frequently do, etc."
Niels Peter
|