OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] XML Parsing using DOM parser

[ Lists Home | Date Index | Thread Index ]


On torsdag, januari 3, 2002, at 03:16 , Dare Obasanjo wrote:

> When last I looked, Xindice wasn't up to handling large files (> 5 
> megabytes).

Yes!  But you have to read what the faq suggests that you should do: 
http://www.dbxml.org/docs/FAQ

--- quote from the dbXML faq ----

10. My 5 megabyte file is crashing the command line, help?

See FAQ #2.  dbXML wasn't designed for monster documents,
rather, it was designed for collections of small to medium
sized documents.  The best thing to do in this case would be to
look at your 5 megabyte file, and determine whether or not it's
a good candidate for being sliced into a set of small documents.
If so, you'll want to extract the separate documents and add
them to a dbXML collection individually.  A good example of
this, would be a massive document of this form:

    <cars>
       <car>
          <make>Boyoda</make>
          <model>Cordova</model>
          <year>1989</year>
       </car>
       <car>
          <make>Frod</make>
          <model>Tortorus</model>
          <year>1990</year>
       </car>
       ...
    </cars>

In this case, it makes more sense to split the file into
several <car> documents instead of a single <cars> document.


2. What is dbXML not?

dbXML is not a persistent DOM implementation.  It was not designed
to store and manage single monster sized documents, where one
document is treated as a set of mini documents.  It was
specifically designed for managing many small to medium sized
documents.

--- quote end -----

As they say, break it up in small pieces. You can use XSLT to break the 
document into smaller pieces. Both Xalan and Saxon help you to save the 
pieces in separate files. Then you can import those files into dbXML, 
useing the command-line tool.


As I wrote in my last posting:

"The answer might be a simple, but it is hard to give any advice without
knowing how you are using the document, how it is structured, and what
kind of changes to the document you frequently do, etc."

Niels Peter





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS