OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   RE: DOM -> SQL -> DOM: Serialization...

[ Lists Home | Date Index | Thread Index ]
  • From: "Fabio Arciniegas A." <l-arcini@uniandes.edu.co>
  • To: <xml-dev@ic.ac.uk>, <cocoon@list.working-dogs.com>
  • Date: Thu, 11 Nov 1999 16:41:55 -0500

Kevin, I agree 100% with Prakash about the time restriction. During the
development of Samxa and Munin(two of my xml projects) i was faced with a
similar problem and definetly the cost of  deserializing/serializing DOM
docs was excesive to my needs (which were lighter than yours!)

The kind of solution you need depends largely on what kind of analysis you
need to run on the docs... for me, things were solved watching for regular
expresions on the startElement and characterData callbacks of a sax
parser... then again, i somehow suspect you need much more than that.
anyway,if you want, email me to tell me more about your kind of analysis and
maybe i can help with a bit of samxa code.

Fabio



- -----Original Message-----
- From: owner-xml-dev@ic.ac.uk [mailto:owner-xml-dev@ic.ac.uk]On Behalf Of
- Prakash Mandgi
- Sent: Jueves, 11 de Noviembre de 1999 12:17 a.m.
- To: Kevin A. Burton; xml-dev@ic.ac.uk; cocoon@list.working-dogs.com
- Subject: Re: DOM -> SQL -> DOM: Serialization...
-
-
- Hi Kevin
-
- with sql server / oracle i doubt you can retrieve that data so fast
- especially over
- the network let alone parse it.
- You may want to consider using Object databases
- 1) Persist DOM documents as is in an object database and then run XQL /
- XPath etc
- 2) Use entity java beans to store the DOM document in memory
-
- You can pretty much forget about serializing / deserializing DOM documents
- of that size in less than a second.
-
- Prakash
-
- ----- Original Message -----
- From: Kevin A. Burton <burton@relativity.yi.org>
- To: <xml-dev@ic.ac.uk>; <cocoon@list.working-dogs.com>
- Sent: Thursday, November 11, 1999 4:56 AM
- Subject: DOM -> SQL -> DOM: Serialization...
-
-
- > I need to take a ton of XML documents... about say 100M and run analysis
- > on them in realtime.  (less than 1 second)
- >
- > I am planning on using a parser to return a DOM, then take that DOM and
- > Serialize it to SQL with maybe ENTITY|ATTRIBUTE|CDATA tables, etc.  And
- > then write an interface (maybe XMLQL) to get the data.
- >
- > Has anyone seen an Open Source project to do this?
- >
- > Kevin
- >
- > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
- > Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
- CD-ROM/ISBN 981-02-3594-1
- > To unsubscribe, mailto:majordomo@ic.ac.uk the following message;
- > unsubscribe xml-dev
- > To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
- message;
- > subscribe xml-dev-digest
- > List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
- >
- >
-
- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
- Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
- CD-ROM/ISBN 981-02-3594-1
- To unsubscribe, mailto:majordomo@ic.ac.uk the following message;
- unsubscribe xml-dev
- To subscribe to the digests, mailto:majordomo@ic.ac.uk the
- following message;
- subscribe xml-dev-digest
- List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
-
-
-


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo@ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)






 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS