OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: [xml-dev] Efficient Parsing of large documents of size 1GB

[ Lists Home | Date Index | Thread Index ]

I would suggest SAX parser which would do mmap() for chunks of your XML
document.
libxml2 seems to be a good choice for that from performance point of view.
Parsers which do regular file read() system calls will not be that fast.

--Mikhail Grushinskiy

----- Original Message ----- 
From: "Purushothama Ranganatha" <Purushothama_R@mindtree.com>
To: <xml-dev@lists.xml.org>
Sent: Wednesday, May 14, 2003 12:30 AM
Subject: [xml-dev] Efficient Parsing of large documents of size 1GB


I get XML messages/documents :Size varies from 1KBk to 1GB.
My application need to parse these XML messages.
How do I go about this. Should I use DOM / SAX / PULL parser.

thanks/regards
Purush

Purushothama Ranganatha
+91-80-6711777. extn:2238.
Have faith in yourself, you can do anything in the Universe.



DISCLAIMER:
This message (including attachment if any) is confidential and may be
privileged. Before opening attachments please check them for viruses and
defects. MindTree Consulting Private Limited (MindTree) will not be
responsible for any viruses or defects or any forwarded attachments
emanating either from within MindTree or outside. If you have received this
message by mistake please notify the sender by return  e-mail and delete
this message from your system. Any unauthorized use or dissemination of this
message in whole or in part is strictly prohibited.  Please note that
e-mails are susceptible to change and MindTree shall not be liable for any
improper, untimely or incomplete transmission.

-----------------------------------------------------------------
The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
initiative of OASIS <http://www.oasis-open.org>

The list archives are at http://lists.xml.org/archives/xml-dev/

To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl>



---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.481 / Virus Database: 277 - Release Date: 5/13/2003





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS