[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]
RE: [xml-dev] Validating Bulk XML Data
- From: "Ravindra Rao" <rrao@globeop.com>
- To: "Michael Kay" <mike@saxonica.com>,<xml-dev@lists.xml.org>
- Date: Wed, 15 Aug 2007 15:43:36 +0100
No its not slower it would be faster.. just that if one of the small documents is invalid the large doc would be invalid too and stop further processing, i would want in an ideal situation an error message saying docs no. 10 200 and 800 are invalid (and allow the processing of the remaining 997) , in a large doc it would just tell me this is invalid .
-----Original Message-----
From: Michael Kay [mailto:mike@saxonica.com]
Sent: 15 August 2007 13:53
To: Ravindra Rao; xml-dev@lists.xml.org
Subject: RE: [xml-dev] Validating Bulk XML Data
> Broadly we can have two approaches either change
> the schema a bit so that the xsd grammar sees each of the
> files as one giant xml document with many nodes or parse each
> of the 1000. The disadvantage with the first approach seems
> to be if one of the original gomls is invalid the whole
> aggregated goml gets invalidated. The second approach is
> potentially slow (we are using this and multi threading it
> and tweaking bits of it).
I can't see any intrinsic reason why validating 1000 small documents should
take longer than validating one document formed by concatenating the
content, provided the schema itself is only prepared once.
Michael Kay
http://www.saxonica.com
_______________________________________________________________________
XML-DEV is a publicly archived, unmoderated list hosted by OASIS
to support XML implementation and development. To minimize
spam in the archives, you must subscribe before posting.
[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
subscribe: xml-dev-subscribe@lists.xml.org
List archive: http://lists.xml.org/archives/xml-dev/
List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]