My suspicion is that if such large files are mostly flat, it might be
possible/desirable to store them in the database as smaller chunksŠ
T
Oxford University Press (UK) Disclaimer
On 05/03/2015 16:04, "Michael Kay" <mike@saxonica.com> wrote:
>
>> As always Michael, you are very wise. And I certainly appreciate your
>>graciousness.
>>
>> I have some figures regarding size and complexity:
>>
>> There are 50 million XML files, each 50MB in size.
>> The files are mostly flat (not deeply nested).
>> We need to perform queries across the 50 million files.
>>
>> What's your recommendation for storing and querying this huge amount of
>>XML files?
>>
>
>Depends on project timescales, budget and risk, and possibly on
>availability of expertise and experience in particular technologies.
>Probably one of the following alternatives:
>
>(A) Start a prototyping project to assess whether MarkLogic is capable of
>meeting the project requirements.
>
>(B) Choose three native XML databases that look promising and assess each
>of the three to compare how well they handle the project requirements.
>
>Michael Kay
>Saxonica
>
>
>_______________________________________________________________________
>
>XML-DEV is a publicly archived, unmoderated list hosted by OASIS
>to support XML implementation and development. To minimize
>spam in the archives, you must subscribe before posting.
>
>[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
>Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
>subscribe: xml-dev-subscribe@lists.xml.org
>List archive: http://lists.xml.org/archives/xml-dev/
>List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
This message is confidential. You should not copy it or disclose its contents to anyone. You may use and apply the information for the intended purpose only. OUP does not accept legal responsibility for the contents of this message. Any views or opinions presented are those of the author only and not of OUP. If this email has come to you in error, please delete it, along with any attachments. Please note that OUP may intercept incoming and outgoing email communications.
_______________________________________________________________________
XML-DEV is a publicly archived, unmoderated list hosted by OASIS
to support XML implementation and development. To minimize
spam in the archives, you must subscribe before posting.
[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
subscribe: xml-dev-subscribe@lists.xml.org
List archive: http://lists.xml.org/archives/xml-dev/
List Guidelines: http://www.oasis-open.org/maillists/guidelines.php