OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] combining XMLEvent lists

On 09/28/2010 04:33 PM, Michael Kay wrote:
> Sounds fascinating, and I wish I had time to get involved. It would
> certainly be elegant if you could have both the productivity of writing
> this declaratively in XSLT and the performance of running it on Hadoop
> MapReduce. Intrinsically, the two seem to fit together hand in glove,
> but I suspect some engineering effort is needed to make it work.

Hello Michael,

I think it would be too complicated to achieve the desired grouping with
Java. Do you think it makes sense to first serialize the results and
then use Saxon's XSLT 2.0 processor to achieve the results? Or do you
have any direct input from a List of XMLEvents to Saxon's XSLT
processor? I assume it reads XML-data from an InputSource or some kind
of a stream.

It's a special case, where two or more revisions of one article are made
at the same time (in the same second). I would have to look through the
XML file with BaseX or Saxon, but I'm pretty sure such cases exist
somewhere in the hugh file (as of now I've only extracted a small subset
of articles and replaced WikiText inside text-elements with XML).

The whole task is to sort the revisions to shredder it into our XML
datastorage system (the deltas of the revisions), which has the
capability to store and retrieve revisions compactly and efficiently. In
parallel I'm currently writing the import of a sorted XML file.

My main task (master project and thesis) is or will be the visualization
of temporal tree structured data to gain further insights into the
evolution of the data, which are otherwise very difficult to realize.


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS