Lists Home |
Date Index |
Sounds a nice idea independent of any speed benefits. I toyed with the idea
of parsing XSLT match pattterns backwards at one stage, but didn't pursue
it. My brain is wired for left-to-right reading...
As a matter of interest, is there any problem with decoding UTF-8 when
Shame that the spec doesn't require ">" to be escaped, I imagine this causes
a fair problem with backtracking, for example how do you cope with
...... <a><b/><c/></a> -->
which might or might not be part of a comment?
IIRC we were able to read files backwards on ICL VME. That's history, but
many of its features have reappeared in Windows 25 years later ...
> -----Original Message-----
> From: Sean McGrath [mailto:firstname.lastname@example.org]
> Sent: 01 April 2006 08:22
> To: email@example.com
> Subject: [xml-dev] Exploiting multi-core CPUs during XML parsing
> I have sketched out an algorithm for fast XML WF parsing
> utilising two
> threads that each start at opposite ends of the octet-stream
> and meet in
> the middle. The algorithm hinges on the fact that start- and end-tags
> are balanced. i.e. as one thread reads forward looking for foo
> start-tag, the other thread is reading backwards looking for
> foo end-tag.
> This also has the nice side effect of giving you accurate
> error messages
> quickly. i.e. as soon as a mismatched tag is found, it can be
> This is particularly useful with recursive element types.
> Modern machines (even desktop boxes) will increasingly have
> CPUs as standard and I think it behooves us to find ways of
> that in the XML world.
> But to get the most out of the algorithm, you really need a
> that supports reading byte-streams backwards.
> Does anyone know of such a filesystem?
> The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
> initiative of OASIS <http://www.oasis-open.org>
> The list archives are at http://lists.xml.org/archives/xml-dev/
> To subscribe or unsubscribe from this list use the subscription
> manager: <http://www.oasis-open.org/mlmanage/index.php>