[
Lists Home |
Date Index |
Thread Index
]
[Dare Obasanjo]
>Tim is basically asking for pull-based XML parsers, implementations of
>which exist in .NET Framework and the Java world.
>[...]
>
>while (<STDIN>) {
> next if (X<meta>X);
> if (X<h1>|<h2>|<h3>|<h4>X)
> { $divert = 'head'; }
> elsif (X<img src="/^(.*\.jpg)$/i>X)
> { &proc_jpeg($1); }
> # and so on...
>}
The xmln and xmlv utilities
(http://pyxie.sourceforge.net/pyxie_download.html) allow you to have the
convenience of pull without forsaking XML parsing.
The patterned code would look quite similar to Tim's except that you would
hook "(meta" for meta start-tags etc.
It has the significant advantage that all the syntactic cruft in XML has
been dealt with so don't have to worry
about the regexps missing something in the markup.
Its also <rant>not another freaking API</rant> :-). PYX is *text*.
See using PYX (http://www.xml.com/pub/a/2000/03/15/feature/)
Also, I used PYX to create a drop-dead simple approach to record-centric
XML processing using PULL
http://www.xml.com/pub/a/2000/04/26/rax/.
Sean
http://seanmcgrath.blogspot.com
|