OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   RE: [xml-dev] rss regularis(z)ation

[ Lists Home | Date Index | Thread Index ]

Unfortunately the "Mutual Termination for Patent Action" makes tagsoup a
time-bomb.
There's no way I could possibly use this.  

-----Original Message-----
From: John Cowan [mailto:cowan@mercury.ccil.org] 
Sent: Wednesday, July 23, 2003 4:41 AM
To: Elliotte Rusty Harold
Cc: xml-dev@lists.xml.org
Subject: Re: [xml-dev] rss regularis(z)ation


Elliotte Rusty Harold scripsit:

> > Feed the element content into
> >a tag-soup parser, infer start- and end- tags to turn it into
> >a tree, and strip out all the elements you don't want showing up
> >in the aggregator output.  Took me about two hours to code this up
> >(to be fair, I did use an off-the shelf lexer for the first step).
> 
> If you need to write your own tag soup parser, it ain't XML. That's 
> too much work for a job that shouldn't be necessary in the first 
> place.

Fortunately, Java programmers don't need to write their own tag soup
parsers;
I did that.

http://www.ccil.org/~cowan/XML/tagsoup

-- 
It was impossible to inveigle           John Cowan
<jcowan@reutershealth.com>
Georg Wilhelm Friedrich Hegel           http://www.ccil.org/~cowan
Into offering the slightest apology     http://www.reutershealth.com
For his Phenomenology.                      --W. H. Auden, from "People"
(1953)

-----------------------------------------------------------------
The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
initiative of OASIS <http://www.oasis-open.org>

The list archives are at http://lists.xml.org/archives/xml-dev/

To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl>




 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS