OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Parsing HTML in Perl



On Wed, 9 May 2001, Frank Boumphrey wrote:

> Perhaps of interest to you perl programmers out there.
> 
> I asked one of our programmers (Gabe Schaeffer) to write a function to
> parse a malformed HTML file, prior to converting it to XHTML. Here is
> what he produced!
> 
> I've never seen an HTML file parsed with a single line of Perl RegEx
> before!

Dude, you can do this *so* much easier...

open(my $fh, "pyxhtml file.html | pyxw |") || die "Cannot pyx: $!";

then parse $fh.

-- 
<Matt/>

    /||    ** Founder and CTO  **  **   http://axkit.com/     **
   //||    **  AxKit.com Ltd   **  ** XML Application Serving **
  // ||    ** http://axkit.org **  ** XSLT, XPathScript, XSP  **
 // \\| // ** mod_perl news and resources: http://take23.org  **
     \\//
     //\\
    //  \\