[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: HTML --> XHTML --> XML Conversion (HTML Scraping)
- From: Michael Fitzgerald <email@example.com>
- To: Gul Imran <firstname.lastname@example.org>, xml-dev <email@example.com>
- Date: Fri, 23 Mar 2001 09:11:01 -0800
What about HTML Tidy?
tidy -asxml file.html > file.xml
From: Gul Imran [mailto:firstname.lastname@example.org]
Sent: Friday, March 23, 2001 8:12 AM
Subject: HTML --> XHTML --> XML Conversion (HTML Scraping)
I'm wondering if anyone has any info on HTML scraping and how to extract
from HTML to get XML data. My idea is to convert HTML to XHTML first and
then go from there. I have found couple of software vendors that claim to
provide HTML -> XML + XSL conversion or direct HTML to WAP/VoiceXML
conversions. One is www.percussion.com and other is www.spyglassmobile.com.
Are there any other tools out there that can perform HTML scraping for XML
data generation? Or what logic is best for data extraction after I have
XHTML Dom Model?
Sr. Software Engineer, R & D
e-Business Customer Interaction Solutions
"What the tech is going on?"