Lists Home |
Date Index |
6/26/02 9:30:41 AM, Yaneric Roussel <email@example.com> wrote:
>I would like to extract informations from non-xml text file and put it
>in xml format.
>Instead of doing a program for each type of text file I have, I would
>like to use a "schema like" file specifying the
>structure of my non-xml text file for retriving from it the informations
>that I will put in xml.
>Have you heard of projects/technologie/etc that do this?
You might want to look into the Data Extraction Language, published as a W3C Note
(http://www.w3.org/TR/2001/NOTE-data-extraction-20011031). I don't know what the status of
implementation is, but it looks like it was designed for problems like yours.
>Can I achieve my goal by using XSLT or other xml related technology?
Not directly with XSLT, since it operates on a tree representation of a well-formed XML document.
However, if you have an XSLT processor that could build such a tree from a stream of SAX events, you
could write SAX drivers for your input formats and then feed them into the transformation. SAX
drivers for non-XML formats are quite popular in the Perl community, among other places.