OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Non-XML documents to XML Converter?

[ Lists Home | Date Index | Thread Index ]
  • From: David Megginson <david@megginson.com>
  • To: "XML Developers' List" <xml-dev@ic.ac.uk>
  • Date: Mon, 17 May 1999 15:12:11 -0400 (EDT)

Roger L. Costello writes:

 > Anyone have a tool that converts a document that is formatted in a
 > non-XML syntax into XML?

Perl -- it's hideously ugly, but most of the world uses it and it runs
pretty fast (especially when pattern matching).

Here's a ten-line Perl program that will convert most non-XML text
files to XML (as long as they don't contain control characters):

  print "<?xml version=\"1.0\"?>\n<junk>";
  while (<>) {
    if (/[&<>]/) {
      s/&/\&amp\;/g;
      s/</\&lt\;/g;
      s/>/\&gt\;/g;
    }
    print;
  }
  print "</junk>"; 


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS