OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: parsing XML using regular expressions

[ Lists Home | Date Index | Thread Index ]
  • From: Simon Carter <simon_carter@splwg.com>
  • To: xml-dev@lists.xml.org
  • Date: Thu, 10 Aug 2000 14:32:53 +1100

> We have occasional battles here about the wisdom of using
> non-XML-parser-based tools to process XML, and regular expressions always
> seem to come up.
> I've got a reader question that sort of ties into a (non-regex-based) Java
> project I'm working on, about the viability of regex and other text-based
> processing for XML work.
> Has anyone written a generic XML parser, even a somewhat broken one,
> built on regular expressions?  I remember hearing of something a long
> ago, but I can't find it.
> I'm not concerned with the efficiency/viability/profitability/wisdom of
> such a solution, just whether or not it's been done - especially if it's
> available open source.

Hi Simon,

My application TextPipe (http://www.crystalsoftware.com.au) performs shallow
parsing on XML documents up to 2GB
in size. Once a regexp match has been found, TextPipe allows the matching text
to be matched by further regexps
or be transformed in other ways, such as edit-distance or sounds-like
search/replaces, user-written code etc.

It can also query databases for data in XML, fixed width and CSV formats which
can be further processed, and even sent
on to other databases.


Simon Carter
End Hours Wasted Manually Editing Data / Text with TextPipe

Other products: LFNit!, DirSize, BabyShield, DirDate, ZeroIn, Arc Menu,
ClipSize, Clean 'n' Go


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS