OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Regular Fragmentations

Simon St.Laurent wrote:
> Back in April I suggested that regular expressions might be a useful
> tool for fragmenting XML 'molecule' content into smaller pieces which
> could then be processed as 'atoms':
> http://www.xml.com/pub/a/2001/04/25/deviant.html
> I've finally found the time to put together an implementation of this
> approach, building a SAX2 filter which uses an XML configuration file
> and the regular expression support built into the Xerces parser.  As
> content passes through the filter, elements identified by the
> configuration file are processed and broken down into smaller elements
> using rules built on regular expressions.

This is terrific. I will use this.

> This filter is written in Java (1.3) and requires the Xerces parser.
> Comments, suggestions, and contributions are welcome, either privately
> or to the xml-dev mailing list.

I agree that it would be nice if this were to be parser independent.