OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: ANN: Converting Lexacy Data to XML using a Lexer/Parser Generator

[ Lists Home | Date Index | Thread Index ]

> I find the use of a lexer/parser generator as an XML tool builder to be
> quite interesting.  It enables you to seamlessly integrate the
> definition of a grammar with a general purpose programming language. 
> [Kind of like having DTDs seamlessly integrated with a general purpose
> programming language.]  I find this to be very powerful.  

What you are doing is essentially building a mini XML parser. But your
solution of using a general lexer/parser generator (such as JavaCC, CUP,
or ANTLR) has an inherent problem.

Namely, you are re-inventing a wheel. And it's a big one. Why build
another XML parser, when you have a lot of free good quality parsers out

Your mini parser isn't even XML parser; it can't handle different
encodings, can't handle entities, Unicode character references,
namespaces ... , the list goes on forever.

You'd better off using a SAX parser as your lexer.

And have a look at RelaxNGCC (http://relaxngcc.sourceforge.net/) I
guess this is close to what you want to do.

Kohsuke Kawaguchi
E-Mail: kk@kohsuke.org


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS