xml-dev - Re: [xml-dev] SAX/Java Proposed Changes

Re: [xml-dev] SAX/Java Proposed Changes

[ Lists Home | Date Index | Thread Index ]

To: <sax-devel@lists.sourceforge.net>
Subject: Re: [xml-dev] SAX/Java Proposed Changes
From: Elliotte Rusty Harold <elharo@metalab.unc.edu>
Date: Mon, 8 Mar 2004 10:07:57 -0500
Cc: <xml-dev@lists.xml.org>
In-reply-to: <006c01c4051c$db4a7690$9e539696@citkwaclaww2k>
References: <200403071358418.SM05492@evanslt><404B7BF3.7030606@sosnoski.com> <002801c404ca$cfb86690$0207a8c0@karl><404C22B7.9020500@sosnoski.com><003701c40517$237d2fd0$9e539696@citkwaclaww2k><p06010207bc72317626a7@[192.168.254.4]><006c01c4051c$db4a7690$9e539696@citkwaclaww2k>

At 9:51 AM -0500 3/8/04, Karl Waclawek wrote:

>>  Being able to rely on
>>  startDocument()/endDocument() in the ContentHandler allows all the
>>  initialization  and tear-down code to easily go in the same class as
>>  the code that fills the data structure. It's all neatly unified.
>
>Why could it not go in the same class in the other case?

If the ContentHandler doesn't have any initialization or cleanup 
methods (or at least any reliably invoked ones) then it can't do the 
initialization or cleanup. Something else has to do it. you could add 
cusotm initialization or clean up methods and then have the something 
else call these:

handler.initialize()
parser.parse();
parser.cleanup();

But that's still ugly and less than ideal. As I teach my students, if 
certain public methods must be invoked in a certain order, then 
something is wrong. They should be made private and combined into one 
public method. Each public method call should be atomic and 
independent of other public methods. Having the ContentHandler do its 
own initialization and cleanup makes the code clean and robust. 
Relying on others to do it makes the code ugly and brittle. It's 
analogous to the difference between programming in a language like C 
with explicit memory allocation and deallocation and a language like 
Java automatic memory management. Both will get the job done, but 
one's a heck of a lot cleaner and less bug prone.

Oh, it just hit me why startDocument() is not an adequate replacement 
for endDocument(). There's often work you want to do at the end of a 
parse irrespective of whether there's a next document or not. For 
instance, you might want to store the results in a database 
somewhere, or update some other variable. The purpose of 
endDocument() is not solely to clean up any data structures that were 
used. We need both startDocument() and endDocument(), not just one. 
Yes, they may not be named precisely correctly, but we do need them, 
and not being able to rely on them is a major hassle.

-- 

   Elliotte Rusty Harold
   elharo@metalab.unc.edu
   Effective XML (Addison-Wesley, 2003)
   http://www.cafeconleche.org/books/effectivexml
   http://www.amazon.com/exec/obidos/ISBN%3D0321150406/ref%3Dnosim/cafeaulaitA

Follow-Ups:
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: "Karl Waclawek" <karl@waclawek.net>

References:
- RE: [xml-dev] SAX/Java Proposed Changes
  - From: "Kirk Allen Evans" <kaevans@xmlandasp.net>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: Dennis Sosnoski <dms@sosnoski.com>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: "Karl Waclawek" <karl@waclawek.net>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: Dennis Sosnoski <dms@sosnoski.com>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: "Karl Waclawek" <karl@waclawek.net>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: Elliotte Rusty Harold <elharo@metalab.unc.edu>
- Re: [xml-dev] SAX/Java Proposed Changes
  - From: "Karl Waclawek" <karl@waclawek.net>

Prev by Date: Re: [xml-dev] SAX/Java Proposed Changes
Next by Date: Re: [xml-dev] SAX/Java Proposed Changes
Previous by thread: Re: [xml-dev] SAX/Java Proposed Changes
Next by thread: Re: [xml-dev] SAX/Java Proposed Changes
Index(es):
- Date
- Thread