[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]
RE: [xml-dev] Auto-generate a DTD from multiple XML documents?
- From: "Michael Kay" <mike@saxonica.com>
- To: "'Jon Noring'" <jon@noring.name>,<xml-dev@lists.xml.org>
- Date: Sun, 7 Oct 2007 04:39:27 +0100
The Saxon DTDGenerator only handles one input document, but I've used it for
this scenario in the past, simply by concatenating all the documents within
a dummy root element, generating the DTD, and then discarding the dummy root
element definition. (It doesn't need to be a physical concatenation, you can
set up a document that brings all the others in as external entities).
I produced a new version of the DTDGenerator a couple of weeks ago, as it
happens - the first release for 6 years, fixing a bug that caused it to
slowly eat memory in proportion to the number of distinct attribute values.
After fixing that, I was able to process 20Gb of input in about 25 minutes.
It's available as a free-standing download from the Saxon project on
sourceforge.
Michael Kay
http://www.saxonica.com/
> -----Original Message-----
> From: Jon Noring [mailto:jnoring@gmail.com] On Behalf Of Jon Noring
> Sent: 06 October 2007 21:10
> To: xml-dev@lists.xml.org
> Subject: [xml-dev] Auto-generate a DTD from multiple XML documents?
>
> Everyone,
>
> Several of us involved with Distributed Proofreaders and
> Project Gutenberg are analyzing a number of TEI documents
> representing PG etexts.
>
> As part of the analysis, we'd like to find an application
> which will autogenerate a minimal DTD (or schema) for
> validating a given set of documents sharing the same root element.
>
> I have seen several applications which will autogenerate a
> DTD for a single XML document, but have not yet found one
> which will generate a DTD representing a set of XML documents.
>
> Of course, we prefer a "free" application for this one-time
> job, but if there's a good commercial package, hopefully we
> can find someone with a copy who'd be happy to volunteer
> running the job for us.
>
> Thanks in advance.
>
> Jon Noring
>
>
>
> ______________________________________________________________
> _________
>
> XML-DEV is a publicly archived, unmoderated list hosted by
> OASIS to support XML implementation and development. To
> minimize spam in the archives, you must subscribe before posting.
>
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org List archive:
> http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>
[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]