OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: Gutenberg Project <longish>

[ Lists Home | Date Index | Thread Index ]
  • From: "Liam R. E. Quin" <liamquin@interlog.com>
  • To: xml-dev@xml.org
  • Date: Tue, 7 Mar 2000 22:12:43 -0500 (EST)

I think that if you are starting with texts from Project Gutenberg,
the DTD isn't too important.  Do read the TEI DTD and use whatever
you can from it, but remember also that the PG texts are far from
scholarly in the first place.  Michael Hart has checked provenance
only so far as is necessary to ensure copyright status.  The texts
there are not always the "most familiar" versions[1], and markup such
as italics or other emphasis is either lost or erndered as ALL CAPS.

If you use the same structures as the TEI where possible, but
names that seem clearer for what you're doing, for example, an
automatic translation would be possible.

This is actually a project very dear to me -- I had wanted so badly
to set up an archive of marked-up texts (then in SGML) when I
worked at SoftQuad, but the world wasn't ready for it.  The texts
at the Oxford Text Archive were not freely redistributable, and there
were few other sources at that time.


Liam Quin, Open Source / SGML / XML / Unix / Perl consultant, Toronto
http://www.holoweb.net/~liam/ Ankh on irc.sorcery.net
barefoot on the net!  co-oauthor, The XML Specification Guide, Wiley Inc.
forthcoming: The Open Source XML Database Toolkit...

This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@xml.org&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS