[
Lists Home |
Date Index |
Thread Index
]
- From: Jon.Bosak@eng.Sun.COM (Jon Bosak)
- To: xml-dev@ic.ac.uk
- Date: Fri, 28 Nov 1997 10:00:07 -0800
[Peter Murray-Rust:]
| Jon Bosak's Shakespeare, and religion are pre-eminent and are a good
| test for whether a system can cope with 'real documents'. I haven't
| looked at religion, but Shakespeare has a clean and natural markup
| without attributes. So it's not a torture test. (I don't think there
| are DTDs - I think I hacked my own). I don't think there is any mixed
| content in Shakespeare
The current distributions at
http://sunsite.unc.edu/pub/sun-info/standards/xml/eg/shakespeare.1.02.xml.zip
http://sunsite.unc.edu/pub/sun-info/standards/xml/eg/religion.1.02.xml.zip
include the DTDs. For the curious, I append them below; they are
achingly simple. Frankly, I have lost track of whether they are
conformant with our current case rules; I think so, but I would be
grateful for any corrections from the parser writers.
Jon
========================================================================
<!-- DTD for Shakespeare J. Bosak 1994.03.01, 1997.01.02 -->
<!-- Revised for case sensitivity 1997.09.10 -->
<!ENTITY amp "&">
<!ELEMENT PLAY (TITLE, FM, PERSONAE, SCNDESCR, PLAYSUBT, INDUCT?,
PROLOGUE?, ACT+, EPILOGUE?)>
<!ELEMENT TITLE (#PCDATA)>
<!ELEMENT FM (P+)>
<!ELEMENT P (#PCDATA)>
<!ELEMENT PERSONAE (TITLE, (PERSONA | PGROUP)+)>
<!ELEMENT PGROUP (PERSONA+, GRPDESCR)>
<!ELEMENT PERSONA (#PCDATA)>
<!ELEMENT GRPDESCR (#PCDATA)>
<!ELEMENT SCNDESCR (#PCDATA)>
<!ELEMENT PLAYSUBT (#PCDATA)>
<!ELEMENT INDUCT (TITLE, SUBTITLE*, (SCENE+|(SPEECH|STAGEDIR|SUBHEAD)+))>
<!ELEMENT ACT (TITLE, SUBTITLE*, PROLOGUE?, SCENE+, EPILOGUE?)>
<!ELEMENT SCENE (TITLE, SUBTITLE*, (SPEECH | STAGEDIR | SUBHEAD)+)>
<!ELEMENT PROLOGUE (TITLE, SUBTITLE*, (STAGEDIR | SPEECH)+)>
<!ELEMENT EPILOGUE (TITLE, SUBTITLE*, (STAGEDIR | SPEECH)+)>
<!ELEMENT SPEECH (SPEAKER+, (LINE | STAGEDIR | SUBHEAD)+)>
<!ELEMENT SPEAKER (#PCDATA)>
<!ELEMENT LINE (STAGEDIR | #PCDATA)+>
<!ELEMENT STAGEDIR (#PCDATA)>
<!ELEMENT SUBTITLE (#PCDATA)>
<!ELEMENT SUBHEAD (#PCDATA)>
========================================================================
<!-- DTD for testaments J. Bosak 1992.11.15, 1994.03.26,
1994.04.01, 1997.01.02 -->
<!ENTITY amp "&">
<!ELEMENT p - - (#PCDATA)>
<!ELEMENT tstmt - - (ttitle,fm,sbttitle?,preface?,(sura | book)+)>
<!ELEMENT ttitle - - ((#PCDATA),ttitle2?)*>
<!ELEMENT ttitle2 - - (#PCDATA)>
<!ELEMENT fm - - (p)+>
<!ELEMENT sbttitle - - (p)+>
<!ELEMENT preface - - (ptitle, p+)+>
<!ELEMENT ptitle - - (#PCDATA)>
<!ELEMENT book - - (bktlong, bktshort, epigraph?, bksum?, chapter+)>
<!ELEMENT sura - - (bktlong, bktshort, epigraph?, bksum?, v+)>
<!ELEMENT bktlong - - (#PCDATA)>
<!ELEMENT bktshort - - (#PCDATA)>
<!ELEMENT bksum - - (p)+>
<!ELEMENT epigraph - - (p)+>
<!ELEMENT chapter - - (chtitle, epigraph?, chsum?, v+)>
<!ELEMENT chtitle - - (#PCDATA)>
<!ELEMENT chsum - - (p)+>
<!ELEMENT v - - (vn, p)>
<!ELEMENT vn - - (#PCDATA)>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
|