OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   RE: intercepting internal entities?

[ Lists Home | Date Index | Thread Index ]
  • From: David Megginson <david@megginson.com>
  • To: "'xml-dev@ic.ac.uk'" <xml-dev@ic.ac.uk>
  • Date: Thu, 29 Apr 1999 15:15:22 -0400 (EDT)

DuCharme, Robert writes:

 > Well, almost all. I couldn't find a mapping that let me output to
 > 7-bit ASCII (e.g. map auml, agrave, aacute, and acirc all to "a")
 > but I can see from the AnselInputStreamReader class that comes with
 > Mike Kay's GedML package (which demos his SAXON library) how to
 > derive my own StreamReader with my own mapping table.  Look out
 > umlauts.

Actually, for German, a-umlaut should map to 'ae', not 'a' (and
o-umlaut to 'oe', and u-umlaut to 'ue').  Likewise, 'ß' should map to
'ss'.  So, if you want to recode Goethe's line

  ... daß beide Männer recht haben möchten ...

into US-ASCII, you would need to put out

  ... dass beide Maenner recht haben moechten ...

The alternative, "das beide Manner recht haben mochten" looks very
wrong to me, though I'm not a native (or even a very good) German
speaker, and perhaps tastes have changed.  It's interesting to note,
though, that the '¨' in old or middle high German was (I think)
originally just a tiny 'e' writting above the vowel by the scribe.

This stuff is always harder than you expect.  I don't know the rules
in Finnish for example, but I'm willing to bet that they handle
ASCII-fication completely differently.  And, of course, once you start
transliterating non-Roman characters, the real fun begins.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS