[
Lists Home |
Date Index |
Thread Index
]
- To: ricko@allette.com.au (Rick Jelliffe)
- Subject: Re: [xml-dev] Re: English sentences, was: Re: [xml-dev] Announce: XML Schema,
- From: John Cowan <jcowan@reutershealth.com>
- Date: Thu, 4 Jul 2002 22:33:33 -0400 (EDT)
- Cc: xml-dev@lists.xml.org ('xml-dev')
- In-reply-to: <024701c21de3$8515df20$4bc8a8c0@AlletteSystems.com> from "Rick Jelliffe" at Jun 28, 2002 12:04:14 AM
Rick Jelliffe scripsit:
> See http://www.alis.com/castil/silc/?AlisTargetHost=http://www.alis.com:8080
> for the commercialization.
For which they want US$15K per box.....
> I have just been looking for public domain tables giving the liklihood of
> various trigrams [...]
> Lots of papers reference them, but it looks like a definitive collection
> has not come yet. (One good approach to doing this would be to take the
> spelling tables from aspell and generate them.)
Actually I think not: for this purpose you want frequencies in running
text, not in spelling lists, which obviously have far too many rare
words in them.
--
John Cowan jcowan@reutershealth.com
At times of peril or dubitation, http://www.ccil.org/~cowan
Perform swift circular ambulation, http://www.reutershealth.com
With loud and high-pitched ululation.
|