OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   Re: [xml-dev] [OT] Looking for a text algorithm

[ Lists Home | Date Index | Thread Index ]

Rick Jelliffe writes:

 > The soundex and multiphone algorithms convert strings to sound
 > equivalents: a kind of multiphone algorithm is probably similar to
 > what you are looking for.  These were created to allow hashed
 > lookup up family names based on sounds, and (in the case of soundex
 > at least) work on the assumption that spelling mistakes are more
 > common later in words than earlier.

Thank you.  I am familiar with soundex, but unlike a hashing
algorithm, it does not produce a fixed-length key -- all other things
(proportion of vowels and significant clusters) being the same,
a longer string will have a longer key.

All the best,


David Megginson, david@megginson.com, http://www.megginson.com/


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS