> On Jul 2, 2015, at 9:11 AM, Ihe Onwuka <ihe.onwuka@gmail.com> wrote:
>
> The Zorba cleaning library is the most immediately interesting.
Note for other people since I had this discussion privately with Ihe: Zorba had a data cleaning
library good for similarity search, edit distance, similar sounds, spelling mistakes, etc.
It’s useful, implemented in pure XQuery 1.0 and Apache license., hence could be used in any XSLT/Xquery engine.
So if it’s useful to you, please use it.
Best
Dana