Reputation: 756
We have a database filled with OCRed data and manually typed data.
When doing searches with CONTAINS command, not all of the results are appearing. So a search for "monkey man" doesn't return the records that have m0nkey man, momkey man in the data.
Is there a way to allow for these issues in the data?
I've had a cursory glance at Lucene.NET and Soundex but can't see these being of much use.
Thanks for any ideas
Upvotes: 7
Views: 13094
Reputation: 905
I believe you are looking for something called Fuzzy matching.
Similar post:
Maybe useful:
http://web.archive.org/web/20100209050309/http://anastasiosyal.com/archive/2009/01/11/18.aspx
Upvotes: 7