I have a table containing 3 million people records on which I want to perform fuzzy matching using q-grams (on surname for instance). I have created a table of 2-grams linking t
Interesting paper about indexing DNA q-grams so you don't have to scan the entire table:
www.comp.nus.edu.sg/~atung/publication/qgram_edit.pdf