I\'m trying to come up with a method of finding duplicate addresses, based on a similarity score. Consider these duplicate addresses:
addr_1 = \'# 3 FAIRMONT LIN
This should be helpful in building your dictionary of abbreviations:
http://www.usps.com/ncsc/lookups/usps_abbreviations.html