For strings you can use approximate matching algorithm.
- Generate a random string
- For all the strings compute their distance from that random shared string using an algorithm like http://www.dotnetperls.com/levenshtein
If the strings are equidistant from a reference string then chances are that they are similar to each other. And there you go you have a locality senitive hash implementation for strings.
You can create different hash buckets for a range of distances.
EDIT: You can try other variations of string distance. A simpler algorithm would just return no. of common characters between two strings.