I have a large number of sets of numbers. Each set contains 10 numbers and I need to remove all sets that have 5 or more number (unordered) matches with any other set.
F
You should find the Pearson Coefficient between two sets of data. This method will make your program easily scalable to huge data sets.