I need to run this code, that returns the most similar domain in y for each sublist in x. The problem is that my real x list has 40k sublists that vary in size and my y list