I am trying to calculate Jaccard distance between certain ids with their attributes in the form of SparseVectors.
from pyspark.ml.feature import MinHashLSH fr