I\'m working with PySpark and I need to compare the content of two files - to do a diffcheck. So I divided my tests into 2: