I am already compressing RDDs using conf.set(\"spark.rdd.compress\",\"true\")
and persist(MEMORY_AND_DISK_SER)
. Will using Kryo serialization make the
Considering another point: kyro is faster than the default in serialization and deserialization, so it's better to use kyro. But the performance increase may be not as good as said, there are other points which will influence the program speed, like how you write your spark code, which lib you choose.