Why Mongo Spark connector returns different and incorrect counts for a query?

前端 未结 2 490
忘掉有多难
忘掉有多难 2021-01-12 11:07

I\'m evaluating Mongo Spark connector for a project and I\'m getting the inconsistent results. I use MongoDB server version 3.4.5, Spark (via PySpark) version 2.2.0, Mongo S

2条回答
  •  说谎
    说谎 (楼主)
    2021-01-12 11:27

    This issue was mostly due to SPARK-151 bug in 2.2.0 Mongo Connector. It is resolved in 2.2.1 version, which I have confirmed. You can continue to use default partitioner with 2.2.1.

提交回复
热议问题