Unexpected behaviour of pyspark `samplingRatio` while reading csv

前端 未结 0 1232
半阙折子戏
半阙折子戏 2021-01-17 11:45

I want to read a billions-of-rows csv file while also inferring the schema:

df = spark.read.csv(\'s3://bucket/data/*\', inferSchema=True, samplingRatio=0.0001         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题