The \"old\" SparkContext.hadoopFile takes a minPartitions argument, which is a hint for the number of partitions:
SparkContext.hadoopFile
minPartitions
def hadoopFile[K, V](
The function newApiHadoopFile allows you to pass a configuration object so in that you can set mapred.max.split.size.
newApiHadoopFile
mapred.max.split.size
Even though this is in the mapred namespace since there is seemingly no new option I would imagine the new API will respect the variable.
mapred