问题 I have a Dataset that i created from a RDD and try to join it with another Dataset which is created from my Phoenix Table : val dfToJoin = sparkSession.createDataset(rddToJoin) val tableDf = sparkSession .read .option("table", "table") .option("zkURL", "localhost") .format("org.apache.phoenix.spark") .load() val joinedDf = dfToJoin.join(tableDf, "columnToJoinOn") When i execute it, it seems that the whole database table is loaded to do the join. Is there a way to do such a join so that the