How to convert List to JavaRDD

前端 未结 2 1941
隐瞒了意图╮
隐瞒了意图╮ 2021-02-11 15:48

We know that in spark there is a method rdd.collect which converts RDD to a list.

List f= rdd.collect();
String[] array = f.toArray(new String[f.si         


        
相关标签:
2条回答
  • 2021-02-11 16:34

    You're looking for JavaSparkContext.parallelize(List) and similar. This is just like in the Scala API.

    0 讨论(0)
  • 2021-02-11 16:36

    Adding to Sean Owen and others solutions

    You can use JavaSparkContext#parallelizePairs for List ofTuple

    List<Tuple2<Integer, Integer>> pairs = new ArrayList<>();
    pairs.add(new Tuple2<>(0, 5));
    pairs.add(new Tuple2<>(1, 3));
    
    JavaSparkContext sc = new JavaSparkContext();
    
    JavaPairRDD<Integer, Integer> rdd = sc.parallelizePairs(pairs);
    
    0 讨论(0)
提交回复
热议问题