I have a class ImageInputFormat in Hadoop which reads images from HDFS. How to use my InputFormat in Spark?
ImageInputFormat
Here is my ImageInputFormat:
The SparkContext has a method called hadoopFile. It accepts classes implementing the interface org.apache.hadoop.mapred.InputFormat
hadoopFile
org.apache.hadoop.mapred.InputFormat
Its description says "Get an RDD for a Hadoop file with an arbitrary InputFormat".
Also have a look at the Spark Documentation.