问题
I am trying to read form a very big databse which consists of geo-referenced time series data. SO I have the file in the following format:
latitude,longitude,value@time1,value@time2,....value@timeN.
So this is the data for the entire earth. Now for my work I need to get the latitude,longitude as the key and the time series values as the value. As far as I know hadoop has KeyValueInputFormat but it considers first tab as the delimiter. Is there a way to customize it.
Really need a solution for this.
Thanks Ayush
回答1:
Play around with
key.value.separator.in.input.line
in job config.
来源:https://stackoverflow.com/questions/11689972/customizing-inputformat-in-hadoop