Is it possible to save files in Hadoop without saving them in local file system? I would like to do something like shown below however I would like to save file directly in
Hadoop has REST APIs that allow you to create files via WebHDFS.
So you could write your own create
based on the REST APIs using a python library like requests
for doing the HTTP. However, there are also several python libraries that support Hadoop/HDFS and already use the REST APIs or that use the RPC mechanism via libhdfs
.
Just make sure you look for how to create a file rather than having the python library call hdfs dfs -put
or hadoop fs -put
.
See the following for more information: