Reputation: 603
I want to fetch the data daily from yahoo/google finance, related to stock's eod prices. These prices should be directly stored in HDFS in file.
I can later make external table on top of it (using HIVE) and use for further analysis.
So, I am not looking for basic map-reduce, since I don't have any input file as such. Are there any connectors available in python, which can write data in Hadoop?
Upvotes: 1
Views: 2894
Reputation: 9067
Start with dumping your data in a local file. Then find a way to upload the file to HDFS.
hdfs dfs -put data.txt /user/johndoe/some/hdfs/dir/
curl
command line) to connect to the HDFS REST service -- could be either webHDFS or httpFS depending on the way the cluster has been set up -- and upload the file with a PUT request
http://namenode:port/webhdfs/v1/user/johndoe/some/hdfs/dir/data.txt?op=CREATE&overwrite=false
(and the content of "data.txt" as payload, of course)
Upvotes: 1