Reputation: 455
I need to load some data from HDFS to Hive. But I need to some aggregations between the files that I've in HDFS. I read that Sqoop can do that but only using MySQL. Which another choices I've to do this?
Thanks!
Upvotes: 0
Views: 91
Reputation: 2954
Your best option would be to create an external table in Hive that sources from your files in HDFS. Then you can create a Hive table to store your aggregated data and some Hive SQL to do the insert into that table.
Upvotes: 1