SaCvP
SaCvP

Reputation: 455

Which component is better to move data from HDFS into Hive with some data transformation?

I need to load some data from HDFS to Hive. But I need to some aggregations between the files that I've in HDFS. I read that Sqoop can do that but only using MySQL. Which another choices I've to do this?

Thanks!

Upvotes: 0

Views: 91

Answers (1)

Jared
Jared

Reputation: 2954

Your best option would be to create an external table in Hive that sources from your files in HDFS. Then you can create a Hive table to store your aggregated data and some Hive SQL to do the insert into that table.

Upvotes: 1

Related Questions