adithyan .p
adithyan .p

Reputation: 121

how hdfs and hive are related?

I have psudeo distributed hadoop environment and using sqoop and hive for data importing and analysis.

I used sqoop to import data from mysql to hive and it worked, i got data in my hive location which is /user/hive/warehouse/ and i can able to see the data in hive table.

my doubts are :

  1. will the data stored in hadoop when i import from mysql to hive(build on top of hadoop)?
  2. why it is creating /user/hive/warehouse in my hdfs?
  3. how hive would work on top of data?
  4. if i update data in mysql , i can use sqoop incremental import to update the same in hdfs but if i directly use hive import it will reflect in hive table but not not in hdfs, is my understanding correct?

I am confuse while performing data ingestion using sqoop and hdfs concept

Can somebody help me on this?

Upvotes: 0

Views: 240

Answers (1)

sandeep rawat
sandeep rawat

Reputation: 4957

Answers

1- Data is stored in HDFS only.

2- Property "hive.metastore.warehouse.dir" at hive-default.xml value tells where data will be stored default value is "/user/hive/warehouse"

3- User Query are converted on MapReduse job more detail

4- You can use SQOOP for incremental data load and all data will be stored on HDFS and if you have created table on the it will reflect.

Upvotes: 2

Related Questions