Heta Desai
Heta Desai

Reputation: 57

can we write MapReduce job to read data from oracle and MySql?

I want to read data from Oracle and Mysql. Then I need to Compare data which are read from Oracle and Mysql and the output will be stored in MSSQL server. I want to perform all this using MapReduce job in c#.

If I have configured Hadoop multinode cluster and I am executing MapReduce job to read data from Oracle and MySQL, data which MapReduce reads will distribute across the memory of multiple nodes in the cluster.?

Upvotes: 0

Views: 102

Answers (1)

KrazyGautam
KrazyGautam

Reputation: 2692

Yess you can use MR to do all you have said OR you can use some inbuilt tools .

  1. Use Sqoop to read data from DB and dump it on to HDFS.
  2. Use MR to read data from HDFS and perform the business logic.
  3. The output of MR is saved in HDFS and again using SQoop pushed into DB.

Sqoop is a wrapper over MapReduce to pull/push data to Database. You can always write your own custom MapReduce to do the same.

Upvotes: 0

Related Questions