Reputation: 79
As of my knowledge, I know only one diffence between Hadoop 1 and 2.
Its active and passive Secondary Name Nodes.
Could some one list me the difference between Hadoop 1 and 2?
Upvotes: 3
Views: 14422
Reputation: 461
Hadoop 1
Hadoop 2
Upvotes: 15
Reputation: 11
1) Scalability - Decreasing the load on the Resource Manager(RM) by delegating the work of handling the tasks running on slaves to application Master, RM can now handle more requests than Job tracker facilitating addition of more nodes.
2) Unlike MPv1 which is strongly coupled with the MapReduce , YARN supports many kinds of code running on them like MR2,Tez, Storm, Spark etc
3) Optimized resource allocation - There are no fixed number of slots separately allocated for Mapper and Reducers in YARN, which is the case in MRv1. So the available capacity of the nodes can be used to any task which needs resources.
4) When Resource manager fails , the jobs running on the cluster need not be restarted again after the recovery of Resource Manager.
5) Fail-over mechanism is implemented by ZK which is already part of Resource manager which says, we don't need to run another daemon.
Please Look for more details here..
Upvotes: 1
Reputation: 99
There is major improvement in Hadoop Architecture in Hadoop2. Introduced distributed operating system layer known as YARN (Yet Another Resource Negotiator). Now the resource (Memory and CPU) management is handled by YARN.
Also, HA introduced for NameNode.
Upvotes: 2