Reputation: 903
I'm a beginner on Hadoop and I can't understand how we distribute data over HDFS:
1- Manually
2- with YARN
3- other ways to do it
4- Automatically
Upvotes: 1
Views: 39
Reputation: 5541
Data is automatically replicated when it is added to HDFS. Simply executing the command
hdfs dfs -copyFromLocal <local-file> <hdfs-location>
will place a file on the HDFS with a replication factor of 3 (by default). Replication is completely automated and abstracted away from the user.
Upvotes: 1