Amal Kostali Targhi
Amal Kostali Targhi

Reputation: 903

Is YARN responsible of distributing the data over HDFS

I'm a beginner on Hadoop and I can't understand how we distribute data over HDFS:

1- Manually   
2- with YARN
3- other ways to do it
4- Automatically

Upvotes: 1

Views: 39

Answers (1)

Ben Watson
Ben Watson

Reputation: 5541

Data is automatically replicated when it is added to HDFS. Simply executing the command

hdfs dfs -copyFromLocal <local-file> <hdfs-location>

will place a file on the HDFS with a replication factor of 3 (by default). Replication is completely automated and abstracted away from the user.

Upvotes: 1

Related Questions