How to find the datanode where a particular file is stored and read from while running an MR Job?

Question

I have 9 files each of size equal to BlockLength of the cluster , stored in hadoop . I need to get the addresses of the datanodes where the files are present . The replication factor is 3 .

Is there any hadoop API to do this or any other possible way ?

Tanveer Dayan · Accepted Answer

To use the java code, you can use the following class

org.apache.hadoop.hdfs.tools.DFSck

Using this method

doWork(final String[] args)

This will create a URI internally and print all the details using System.out.

How to find the datanode where a particular file is stored and read from while running an MR Job?

Answers (2)

Related Questions