Reputation: 51
The checksum of a HDFS block is stored in a local file, along with the raw content of the block, both on each of the dedicated datanodes (replica).
I am wondering: is the checksum of a block stored also within the namenode, as part of the metadata of a file?
Upvotes: 1
Views: 1149
Reputation: 872
The Short Answer: Checksums are stored on datanodes
Explanation:
see "hadoop the definitive guide 4th edition page 98"
Upvotes: 0
Reputation: 2384
No. The checksum is stored only along with the blocks on the slave nodes[sometimes also called as Data Nodes].
From the Apache Documentation for HDFS
Data Integrity
It is possible that a block of data fetched from a DataNode arrives corrupted. This corruption can occur because of faults in a storage device, network faults, or buggy software.
It works in the following manner.
Upvotes: 2