Reputation: 11691
Is there anything like HDFS's DataBlockScanner for Cassandra, ie. an automatic mechanism that checks for corrupted data that doesn't get read often?
Upvotes: 2
Views: 557
Reputation: 16400
Best bet is to use nodetool verify
to compare the hash of the sstable with the content. Particularly with nodetool verify -e
to walk the individual cells.
https://docs.datastax.com/en/cassandra/3.0/cassandra/tools/toolsVerify.html
Upvotes: 0
Reputation: 11691
No.
Cassandra doesn't do that automatically - it can guarantee consistency on read or write via ConsistencyLevel on each query, and it can run active (AntiEntropy) repairs. But active repairs must be scheduled (by human or cron or by third party script like http://cassandra-reaper.io/), and to be pedantic, repair only fixes consistency issue, there's some work to be done to properly address/support fixing corrupted replicas (for example, repair COULD send a bit flip from one node to all of the others)
Big data as a trash can. Cool.
Upvotes: 2