Does Flink RocksDB statebackend help restoring state?

Question

I'm considering using RocksDB as a statebackend of flink job which has state size up to 1TB.

My environment

checkpoint dir: hdfs
flink job submit: yarn-per-job (per-job mode on yarn cluster)

If the job fails and retry attempts exceed maximum retry count and the job completely dies (or canceling the job), I think the checkpoint and the rocksdb file will be deleted(because I'm deploying job as per-job-mode and the task manager would also terminate).
Here, I think I lose all state and have no way to restore the state but I expect using RocksDB would help something to restore the state because it is a disk based statebackend. If not, what is the advantage of using RocksDB statebackend?

Would retaining the checkpoint on cancellation and restart the job from the checkpoint(or savepoint) help in this case?
Thank you

Does Flink RocksDB statebackend help restoring state?

Answers (1)

Related Questions