Reputation: 11
i have a job ,use flink ingest data and send file format by parquet to HDFS,because of i`m use streamFileSink in Flink ,only checkpoint is success ,the file will be finished.I want to konw how long should be seted for checkpoit,What kind of parameters can refer to?
Upvotes: 1
Views: 149
Reputation: 43454
The checkpoint interval will determine
Choose whatever makes sense, given your tolerance for longer recovery times, increased latency for other processes waiting for these files to be finalized, and for larger output files.
Checkpointing also imposes some overhead on the cluster. More frequent checkpointing will impact performance.
Upvotes: 1