Skip to content
Advertisement

How do I time the checkpointing in Apache Flink streaming?

I am running the Fraud Detector example of Apache Flink with RocksDB as my state backend. I want to know how long does Apache Flink takes to checkpoint the state.

My approach is to print time before and after the checkpoint functions.

I could not find the function/class or any piece of code that checkpoints the state I tried debugging through the IDE but in vain.

This is what I have gone through so far:

JavaScript

I step into the execute function and find many places where the code sets the config for checkpoint (such as checking the timeout interval, etc). However, I could not find the function that actually checks executes the checkpointing.

Advertisement

Answer

It won’t be easy to measure this on your own, since checkpointing is done partly in the thread that runs your user functions, and partly asynchronously in another thread.

The best way to get some information about the impact of checkpointing is to look at the checkpointing metrics, which are conveniently collected together and displayed in the web UI:

enter image description here

Normally the web UI isn’t available when you are running in the IDE, but you can change this:

JavaScript

For this to work, you’ll also have to add this dependency:

JavaScript
User contributions licensed under: CC BY-SA
2 People found this is helpful
Advertisement