Quarkus resilience best practice

Question

I have a use case like the following:

One Quarkus microservice is responsible for talking with several other fixed APIs (e.g. ArgoCD REST API, Standard Corporate Driven API) to bring the whole system in the desired state.

The whole request needs to transactional, which means that either all API requests need to be successful or rolled back in case of any error.

If the APIs give errors back, the case is clear for me. "Just revert everything you had done before", but what happens if my Quarkus application crashes?

An example:

Quarkus application receives a POST request on an endpoint which starts the following tasks:

read a resource on REST-API 1
create a resource on REST-API 2 (ArgoCD API)
create a resource on REST-API 3 (Standard Corporate API)

If my Quarkus application dies after step two, it will leave my application in an inconsistent state.

For example, my second request will create an ArgoCD-Application using its REST API, if then the 3rd request fails, I have to delete the created application again to bring the system back to a consistent state.

The LRA approach is not applicable here because the ArgoCD Rest API does not implement the LRA API.

So at least, i have to maintain the state and the compensation logic within my Quarkus application. However, I need to persist the state of my transaction anywhere to recover from it after a failure.

My current solution uses a Redis database besides to persist the state of each transactions until it is finished, but I was wondering if I missed some standard solution matching my use case.

Quarkus resilience best practice

An example:

Answers (1)

Related Questions