Design patterns to recover system from intermediate state after crashing

Question

My system is made up of couple of components, a request typically goes through all components and each component uses own DB table to track system states.

For example, when a request arrives, component A creates a resource R by: 1. create DB row for R, marking state as "Creating" 2. application layer does the real work which may takes up to couple of minutes or hours. 3. update DB row for R, marking state as "Ready"

every component does similar things.

The problem is, the system may crash at any time and leave the system in an intermediate state. For example, resource R may remain in "Creating" after system failure.

My question is, for system like this which can not use a transaction to cover all steps(either the transaction is too long or the system is distributed), what're the design patterns or best practice to recover system?

I thought this case is very common in ERP system or any system that uses SOA.

UPDATE: The request can be resent, but the resource R which is in intermediate state 'Creating' which may have been created in real world, this is somehow like in a distributed system, a component crash causes whole system states inconsistent. what's some practice to design a system that can resync system after failure?

Design patterns to recover system from intermediate state after crashing

Answers (1)

Related Questions