Snapshot taking and restore strategies

Question

I've been reading about CQRS+EventSoucing patterns (which I wish to apply in a near future) and one point common to all decks and presentations I found is to take snapshots of your model state in order to restore it, but none of these share patterns/strategies of doing that.

I wonder if you could share your thoughts and experience in this matter particularly in terms of:

When to snapshot
How to model a snapshot store
Application/cache cold start

TL;DR: How have you implemented Snapshotting in your CQRS+EventSourcing application? Pros and Cons?

Charles · Accepted Answer

There are few instances you need to snapshot for sure. But there are a couple - a common example is an account in a ledger. You'll have thousands maybe millions of credit/debit events producing the final BALANCE state of the account - it would be insane not to snapshot that every so often.

My approach to snapshoting when I designed Aggregates.NET was its off by default and to enable your aggregates or entities must inherit from AggregateWithMemento or EntityWithMemento which in turn your entity must define a RestoreSnapshot, a TakeSnapshot and a ShouldTakeSnapshot

The decision whether to take a snapshot or not is left up to the entity itself. A common pattern is

Boolean ShouldTakeSnapshot() {
    return this.Version % 50 == 0;
}

Which of course would take a snapshot every 50 events.

When reading the entity stream the first thing we do is check for a snapshot then read the rest of the entity's stream from the moment the snapshot was taken. IE: Don't ask for the entire stream just the part we have not snapshoted.

As for the store - you can use literally anything. VOU is right though a key-value store is best because you only need to 1. check if one exists 2. load the entire thing - which is ideal for kv

For system restarts - I'm not really following what your described problem is. There's no reason for your domain server to be stateful in the sense that its doing something different at different points in time. It should do just 1 thing - process the next command. In the process of handling a command it loads data from the event store, including a snapshot, runs the command against the entity which either produces a business exception or domain events which are recorded to the store.

I think you may be trying to optimize too much with this talk of caching and cold starts.

Snapshot taking and restore strategies

Answers (2)

Related Questions