Can cassandra guarantee the replication factor when a node is resyncing it's data?

Question

Let's say I have a 3 node cluster.

I am writing to node #1.

If node #2 in that cluster goes down, and then comes back up and is resyncing the data from the other nodes, and I continue writing to node #1, will the data be synchronously replicated to node #2? That is, is the replication factor of that write honored synchronously or is it behind the queue post resync?

Thanks Steve

Andy Tolbert · Accepted Answer

Yes granted that you are reading and writing at a consistency level that can handle 1 node becoming unavailable.

Consider the following scenario:

You have a 3 node cluster with a keyspace 'ks' with a replication factor of 3.
You are writing at a Consistency Level of 'QUORUM'
You are reading at a Consistency level of 'QUORUM'.
Node 2 goes down for 10 minutes.
Reads and Writes can successfully continue while node is down since 'QUORUM' only requires 2 (3/2+1=2) nodes to be available. While Node 2 is down, both Node 1 and 3 maintain 'hints' for Node 2.
Node 2 comes online. Node 1 and 3 send hints they recorded while Node 2 was down to Node 2.

If a read happens and the coordinating cassandra node detects that nodes are missing data/not consistent, it may execute a 'read repair'

If Node 2 was down for a long time, Node 1 and Node 3 may not retain all hints destined for it. In this case, an operator should consider running repairs on a scheduled basis.

Also note that when doing reads, if Cassandra finds that there is a data mismatch during a digest request, it will always consider the data with the newest timestamp as the right one (see 'Why cassandra doesn't need vector clocks').

Can cassandra guarantee the replication factor when a node is resyncing it's data?

Answers (2)

Related Questions

Can cassandra guarantee the replication factor when a node is resyncing it&#39;s data?

Answers (2)

Related Questions

Can cassandra guarantee the replication factor when a node is resyncing it's data?