Is the concept of release-sequence useful in practice?

Question

C++ atomic semantics only guarantee visibility (through happen-before relation) of memory operations performed by the last thread that did a release write (simple or read-modify-write) operation.

Consider

int x, y;
atomic a;

Thread 1:

x = 1;
a.store(1,memory_order_release);

Thread 2:

y = 2;
if (a.load(memory_order_relaxed) == 1))
  a.store(2,memory_order_release);

Then the observation of a == 2 implies visibility of thread 2 operations (y == 2) but not thread 1 (one cannot even read x).

As far as I know, real implementations of multithreading use concepts of fences (and sometimes release store) but not happen-before or release-sequence which are high level C++ concepts; I fail to see what real hardware details these concepts map to.

How can a real implementation not guarantee visibility of thread 1 memory operations when the value of 2 in a is globally visible?

In other words, is there any good in the release-sequence definition? Why wouldn't the release-sequence extend to every subsequent modification in the modification order?

Consider in particular silly-thread 3:

if (a.load(memory_order_relaxed) == 2))
  a.store(2,memory_order_relaxed);

Can silly-thread 3 ever suppress any visibility guarantee on any real hardware? In other words, if value 2 is globally visible, how would making it again globally visible break any ordering?

Is my mental model of real multiprocessing incorrect? Can a value of partially visible, on some CPU but note another one?

(Of course I assume a non crazy semantic for relaxed writes, as writes that go back in time make language semantics of C++ absolutely nonsensical, unlike safe languages like Java that always have bounded semantics. No real implementation can have crazy, non-causal relaxed semantic.)

Is the concept of release-sequence useful in practice?

Answers (1)

Related Questions