Effect of distance between CUDA threads in block?

Question

I have a naive question about GPU programming. (ChatGPT and Claude didn't really give me a convincing answer. Maybe I'm prompting badly.)

GPU programming languages like CUDA and OpenCL organise threads in (using Nvidia terminology) a 3D block structure, and blocks in a 3D grid. I know that this is convenient and natural for computer graphics. But I wonder if the 'distance' (see below for a definition) of two threads in a block, or two blocks in a grid, has any technical effects for the performance of thread-execution?

What I mean is that there is a natural distance between two threads T1, and T2 in the same block at block indices

T1 at (x1, y1, z1)
T2 at (x2, y2, z2)

The natural distance is the 3-dimensional euclidean distance (but other choices are possible). Does this distance have any hardware effects? (E.g. if T1 and T2 are close then they can commuicate faster?) I think the answer is negative, but I could not find a convincing explanation online.

A similar question can be asked about the distance of blocks in a grid.

Effect of distance between CUDA threads in block?

Answers (1)

Related Questions