Inconsistency between OpenGL and CUDA maximum number of threads

Question

My GPU is NVIDIA GeForce GT440, whose compute capability version is 2.x. NVIDIA's official CUDA_C_Programming_Guide points out

Limit 1. Maximum number of threads per block = 1024
Limit 2. Maximum number of resident threads per multiprocessor = 1536

However, two of the OpenGL computer shader implementation limits are

Limit 3. GL_MAX_COMPUTE_WORK_GROUP_INVOCATIONS = 1536

My questions are
1. Why Limit 1 is not equal to Limit 2 and Limit 3?
2. Should the real threads/block (invocations/workgroup) be 1024 or 1536?

Answers (1)