StackOverflow Questions for Tag: cuda

Shui_
Shui_

Reputation: 13

4090's four card has high P2P bandwidth(52GB/s), but the bandwidth of the lstopo intermediate node is low(8GB/s)

Score: -4

Views: 41

Answers: 0

Read More
naruto98
naruto98

Reputation: 11

Extending a Single-Pass Scan Kernel for Independent Row-wise Scan in CUDA

Score: 1

Views: 35

Answers: 0

Read More
Lifu Huang
Lifu Huang

Reputation: 12869

What is the maximum size of a global memory transaction in CUDA?

Score: 1

Views: 58

Answers: 0

Read More
Alexander Dzhoganov
Alexander Dzhoganov

Reputation: 236

How to optimize Conway's game of life for CUDA?

Score: 12

Views: 4545

Answers: 4

Read More
user0002128
user0002128

Reputation: 2931

The behavior of __CUDA_ARCH__ macro

Score: 6

Views: 12361

Answers: 2

Read More
R3dy
R3dy

Reputation: 9

CUDA Streams are NOT Asynchronous

Score: 0

Views: 70

Answers: 0

Read More
Santiago
Santiago

Reputation: 83

Use thrust::reduce for multplying a sequence of matrices

Score: 1

Views: 28

Answers: 0

Read More
pmcr
pmcr

Reputation: 135

CUDA streams not overlapping

Score: 10

Views: 4277

Answers: 2

Read More
mluerig
mluerig

Reputation: 749

Issues with CUDA installation via `cuda-toolkit` on win 11 - cannot find VS C++ tools?

Score: 5

Views: 1093

Answers: 1

Read More
Pannaga Sudarshan
Pannaga Sudarshan

Reputation: 1

How do I install a different CUDA-toolkit version in my conda env (that is different from base)?

Score: 0

Views: 37

Answers: 0

Read More
Lifu Huang
Lifu Huang

Reputation: 12869

Why CUDA allocated more registers per thread than max live registers?

Score: 1

Views: 33

Answers: 0

Read More
Mineral
Mineral

Reputation: 407

CUDA compile problems on Windows, Cmake error: No CUDA toolset found

Score: 13

Views: 44949

Answers: 11

Read More
Rich Tanenbaum
Rich Tanenbaum

Reputation: 49

Why does setting a class member not work consistently inside a kernel

Score: 3

Views: 103

Answers: 1

Read More
einpoklum
einpoklum

Reputation: 132128

The CUDA "driver version" looks like the CUDA runtime version - so what's the difference?

Score: 25

Views: 48572

Answers: 1

Read More
Vadim Kashtanov
Vadim Kashtanov

Reputation: 57

Cuda gdb print constant

Score: 0

Views: 271

Answers: 2

Read More
Thiago Conrado
Thiago Conrado

Reputation: 863

__threadfence_block() and volatile + shared memory to fight registers

Score: 0

Views: 422

Answers: 1

Read More
Blue_Black
Blue_Black

Reputation: 317

Is cudaFree() asynchronous?

Score: 12

Views: 6596

Answers: 4

Read More
Matan
Matan

Reputation: 179

Cannot View CUDA Device Variables during Debug

Score: 0

Views: 44

Answers: 0

Read More
mEm
mEm

Reputation: 377

`cuModuleLoadDataEx` returns `CUDA_ERROR_UNSUPPORTED_PTX_VERSION`

Score: 0

Views: 70

Answers: 1

Read More
Dan Stahlke
Dan Stahlke

Reputation: 1469

CUDA.jl mapreduce integer sequence without creating intermediate array

Score: 0

Views: 32

Answers: 0

Read More
PreviousPage 2Next