Maria Chiara
Maria Chiara

Reputation: 113

What's the newer version or the alternative to Cuda Occupancy Calculator?

I'm working with a Tesla P100 having compute capability 6.0. I'd like to find a tool to automatically get the best grid and block sizes w.r.t. my kernel code.

I recently discovered the CUDA Occupancy Calculator (the .xls spreadsheet). But I realized it's a bit outdated (the Capability was until 2.1).

Also, can CUDA profilers (nvprof or Nsight) do estimations about occupancy and give some optimal block/grid size?

I'm a quite new about these tools.

Upvotes: 1

Views: 1536

Answers (2)

Paulo Santos
Paulo Santos

Reputation: 59

There is a very complete tool that help you find best configuration Check my configuration and the graphs You want the red dots to be on the peak of each graph

You can check it out https://xmartlabs.github.io/cuda-calculator/

Upvotes: -1

Robert Crovella
Robert Crovella

Reputation: 152269

An updated version of the CUDA occupancy calculator spreadsheet ships with the CUDA toolkit, so when you install the CUDA toolkit, the excel spreadsheet is also installed on your machine. Maybe easiest just to use a file find utility for your OS to find it.

The CUDA occupancy API allows you to make the same calculations at runtime.

NVIDIA profilers offer some capability to inspect achieved occupancy. For example, nvvp can display achieved occupancy, and there is a metric for achieved occupancy which you can gather with nvprof. You may wish to simply search the profiler docs for the word "occupancy". These tools don't make estimations of optimal block and grid sizes, but they may give an indication as to whether occupancy may be a performance limiter for your application. These tools can also report the actual block and grid sizes for each kernel launch.

Upvotes: 5

Related Questions