Reputation: 1
I am trying to run Ollama on WSL2 (Ubuntu 22.04) with GPU acceleration (CUDA), but it still heavily relies on CPU instead of utilizing only the NVIDIA GPU.
System Details:
Problem
I have installed CUDA and Nvidia drivers.
root@PC01:/mnt/c/Windows/system32# nvcc --version nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2025 NVIDIA Corporation Built on Wed_Jan_15_19:20:09_PST_2025 Cuda compilation tools, release 12.8, V12.8.61 Build cuda_12.8.r12.8/compiler.35404655_0
I tried changing the driver versions.
updating the sudo nano ~/.ollama/config.yaml
to include use_cuda: true num_threads: 1
also exporting these values
echo 'export CUDA_VISIBLE_DEVICES=0' >> ~/.bashrc echo && 'export OLLAMA_USE_CUDA=1' >> ~/.bashrc && echo 'export OLLAMA_NUM_THREADS=1' >> ~/.bashrc && source ~/.bashrc
Upvotes: 0
Views: 40