Tensorflow not fully utilizing GPU in GPT-2 program

Question

I am running the GPT-2 code of the large model(774M). It is used for the generation of text samples through interactive_conditional_samples.py , link: here

So I've given an input file containing prompts which are automatically selected to generate output. This output is also automatically copied into a file. In short, I'm not training it, I'm using the model to generate text. Also, I'm using a single GPU.

The problem I'm facing in this is, The code is not utilizing the GPU fully.

By using nvidia-smi command, I was able to see the below image

https://i.sstatic.net/f02p7.jpg

Tensorflow not fully utilizing GPU in GPT-2 program

Answers (1)

Related Questions