tensorflow model.predict uses more and more memory

Question

I am using tensorflow 2.14. I have a dataset created using from_generator. I then batched the dataset to a fixed batch_size 1024. The code used is close to d1024 = from_generator(...).batch(1024).cache('cache_file').prefetch(AUTOTUNE).

Now if I do model.predict(d1024.take(n)) for a big n, I see GPU memory usage gradually increases when more batches are processed. This eventually results in an out-of-memory error if n is big.

In my understanding, tensorflow will do the prediction batch by batch. Given that the batch size is fixed at 1024, its memory usage should be determined by the batch size instead of n. Is this correct?

Why does bigger n needs more memory and how to mitigate this problem if I have a big n.

I googled and found this issue in tensorflow github. But I don't know whether it is related to this problem.

tensorflow model.predict uses more and more memory

Answers (1)

Related Questions