Tensorflow: How to use the "new" Dataset API with QueueRunner

Question

Basically I have a list of images to be processed. And I need to do some pre-processing (data augmentation) after loading, then feed to the main graph of TF. Currently I am working with a customized generator which takes a list of paths yield a pair of tensors(images) and feed to the network via placeholder. And the sequential processing took ~0.5s for each batch.

I just read the Dataset API which I could directly use by using .from_generator() function, and I could use the .get_next() as input directly.

But how does the QueueRunner fit into the framework? Does Dataset implicitly utilize queue + dequeue to maintain its generator/get_next pipeline, or it requires me to explicitly feed into a FIFOQueue afterwards? If the answer is the later one, what's the best practice to maintain the pipeline to train + validate multiple random_shuffle epochs? (I mean, how many DS/queueRunner do I need to maintain, and where do I set the shuffle and epochs?)

Tensorflow: How to use the "new" Dataset API with QueueRunner

Answers (1)

Related Questions

Tensorflow: How to use the &quot;new&quot; Dataset API with QueueRunner

Answers (1)

Related Questions

Tensorflow: How to use the "new" Dataset API with QueueRunner