Is it possible to shuffle the dataset using the index of its elements?

Question

I am using tf.data.experimental.make_csv_dataset in tensorflow (TF1.14 and TF2.0) to read a csv file consisting 3 columns; index, column1 and column2. For me only column 1 and column2 are important. Each element in column1 is an array of shape (1,4) and column2 has (1,1). On this dataset, when I use tf.data.shuffle(buffer_size = some_number) for shuffling, it takes a lot of time to do this shuffling with a message Filling Up the shuffle buffer. My question is if there is a way to shuffle the dataset by using the indices of the column1/column2, because this might not take so much time for shuffling since it is only the indices.

Is it possible to shuffle the dataset using the index of its elements?

Answers (1)

Related Questions