Should Augmented Data be Added to Existing Data or Used as Complete Replacement in CNN Models?

Question

I need help with the optimal approach for integrating augmented data into Convolutional Neural Network (CNN) models. Specifically, should augmented data be added to the existing dataset to expand its size, or should it replace the original data entirely? I would appreciate any advice, best practices, or experiences regarding this matter. Thank you!

I checked the tutorials about Data Augmentation on the TensorFlow website, they just make it Sequential and either use it as a part of the layer in the model or replace the original dataset with the augmented one. However, one of the reasons for augmenting data I found is for smaller datasets. So wouldn't adding augmented data to the original one would be more suitable for that benefit? Or is augmenting data to make the model able to get more meaningful features which results in better results even with less data?

Should Augmented Data be Added to Existing Data or Used as Complete Replacement in CNN Models?

Answers (1)

Related Questions