Not understanding normalization when using ImageDataGenerator

Question

Im trying to build a simple image classifier using Keras with Tensorflow as backend. However im having a very hard time understanding how nomalization is done in Keras.

It is my understanding that in Machine Learning you calculate the mean and std of the training + validation set and then reuse the mean and std when normalizing the test set and when doing prediction on new data. So whit this in mind I will explain what Im not understanding in each part of Keras.

train_datagen = ImageDataGenerator(rescale=1./255, samplewise_center=True, samplewise_std_normalization=True, shear_range=0.2, zoom_range=0.2)
test_datagen = ImageDataGenerator(rescale=1./255, samplewise_center=True, samplewise_std_normalization=True)
batch_size = 1024
train_generator = train_datagen.flow(X_train, one_hot_train_labels, batch_size=batch_size, shuffle=True)
validation_generator = test_datagen.flow(X_valid, one_hot_valid_labels, batch_size=batch_size)

First questions are with regards to ImageDataGenerator. In the documentation it says that the flow function normalizes the data, then I have tree questions regarding this:

What is the effect of samplewise_std_normalization and samplewise_center if it is the flow function that does the normalization?
Why use rescale if I also do normalization?
How can Keras do normalization on augmentad data that is generated at runtime so the mean and std is not know before start?

result = model.evaluate(X_test, one_hot_test_labels)

When we run evaluate I have one question:

How is the normalization handled here? I dont have access to the mean and std so I cant apply them to also the testing set?

predict_softmaxs = model.predict(np.array(resized_images))

When I run predict I have one question:

Again I dont have access to the mean and std so I cant apply it to the prediction image?

Not understanding normalization when using ImageDataGenerator

Answers (1)

Evaluate and predict:

Related Questions