Neural net with duplicated inputs - Keras

Question

I have a dataset of N videos each video is characterized by some metrics (that will be inputs for a neural net) my goal is to predict the score that a person will give when he or she watches the video.

The problem is that in my dataset each video was watched more than once by different subjects, so I was forced to duplicate the same metrics (inputs) the number of time the video was watched to keep all the scores given by the subjects.

I built an MLP model to predictet the scores. But when I calculate the RMSE it's always higher than 0.7.

I want to know if having a dataset like that would affect the performance of my model ? And how can I deal with it ?

Here is how the dataset looks like:

The first 5 columns are the inputs and the last one is the score of subjects. Note that all of them are normalized.

Here is my Model:

 def mlp_model():
    # create model
    model = Sequential()
    model.add(Dense(100,input_dim=5, kernel_initializer='normal', activation='relu'))
    model.add(Dense(100, kernel_initializer='normal', activation='relu'))
    model.add(Dense(100, kernel_initializer='normal', activation='relu'))
    model.add(Dense(100, kernel_initializer='normal', activation='relu'))
    model.add(Dense(1, kernel_initializer='normal'))
    # Compile model
    model.compile(loss='mean_squared_error', optimizer='adam')
    return model

    seed = 100 
    numpy.random.seed(seed)
    myModel = mlp_model()
    myModel.fit(x=x_train, y=y_train, batch_size=10, epochs=45, validation_split=0.3, shuffle=True,callbacks=[plot_losses])
    predictions = myModel.predict(x_test)
    print predictions

Neural net with duplicated inputs - Keras

Answers (1)

Related Questions