Reputation: 362

Excluding One hot represation data from LSTM prediction

I'm making a prediction over daily sales data for a company using Keras LSTM. The original shape of the data is [60 shops x 1034 days x 207 products]. I have created One-hot representation of days like:

Monday 1 0 0 0 0 0 0
Tuesday 0 1 0 0 0 0 0
Wednesday 0 0 1 0 0 0 0
Thursday 0 0 0 1 0 0 0
Friday 0 0 0 0 1 0 0
Saturday 0 0 0 0 0 1 0
Sunday 0 0 0 0 0 0 1

This rep is added to the date of all shops so the shape new shape is [60 shops x 1034 days x (207 products + 7 days)]

I have added one more column named S_day for days of great significance with value either 0 or 1.

So the final dimension of data is [60 x 1034 x 215].

I use for train the upper 973 days and for test the last 61 days of the set for everyshop.
train_x [60 x 973 x 215]
train_y [60 x 973 x 215]
test_x [60 x 973 x 215] (data shape is [60 x 61 x 215] but we pad zeros to match shape)
test_y [60 x 973 x 215](data shape is [60 x 61 x 215] but we pad zeros to match shape)

y data are x data shifted with lag -1 in order to have as target of prediction the next day.

My problem is that I need to exclude those 8 extra columns from my final prediction.

# design model
model = Sequential()
model.add(LSTM(100, input_shape=(train_x.shape[1], train_x.shape[2]), return_sequences=True))
model.add(Dense(train_x.shape[2]))
model.compile(loss='mean_squared_error', optimizer='adam', metrics=['accuracy'])


# fit model
history = model.fit(train_x, train_y,
                    epochs=10,
                    batch_size=2,
                    validation_data=(test_x, test_y),
                    verbose=2,
                    shuffle=False)


# make a prediction
test_pred = model.predict(test_x)

Upvotes: 1

Excluding One hot represation data from LSTM prediction

Answers (2)

Related Questions