Adding metadata to a Keras LSTM

Question

I looked at several answers, and was not able to see a clear solution to what I'm trying to do.

I have an LSTM for binary text classification that takes the top 40k words in a corpus, then operates on the first 50 tokens. Prepared like this:

max_words = 40000
max_review_length = 50
embedding_vector_length = 100
batch_size = 128
epochs = 10 

all_texts = combo.title.tolist()
lstm_text_tokenizer = Tokenizer(nb_words=max_words)
lstm_text_tokenizer.fit_on_texts(all_texts)

x_train = lstm_text_tokenizer.texts_to_sequences(x_train.title.tolist())
x_test =  lstm_text_tokenizer.texts_to_sequences(x_test.title.tolist())
x_test = sequence.pad_sequences(x_test, maxlen=50)


x_train = sequence.pad_sequences(x_train, maxlen=50)

My current model looks like this:

def lstm_cnn_model(max_words, embedding_vector_length, max_review_length):
    model = Sequential()
    model.add(Embedding(max_words, embedding_vector_length,     input_length=max_review_length))
    model.add(Conv1D(filters=32, kernel_size=3, padding='same', activation='relu'))
    model.add(MaxPooling1D(pool_size=2))
    model.add(LSTM(100))
    model.add(Dense(num_classes))
    model.add(Activation('softmax'))
    return model

I also have a 1-dimensial list of meta data for each example, with one value per example. I may have more complex meta data to add in the future.

My question is, what is the best way to combine these two inputs in the training of the model?

Adding metadata to a Keras LSTM

Answers (1)

Related Questions