How does the following Encoder-Decoder model generate outputs a different size than the input?

Question

I implemented the following tutorial in Keras:

https://towardsdatascience.com/nlp-sequence-to-sequence-networks-part-2-seq2seq-model-encoderdecoder-model-6c22e29fd7e1

In the intro the author says the setup is good for matching input sequences of random varying sizes to output sequences of random varying sizes. I am confused because I do not see how to generate sentence outputs that are a different length than the input sentence.

Let's assume that the inputs are English sentences and the outputs are French sentences, as in the tutorial.

My current understanding is as follows:

The encoder input is the English sentence as a sequence of integers to be embedded. The decoder input is the french sentence as a sequence of integers delayed one time step, with the first integer in the series representing a null value. This layer is also embedded.

The target is the french sentence as a series of integers, not delayed. I seem to need to add an integer at the end to represent end of field, otherwise the size does not match up with the decoder embedded input and keras throws an error.

When making predictions, what exactly do you feed it? It doesn't seem possible to get outputs a different length than the input. Is that the case?

How does the following Encoder-Decoder model generate outputs a different size than the input?

Answers (1)

Related Questions