What decoder_input_ids should be for sequence-to-sequence Transformer model?

Question

I use the HuggingFace's Transformers library for building a sequence-to-sequence model based on BART and T5. I carefully read the documentation and the research paper and I can't find what the input to the decoder (decoder_input_ids) should be for sequence-to-sequence tasks.

Should decoder input for both models (BART and T5) be same as lm_labels (output of the LM head) or should it be same as input_ids (input to the encoder)?

What decoder_input_ids should be for sequence-to-sequence Transformer model?

Answers (1)

Related Questions