Pytorch build seq2seq MT model ,But how to get the translation results from the output tensor?

Question

I am try to implement my own MT engine, i am following the steps in https://github.com/bentrevett/pytorch-seq2seq/blob/master/1%20-%20Sequence%20to%20Sequence%20Learning%20with%20Neural%20Networks.ipynb

SRC = Field(tokenize=tokenize_en,
            init_token='',
            eos_token='',
            lower=True)

TRG = Field(tokenize=tokenize_de,
            init_token='',
            eos_token='',
            lower=True)

After training the model,the link only share a way to batch evaluate but i want to try single string and get the translation results. for example i want my model to translate the input "Boys" and get the German translations.

savedfilemodelpath='./pretrained_model/2020-09-27en-de.pth'
model.load_state_dict(torch.load(savedfilemodelpath))
model.eval()
inputstring = 'Boys'
processed=SRC.process([SRC.preprocess(inputstring)]).to(device)
output=model(processed,processed)
output_dim = output.shape[-1]
outputs = output[1:].view(-1, output_dim)
for item in outputs:
    print('item shape is {} and item.argmax is {}, and words is {}'.format(item.shape,item.argmax(),TRG.vocab.itos[item.argmax()]))

So my question is that it it right to get the translation results by:
First: convert the string to tensor

inputstring = 'Boys'
processed=SRC.process([SRC.preprocess(inputstring)]).to(device)

Second: send the tensor to the model. As the model have a TRG param.I have to give the tensor,am i able not given the TRG tensor?

output=model(processed,processed)
output_dim = output.shape[-1]
outputs = output[1:].view(-1, output_dim)

Third：through the return tensor, i use the argmax to get the translation results? is it right?

Or how can i get the right translation results?

for item in outputs:
        print('item shape is {} and item.argmax is {}, and words is {}'.format(item.shape,item.argmax(),TRG.vocab.itos[item.argmax()+1]))

Pytorch build seq2seq MT model ,But how to get the translation results from the output tensor?

Answers (1)

Related Questions