Back propagation through time for minibatch

Question

I want to predict computer actions with dialogs data between human and computer. I have 1000 dialogs for the training. Each dialog has different number of turns.

In my reference thesis (https://arxiv.org/abs/1702.03274) explains the training like below. It uses basic LSTM.

In training, each dialog formed one minibatch, and updates were done on full rollouts (i.e., non-truncated back propagation through time).

Then, I have two questions.

Does each dialog formed one minibatch mean 1000 minibatches?
Does updates were done on full rollouts mean that the updates were done after all dialog data? or the updates were done after each dialog?

I am not a native speaker of English and not expert of machine learning. Any helps will be appreciated. Thank you.

Update

I added more details. User inputs are translated to features, and system actions are translated one-bit vectors. So this tasks is a task of multi-classification problem. In the thesis, this task is tackled with one LSTM model. Each dialog has different number of turns.

dialog 1
t1: hello       ([1,0,1,0,]) -> hi   ([0,0,1,0])
t2: how are you ([0,1,1,0,]) -> fine ([0,1,0,0])
dialog 2
t1: hey       ([1,0,1,0,]) -> hi   ([0,0,1,0])
...
dialog 1000
...

So this problem is to predict y via x

dialog_list = [ {(x1,y1), (x2,y2)}, {(x1,y1)}, ..  ] # length is 1000

Back propagation through time for minibatch

Update

Answers (1)

Related Questions