Strategies to speed up LSTM training

Question

I have an encoder decoder network with : 3 BLSTMs in the encoder and 2 vanilla LSTMs in the decoder connected with a mutli head attention with 4 nodes. Latent dimension is 32 and my total sample looks like (10000,400,128). The encoder network has a dropout of 0.2 and the decoder has a dropout of 0.3. I'm using an adam optimizer with a learning rate of 0.001 and Mean Squared error loss. Finally I have a validation split of 0.3. I rented an Nvidia Titan V (with Core™ i9-9820X, 5.0/20 cores and 16/64 GB total effective shared RAM) on Vast.ai and it takes ~6 minutes for each epoch when I train it all together (7000 train and 3000 validation samples).

I was hoping to find ways of reducing the total train timing. Any suggestions would be great.

Strategies to speed up LSTM training

Answers (1)

Related Questions