Difference between Tensorflow's tf.keras.layers.Dense and PyTorch's torch.nn.Linear?

Question

I have a quick (and possibly silly) question about how Tensorflow defines its Linear layer. Within PyTorch, a Linear (or Dense) layer is defined as, y = x A^T + b where A and b are the weight matrix and bias vector for a Linear layer (see here).

However, I can't precisely find an equivalent equation for Tensorflow! Is it the same as PyTorch or is it just y = x A + b ?

Thank you in advance!

Alex · Accepted Answer

tf.keras.layers.Dense is defined here in the tensorflow source code:

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/keras/layers/core.py#L1081

If you follow the references in its call function, it leads you to the definition of the operation used here, which is indeed a matrix multiplication of the inputs and weights plus a bias vector as expected:

https://github.com/tensorflow/tensorflow/blob/a68c6117a1a53431e739752bd2ab8654dbe2534a/tensorflow/python/keras/layers/ops/core.py#L74

outputs = gen_math_ops.MatMul(a=inputs, b=kernel)
...
outputs = nn_ops.bias_add(outputs, bias)

Difference between Tensorflow's tf.keras.layers.Dense and PyTorch's torch.nn.Linear?

Answers (2)

Related Questions

Difference between Tensorflow&#39;s tf.keras.layers.Dense and PyTorch&#39;s torch.nn.Linear?

Answers (2)

Related Questions

Difference between Tensorflow's tf.keras.layers.Dense and PyTorch's torch.nn.Linear?