Neural network: weights and biases convergence

Question

I've been reading up on a few topics regarding machine learning, neural networks and deep learning, one of which is this (in my opinion) excellent online book: http://neuralnetworksanddeeplearning.com/chap1.html

For the most part I've come to understand the workings of a neural network but there is one question which still bugs me (which is based on the example on the website): I consider a three layer neural network with an input layer, hidden layer and output layer. Say these layers have 2, 3 and 1 neurons (although the amount doesn't really matter).

Now an input is given: x1 and x2. Because the network is [2, 3, 1] the weights are randomly generated the first time being a list containing a 2x3 and a 3x1 matrix. The biases is a list of a 3x1 and 1x1 matrix.

Now the part I don't get: The formula calculated in in the hidden layer:

weights x input - biases = 0

On every iteration the weights and biases are changed slightly, based on the derivative in order to find a global optimum. If this is the cases, why don't the biases and weights for every neuron converge to the same weights and biases?

Neural network: weights and biases convergence

Answers (1)

Related Questions