Neural network and XOR as a classification

Question

I read somewhere that mean squared error loss is good for regression, and cross entropy loss for classification.

When I tried to train XOR as a classification problem with cross entropy loss, network failed to converge.

My setting:

Network is 2-2-2
First output is signaling 0 and second 1 (so two classes of inputs).
Cross entropy is used for calculating error in output layer of network instead of mean squared error.
as a activation function, Im using logsig.

Apparently, Im missing something, where is my mistake ?

Answers (1)