MNIST: Trying to obtain high accuracy

Question

I'm currently working on the handwritten digits recognition problem.

To start with, I have tested sample handwritten digits against MNIST dataset.

I'm getting an accuracy of 53% and I need above 90% accuracy.

Following are the things that I have tried so far to increase the accuracy.

Created my own dataset
- I have created 41,000 examples. To get started I made a small dataset, which has 10000 examples( 1000 for each digits ).
- The dataset was created on the lines of mnist format( probably thought of clubbing my dataset and mnist dataset in the later stage ). The accuracy of the model built on this was close to 65%.
Approach
- I have been testing my input data using softmax, but it didn't help much on the accuracy part. So I have started trying the cnn way.

So my question is:

Is there another approach/algorithm, which would be able to detect the digits more accurately.
Do I need to train the model more ?
Do I need to clean the images ?
I am working on clubbing the mnist dataset and my dataset(41,000 digits data) to see, if it increases the accuracy.

Code

To test my images against mnist(Run the mnist before this code)

You can find the Ipyhton notebooks for:

Testing my sample digits against MNIST ( Script - 1 )
Testing my sample digits against my dataset( Script - 2 )

The scripts and the images are available at this link

Sorin · Accepted Answer

First a few notes:

cnn and softmax are not mutually exclusive. You can have your cnn for the lower levels. You should have softmax for the prediction (tends to work best because there is only one answer).
Hard to say anything about your training. Please consider posting a graph of loss on the training dataset and on an eval set. Normally we should see the two lines going down and flattening out towards. If they don't flatten out you should train longer. If the two lines start diverging you need more regularization or to stop earlier.
You should always try various parameters (number of layers, number of neurons activation functions, etc.) This is called hyper-parameter tuning, there are tools out there to help you and they usually improve a lot.
You should also try a deep neural network, in addition to cnn. I've seen good results with the canned DNNClassifier.

Assuming you do all these and you don't see improvements, it might imply there's an issue with your data. Check out a confusion matrix to see where the model is having trouble. Look at some examples that are missclassified. From my experience I've seen 1s and 7s in the dataset that are almost indisinguishable. This is not exactly a solution, but should point you in the right direction on what you need to fix.

MNIST: Trying to obtain high accuracy

Answers (2)

Related Questions