Python Scikit-learn - An attempt at a low level OCR

Question

I want to train a SVM to perform a classification of Images of Digits (0-9) and then use it to read images with numerical values(a low level OCR).

My idea is to read images one by one and store them in numpy array and then to put all those arrays in an array, so as to make my sample_array.

As from the Scikit-Learn documentation " As other classifiers, SVC, NuSVC and LinearSVC take as input two arrays: an array X of size [n_samples, n_features] holding the training samples, and an array y of class labels (strings or integers), size [n_samples]:"

My question is what should be the features of the Images, and how to define them?

I read the Handwriting Recognition example on the tutorial page of the scikit-learn page, but over there the data is already in a dataset(even after a searching the web quite intensively I still don't know how to get my PICS to convert to a dataset) so you see it's a different situation.

A Secondary question: will scikit-image be helpful to use in this situation? I'm using standard PIL for reading images.

Python Scikit-learn - An attempt at a low level OCR

Answers (1)

Related Questions