Reputation: 343
I am using this for OCR in my android application.
I am using the default camera app for taking pictures and feeding the same image to the tesseract library. I am using this sample project currently. When I am processing any image I am getting inaccurate results along with many random single alphabets. I have read many posts and found that I need to do some preprocessing on the picture taken by me, so please help me with this. Any detailed explanation on processing of image will be extremely helpful.
Thanks.
Upvotes: 2
Views: 1497
Reputation: 1183
I got a pretty good result (85%) through applying a threshold filter.
Note, your input image is not the best it could be.
If you could ensure the page was head-on, that no text bleeds from the page underneath and that the image is focussed, then you should look at applying an Otsu or adaptive threshold prior to Tesseract with OpenCV. I have had better results performing my own thresholding often than leaving it to Tesseract
http://docs.opencv.org/trunk/doc/py_tutorials/py_imgproc/py_thresholding/py_thresholding.html
Upvotes: 1