nihartrivedi810
nihartrivedi810

Reputation: 343

Android tesseract OCR improving results

I am using this for OCR in my android application.

I am using the default camera app for taking pictures and feeding the same image to the tesseract library. I am using this sample project currently. When I am processing any image I am getting inaccurate results along with many random single alphabets. I have read many posts and found that I need to do some preprocessing on the picture taken by me, so please help me with this. Any detailed explanation on processing of image will be extremely helpful.

Thanks.

This is the sample image

Upvotes: 2

Views: 1497

Answers (1)

PorridgeBear
PorridgeBear

Reputation: 1183

I got a pretty good result (85%) through applying a threshold filter.

Note, your input image is not the best it could be.

  • It is blurry
  • There appears to be text from the back side of the page coming through
  • The page is at an angle

If you could ensure the page was head-on, that no text bleeds from the page underneath and that the image is focussed, then you should look at applying an Otsu or adaptive threshold prior to Tesseract with OpenCV. I have had better results performing my own thresholding often than leaving it to Tesseract

http://docs.opencv.org/trunk/doc/py_tutorials/py_imgproc/py_thresholding/py_thresholding.html

Upvotes: 1

Related Questions