Python OCR Tesseract cannot recognize Single Characters

Question

I have two TIF images. First Image(a.tif) is:

and Second Image(bcd.tif) is

When I am using "tesseract a.tif a.txt" it is not reading that Character and The same command "tesseract bcd.tif bcd.txt" is working.I have seen some answers in stackoverflow they they didn't gave solution how to run that.If we need to add any parameters what are those?

Nisarg Shah · Accepted Answer

Seems like the issue has something to do with there being only a single character in the image. For instance I tried these two images:

This one works fine. Tesseract reports 95% confidence in the result:

This one doesn't work.

I also tried scanning that image with PageSegMode set to SingleChar, and then it is scanned fine.

The command line argument for that should be -psm 10. See this: https://stackoverflow.com/a/26418458/5894241

Python OCR Tesseract cannot recognize Single Characters

Answers (2)

Related Questions