Python 3 OCR with custom characters

Question

I have images of roughly this format that I would like to parse into numbers:

I have attempted to use the pytesseract module but found the results to be lacking. Occasionally a 5 would be read as a 6 and so on. I was also forced to manually detect the colored circles, as they were generally interpreted as 0.

Sample code used:

import pytesseract
from PIL import Image
img = Image.open("foo.png")
print(pytesseract.image_to_string(img))

> 150150150

Is there a way that allows me to specify that, for instance, the yellow circle would map to a custom character that would be represented as, say, yellow? An expected result of parsing the sample image would result in something like 15 yellow 15 gray 15 brown

Also, since the font is mostly constant and only background color varies slightly, is there a way to train tesseract with images of digits that I would manually feed it before using it to identify actual images?

Python 3 OCR with custom characters

Answers (1)

Related Questions