Parse only a specific part of image with Tesseract

Question

I am trying to use Tesseract OCR on Android to read the state of a gas meter when you take a picture of it:

This is the output when I parse this image:

vb"
22% BK-G4T ||||||||I||||I|||ii\|||\
’ 64 2007
22?: 06.0"! 'm'lm Mm. 23212274 ,
v 2,0 dm’ 1
pmn 0_5 bar tm ~25°C v‘40"(1 I
1amp é 0_o1m’ sb15°cl :Sp 20°c l
'I ELSTEQ~I¢¢>>InstrogwnSs HB Z _ 18 _ 1013 . ‘
a, 069373593435- 3 I
i'23212214 Y _ w w V'
g

The idea is to extract the first 5 digits of the state of the gas meter ( 06937 on this image ).

My question is, is there a way to train Tesseract to only parse this part of the image? Absolute coordinates are not an option since every picture would be different. I am guessing the best logic would be something like: parse only white numbers on black background.

Parse only a specific part of image with Tesseract

Answers (1)

Related Questions