Python tesseract OCR detecting text improperly

Question

testimg.png:

I am trying to detect the Text in this image and it gives me nothing, it works for some other text barely. It gives me some of the word replaced with random letters and i have to compare it to a word list to get the right one

This is my image i am trying to detect the text ive tried tons of stuff with greyscale and inversion using tesseract but nothing seems to work it keeps giving me nothing. How can I fix it or train it?

Here is my current code

pyautogui.screenshot("testimg.png", region=(1050, 30, 455, 50)) #name

originalImage = cv2.imread('testimg.png')

# Convert the image to grayscale
grayImage = cv2.cvtColor(originalImage, cv2.COLOR_BGR2GRAY)

# Apply a threshold to get a binary image
(_, blackAndWhiteImage) = cv2.threshold(grayImage, 127, 255, cv2.THRESH_BINARY_INV)


custom_config = r'--psm 7'
text = pytesseract.image_to_string(blackAndWhiteImage, config=custom_config)
print('Extracted Text: ', text)

Python tesseract OCR detecting text improperly

Answers (1)

Related Questions