Image recognition from Computer Screen

Question

I am trying to extract text from the below image. I tried OCR in python. But it is giving me incorrect results.

I preprocessed the image removed the underline, used canny edge detector increased contrast ratio and then feed it to OCR. Still, I am not getting expected output.

With limited knowledge, I tried to separate characters out of image after increasing contrast.

import cv2
import numpy as np
import os

image_path = os.path.join(os.path.dirname(__file__), "image.png")

im = cv2.imread(image_path)

gray = cv2.cvtColor(im,cv2.COLOR_BGR2GRAY)


# converted intermediate pixels to black and white
gray[gray<100] = 0
gray[gray>=100] = 255


gray = gray[~np.all(gray == 255, axis=1)]
gray = gray[:,~np.all(gray == 255, axis=0)]
gray = gray[~np.all(gray == 0, axis=1)]
print (np.where(np.all(gray == 255,axis=0)))
print (gray[:,20:33])
words =  np.hsplit(gray, np.where(np.all(gray == 255,axis=0))[0])

i = 0
for word in words:
    word = word[:,~np.all(word == 255, axis=0)]
    if(word.size):
        print (word.shape)
        i = i + 1
        cv2.imwrite("temp" + str(i) + ".png", word)

It became like this

And again I gave this as input to pytesseract. It gave me blank output.

Here are my doubts.

Can we have a better mechanism to separate characters on white-space from image. Currently it seems highly breakable to me.
How can we pre-process image to be better detected by OCR.
Can we use neural-networks or SVM over here like we used for MNIST Digits dataset

Short pointers are ok if it seems too broad. What is the best approach to tackle this kind of problem?

Image recognition from Computer Screen

Answers (1)

Related Questions