Extract data from image containing table grid using python

Question

I have images such as the one attached below. I need to extract the data within the grid along with the tabular structure and transform it into a dataframe/csv.

I am using OCR to extract the text along with the coordinates but in order to extract the table structure I would like to extract the horizontal and vertical grid lines.

Is there a method in OpenCV to do that that would generalize well ?

So far the approaches I've come across are : 1. Hough Lines 2. Extracting Rectangular contours 3. Drawing vertical and horizonal contours

Chrys Bltr · Accepted Answer

You can define a grid structure and extract information from all separate area with openCV, check this article A Box detection algorithm for any image containing boxes

Everything is perfectly explained

Extract data from image containing table grid using python

Answers (2)

Related Questions