atulsri
atulsri

Reputation: 33

Azure Computer Vision API not extracting text from cheque image correctly

I used Azure Cognitive Vision API to extract the text from a cheque image. But, it is not correctly extracting the text from cheque. Example MICR code having characters like " || are incorrectly read into some other digits. Is there any way we can work on to improve the accuracy or set some context to specifically extract text from cheque images?

Sample Image used for Vision API: enter image description here

Output of Vision API:

MDBS
Date
2 16
NOT NEG
AS PAYES
D
D
M
M
Y
Y
Pay
Hexagon metrology Asia Pacific
or Bearer
SECURA 3011678/2 06/19
Singapore Siahd hundred and niudy- S$ 896<
Dollars
Sit only
2M PRECISION ENGRG
DBS Bank Ltd
PTE LTD
Cheque No.
Bank/Branch Code
Account No.
Please sign above this line
1;91 308 2081717 00540 0549001996

Upvotes: 1

Views: 892

Answers (2)

Deeks
Deeks

Reputation: 1

Not just Azure, even Google cloud Vision API or AWS Rekognition Text Detection has the same issues while reading the MICR code. The separators are predicted as either 1 or : (colon). We've built a custom model to overcome such issues. https://api.arya.ai/cheque-extraction

Upvotes: 0

Ram
Ram

Reputation: 2754

This custom tags is currently not available. However, our engineers are working to bring this functionality to Computer Vision. You can use the custom vision to detect.

Azure Form Recognizer does a fantastic job in creating a viable solution with just five sample documents. It performs end-to-end Optical Character Recognition (OCR) on handwritten as well as digital documents with an amazing accuracy score and in just three seconds.

If you are extracting only text, tables and selection marks from documents you should use layout, if you also need to extract key value pairs you can train a custom model or use a pre-built (Invoice, Receipts, Business Cards). Layout results (text, tables and selection marks) are included in all the Analyze outputs (custom and pre-built) in the readResults (text) and pageResults (tables) of the JSON output.

• Layout – extract text, tables selection marks no training required

• Pre-built – Invoice, Receipts, Business Cards – extract values of interest from these type of documents

• Custom – Extract key value pairs trained on your own documents

Here is the snapshot of output text computer vision API. enter image description here.

Please follow the Computer Vision API Frequently Asked Questions that could help.

Upvotes: 1

Related Questions