Jana
Jana

Reputation: 189

OCR algorithm- distinguish between textual image and object image

Am writing a program that extracts the contents from the logo of different websites.. i am using OCR to extract the text from the logo but i want to optimize the program and want to apply OCR only on those logos which have text but i dont know how to determine if a logo contains text or not??? any method??

Upvotes: 4

Views: 1414

Answers (1)

Abid Rahman K
Abid Rahman K

Reputation: 52646

this is a case where we need to know if an image has text in it. It is different from OCR.

The algorithm which is considered to be best to date is Stroke Width Transform. It was designed by Ephstein under Microsoft in 2010. It doesn't use any machine learning purposes.

You can get more details from this paper : Detecting Text in Natural Scenes with Stroke Width Transform

Or watch a video about this.

There is an implementation of this algorithm here.

Upvotes: 3

Related Questions