Reputation: 189
Am writing a program that extracts the contents from the logo of different websites.. i am using OCR to extract the text from the logo but i want to optimize the program and want to apply OCR only on those logos which have text but i dont know how to determine if a logo contains text or not??? any method??
Upvotes: 4
Views: 1414
Reputation: 52646
this is a case where we need to know if an image has text in it. It is different from OCR.
The algorithm which is considered to be best to date is Stroke Width Transform. It was designed by Ephstein under Microsoft in 2010. It doesn't use any machine learning purposes.
You can get more details from this paper : Detecting Text in Natural Scenes with Stroke Width Transform
There is an implementation of this algorithm here.
Upvotes: 3