Reputation: 39
I was trying to analyze the following image, and convert it to text, using Google Cloud Vision
I realize that it has low quality, and that some words aren't exactly uniquely identifiable. My current strategy was to threshold and splice into sections using numpy and opencv. Then, for each section I use the document_text_detection
function to get the results for each section of the sentence above. However, this gives extremely poor quality. I am not looking for 100% accuracy, but just something that looks fairly close to what is written in this line. Any suggestions to improve the result? Are there other tools than GCV that could be useful?
Upvotes: 3
Views: 954
Reputation: 1060
Vision API has a new OCR model available for opt-in. You can try it out by specifying the model "builtin/latest".
More info here: https://cloud.google.com/vision/docs/release-notes#January_21_2022
Upvotes: 4
Reputation: 2337
You could give Transkribus a try, which has been reported to provide good OCR results on hand-written historical documents.
Upvotes: 0