paratext
paratext

Reputation: 53

Generate hOCR from Microsoft Computer Vision OCR

I'm using the Microsoft Read API to derive OCR data from local images. My script is based on this tutorial: https://learn.microsoft.com/en-us/azure/cognitive-services/computer-vision/quickstarts/python-hand-text?tabs=version-2.

While I can obtain the OCR data in JSON format, I would like to get the output as hOCR. I found this resource: https://learn.microsoft.com/en-us/samples/azure-samples/azure-search-power-skills/azure-hocr-generator-sample/. It's not clear to me how to implement this in relation to the Python script in the tutorial.

Does anybody know how to convert the JSON output to hOCR?

Upvotes: 2

Views: 1023

Answers (0)

Related Questions