Document AI to perform automatic research in large amount of data from pdf files

Question

I need to add a feature for my app to allow my clients to extract text from image texts and parse them to usable data like json format and store them to then be able to perform better data research.

Those image-texts are big pdf files (~150-500 pages) and my clients would want to be able to upload a large amount of those files because for now they have to look for the data they need by manually reading all the pdf files.

For now I'm considering using the google API Cloud Document AI which seems do do exactly what I need really easily, especially combined to the Document AI warehouse API. But I heard here and there that the OCR quality of document AI may not be reliable. Do you have any feedback about that? Or another way to do what I want?

Document AI to perform automatic research in large amount of data from pdf files

Answers (1)

Related Questions