Reputation: 65
I'm working with searchable pdf, there is no need to OCR the document. Is there a rule/action in Datacap that can read the data instead of using OCR?
Upvotes: 0
Views: 174
Reputation: 873
Yes, in IBM Datacap, there is a way to extract data from a searchable PDF without using OCR. This can be done by utilizing the PDF actions that are available in Datacap. Specifically, you can use the PDF action library which provides a set of actions for working with PDF files.
Using PDF Actions in Datacap Here are some of the key actions from the PDF action library that you can use to extract text from a searchable PDF:
PDF::GetPDFText: This action extracts text from a PDF and places it into a specified variable.
PDF::ExtractText: This action extracts text from specific coordinates or zones within the PDF.
Upvotes: 0