Reputation: 1349
I want to convert *.PDF file to searchable *.PDF files using alfresco and tesseract OCR.
tesseract version 3.03 needs to be compiled and i need to generate setup of that using source code.Is there any other solution for the same. Can anyone help for the same?
Upvotes: 1
Views: 2548
Reputation: 2203
you can use another tool which is directly performing pdf to searchable pdf conversion.This tool is using tesseract internally for this conversion.You can find more details on below link and configure same for alfresco.
http://ubuntuforums.org/showthread.php?t=1456756
command
pdfocr -i input.pdf -o output.pdf
Upvotes: 2
Reputation: 8345
You'll need Tesseract 3.03 or later for searchable PDF output feature.
tesseract yourimage.tif out pdf
Upvotes: 4