jazzviolin
jazzviolin

Reputation: 25

Applescript or Automator: Run Acrobat X Pro to batch OCR multiple PDF files, more

I'm using a ScanSnap S1500M to scan all paper documents to the folder /PDF-scans/ – I'd like to use Adobe Acrobat X Professional to OCR the text.

I'd like to automate this process (daily):

Should I use Automator? Is there a script that can do this? Does it have to be tied to iCal's repeating events?

Thank you.

Upvotes: 1

Views: 6152

Answers (1)

fireshadow52
fireshadow52

Reputation: 6516

I would download PDFPen which allows you to ocr documents easily. Once you've done that, you can use this script:

set the PDF_folder to "where:Ever:Your:PDF:folder:is:" as alias
set the OCR_folder to "/where/ever/you/want/the/new/folder/to/be" as POSIX file

tell application "Finder"
    repeat with this_PDF in (every item of the PDF_folder)
        my ocr(this_PDF)
    end repeat
end tell

on ocr(this_PDF)
    tell application "PDFpen"
        open this_PDF as alias
        tell document 1
            ocr --simple
            repeat while performing ocr
                delay 1
            end repeat
            delay 1
        end tell
        set this_PDF to (save document 1 in this_PDF)
        close document 1
    end tell
    tell application "Finder"
        if not exists OCR_folder then set the OCR_folder to (make new folder at (the OCR_folder as alias with properties {name:"ocr"})
        move this_PDF to the OCR_folder with replacing
    end tell
end ocr

Upvotes: 1

Related Questions