Josh Nankin
Josh Nankin

Reputation: 2568

Remove encryption from PDF file using Apache PDFBox

With QPDF, you can simply remove restrictions / encryption from a PDF file like so:

qpdf --decrypt infile outfile

I would like to do the same thing with PDFBox in Java:

PDDocument doc = PDDocument.load(inputFilename);
if (doc.isEncrypted()) {
   // remove the encryption to alter the document
}

I've tried this with StandardDecryptionMaterial, but I have no idea what the owner password is. How does QPDF does this?

Sample document:
https://issues.apache.org/jira/secure/attachment/12514714/in.pdf

Upvotes: 10

Views: 28593

Answers (1)

Josh Nankin
Josh Nankin

Reputation: 2568

This is what you'd need to do (inspired from the PDFBox WriteDecodedDoc command line tool):

if (doc.isEncrypted()) {
    try {
        doc.decrypt("");
        doc.setAllSecurityToBeRemoved(true);
    } catch (Exception e) {
        throw new Exception("The document is encrypted and we can't decrypt it.", e);
    }
}

Note: you may have to include the Bouncy Castle JAR.

Upvotes: 26

Related Questions