Trind
Trind

Reputation: 1599

Extract text from PDF/A in Java

Are there any Java Frameworks, that can extract text from PDF/A? There are many java PDF frameworks, but they don't specify, if they support that format.

Upvotes: 2

Views: 1085

Answers (2)

mark stephens
mark stephens

Reputation: 3184

Leonard Rosenthol did an excellent talk on PDF standards and formats at the itext summit. He explained all the different subsets. You can watch the video online at http://www.parleys.com/#st=5&id=3188&sl=0

There is a list of all the presentations at http://lowagie.com/summit2012presentations

Upvotes: 1

sergiofbsilva
sergiofbsilva

Reputation: 1606

You should check http://itextpdf.com/ (specially this chapter). It is the best opensource framework to work with PDF files in Java I have ever used.

Upvotes: 1

Related Questions