Reputation: 1599
Are there any Java Frameworks, that can extract text from PDF/A? There are many java PDF frameworks, but they don't specify, if they support that format.
Upvotes: 2
Views: 1085
Reputation: 3184
Leonard Rosenthol did an excellent talk on PDF standards and formats at the itext summit. He explained all the different subsets. You can watch the video online at http://www.parleys.com/#st=5&id=3188&sl=0
There is a list of all the presentations at http://lowagie.com/summit2012presentations
Upvotes: 1
Reputation: 1606
You should check http://itextpdf.com/ (specially this chapter). It is the best opensource framework to work with PDF files in Java I have ever used.
Upvotes: 1