Sachin G
Sachin G

Reputation: 139

PDFBox - Accessible PDF - How to check if PDF Tags have properties as per Accessiblity guidelines

Need to check if PDF Tags have properties as per Accessibility guidelines. Examples:

So far I was able to:

To access the Tags, I have tried these options:

Creation of Accessible PDF is done using OpenText so Dev team doesn't know about PDFBox. I am lost here as how to get the access to Tags/Objects (use MarkedContent or something else).

Please suggest how to extract the individual objects(tags) such as P, H1, Table, Figure/Image and validate their properties. Note: Manual validation of these properties are performed using Adobe Acrobat Pro

Upvotes: 6

Views: 2151

Answers (1)

Monte Chan
Monte Chan

Reputation: 1203

Based upon https://issues.apache.org/jira/browse/PDFBOX-7, it appears that you can use PDFMarkedContentExtractor to get the information that you need.

Upvotes: 3

Related Questions