user2331283
user2331283

Reputation:

How to read pdf bookmark using apache tika?

I'm using apache tika library for reading pdf content. Dose the same library allow to read pdf bookmarks as well?

Upvotes: 0

Views: 404

Answers (1)

Stephen C
Stephen C

Reputation: 719576

Does the same library allow to read pdf bookmarks as well?

I think the answer is Yes.

  • I found a Tika bug report (fixed) about text extraction from PDF bookmarks not working.

  • Tika uses the Apache PDFBox library for reading PDF files, and the website for that library says it handles PDF bookmarks.

And ...

  • "PDF: Text from bookmarks is now extracted (TIKA-1035)." from the Tika 1.3 release notes.

Upvotes: 2

Related Questions