Reputation: 737
Is there a way to extract all content from mupdf.net? For example the following code using the GetText() method will extract all text in html format:
using MuPDF.NET
var document = new Document("path-to-doc.pdf")
for (int i = 0; i < document.PageCount; i++) {
var htmlContent = page.GetText("html");
}
this will not necessairly include form fields, vector graphics e.t.c. How would i get all of these and their relative positions within the PDF?
Upvotes: 0
Views: 23