Extract all content using MUPDF.net

Question

Is there a way to extract all content from mupdf.net? For example the following code using the GetText() method will extract all text in html format:

using MuPDF.NET

var document = new Document("path-to-doc.pdf")
for (int i = 0; i < document.PageCount; i++) {
           var htmlContent = page.GetText("html");
           
}

this will not necessairly include form fields, vector graphics e.t.c. How would i get all of these and their relative positions within the PDF?

Extract all content using MUPDF.net

Answers (0)

Related Questions