Binod
Binod

Reputation: 313

iTextSharp PDF Reading highlighed text (highlight annotations) using C#

I am developing a C# winform application that converts the pdf contents to text. All the required contents are extracted except the content found in highlighted text of the pdf. Please help to get the working sample to extract the highlighted text found in pdf. I am using the iTextSharp.dll in the project

Upvotes: 2

Views: 1653

Answers (1)

Bruno Lowagie
Bruno Lowagie

Reputation: 77606

Assuming that you're talking about Comments. Please try this:

for (int i = pageFrom; i <= pageTo; i++)
{
    PdfDictionary page = reader.GetPageN(i);
    PdfArray annots = page.GetAsArray(iTextSharp.text.pdf.PdfName.ANNOTS);
    if (annots != null)
        foreach (PdfObject annot in annots.ArrayList)
        {
            PdfDictionary annotation = (PdfDictionary)PdfReader.GetPdfObject(annot);
            PdfString contents = annotation.GetAsString(PdfName.CONTENTS);
            // now use the String value of contents
        }
}

This is written from memory (I'm a Java developer, not a C# developer).

Upvotes: 2

Related Questions