How to split pdf file by book marks using itext 7 , if pdf contains "Duplicate Bookmarks"

Question

I am trying to split pdf by its bookmarks using itext7.

Problem : if Pdf is having same bookmark in other place in the outline tree , it is over ridding and unable to split.

Sample code to reproduce the problem:

public void walkOutlines(PdfOutline outline, Map names, PdfDocument pdfDocument,Listtitles,ListpageNum) { //----------loop traversing all paths
        
    for (PdfOutline child : outline.getAllChildren()){
        if(child.getDestination() != null) {
            prepareIndexFile(child,names,pdfDocument,titles,pageNum,list);
        }
    }
}

//------------Getting pageNumbers from outlines
public void prepareIndexFile(PdfOutline outline, Map names, PdfDocument pdfDocument,Listtitles,ListpageNum) {
        
        String title = outline.getTitle();
        
        PdfDestination pdfDestination = outline.getDestination();
        String pdfStr = ((PdfString)pdfDestination.getPdfObject()).toUnicodeString();
        PdfArray array = (PdfArray) names.get(pdfStr);
        PdfObject pdfObj = array != null ? array.get(0) : null;
        
        Integer pageNumber = pdfDocument.getPageNumber((PdfDictionary)pdfObj);
        
        titles.add(title);
        pageNum.add(pageNumber);
        
        
        if(outline.getAllChildren().size() > 0) {
            
            for (PdfOutline child : outline.getAllChildren()){
                prepareIndexFile(child,names,pdfDocument,titles,pageNum);
            }
            
        }
        
}

public boolean splitPdf(String inputFile, final String outputFolder) {

        boolean splitSuccess = true;
        PdfDocument pdfDoc = null;
        try {
            PdfReader pdfReaderNew = new PdfReader(inputFile);
            pdfDoc = new PdfDocument(pdfReaderNew);
            
            final List titles = new ArrayList();
            List pageNum = new ArrayList();
            
            PdfNameTree destsTree = pdfDoc.getCatalog().getNameTree(PdfName.Dests);
            Map names = destsTree.getNames();//--------------------------------------Core logic for getting names
            PdfOutline root = pdfDoc.getOutlines(false);//--------------------------------------Core logic for getting outlines
            
            walkOutlines(root,names, pdfDoc, titles, pageNum,content);  //------Logic to get bookmarks and pageNumbers
            

            if (titles == null || titles.size()==0) {
                splitSuccess = false;
            }else {                                                             //------Proceed if it has bookmarks
                
                for(int i=0;i startPage) {
                             endPage = nextPage - 1;
                         }else {
                             endPage = nextPage;
                         }
                     }
                     
                     String outFileName = outputFolder + File.separator + getFileName(title) + ".pdf";
                     PdfWriter pdfWriter = new PdfWriter(outFileName);
                    
                     PdfDocument newDocument = new PdfDocument(pdfWriter, new DocumentProperties().setEventCountingMetaInfo(null));
                     pdfDoc.copyPagesTo(startPage, endPage, newDocument);
                     newDocument.close();
                     pdfWriter.close();
                }
            }
        }catch(Exception e){
            //---log
        }       
}

Found root cause: In PdfNameTree items.put(name.toUnicodeString(), names.get(k));

How to over come this issue?

Thanks in advance

How to split pdf file by book marks using itext 7 , if pdf contains "Duplicate Bookmarks"

Answers (1)

Related Questions

How to split pdf file by book marks using itext 7 , if pdf contains &quot;Duplicate Bookmarks&quot;

Answers (1)

Related Questions

How to split pdf file by book marks using itext 7 , if pdf contains "Duplicate Bookmarks"