Dev Pandu
Dev Pandu

Reputation: 121

Scrapy crawl data inside pdf file

I would like to know how to crawl data inside a pdf file using scrapy. Which module should I use and which is the best and effective way?? Could you please give me some sample tutorials on this

Thanks!!

Upvotes: 4

Views: 6616

Answers (1)

GHajba
GHajba

Reputation: 3691

I suggest you get the PDF with Scrapy and use PyPDF2 to get the content inside the PDF.

For a complete but somewhat old (using pyPDF) example take a look at this site.

Upvotes: 4

Related Questions