nikhil
nikhil

Reputation: 9363

Crawler in python

I have a website (my college website) for which each student has a username/password to login into his account. The website consists of many files uploaded in different formats. I want to programmatically enter my credentials and login to my account and download all files that have been uploaded in the site.

Is there any python module that would authenticate html form and provide access to the pages and then allow downloading files present in those pages?

Upvotes: 0

Views: 468

Answers (1)

Philip Southam
Philip Southam

Reputation: 16465

I would look at http://scrapy.org. It's python based crawler/scraper that you can customize to perform any workflow you want. Here's an example of how user authentication is handled: http://doc.scrapy.org/topics/request-response.html?highlight=username#request-usage-examples

Upvotes: 2

Related Questions