Scrape using Scrapy using Urls taken from a list

Question

class PractiseSpider(scrapy.Spider):
    name = "practise"
    allowed_domains = ["practise.com"]
    start_urls = ['https://practise.com/product/{}/']
    def parse(self, response):
        #do something
        #scrape with next url in the list

My list m contains the url needed to be added like product/{}/.format(m[i]) iteratively. How do I do this. Should I make new spider calls for each Url or should I write some code for the spider to automatically iterate the list. If the answer is the latter what do i write ?

I know there are many answers related to this, for e.g. this but i have a fixed and known list of urls.

Tom&#225;š Linhart · Accepted Answer

If you know the URLs beforehand, just populate start_urls. If you say m is a list of products (that's what I assume from what you wrote), then it would look like this:

start_urls = ['https://practise.com/product/{}/'.format(product) for product in m]

Scrape using Scrapy using Urls taken from a list

Answers (2)

Related Questions