Scrapy call request in a loop

Question

I want to scrap a web page which contains comobobox with filtering options. The base url is the same but the request payload depends on selected combobox value. I have a list of available options and I've created a loop which iterate over the combobox values and execute request. Code below:

def parse_product_lines(self, response):
    options = json.loads(response.body_as_unicode())
    product_lines = options['products']

    for product_line in product_lines:
        payload = self.prepare_payload(product_line)

        scrapy.Request('http://example.com',
                       method="POST",
                       body=urllib.urlencode(payload),
                       callback=self.parse_items)

def parse_items(self, response):
    print response

,but requests are not executed. Do somebody know what's going on there ?

zephor · Accepted Answer

first, a Spider class use method parse by default.

each callback should return an Item or a dict, or an iterator.

you should yield request in your parse_product_lines method to tell scrapy to handle next.

Scrapy call request in a loop

Answers (2)

Related Questions