How to use the `yield Request()` to control the FOR Loop in scrapy?

Question

I'm setting up a scrapy project. In my project, there is a for-loop which should be controlled by the scrawl results but the yield Request() keyword will not return a value. So how do I control the for-loop in scrapy? See the code below for more details:

def parse_area_detail(self, response):
    for page in range(100):
        page_url = parse.urljoin(response.url, 'pg' + str(page + 1))
        yield Request(page_url, callback=self.parse_detail)
        # the pase_detail funtion will get a title list. If the title list is 
        #  empty, the for loop should be stopped.

def parse_detail(self, response):   
  title_list=response.xpath("//div[@class='title']/a/text()").extract()

The parse_detail function will get a title list. I expect that if the title list is empty then the for-loop will stop. But I know my code doesn't work like that. How do I change my code to make it work?

How to use the `yield Request()` to control the FOR Loop in scrapy?

Answers (1)

Related Questions