Scrapy Pagination XHR 400 Bad Request

Question

I am trying to fetch all url from https://www.magzter.com/magazines/listAllIssues/503

In one set, Page show 12 magzines and scroll paginate and proceed with next 12 magazines

After Debugging, Upcoming request are as follows

https://www.magzter.com/magazines/listAllIssues/503/12
https://www.magzter.com/magazines/listAllIssues/503/24

But get request to https://www.magzter.com/magazines/listAllIssues/503/12 through

400 Bad Request

Is there any implementation of this scenario in scrapy please provide a sample script.

or any other library which stimulate infinite scrolling and work with scrapy framework

Tarun Lalwani · Accepted Answer

The issue is that the request is a AJAX request and not sending it X-Requested-With: XMLHttpRequest header makes it a 400 bad request. There is no way to send headers directly from shell command line, so you need to launch shell and type commands to fetch the request with headers

$ scrapy shell --nolog

>>> from scrapy import Request
>>> req = Request("https://www.magzter.com/magazines/listAllIssues/146/12", headers = {"X-Requested-With" : "XMLHttpRequest"})
>>> fetch(req)
>>> response.body
b'

Scrapy Pagination XHR 400 Bad Request

Answers (1)

Related Questions