Can´t extract the pagination link with scrapy

Question

I want to identify the "next-page-link" with and for scrapy of a multi page website. I have the feeling that I cannot do it the common way as the href-content is empty (href=""). See here:


1
23
...
330
►

I tried response.css('div.page-navigation > a::attr(href)').extract_first()

but it's not working.

I´d appreciate if someone could help me as I´m struggeling with this problem already for a while.

Sohan Das · Accepted Answer

You can simply generate the urls, then parse.

page = 0
for i in range(330):
    page+=1
    url = ('https://www.vdma.org/mitglieder'
        '?p_p_lifecycle=2&p_p_resource_id=getPage&p_p_id'
        '=vdma2publicusers_WAR_vdma2publicusers&s=&page='+str(page))
    print(url)

Can´t extract the pagination link with scrapy

Answers (1)

Related Questions

Can&#180;t extract the pagination link with scrapy

Answers (1)

Related Questions

Can´t extract the pagination link with scrapy