Scaper of ASIN number from an Amazon page using python

Question

I would scrape all the asin numbers from an amazon page. I need that lists to make a scraping for every asin obtained.

I tryed with this code but i could read only 3 asin number as results.

I think i make a wrong regular expression

this is my code:

import requests

###Amazon URL
urls = ['https://www.amazon.it/gp/bestsellers/apparel/', 'https://www.amazon.it/gp/bestsellers/electronics/', 'https://www.amazon.it/gp/bestsellers/books/']

htmltexts = []
for url in urls:
    req = requests.get(url).content
    htmltexts.append(req)

import re
for htmltext in htmltexts:
    text = str(htmltext)
    pattern = re.compile(r"/.*/dp/(.*?)\"")
    s = re.findall(pattern, text)
    print (s)

I expect at least 20 result from every page. The program has built for 3 amazon pages. so i need 60 results at least

Scaper of ASIN number from an Amazon page using python

Answers (1)

Related Questions