web-scrape: get H4 attributes & href

Question

I am trying to web-scrape a website. But I can get access to the attributes of some fields.

here is the code i used:

import urllib3
from bs4 import BeautifulSoup
import pandas as pd

scrap_list = pd.DataFrame()


for path in range(10): # scroll over the categories
    for path in range(10): # scroll over the pages 
        url = 'https://www.samehgroup.com/index.php?route=product/category'+str(page)+'&'+'path='+ str(path)
        req = urllib3.PoolManager()
        res = req.request('GET', URL)
        soup = BeautifulSoup(res.data, 'html.parser')
        soup.findAll('h4', {'class': 'caption'})
        
        # extract names
        scrap_name = [i.text.strip() for i in soup.findAll('h2', {'class': 'caption'})]
        scrap_list['product_name']=pd.DataFrame(scrap_name,columns =['Item_name'])

        # extract prices
        scrap_list['product_price'] = [i.text.strip() for i in soup.findAll('div', {'class': 'price'})]
        product_price=pd.DataFrame(scrap_price,columns =['Item_price'])

I want an output that provides me with each product and its price. I still can't get that right.

Any help would be very much appreciated.

web-scrape: get H4 attributes & href

Answers (1)

Related Questions

web-scrape: get H4 attributes &amp; href

Answers (1)

Related Questions

web-scrape: get H4 attributes & href