Find div class by substring then extract entire class name

Question

I'm trying to find all div's that contain the substring 'auction-results' then extract the class name. Here's an example:

I can find all the div's that contain 'auction-results' like so:

results = soup.select("div[class*=auction-results]")
type(results)
results

Out: [
     
            $700,000
     ]

Out: bs4.element.ResultSet

What I want is to store the entire class name 'auction-results high-bid has-price' in a pandas column like so:

class_text = ['auction-results high-bid has-price']
'auction-results high-bid has-price'
scraped_data = pd.DataFrame({'class_text': class_text})
scraped_data

                            class_text
0   auction-results high-bid has-price

I haven't found a solution yet so I hope someone can help me out, thanks!

Jack Fleeting · Accepted Answer

Try it this way:

columns = ['class_text']
rows = []
for result in results:
    rows.append(' '.join(result['class']))
scraped_data = pd.DataFrame([rows],columns=columns)
scraped_data

Output:

    class_text
0   auction-results high-bid has-price

Find div class by substring then extract entire class name

Answers (2)

Related Questions