How to extract text from
tag?

Question

I would like to scrape reviews from Zomato with BeautifulSoup library in Python.

However, each review doesn't have the tag div but only the tag paragraph.

When I code this

 review = soup.find_all("p", attrs={"class": "sc-1hez2tp-0 sc-kQsIoO cCvqWb"})

The output is:

review
[,
 Kalau kesini wajib bangetnih pesen Ikan Gurame, rasanya bener” enakk. Bumbu guramenya pun macem”, mulai dari bumbu spesial gurih7, asem manis, sambal manga, kecombrang, rica-rica, & pecak! Kulit di udang mayonaisenya juga udah dicopotin, jadi lebih enak makannya. 
.
Buat minuman, es kelapa nya seger banget, dagingnya juga gampang diambil, ga kaya es kelapa ditempat lainnya yang dagingnya susah dikerok. Tapi air kelapanya pakai gula, jadi rasanya terlalu manis. Tips kalau pesen es kelapa, request tanpa gula aja biar manisnya pas & lebih segerr👌🏻,
 This restaurant served a west java cuisine that adapt into local taste which i could say is not spicy compare the original recipe. The place is easy to access, 15 minutes go to highway nearby means that people are not difficult to find the location. But the parking area are not so good & not convenience in some place not all,
 Waiters nya kak kikis ramah dan sopan,
 Pelayanan yg responsif, waiter AA Joko baik pisan euy sangat ramah dan responsif. Menunya enak2, gurame goreng kipas dan udang bakar galah madu mantaabbbb!!! Sayur asemnya endeusss hihihi anyway thx zomato gold 🥰🥰🥰 pasti balik lagi donggg 💃💃💃]

I want each of the text in paragraf to be inserted to the list of Dataframe with one column name 'reviews'.

reviews

1. Kalau kesini wajib bangetnih pesen Ikan Gurame, rasanya bener” enakk. Bumbu guramenya pun macem”, mulai dari bumbu spesial gurih7, asem manis, sambal manga, kecombrang, rica-rica, & pecak! Kulit di udang mayonaisenya juga udah dicopotin, jadi lebih enak makannya. 
.
Buat minuman, es kelapa nya seger banget, dagingnya juga gampang diambil, ga kaya es kelapa ditempat lainnya yang dagingnya susah dikerok. Tapi air kelapanya pakai gula, jadi rasanya terlalu manis. Tips kalau pesen es kelapa, request tanpa gula aja biar manisnya pas & lebih segerr👌🏻
2. This restaurant served a west java cuisine that adapt into local taste which i could say is not spicy compare the original recipe. The place is easy to access, 15 minutes go to highway nearby means that people are not difficult to find the location. But the parking area are not so good & not convenience in some place not all
3. ...

I had tried

import pandas as pd
review_text = []
for el in soup.find_all('p', attrs={'class': 'sc-1hez2tp-0 sc-kQsIoO cCvqWb'}):
    komentar = print(el.get_text().encode('utf-8'))
    review_text.append(komentar)

reviews = {'komentar':review_text}
df = pd.DataFrame(reviews, columns=['reviews'])
df

but it return an empty dataframe output.

How to extract text from <p> tag?

Answers (1)

Related Questions

How to extract text from &lt;p&gt; tag?

Answers (1)

Related Questions

How to extract text from <p> tag?