Remove HTML tags and unwanted information in Python With BeautifulSoup

Question

I am pretty new to Python and I am in the process of parsing the contents of a webpage with BeautifulSoup. The webpage is https://www.ranker.com/crowdranked-list/the-greatest-rappers-of-all-time if that matters. I want to make a list of the top 25 rappers. I managed to find the path with the rappers name, but cannot get rid of the HTML tags and other nested information. Is there a way to iterate over the list, to only display the name of the artist?

Here is my code:

r = requests.get('https://www.ranker.com/crowdranked-list/the-greatest-rappers-of-all-time')

soup = BeautifulSoup(r.text, 'html.parser')
results = soup.find_all('meta', attrs={'itemprop': 'name'})
results 
[,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ,
 ]

Basically I want to have this output (but for all 25 artists), which works with a single item in the list:

first_result = results[1]

print(first_result['content'])

Eminem

Remove HTML tags and unwanted information in Python With BeautifulSoup

Answers (1)

Related Questions