BeautifulSoup: extracting attribute for various items

Question

Let's say we have HTML like this (sorry, I don't know how to copy and paste page info and this is on an intranet):

And I want to get the highlighted portion for all of the questions (this is like a Stack Overflow page). EDIT: to be clearer, what I am interested in is getting a list that has:

['question-summary-39968',
 'question-summary-40219',
 'question-summary-42899',
 'question-summary-34348',
 'question-summary-32497',
 'question-summary-35308',
...]

Now I know that a working solution is a list comprehension where I could do:

[item["id"] for item in html_df.find_all(class_="question-summary")]

But this is not exactly what I want. How can I directly access question-summary-41823 for the first item?

Also, what is the difference between soup.select and soup.get?

BeautifulSoup: extracting attribute for various items

Answers (1)

Related Questions