Beautifulsoup return list for attribute "class" while value for other attribute

Question

Beautifulsoup is handy for html parsing in python, and below code result cofuse me.

from bs4 import BeautifulSoup
tr ="""

    t1
    t2

"""
table = BeautifulSoup(tr,"html.parser")
for row in table.findAll("tr"):
    print row["class"]
    print row["id"]

result:

[u'passed']
row1
[u'failed']
row2

Why the attribute class returns as array ? while id is normal value ?

beautifulsoup4-4.5.0 is used with python 2.7

alecxe · Accepted Answer

class is a special multi-valued attribute in BeautifulSoup:

HTML 4 defines a few attributes that can have multiple values. HTML 5 removes a couple of them, but defines a few more. The most common multi-valued attribute is class (that is, a tag can have more than one CSS class)

Sometimes, this is problematic to deal with - for instance, when you want to apply a regular expression to class attribute value as a whole:

BeautifulSoup returns empty list when searching by compound class names

You can turn this behavior off by tweaking the tree builder, but I would not recommend doing it.

Beautifulsoup return list for attribute "class" while value for other attribute

Answers (2)

Related Questions

Beautifulsoup return list for attribute &quot;class&quot; while value for other attribute

Answers (2)

Related Questions

Beautifulsoup return list for attribute "class" while value for other attribute