How to select tags that have certain attribute type

Question

Here's the Thing

I want to crawl only these tags in the full of other messy html

I tried for the first time with CSS selector selector was

#div_article_contents > tr:nth-child(1) > th:nth-child(1) > table > tbody > tr:nth-child(1) > td > table > tbody > tr > td > a > img

but soup.select('selector') wasn't works. It output empty list. I don't know why

Secondly I tried with tag every that I want to crawl have specific style so I tried:

soup.select('img[style = fixedstyle]')

but it wasn't works. It would be syntax error...

all I want to crawl is list of href links and list of img titles

please help me

Adirmola · Accepted Answer

If the img tag has a specific style value you can use what you tried just delete extra spaces:

from bs4 import BeautifulSoup

html='''

    


    


    

'''

srcs=[]
titles=[]
soup=BeautifulSoup(html,'html.parser')
for img in soup.select('img["style=max-width:222px;max-height:222px"]'):
    srcs.append(img['src'])
    titles.append(img['title'])
print(srcs)
print(titles)

Other wise you can start with the a tag and get down to the img like this:

for a in soup.select('a'):
    srcs.append(a.select_one('img')['src'])
    titles.append(a.select_one('img')['title'])
print(srcs)
print(titles)

How to select tags that have certain attribute type

Answers (1)

Related Questions