how to extract the text from the following HTML code?

Question

I am doing web scraping for a DS project, and i am using BeautifulSoup for that. But i am unable to extract the Duration from "tbody" tag in "table" class. Following is the HTML code :


    
        
            
                Start Date
                Duration
                Stipend
                Posted On
                Apply By
            
        
        
            
                
                    Immediately
                
                1 Month
                 
                1500 /month
                
                26 May'20
                23 Jun'20

Note : for extracting 'Immediately' text, i use the following code :

x = container.find("div", {"class" : "table-responsive"})
x.table.tbody.tr.td.div.text

studio-luke · Accepted Answer

You can use select() function to find tags by css selector.

tds = container.select('div > table > tbody > tr > td')
# or just select('td'), since there's no other td tag

print(tds[1].text)

The return value of select() function is the list of all HTML tags that matches the selector. The one you want to retrieve is second one, so using index 1, then get text of it.

how to extract the text from the following HTML code?

Answers (2)

Related Questions