How to get first child table row from a table in BeautifulSoup ( Python )

Question

Here is the Code and sample results , I just want the first column of the table ignoring the rest. There are similar question on Stackoverflow but they did not help.


JOHNSON
 2,014,470 
0.81
2

I want JOHNSON only, as it is the first child. My python code is :

import requests
  from bs4 import BeautifulSoup
 def find_raw():
      url = 'http://names.mongabay.com/most_common_surnames.htm'
      r = requests.get(url)
      html = r.content
      soup = BeautifulSoup(html)
      for n in soup.find_all('tr'):
          print n.text
  
  find_raw()

What I get:

SMITH 2,501,922 1.0061
JOHNSON 2,014,470 0.812

enrico.bacis · Accepted Answer

You can find all the tr tags with find_all, then for each tr you find (gives only the first) td. If it exists, you print it:

for tr in soup.find_all('tr'):
    td = tr.find('td')
    if td:
        print td

How to get first child table row from a table in BeautifulSoup ( Python )

Answers (2)

Related Questions