Using Beautiful soup to analyze table in python

Question

So I've got a table:


And I was simply trying to return a JSON string of the table pairs like so:
[["Pig A", "Straw"], ["Pig B", "Stick"], ["Pig C", "Brick"]]
However, with my code I can't seem to get rid of the HTML tags:
stable = soup.find('table')

cells = [ ]
rows = stable.findAll('tr')
for tr in rows[1:4]:
    # Process the body of the table
    row = []
    td = tr.findAll('td')
    #td = [el.text for el in soup.tr.finall('td')]
    row.append( td[0])
    row.append( td[1])
    cells.append( row )


return cells
#eventually, I'd like to do this:
    #h = json.dumps(cells)
    #return h
My output is this:
[[
, ], [, ], [, ]]

  
  
  
  

  Pig
  House Type


  Pig A
  Straw


  Pig B
  Stick


  Pig C
  Brick

















Pig A Straw Pig B Stick Pig C Brick

cvsguimaraes · Accepted Answer

Use the text property to get only the inner text of the element:

row.append(td[0].text)
row.append(td[1].text)

Using Beautiful soup to analyze table in python

Answers (2)

Related Questions