Web scraping - Python

Question

How can I extract the entire content within "td"?


    Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! 
    8 entire dolls per set! Octuple the presents!

I tried this:

desc = data.xpath("//td/text()") 
print desc

But, it returns the first sentence only:

Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"!

I would like to have the output in the following format:

Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! 8 entire dolls per set! Octuple the presents!

I also tried:

desc = data.xpath("//td//text()") 
    print desc

The output looks like this:

Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! 
8 entire dolls per set! Octuple the presents!

I prefer the following:

Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! 8 entire dolls per set! Octuple the presents!

kevin · Accepted Answer

This worked.

desc = data.xpath("//td") 
    print desc.text_content()

Web scraping - Python

Answers (1)

Related Questions