python3 to extract a html part from html with xpath

Question

I want to extract a part of html from the following html with python xpath. my question just want to extract the html part include tag and text, and this Get all text inside a tag in lxml question is to extract text part of html, so these two questions is different.

 
  
 
  
     first item
     second item
     third item 
     fourth item
     fifth item
  
  
  
  
  [url=http://]
     movie a
     movie b
     movie c
     movie d

Actually, I just want to extract the following html from the above html.

      
   
     movie a
     movie b
     movie c
     movie d

My code imports requests

 page = requests.get('........html')
 tree = html.fromstring(page.content)
 body = tree.xpath('//div[contains(@title, "name")]')
 print('body:', body)

but the result is

I want to get all the elements in this part html, for example

please use the xpath method not other method.

hr_117 · Accepted Answer

I want to get all the elements in this part html, for example

Try to use:

  body = tree.xpath('//div[contains(@title, "name")]/ul')

or:

Update:(Thanks to @RafaelAlmeida) for all elements blow the div

  body = tree.xpath('//div[contains(@title, "name")]//*')

python3 to extract a html part from html with xpath

Answers (1)

Related Questions