Failing to get duration of youtube video using xpath

Question

I wanted to write something that would return me the video duration of a youtube link. So I found requests and lxml and started out following this guide.

Here's the setup:

import requests
from lxml import html

url = 'https://www.youtube.com/watch?v=EN8fNb6uhns'
page = requests.get(url)
tree = html.fromstring(page.content)

Then I try and use xpath to get the duration, but it doesn't work. Trying to get the duration:

tree.xpath('//span[@class="ytp-time-duration"]/text()')

returns an empty list. But when I try and get the title (as a test) with:

tree.xpath('//h1[@class="watch-title-container"]/span/text()')

it works. When I use inspect to copy the xpath of the duration element nothing is returned:

tree.xpath('/html/body/div[2]/div[4]/div/div[4]/div[2]/div[2]/div/div[24]/div[2]/div[1]/div/span[3]')

When I do the same for the title it works again.

What is going on?

宏杰李 · Accepted Answer

span[@class="ytp-time-duration"]

this span tag is generated by JavaScript, and it will not returned by requests, requests just return the HTML code

Failing to get duration of youtube video using xpath

Answers (2)

Related Questions