Python: How can i get text from a tag like this in BeautiflSoup

Question

I need to get the date and hour of this links : 'https://www.pagina12.com.ar/225378-murio-cacho-castana-simbolo-del-macho-porteno' or any in the site 'https://www.pagina12.com.ar/'.

the structure is this:

Cultura y Espectáculos
15 de octubre de 2019 · Actualizado hace 3 hs

and i did this:

cosa = requests.get('https://www.pagina12.com.ar/225378-murio-cacho-castana-simbolo-del-macho-porteno').text
parse = BeautifulSoup(cosa, 'html5lib')
info = parse.findAll('div', {'class':'article-info'})

then i try to get the text that says '3 Hs' and cant access to it and dont know how to do it. Anyone have an idea ?

Thanks!

QHarr · Accepted Answer

You could calculate from the data-time attribute

from bs4 import BeautifulSoup as bs
import requests, datetime
import dateutil.relativedelta

r = requests.get('https://www.pagina12.com.ar/225378-murio-cacho-castana-simbolo-del-macho-porteno')
soup = bs(r.content, 'lxml')
dt1 = datetime.datetime.fromtimestamp(float(soup.select_one('[data-time]')['data-time']))
dt2 = datetime.datetime.fromtimestamp(datetime.datetime.now().timestamp()) 
diff = dateutil.relativedelta.relativedelta(dt2, dt1)
print(diff.hours)

Python: How can i get text from a tag like this in BeautiflSoup

Answers (1)

Related Questions