Problems scraping content from a news website

Question

I am trying to collect the headlines/titles and other elements from a news website. However, the tags I am using (that I have found using the gadget selector and inspecting the website code) seem not to be working.

For the headlines I've tried the tags '.article-h' and '.article-h-link' without no result. The same happen for the dates ('.date.right') and the leads ('.result-intro')

url_test <- read_html('https://www.semana.com/Buscador?query=proceso%20paz%20farc&post=semana&limit=10&offset=0&from=2012%2F08%2F26&to=2016%2F12%2F03')
titles <- html_text(html_nodes(url_test, '.article-h-link'))

I always get "character (0)". Interestingly, though, if a try to collect the information within the home page (www.semana.com), those same tags work without problem. What can be the problem?

Problems scraping content from a news website

Answers (1)

Related Questions