Extract the inner text of the anchor elements correctly using R

Question

I am using R to scrape the link titles in this link www.jamesaltucher.com/sitemap.xml

This is my code.

library(XML)
library(RCurl)
url.link <- 'http://www.jamesaltucher.com/sitemap.xml'
blog <- getURL(url.link)
blog          <- htmlParse(blog, encoding = "UTF-8")
titles  <- xpathSApply (blog ,"//a",xmlValue)             ## titles

My titles is an empty list.

See the screenshot:

enter image description here

Did I use the xpath incorrectly?

CHP · Accepted Answer

Yes. You are looking for loc element and not a element.

titles  <- xpathSApply (html ,"//loc",xmlValue)

Extract the inner text of the anchor elements correctly using R

Answers (2)

Related Questions