rvest function read_html_live() doesn't allow html_elements() to read correctly

Question

Although read_html_live() does return a nodeset that seems to contain all the relevant "bits", I can't then use html_elements() on it (even though the same website, and the same xpath, work perfectly using the more traditional read_html).

I have experience using various other libraries for webscraping, but I'm a relatively new convert to rvest, so entirely possible I'm missing something obvious.

Minimum working example below:

library(rvest)

x <- read_html("https://www.ngaarawhetu.org/news/")
y <- read_html_live("https://www.ngaarawhetu.org/news/")

x_ele <- html_elements(x, xpath = "//link[@rel = 'alternate']") # Just to demonstrate - doesn't seem to work with anything
y_ele <- html_elements(y, xpath = "//link[@rel = 'alternate']")

print(x_ele)

print(y_ele)

The 'x' version, using read_html(), returns the expected values:

{xml_nodeset (5)}
[1] 

[4]

Brett · Accepted Answer

As per margusl's comment above, the answer was to swap the quotation marks from html_elements(y, xpath = "//link[@rel = 'alternate']") to
html_elements(y, xpath = '//link[@rel = "alternate"]').

rvest function read_html_live() doesn't allow html_elements() to read correctly

Answers (1)

Related Questions

rvest function read_html_live() doesn&#39;t allow html_elements() to read correctly

Answers (1)

Related Questions

rvest function read_html_live() doesn't allow html_elements() to read correctly