Scraping PDFs of all linked websites

Question

I would like to scrape official laws from websites (here is an example). The documents are accessible within a menu in the html website. I managed to extract links from websites such as github and download PDFs, however, I have difficulties extracting from this type of website. I tried the following code:

library(rvest)

# read html 
page <- read_html("https://bl.clex.ch/app/de/texts_of_law/780")

# from nodes I would like to get the links where the PDFs are stored
raw_list <- page %>%   # takes the page above for which we've read the html
  html_nodes("a") %>%  # find all links in the page
  html_attr("href")

No links can be found on this website as the result is an empty character string

character(0)

The Questions that I have:

What is different about the menu on the linked website compared to for example PDFs stored on github accessible through the links on the main page of github project?
How can I access the links and download all PDFs stored in this menu?

Scraping PDFs of all linked websites

Answers (1)

Output

Related Questions