Scrape all div tags id (not their value) with similar format

Question

I have a internal company html webpage with a div html tag having the following format:

I would like to extract all the id names so the end result would be B4_6_2019 B3_6_2019

How would I do that? (the id names are all dates)

Ronak Shah · Accepted Answer

Try doing

library(dplyr)
library(rvest)

url %>%
  read_html() %>%
  html_nodes("div") %>%
  html_attr("id") %>%
  grep("^B\d+_\d+_\d+", ., value = TRUE)

Answers (2)