How to extract XML attributes when unnesting XML that has been converted to a dataframe in R?

Question

I have an XML file that looks like this:

xml_reprex <- '
  
    
      354
        
          301
          53
        
  
  
    
      154
        
          142
          12
        
  
'

I would like to turn this into a tidy data frame with columns AthleteIdentifier, TotalGames, GameType and Games.

Here is my current attempt:

df <- read_xml(xml_reprex) 

as_tibble(as_list(df)) %>% 
  unnest_wider(Athletes, names_repair = "universal") %>% 
  unnest(TotalGames) %>% 
  unnest(TotalGames) %>% 
  unnest(GamesByType) %>% 
  unnest(GamesByType) %>% 
  unnest(GamesByType)

However, I lose the AthleteIdentifier values as they are listed as an attribute, similarly I lose the Games type as it is an attribute as well.

Is this because I have converted it to a tibble and therefore I lose all those attributes? Is there a way I can retain that information in the conversion to a tibble for for manipulation?

Thanks.

How to extract XML attributes when unnesting XML that has been converted to a dataframe in R?

Answers (1)

Related Questions