r - extracting nodes from xml file using xml2 while keeping original sequence of nodes

Question

I have this xml file:

txt <- read_xml(
  "
    
     
     
      text1
     
    
    
     
     
      text2
     
    
    
     
    
   "
)

I'm trying to extract all nodes with "element" while keeping the original order of the nodes from the XML file. Tried using the xml2 package:

> txt %>% xml2::xml_find_all("mes") %>% xml_find_all("element")
{xml_nodeset (5)}
[1] 
[2] 
  text1

[3] 
[4] 
  text2

[5]

Here I get all nodes but I don't get the sequence from the file.

Finally I would like to get something like this:

data.frame(
  sequence = c(1, 1, 2, 2, 3),
  element_id = c(159, 183, 159, 183, 159),
  error = c("info1", "NA", "info2", "NA", "info3"),
  text = c("NA", "text1", "NA", "text2", "NA")
)

where sequence is the sequence of the node in the XML.

Is this possible ?

r - extracting nodes from xml file using xml2 while keeping original sequence of nodes

Answers (1)

Related Questions