How to get siblings' child according to specific defined sibling content

Question

I need to find the best method to gather writer and artist information from the following XML data. The comic node appears multiple times and includes data for a single comic book.

I can't grab the appropriate person according to their job function, writer, artist, etc. There are sometimes multiple writers and artists of each comic book. My plan is to add/append each to a List.

So, for this single comic book, I need to get all the writers' and artists' display name, but the job function (e.g. writer) is a sibling of the persons name.

Here is what I have, but doesn't work:

writer = []
penciler = []
doc.xpath('//comic').each do |main_element|
 main_element.xpath("mainsection/credits/credit/role[@id='dfWriter']").each do |n|
    writer << n.xpath('person/displayname').text
  end
  main_element.xpath("mainsection/credits/credit/role[@id='dfPenciler']").each do |n|
    penciler << n.xpath('person/displayname').text
  end
end

p "Writer(s): ",writer
p "Penciler(s): ",penciler

This is the XML file/data:


  3398
  195
  
    Mind Games
    0
    
      0
      0
    
    32
    
      
        Writer
        dfWriter
        
          Will Pfeifer
          Pfeifer, Will
          Pfeifer
          Will
        
      
      
        Writer
        dfWriter
        
          John Byrne
          Byrne, John
          Byrne
          John
        
      
      
        Penciller
        dfPenciler
        
          John Byrne
          Byrne, John
          Byrne
          John

The code I have does not give me the desired results. I found "Getting the siblings of a node with Nokogiri" but I need to iterate and grab each sibling.

I can either search by dfWriter or Writer as they are the same.

My expected output would be:

Writer(s): Will Pfeifer, John Byrne 
Penciler(s): John Byrne

har07 · Accepted Answer

You can use XPath following-sibling axis for this purpose assuming the target element always located after role :

doc.xpath('//comic').each do |main_element|
 main_element.xpath("mainsection/credits/credit/role[@id='dfWriter']").each do |n|
    writer << n.xpath('following-sibling::person/displayname').text
  end
  main_element.xpath("mainsection/credits/credit/role[@id='dfPenciler']").each do |n|
    penciler << n.xpath('following-sibling::person/displayname').text
  end
end

Or you can just iterate through credit instead of role in the first place :

doc.xpath('//comic').each do |main_element|
 main_element.xpath("mainsection/credits/credit[role/@id='dfWriter']").each do |n|
    writer << n.xpath('person/displayname').text
  end
  main_element.xpath("mainsection/credits/credit[role/@id='dfPenciler']").each do |n|
    penciler << n.xpath('person/displayname').text
  end
end

How to get siblings' child according to specific defined sibling content

Answers (2)

Related Questions

How to get siblings&#39; child according to specific defined sibling content

Answers (2)

Related Questions

How to get siblings' child according to specific defined sibling content