How to use XQuery to extract specific XML records and output in comma delimited format?

Question

I'm trying to extract only data along with their book IDs using a XQuery (I'm new to this).

Here is the input data:

  
    
        
            72771KAM3
            US72771KAM36
        
    
    
        24.95
        2000-10-01
        An in-depth look at creating applications with XML.
    
  
  
    
        
            070185UL5
            US070185UL50
        
    
    
        19.25
        2002-11-01
        A former architect battles corporate zombies, 
  an evil sorceress, and her own childhood to become queen 
  of the world.
    
  
  
  
        
            070185UK7
            US070185UK77
        
    
    
        5.95
        2004-05-01
        After the collapse of a nanotechnology 
  society in England, the young survivors lay the 
  foundation for a new society.
    
  
  
    
        
            070185UJ0
            US070185UJ05
        
    
    
        4.95
        2000-09-02
        When Carla meets Paul at an ornithology 
  conference, tempers fly as feathers get ruffled.

Expected output format 1:

  
    
        
            72771KAM3
            US72771KAM36

XQuery that I'm using for format 1:

  for$x in //book_xref/xref
  return $x

Question for format 1: I tried including book id separately so that it's included in the output but it doesn't match the expected format as I mentioned above. How do I get the book id to be fetched as well in the output per the format?

Expected output format 2 (comma delimited):

  book_id, xref_type, xref_type_id, xref
  6636551, Fiction, 1, 72771KAM3
  6636551, Non_Fiction, 2, US72771KAM36
  119818569, Fiction, 1, 070185UL5
  119818569, Non_Fiction, 2, US070185UL50
  etc.

Question for format 2: How can I get output in comma delimited format through XQuery? Do I need to stick to XSLT for that?

I appreciate your response.

Martin Honnen · Accepted Answer

For the CSV you can use string-join i.e. for those four values you can use

//book//book_xref/xref/string-join((ancestor::book/@id, @type, @type_id, .), ',')

which would give a sequence of strings with the record data; if you want a single string with the header line and those data lines you can use another string-join:

string-join(('book_id,xref_type,xref_type_id,xref', //book//book_xref/xref/string-join((ancestor::book/@id, @type, @type_id, .), ',')), '
')

For the transformation/XML extraction reconstruct the book elements with the xref descendants and add the master_information e.g.

//book[.//book_xref/xref]/{master_information}

How to use XQuery to extract specific XML records and output in comma delimited format?

Answers (2)

Related Questions