Get Text between two tags using nokogiri

Question

My HTML structure is


    Header
    Mailing Address
    2349 Glorem ipsun lorem ipsum  CA 95833

    
    

    Phone: 111-111-2111    Fax: 111-511-1111

    some text

    some address

          

    Contact(s)

The HTML page contains several

elements. For each div i need to extract Phone and Fax in a array with other data. I tried using

doc.css("div#ctl00_cphContent_divBrowseByMember").each do |div|
  div.css("div.line").each do |line|
    line.xpath('//text()[preceding-sibling::br and following-sibling::a]').text.strip
  end
end

It returns nothing and returns time out error. If I try as line.xpath('//text()[preceding-sibling::br and following-sibling::a]')[0].text.strip will return same Phone and fax for all other divs. Please suggest any other solution that will help me.

pguardiario · Accepted Answer

The easy way:

phone, fax = line.text.scan /\d{3}-\d{3}-\d{4}/

Get Text between two tags using nokogiri

Answers (1)

Related Questions