Is it possible to find the .. text, when any of the .. value is known?

Question

I have an webpage which has the similar kind of html format as below:



 .... 
  .
  .
  .
 alo 
 foo 
 bla bla

Now, I know only the value bla bla, base on the value can we track or find the 3rd last .. value(which is here alo)? I can track those,with the help of HREF values,but the HREF values are not fixed always, they can be anything anytime.

the Tin Man · Accepted Answer

Extracting every from an HTML document is easy, but it's not a foolproof way to navigate the DOM. However, given the limitations of the sample HTML, here's a solution. I doubt it'll work in a real-world situation though.

Mechanize uses Nokogiri internally for its heavy lifting so doing require 'nokogiri' isn't necessary if you've already required Mechanize.

require 'nokogiri'

doc = Nokogiri::HTML::DocumentFragment.parse(< alo 
 foo 
 bla bla 
EOT

doc.search('td')[-3].at('a')['href']
=> "http://www.edu/st/file.html"

How to get the Nokogiri document from the Mechanize "agent" is left as an exercise for the user.

Is it possible to find the <td> .. </td> text, when any of the <td>..</td> value is known?

Answers (2)

Related Questions

Is it possible to find the &lt;td&gt; .. &lt;/td&gt; text, when any of the &lt;td&gt;..&lt;/td&gt; value is known?

Answers (2)

Related Questions

Is it possible to find the <td> .. </td> text, when any of the <td>..</td> value is known?