Get list of files from Apache index page using Ruby/Rails

Question

I am attempting to create a radar animation using data from the National Weather Service. For static images they make it easy by always having the same filename. However, for the historical images, they are timestamped, and always change. Thus, to get the previous N images, you would have to know the filenames beforehand. They do, however, provide a directory which provides a listing for each site. See the example here: http://radar.weather.gov/ridge/RadarImg/N0R/FWS/

What I need is from my Rails app to extract the last N images from that directory. Is that possible? I could imagine one option would be to download and then scrape that page, but I am assuming there is a better way?

Thanks!

stew · Accepted Answer

Following on from the above you could try something like I just tried in the console..

require 'open-uri'
require 'nokogiri'    
doc = Nokogiri::HTML(open('http://radar.weather.gov/ridge/RadarImg/N0R/FWS/'))
doc.xpath('//table/tr/td').each do |tabrow|
  puts tabrow.content
end

That's a pretty basic stab in the dark but should give you food for thought to get you on the way

Get list of files from Apache index page using Ruby/Rails

Answers (2)

Related Questions