Nokugiri web scrape issue

Question

Seeming as i didn't ask this very well the first time. Heres another go.

I'm trying to follow this tutorial here: http://railscasts.com/episodes/190-screen-scraping-with-nokogiri

I'm currently also trying to scrape the price from this website link here: http://www.ticketmaster.co.uk/derren-brown-miracle-glasgow-04-07-2016/event/370050789149169E?artistid=1408737&majorcatid=10002&minorcatid=53&tpab=-1

What i'm wanting to achieve is to have all three of these ticket (name and price hopefully as much information about the tickets/prices as possible) and use them in my web application.

I can't show you the result, Its stupidly big in size, But i can tell you that i don't hit the second byebug, Heres my code.

  url = "http://www.ticketmaster.co.uk/derren-brown-miracle-glasgow-04-07-2016/event/370050789149169E?artistid=1408737&majorcatid=10002&minorcatid=53&tpab=-1"
    doc = Nokogiri::HTML(open(url))
    byebug
    doc.css(".item").each do |item|
      title = item.at_css(".fru").text
      byebug
    end

Unfortunately to help you'll ideally have to try this yourself to see the horrible page size! haha!

Edit, Ok baring in mind my screen is 27 inches, The text FILLS the screen

Heres an image of what i got in from the first image.

Further to this i believe that this image here is all i need? its just getting it out.

Thanks Sam

Volodymyr Balytskyy · Accepted Answer

The main issue here is that the price is written inside javascript and not html itslef. Nokogiri only parse XML and HTML, therefore you need help of awesome REGEX. Before you read the full code, here a few tips to undestand it.

First I search for all tags named

Nokugiri web scrape issue

Answers (1)

Related Questions