How do I extract text with the Xpath in Scrapy?

Question

Alright, it seems like I've tried everything and just can't figure out what is going on here. The direct link to the site I am trying to scrape is http://www.ammofast.com/collections/pistol/products/aguila-25-auto-acp-full-metal-jacket-50-gr-1000-round-case

And I want to extract the title and the price. My best attempts just end up returning an empty []

User-Ps-MacBook-Pro:ammo user$ scrapy shell "http://www.ammofast.com/collections/pistolproducts/aguila-25-auto-acp-full-metal-jacket-50-gr-1000-round-case"
...
>>> response.xpath('//div[@id="product-header-title"]/h1[@id="product-title"]/text()').extract

I have tried a handfull a different variations but all to no avail. I guess I don't quite fully understand the XPath structure just yet, but that's why I'm practicing :) Thanks for the help! I am on Scrapy 1.1.0 as well.

Padraic Cunningham · Accepted Answer

The price is loaded using Javascript, in the source you can see:

window.ShopifyAnalytics.lib.track(
          "Viewed Product",
          {"id":705964349,"name":"Aguila .25 Auto ACP Full Metal Jacket 50 Gr 1000 Round Case - 1000 \/ 50","price":"329.99","currency":"USD","sku":"","brand":"Aguila","category":".25 ACP","nonInteraction":true}
        );

            });

You can see if you look at the source returned that there is actually no text in p class="price":

What you could do is parse that function code or get the price from the option tag:

In [15]: response.xpath("//*[@id='product-title']/text()")
Out[15]: []

In [16]: response.xpath("//option/text()")
Out[16]:  []

How do I extract text with the Xpath in Scrapy?

Answers (1)

Related Questions