Scrapy web crawler not returning anything

Question

I'm working on Scrapy for the first time and I can't get this to return anything. Can someone help me understand what I'm doing wrong?

from scrapy.spider import BaseSpider
from scrapy.selector import HtmlXPathSelector

from idcode.items import StatuteItem

class IdCodeSpider(BaseSpider):
  name = "idcode"
  allowed_domains = ["idaho.gov"]
  start_urls = ["http://legislature.idaho.gov/idstat/Title1/T1CH1SECT1-101.htm"]

  def parse(self, response):
    hxs = HtmlXPathSelector(response)
    item = StatuteItem()
    item['title'] = hxs.select("//table/tbody/tr[1]/td[2]/div[2]/div[1]/div[1]/text()").extract()
    return item

I know everything else in my project is working because if I add item['title'] = "test" above return item it returns "test". So I must have something wrong with my XPath, but I tested that in the Chrome Developer Console and it's working there.

Splendor · Accepted Answer

Removing tbody resolved the issue.

item['title'] = hxs.select("//table/tr[1]/td[2]/div[2]/div[1]/div[1]/text()").extract()

Scrapy web crawler not returning anything

Answers (2)

Related Questions