Scrapy: Selector returns full element with .extract (but assigns data correctly)

Question

I have recently started learning Scrapy (and Python for that matter) but have encountered a peculiar issue that so far I have not been able to find an explanation for. I managed to find a workaround (see below), but am curious to understand the reason behind the .extract() behavior.

Running the following in my parse function

item['stops'] = response.xpath('//td[@class="station"]/a[@href]/text()').extract

results in Scrapy saving not the data in the defined output csv, but the full string(?) like so:

, 
, 
, 
, 
, 
,
, 
, 
]>

Data is correctly assigned but doesn't get passed through as such to the element. Other functions that run with .re() instead of .extract() work fine. Surprisingly, also the above query works fine if I run it as follows

item['stops'] = response.xpath('//td[@class="station"]/a[@href]/text()').re('.*')

Scrapy: Selector returns full element with .extract (but assigns data correctly)

Answers (1)

Related Questions