Scrapy - use "normalize-space" with a list of items

Question

Trying to remove escape symbols and spaces from html list. I'm using normalize-space() but was not able to apply it to whole list. I'm testing my code using scrapy shell

scrapy shell https://universalmotors.ru/boardmotors/suzuki/suzuki-df-4-s/


              Мощность двигателя (л.с.)
              
                4
              
            

              Тип масла в двигателе
              
                10W-30 10W-40

Here what I tried

[item.normalize-space() for item in response.xpath('//tr[@itemprop="additionalProperty"]').extract()]

But i'm getting an error

Traceback (most recent call last):
  File "", line 1, in 
  File "", line 1, in 
AttributeError: 'str' object has no attribute 'normalize'

It is only working for

[item.strip() for item in response.xpath('//tr[@itemprop="additionalProperty"]').extract()]

then I get folowing

['
              Мощность двигателя (л.с.)
              
                4
              
            ', '
              Тип масла в двигателе

My goal is to get flowing:

Мощность двигателя (л.с.) 4
Тип масла в двигателе 10W-30 10W-40
Объем масла в двигателе 700

Tom&#225;š Linhart · Accepted Answer

normalize-space is a XPath function, not a Python function or a method of a Python object. So you need to use it in the XPath expression like this:

for item in response.xpath('//tr[@itemprop="additionalProperty"]'):
    yield {
        'name': item.xpath('normalize-space(./*[@itemprop="name"])').extract_first(),
        'value': item.xpath('normalize-space(./*[@itemprop="value"])').extract_first()
    }

Scrapy - use "normalize-space" with a list of items

Answers (2)

Related Questions

Scrapy - use &quot;normalize-space&quot; with a list of items

Answers (2)

Related Questions

Scrapy - use "normalize-space" with a list of items