How to scrape only text?

Question

Code :

import scrapy

class BlogSpider(scrapy.Spider):
    name = 'bijouterie'
    start_urls = ['https://www.example.com']

    def parse(self, response):
        for post in response.css('#engine-results .drs'):
            yield {'title': post.css('a.moodalbox.response').get()}

Run Command (Windows 10) :

scrapy runspider C:\Users\DELL\Desktop\icscrape\bijouterie.py -o posts.csv

The CSV File : https://pastebin.com/qEQTKEcC

I want to scrape only the text not the entire html class code.

Ayush · Accepted Answer

Just add (::text) at the end of your css selector like

{'title': post.css('a.moodalbox.response::text').get()}

How to scrape only text?

Answers (1)

Related Questions