Scrapy: How to get all content HTML without "
"

Question

Hi I have a problem on getting HTML code without " " I tried normalize-space function but it seems that it just getting the first paragraph (Not the whole message)

Here's the code that I am using

response.xpath("normalize-space(//div[@class = 'messageContent'])").extract_first()

URL: https://teslamotorsclub.com/tmc/threads/tesla-tsla-the-investment-world-the-2019-investors-roundtable.139047/

Without Normalize-space

 class="sample">

Sample Message

With Normalize-space

Sample Message

What I wanted is to also save the HTML code without " "

 class="sample">
Sample Message

Tom&#225;š Linhart · Accepted Answer

If all you want is to remove the newline character from the output, just do it:

response.xpath("//div[@class = 'messageContent']").extract_first().replace('
', '')

Scrapy: How to get all content HTML without "\n"

Answers (1)

Related Questions

Scrapy: How to get all content HTML without &quot;\n&quot;

Answers (1)

Related Questions

Scrapy: How to get all content HTML without "\n"