Get all text including html in a single node scrapy xpath

Question

response.xpath('//*[@id="blah"]//text()')

Suppose my html is

This is a simple text foo and this is after tag.

What is happening i get a list of text even though its one

tag. Such as

[u'This is a simple text', u' and this is after tag.']

My actual html content is huge and I have to do join in order to achieve this. Also i lose foo while join. Is there any specific xpath scrapy mechanism of doing this ?

I want to get result This is a simple text foo and this is after tag.

Please notice the foo here too.

Thanks

Andersson · Accepted Answer

You can get all text nodes as single string as below:

response.xpath('//*[@id="blah"]')[0].text_content()

Output:

'This is a simple text foo and this is after tag. '

Get all text including html in a single node scrapy xpath

Answers (2)

Related Questions