How to get text of selected elements in XPath?

Question

I try to extract several forum posts by using the standard XPath method:

response.xpath('.//div[contains(@class, "Message userContent")]')

That one returns a complete list of comments as wished.

But once I include //text() or string(...) the length of the list jumps up to 100 or 150 items, which makes it impossible to grasp or to iterate over the list and join it with other data like author or the date...

normalize-space(...) only returns the first comment.

It has to do something with all the new lines and breaks in the html code but at this stage I have no idea how to handle these.

Would string-join(...[normalize-space()]) be an option here?

How to get text of selected elements in XPath?

Answers (1)

Related Questions