How to get multiple occurences of an element with XPath under usage of normalize-space and substring-before

Question

I have an element with three occurences on the page. If i match it with Xpath expression //div[@class='col-md-9 col-xs-12'], i get all three occurences as expected.

Now i try to rework the matching element on the fly with

substring-before(//div[@class='col-md-9 col-xs-12'], 'Bewertungen'), to get the string before the word "Bewertungen",
normalize-space(//div[@class='col-md-9 col-xs-12']), to clean up redundant whitespaces,
normalize-space(substring-before(//div[@class='col-md-9 col-xs-12'] - both actions.

The problem with last three expressions is, that they extract only the first occurence of the element. It makes no difference, whether i add /text() after matching definition.

I don't understand, how an addition of normalize-space and/or substring-before influences the "main" expression in the way it stops to recognize multiple occurences of targeted element and gets only the first. Without an addition it matches everything as it should.

How is it possible to adjust the Xpath expression nr. 3 to get all occurences of an element?

Example url is https://www.provenexpert.com/de-de/jazzyshirt/

Jack Fleeting · Accepted Answer

The problem is that both normalize-space() and substring-before() have a required cardinality of 1, meaning can only accept one occurrence of the element you are trying to normalize or find a substring of. Each of your expressions results in 3 sequences which these two functions cannot process. (I probably didn't express the problem properly, but I think this is the general idea).

In light of that, try:

//div[@class='col-md-9 col-xs-12']/substring-before(normalize-space(.), 'Bewertung')

How to get multiple occurences of an element with XPath under usage of normalize-space and substring-before

Answers (2)

Related Questions