Searching for text in all nodes with xpath

Question

I'm trying to find words in a fragment of html to replace them with a href. Somehow can't find the right path to use for Xpath. Example:

require 'nokogiri'

html = '
A paragraph Apple

Apple

  Item 1
  Apple Apple
  Apple
  Orange

AppleApple'

doc = Nokogiri::HTML.fragment(html)
doc.xpath('.//*[text()="Apple"]').each do |node|
  puts "
"
  puts node.name
  puts node.content
  puts node.replace('REPLACED')
end

puts doc.to_html

Result:

span
Apple
REPLACED

strong
Apple
REPLACED

li
Apple
REPLACED

i
Apple
REPLACED
A paragraph Apple

REPLACED


  Item 1
  Apple REPLACED
  REPLACED
  Orange

REPLACEDApple

So the words in the root p elements are not replaced and one in the li is left. Which path should i use in this case to search in root and all children? Reading on a page like this .//* should be the path used to search in root and child nodes. Any ideas on how to handle this correctly with nokogiri or xpath?

Thanks in advance!

Eric Duminil · Accepted Answer

You're looking for nodes where the whole text is equal to "Apple", not nodes which contain "Apple"

html = '
A paragraph Apple

Apple

  Item 1
  Apple Apple
  Apple
  Orange

AppleApple
Dont replace!
'

doc = Nokogiri::HTML.fragment(html)

doc.traverse do |node|
  if node.text?
    node.content = node.content.gsub('Apple', 'REPLACED')
  end
end

puts doc.to_html

It outputs :

A paragraph REPLACED

REPLACED

  Item 1
  REPLACED REPLACED

  REPLACED
  Orange

REPLACEDREPLACED
Dont replace!

Searching for text in all nodes with xpath

Answers (1)

Related Questions