Reputation: 2144
I'm trying to use Goutte to scrape a web page and I can't find a DOMCrawler
method to search for actual text. Let's say there's a td
, but it has no class or ID. So, I need to search for let's say "Title" then get that td
s next sibling.
Upvotes: 0
Views: 2166
Reputation: 2047
First I would like to refer you to the DomCrawlers documentation on Node Filtering, specifically the DomCrawler::reduce() method and Accessing Node Values
If you aren't happy with accessing these then you can always drop down and use DOMNode properties
Of course you might have better luck with using XPath queries which are extremely powerful.
Upvotes: 1