Reputation: 2961
Now that the Google search API has been discontinued - what is the best way to retrieve search results programmatically?
I need to get a list of files that have been indexed by google in my web site, so that I can write a script using that data.
What I want to do is delete everything cached under http://mysite.com/mypdfs/
i.e.
search for "site:mysite.com/mypdfs",
to get back a list of pdfs on mysite.com:
http://mysite.com/pdf/1.pdf
http://mysite.com/pdf/1.pdf
...
http://mysite.com/pdf/1000000.pdf
etc
Then use WebDriver to push them through the webmaster removal tool.
Happy to pay for the privilege if required...
Upvotes: 1
Views: 574
Reputation: 143946
You'll have to setup a Custom Search and use the new Custom Search API. It's similar to the old deprecated search API and does JSON or Atoms.
Upvotes: 1