laura
laura

Reputation: 2961

How to programmatically retrieve google results?

Now that the Google search API has been discontinued - what is the best way to retrieve search results programmatically?

I need to get a list of files that have been indexed by google in my web site, so that I can write a script using that data.

What I want to do is delete everything cached under http://mysite.com/mypdfs/

i.e.
search for "site:mysite.com/mypdfs", to get back a list of pdfs on mysite.com:
http://mysite.com/pdf/1.pdf
http://mysite.com/pdf/1.pdf
...
http://mysite.com/pdf/1000000.pdf etc

Then use WebDriver to push them through the webmaster removal tool.

Happy to pay for the privilege if required...

Upvotes: 1

Views: 574

Answers (1)

Jon Lin
Jon Lin

Reputation: 143946

You'll have to setup a Custom Search and use the new Custom Search API. It's similar to the old deprecated search API and does JSON or Atoms.

Upvotes: 1

Related Questions