Reputation: 1701
I'm a webmaster and I'm trying out Watir ruby gem that controls actions of a browser.
I know that a lot of developers use Watir for testing but I also see the use of Watir to scrape content from websites.
I wonder as a webmaster, can I detect such usage? Can I also detect that the scraper is using Watir?
Also how can I stop Watir?
Upvotes: 2
Views: 456
Reputation: 1905
You might be detect if someone scrapes your site if you monitor the usage of that user - if the user is doing things very fast then it's probably a robot.
You cannot detect however if the scraper is using Watir or any other scraping tool.
Upvotes: 1
Reputation: 57312
I am not sure if you could detect if there is a human or a Watir script behind the browser visiting your site. Watir drives real browsers, so filtering by user agent would not help.
If you have a tool that allows you to monitor traffic in real time, you could detect screen scraping by a lot of traffic to the same IP. You could then (temporarily) block the IP.
At the moment I can not think about any other way to block screen scraping. Files like robots.txt
are just a convention, the script can ignore it.
Upvotes: 3