Reputation: 30034
I'm doing work on information extraction, and I need a tool to crawl data from web page , is there a popular one in windows?
Upvotes: 0
Views: 1558
Reputation: 188014
From: http://en.wikipedia.org/wiki/Web_crawler:
And some reading: Spidering Hacks 100 Industrial-Strength Tips & Tools:
Written for developers, researchers, technical assistants, librarians, and power users, Spidering Hacks provides expert tips on spidering and scraping methodologies. You'll begin with a crash course in spidering concepts, tools (Perl, LWP, out-of-the-box utilities), and ethics (how to know when you've gone too far: what's acceptable and unacceptable). Next, you'll collect media files and data from databases. Then you'll learn how to interpret and understand the data, repurpose it for use in other applications, and even build authorized interfaces to integrate the data into your own content.
Upvotes: 9