İsmet Alkan
İsmet Alkan

Reputation: 5447

Nutch Raw Html Saving

I'm trying to get raw html of crawled pages in different files, named as url of the page. Is it possible with Nutch to save the raw html pages in different files by ruling out the indexing part?

Upvotes: 2

Views: 2596

Answers (1)

Tejas Patil
Tejas Patil

Reputation: 6169

The is no direct way to do that. You will have to do few code modifications. See this and this.

Upvotes: 2

Related Questions