Reputation: 3819
In nutch, when I crawl and then re-crawl, duplicated segments are created. how can I delete the old ones?
I can't know for sure that only the segments that were created in the latest crawl are used and all the others can be deleted, can I?
Upvotes: 0
Views: 676
Reputation: 6169
I can't know for sure that only the segments that were created in the latest crawl are used and all the others can be deleted, can I?
The segments created in the last crawl are useful and others can be deleted.
Upvotes: 1