The Source for Java Technology Collaboration
User: Password:



Start New Message Delete Post a Reply

Article: 
 Introduction to Nutch, Part 2: Searching
Subject:  Index wasn't updated after runned recrawl script
Date:  2006-02-17 11:16:27
From:  nutchnewbie


First of all, thank you for your instructional article.

I tried to update index using the script in your article, but I didn't get my index updated.

1. bin/nutch crawl urls -dir crawl-tinysite -depth 3
2. nutch search works fine.
3. I added a new link to tinysite/A.html
4. ./recrawl crawl.tinysite 3
after recrawl, index was updated. It seemed that the fetcher didn't generate new entries from the running output.
parsing file:nutch-0.7.1/conf/nutch-default.xml
parsing file:nutch-0.7.1/conf/nutch-site.xml
No FS indicated, using default:local
FetchListTool started
Overall processing: Sorted 0 entries in 0.0 seconds. <---
Overall processing: Sorted Nana entries/second <---
FetchListTool completed

Would you please tell me where caused the problem? thank you.

Fan

 Feed java.net RSS Feeds