I am planning on starting a search engine that will crawl a handful of sites (100 max). It will crawl nightly.
I have reviewed multiple search engine platforms and I think that Nutch is what I want to use. I don't, however, know how to work with Java based applications.
You are bidding on installation of Nutch ([url removed, login to view]) on a webserver. The current development box will be on a Windows 2003 server box running apache 2.x.
You must also provide me with some basic instructions on modifying the Nutch templates to reflect my search engine and adding advertisements (google, yahoo, etc.). While Nutch is a java application, php must be on the server for future addons I will code myself. The basic search page must be in PHP and not jsp pages.
I also need basic instructions on how to index the different websites and where to point the crontab in order for it to index nightly.
I'm essentially just asking for an installation and consultation. If you've worked with Nutch before this will be a VERY easy project for you.