Solr + nutch + hadoop integration

This project is strictly for people who are highly skilled in nutch, hadoop and solr, as integrating these three shouldn't take more than an hour for the person who knows his job. After this, I will have more work with respect of search engine development - I plan to do large scale searches.

For now -

I need to create a nutch, solr, hadoop integration such that -

1. Hadoop will be configured on more than 2 machines and it should be easy to add another machine to expand existing configuration wrt scale

2. Nutch will be used for indexing, will pick up urls from a flat file, will pick up configuration from a central settings file and will start indexing. Will use hadoop to use other machines to do clustered indexing. Needs to be configured such that, urls already indexed, should not be followed unless reindex flag is set in settings file

3. Nutch input will go to solr, and I should be able to search indexed websites using solr. Again, solr will also be integrated with hadoop to run clustered searches.

Initially, we will have a central server and 2 sub servers on which we can distribute search and indexing.

If you can also suggest ways to change ranking dynamically, I would be interested.

Let me know.


Evner: Apache Solr, PHP, Software Arkitektur

Se mere: solr hadoop integration, nutch solr hadoop, hadoop nutch solr, solr nutch hadoop, solr hadoop, solr nutch integration, solr reindex using nutch, nutch hadoop integration, integration nutch hadoop, hadoop solr integration, software used to create websites, software development job search, job search skilled, hour change 2012, architecture search engine, hadoop job, wrt, Nutch, hadoop, hadoop project, run nutch hadoop, solr integration hadoop, using php hadoop, large scale php, search engine indexing

Om arbejdsgiveren:
( 19 bedømmelser ) Mumbai, India

Projekt ID: #4065263

3 freelancere byder i gennemsnit $283 på dette job


we will do excellent job for you.

$250 USD in 10 dage
(251 bedømmelser)

I have very good knowledge on HADOOP, I can help you,

$100 USD in 10 dage
(0 bedømmelser)

Chris, I am certified for Hadoop from Cloudera.

$500 USD in 3 dage
(0 bedømmelser)