If you like interesting projects and challenges, and you are a very good php programmer, then this project is for you.
I have a database that has thousands of records that are related to a niche market. I would like to have a script that could run through these news and depending on its title and content, could group news which are related together.
I would provide you with the database that has such news so you could do all the tests you need to.
There are plenty of Wordpress plugins that do this (show related links, based on content), but I have seen that such scripts are not that precise when showing related content together. An example of such script is the following one,
[url removed, login to view]
but there are many more (any related links WP plugin is doing something similar to this). My site runs in Drupal, but i would need this to be a stand alone php script that could be running as a cron job or when called upon.
The database has many, many news, so I would need this script to be very precise in how it groups the news, so that when a person sees a page that displays such links, will see that content is very much related.
The challenges here are twofold:
1.- The script would need to decide, based on the titles of the news, what "groups" of related news to form... so the moment it sees taht for example there are at least 10 news which have a similar content, it should group them together, for example putting them a record in a table.
2.- It should be very precise to group related content together.
I have more projects for the future which i could assign you if you do a good job at this. Thanks!
Some research I did is the following:
So if your are able to understand those solutions suggested there and implement them according to my needs, this may be a good solution. I am open to any other good solution that may accomplish my goal.
14 freelancers are bidding on average $169 for this job
This look like an very interesting little project that I'd be happy to work on. I believe I can find a suitable algorithm to use for the required problem.