The main goal was given a gene name find all disease associated, also can be given a disease name find all gene associated in large data set.
I need you to regenerate the index files and itemsets for those large data set and implement the FP growth algorithm to mine the frequent pattern in the itemsets files, and do the evaluation.
Also need to build a website to show the frequent pattern result.
If you are good at java, mysql ,data mining, apache lucene, please contact me.
See the detail at attached document.
Please finish in 3 weeks.
9 freelancers are bidding on average $650 for this job
I was written an FIM apriori algorithm last month by C so I think I can extend it to FP-growth algorithm as well. But I am familiar with C so I will program by C. Thanks,