Annulleret

Build a hadoop program

As this is a continuous assignment i am including the description of the test 1 and solution file for the same but i want the solution for test 2. Also included the data file for this test. This is a hadoop program basically.

Test 1 - Python

Use data set from the files movie ratings 1 million records ([login to view URL], [login to view URL], [login to view URL]). Please make Python/Mapreduce code (mapper and reducer) to answer the following research question:

"What are the most popular movies for different age groups?"

Data set [login to view URL] has an information about age groups

* 1: "Under 18"

* 18: "18-24"

* 25: "25-34"

* 35: "35-44"

* 45: "45-49"

* 50: "50-55"

* 56: "56+"

Your code should be able to provide a movie ID for the movie that has the highest number of ratings and that number for each age group. If you want, you can also provide the name of the movie as well. However, this is optional.

To achieve the first task, you can join [login to view URL] and [login to view URL] and get most popular movies IDs.

For the optional task, you can produce two mapreduce programs (that is, mapper1, reducer1, mapper2, reducer2). The first one will join [login to view URL] and [login to view URL] and get most popular movies IDs. The second one will join your result with [login to view URL] and output movie titles. If you go this way, you should provide me an instruction what mapper/reducer use first and what data to load in each of them.

Your submission will include three files: mapper, reducer and result output from Hadoop (part-00000 file). If you decide to go with the optional task, then you will submit more files and an instruction how to use them. Either way - you don't need to submit data files.

Hadoop Test 2 - Pig

Your test 2 is to finish the optional task the same as in test 1, i.e., provide a movie name for the movie that has the highest number of ratings and that number for each age group.

The only difference - now you have to use Pig and PigLatin. This task requires "normal" programming logic: load three data sets, join first and second, then join resulted set with the third one, group, aggregate, probably group again to find maximum.

You have to submit two files - PigLatin script and Hadoop/MapReduce output with results.

Evner: Big Data Sales, Datasøgning, Hadoop, Java, Python

Se mere: how to run wordcount program in hadoop in ubuntu, how to run wordcount program in hadoop in windows, mapreduce example problems, mapreduce programming in java examples, word count mapreduce program in eclipse, mapreduce, hadoop architecture, hadoop wordcount example source code, build p2p program, build reward program, build ajax program interact handheld socket, build bookkeeping program, java build autoresponder program, build radian6 program, build a javascript library for mp3 and mp4 player, build a landing page for personal use, build a program, build a responsive website for limo service with online booking, build a wordpress template for my new blog, build a wordpress website for acne treatment & include the video i provide within the design

Om arbejdsgiveren:
( 0 bedømmelser ) Adelaide, Australia

Projekt ID: #19858406

9 freelancere byder i gennemsnit $81 på dette job

tausy

Hi, I'm a Hadoop developer with over 5 years of experience and expertise working on different tools and technologies including sqoop, flume, oozie, hive, pig and spark. I have delivered over 100 projects here on fre Flere

$80 AUD in 3 dage
(57 bedømmelser)
5.3
mikeitexpert

Dear Employer I have extensive experience in map reduce programming using hadoop and java. I can finish the work as per your requirements. Please let me know if you are interested.

$75 AUD in 3 dage
(53 bedømmelser)
5.1
iMonte555

Hello How are you? I've read carefully your job description. I have more than 2 years experience in these parts. Your satisfaction with the project is my top priority! If you give me a chance to work with you, the Flere

$80 AUD in 3 dage
(9 bedømmelser)
4.0
pradeepred

Have 10 years of IT experience with more than 4.5 years of experience in hadoop technologies like hive,pig,spark,sqoop,map reduce and [login to view URL] have very good experiemcence in Java,scala,Python and shell scripting. Flere

$88 AUD in 2 dage
(11 bedømmelser)
4.0
DataLamp

Hi, I am writing to you today as I would like to draw your attention towards my company Data Lamp. Our company is into Big Data, Spark, Flow Designing/Optimizations, Research & Development, Algorithms (Graph Theory, D Flere

$66 AUD in 3 dage
(1 bedømmelse)
2.6
rbagdiya

Hello I am good at hadoop ecosystem. I have gone through your problem statement and I can solve your second problem. Hadoop Test 2 - Pig. lets chat to explore more

$70 AUD in 4 dage
(1 bedømmelse)
1.4
akramamu08

Hi I have good experience in hadoop and map reduce programming . I have 4+ experience in Hive , Map Reduce and Pig . Please provide the opportunity to start the work . Thanks Akram

$88 AUD in 3 dage
(1 bedømmelse)
1.2
techlinesols6

Dear Prospect Hiring Manager. Thank you for giving me a chance to bid on your project. i am a serious bidder here and i have already worked on a similar project before and can deliver as u have mentioned "I can do th Flere

$72 AUD in 7 dage
(2 bedømmelser)
0.0
vedchauhan14

I have already worked on these movielen data set. I am fast, accurate and reliable, results oriented Virtual Assistant. Believe in delivering accurate results within the expected turnaround time.I have 8 years of work Flere

$111 AUD in 6 dage
(0 bedømmelser)
0.0