Kapalı

Build a hadoop program -- 2

As this is a continuous assignment i am including the description of the test 1 and solution file for the same but i want the solution for test 2. Also included the data file for this test. This is a hadoop program basically.

Test 1 - Python

Use data set from the files movie ratings 1 million records ([login to view URL], [login to view URL], [login to view URL]). Please make Python/Mapreduce code (mapper and reducer) to answer the following research question:

"What are the most popular movies for different age groups?"

Data set [login to view URL] has an information about age groups

* 1: "Under 18"

* 18: "18-24"

* 25: "25-34"

* 35: "35-44"

* 45: "45-49"

* 50: "50-55"

* 56: "56+"

Your code should be able to provide a movie ID for the movie that has the highest number of ratings and that number for each age group. If you want, you can also provide the name of the movie as well. However, this is optional.

To achieve the first task, you can join [login to view URL] and [login to view URL] and get most popular movies IDs.

For the optional task, you can produce two mapreduce programs (that is, mapper1, reducer1, mapper2, reducer2). The first one will join [login to view URL] and [login to view URL] and get most popular movies IDs. The second one will join your result with [login to view URL] and output movie titles. If you go this way, you should provide me an instruction what mapper/reducer use first and what data to load in each of them.

Your submission will include three files: mapper, reducer and result output from Hadoop (part-00000 file). If you decide to go with the optional task, then you will submit more files and an instruction how to use them. Either way - you don't need to submit data files.

Hadoop Test 2 - Pig

Your test 2 is to finish the optional task the same as in test 1, i.e., provide a movie name for the movie that has the highest number of ratings and that number for each age group.

The only difference - now you have to use Pig and PigLatin. This task requires "normal" programming logic: load three data sets, join first and second, then join resulted set with the third one, group, aggregate, probably group again to find maximum.

You have to submit two files - PigLatin script and Hadoop/MapReduce output with results.

Beceriler: Big Data Sales, Veri Madenciliği, Hadoop, Java, Python

Daha fazlasını gör: hadoop 3.0 installation, how to run wordcount program in hadoop in ubuntu, hadoop cluster setup, hadoop architecture, download hadoop for windows 10 64 bit, hadoop download, mapreduce programming in java examples, how to run java program in hadoop, build p2p program, build reward program, build ajax program interact handheld socket, build bookkeeping program, java build autoresponder program, build radian6 program, arma 2 coder for hire, build a photo gallery page for our website with grid loading effects, build a program, build a very simple website for tv series, Build a Website -- 2, Build a Website --2

İşveren Hakkında:
( 0 değerlendirme ) Adelaide, Australia

Proje NO: #19875458

Bu iş için 9 freelancer ortalamada $82 teklif veriyor

utkarshkatiyar19

Hi I'm an expert in working with hadoop. I'm sure that I can easily do this project. We can have a about it. Thanks..

$120 AUD in 3 gün içinde
(317 Değerlendirme)
7.2
mikeitexpert

Dear Employer, I have extensive experience in Java based Map Reduced models. Please let me know if you are interested. Regards

$65 AUD in 10 gün içinde
(53 Değerlendirme)
5.2
pradeepred

Have 10 years of IT experience with more than 4.5 years of experience in hadoop technologies like hive,pig,spark,sqoop,map reduce and [login to view URL] have very good experiemcence in Java,scala,Python and shell scripting. Daha Fazla

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% AUD
(11 Değerlendirme)
4.0
vishalsha95570

I am expert and can do it in [login to view URL] you want to implement your idea, then I am always ready for you Being a professional developer means to understand all the requirements of the project, and find the best way to impl Daha Fazla

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% AUD
(9 Değerlendirme)
3.6
DataLamp

Hi, I am writing to you today as I would like to draw your attention towards my company Data Lamp. Our company is into Big Data, Spark, Flow Designing/Optimizations, Research & Development, Algorithms (Graph Theory, D Daha Fazla

$66 AUD in 10 gün içinde
(1 Yorum)
2.6
Snake4eva

Hi, i am skilled in python and would like to try your project. Have only fooled around hadoop briefly but once the description of the assignment is detailed i'm sure i'll be able to fill the gaps and complete the proje Daha Fazla

$70 AUD in 3 gün içinde
(1 Yorum)
1.5
ambajiinfotech

i can wotk on this rightaway. but will need an elaboration on pig and piglatin. i have searched online but i am not sure. please confirm and i could start right away.

$100 AUD in 2 gün içinde
(1 Yorum)
1.5
twoandway

Hi. I am an expert Front end developer, with experience across a wide variety of technologies. Excellent in both creating complex and highly scalable backend services and in designing complicated UI. My focus is on Daha Fazla

$45 AUD in 3 gün içinde
(0 Değerlendirme)
0.0
divyajain987

I am certified Hadoop Developer/ HBASe expert. I have been working on similar solutions in the past. I would be able to provide solution for the same in Java

$100 AUD in 2 gün içinde
(0 Değerlendirme)
0.0