Skip to content

Understanding how to use partitioner class in MapReduce for better analysis.

Notifications You must be signed in to change notification settings

srjsunny/Employee-CTC-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Employee-CTC-Analysis

The objective of this project is to develop a MapReduce job and get details of employee, male and female seperately with highest CTC in each department.

Data Format:

It's tab seperated
empid name age gender dept CTC

Prerequisite:

  • Use Ubuntu or any flavour of Linux with following installed
    • Hadoop 2.x or above
    • JDK

Idea:

The idea here to write custom partitioner class to partition the data based on gender before sending it to the reducer.

Output:

  • We can see the partitioned records here.

  • Male records

  • Female records

About

Understanding how to use partitioner class in MapReduce for better analysis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages