Question
Write a MapReduce program to find the maximum salary of an employee for each department along with the employee Id. Data is : EMPNO,EMPNAME,JOB,MANAGERID,HIREDATE,SAL,COMMISSION,DEPTNO7369,SMITH,CLERK,7902,13-06-13,800,0,207499,ALLEN,SALESMAN,7698,15-08-14,1600,300,307521,WARD,SALESMAN,7698,26-03-14,1250,500,307566,JONES,MANAGER,7839,31-10-14,2975,,207698,BLAKE,MANAGER,7839,11-06-14,2850,,307782,CLARK,MANAGER,7839,14-05-14,2450,,107788,SCOTT,ANALYST,7566,05-03-14,3000,,207839,KING,PRESIDENT,,09-06-14,5000,0,107844,TURNER,SALESMAN,7698,04-06-14,1500,0,307876,ADAMS,CLERK,7788,04-06-14,1100,,207900,JAMES,CLERK,7698,23-06-14,950,,307934,MILLER,CLERK,7782,21-01-14,1300,,107902,FORD,ANALYST,7566,05-12-14,3000,,207654,MARTIN,SALESMAN,7698,05-12-14,1250,1400,30 Dataset
Write a MapReduce program to find the maximum salary of an employee for each department along with the employee Id.
Data is :
EMPNO,EMPNAME,JOB,MANAGERID,HIREDATE,SAL,COMMISSION,DEPTNO7369,SMITH,CLERK,7902,13-06-13,800,0,207499,ALLEN,SALESMAN,7698,15-08-14,1600,300,307521,WARD,SALESMAN,7698,26-03-14,1250,500,307566,JONES,MANAGER,7839,31-10-14,2975,,207698,BLAKE,MANAGER,7839,11-06-14,2850,,307782,CLARK,MANAGER,7839,14-05-14,2450,,107788,SCOTT,ANALYST,7566,05-03-14,3000,,207839,KING,PRESIDENT,,09-06-14,5000,0,107844,TURNER,SALESMAN,7698,04-06-14,1500,0,307876,ADAMS,CLERK,7788,04-06-14,1100,,207900,JAMES,CLERK,7698,23-06-14,950,,307934,MILLER,CLERK,7782,21-01-14,1300,,107902,FORD,ANALYST,7566,05-12-14,3000,,207654,MARTIN,SALESMAN,7698,05-12-14,1250,1400,30
Dataset Description:
COLUMN NAME | NULL? | TYPE |
EMPNO | NOT NULL | NUMBER(4) |
EMPNAME | NOT NULL | STRING |
JOB | NOT NULL | STRING |
MANAGERID | INT | |
HIREDATE | DATE | |
SAL | DOUBLE | |
COMMISSION | DOUBLE | |
DEPTNO | NOT NULL | INT |
Output: It should include the columns - Dept, max(salary), EmpNo
Constraints:
- Use a single MapReduce job to complete the task
- Also, skip the header row while reading the file
Expected Solution: You need to paste the MR code, Hadoop commands & path of the final jar that is used to achieve this output.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started