Question
Create an Oozie Workflow that includes a MapReduce Action. In this MapReduce Action, write a MapReduce program that finds the maximum salary of an employee
Create an Oozie Workflow that includes a MapReduce Action.
In this MapReduce Action, write a MapReduce program that finds the maximum salary of an employee for each department
along with the employee Id.
Input Dataset: EMPNO,EMPNAME,JOB,MANAGERID,HIREDATE,SAL,COMMISSION,DEPTNO7369,SMITH,CLERK,7902,13-06-13,800,0,207499,ALLEN,SALESMAN,7698,15-08-14,1600,300,307521,WARD,SALESMAN,7698,26-03-14,1250,500,307566,JONES,MANAGER,7839,31-10-14,2975,,207698,BLAKE,MANAGER,7839,11-06-14,2850,,307782,CLARK,MANAGER,7839,14-05-14,2450,,107788,SCOTT,ANALYST,7566,05-03-14,3000,,207839,KING,PRESIDENT,,09-06-14,5000,0,107844,TURNER,SALESMAN,7698,04-06-14,1500,0,307876,ADAMS,CLERK,7788,04-06-14,1100,,207900,JAMES,CLERK,7698,23-06-14,950,,307934,MILLER,CLERK,7782,21-01-14,1300,,107902,FORD,ANALYST,7566,05-12-14,3000,,207654,MARTIN,SALESMAN,7698,05-12-14,1250,1400,30
Dataset Description:
COLUMN NAME NULL? TYPE
EMPNO NOT NULL NUMBER(4)
EMPNAME NOT NULL STRING
JOB NOT NULL STRING
MANAGERID INT
HIREDATE DATE
SAL DOUBLE
COMMISSION DOUBLE
DEPTNO NOT NULL INT
Output: It should include the columns - Dept, max(salary), EmpNo
Constraints:
Use a single MapReduce job to complete the task.
Also, skip the header row while reading the file
Expected Solution: You need to paste
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started