Question
Referencing Styles : Harvard I need help by providing me with instructions and hadoop command to carry out the following tasks in Hadoop: 1. Upload
Referencing Styles : Harvard
I need help by providing me with instructions and hadoop command to carry out the following tasks in Hadoop:
1. Upload the dataset CIS_FacultyList.csv into HDFS storage on the cluster to your designated storage space.
2. Use Pig to create new datasets from the source file that categorises the instructors using the following criteria:
a. The degree level Bachelors, Masters or Doctorate
b. Number of years of teaching less than 5 years, or more than 5 years
c. Whether the last degree was obtained from North America, Europe or elsewhere HINT: Consider using the Pig Latin Split (Partition), For each and Group statement constructs.
3. Move the datasets from HDFS back to local file system storage.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started