Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Problem statement: In this case study, we are giving a real world example of how to use HIVE on top of the HADOOP for different
Problem statement:
In this case study, we are giving a real world example of how to use HIVE on top of the HADOOP for different exploratory data analysis. In here, we have a predefined dataset (2018_Yellow_Taxi_Trip_Data.csv) having more than 15 columns and more than 100000 records in it. The dataset has different attributes like
- vendor_id string,
- pickup_datetime string,
- dropoff_datetime string,
- passenger_count int,
- trip_distance DECIMAL(9,6),
- pickup_longitude DECIMAL(9,6),
- pickup_latitude DECIMAL(9,6),
- rate_code int,
- store_and_fwd_flag string,
- dropoff_longitude DECIMAL(9,6),
- dropoff_latitude DECIMAL(9,6),
- payment_type string,
- fare_amount DECIMAL(9,6),
- extra DECIMAL(9,6),
- mta_tax DECIMAL(9,6),
- tip_amount DECIMAL(9,6),
- tolls_amount DECIMAL(9,6),
- total_amount DECIMAL(9,6),
- trip_time_in_secs int
Perform taxi trip analysis by solving the questions below:
- What is the total Number of trips ( equal to the number of rows)?
- What is the total revenue generated by all the trips? The fare is stored in the columntotal_amount.
- What fraction of the total is paid for tolls? The toll is stored in tolls_amount.
- What fraction of it is driver tips? The tip is stored in tip_amount.
- What is the average trip amount?
- What is the average distance of the trips? Distance is stored in the column trip_distance.
- How many different payment types are used?
- For each payment type, display the following details:
- Average fare generated
- Average tip
- Average tax - tax is stored in column mta_tax
- On average which hour of the day generates the highest revenue?
Note:The information about the dataset is given to you in the data information file. Get the Cloud lab access from Cloudxlab before starting this project.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started