Answered step by step
Verified Expert Solution
Question
1 Approved Answer
You are tasked with improving the data quality and observability of a dataset containing customer orders for an e - commerce website. The dataset contains
You are tasked with improving the data quality and observability of a dataset containing customer
orders for an ecommerce website. The dataset contains records, and you've identified several data
quality issues:
Duplicate Orders: There are duplicate orders in the dataset.
Missing Values: In the "Shipping Address" field, records have missing or incomplete addresses
Inconsistent Date Formats: The "Order Date" field contains dates in both MMDDYYYY and
YYYYMMDD formats. There are records with inconsistent date formats.
Outliers in Order Amount: There are orders with an unusually high order amount that seems like
outliers.
a Explain why data quality and observability are essential in this context.
b Design and compute four data quality metrics based on the given data.
c Propose a stepbystep plan to address these data quality issues and enhance data observability.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started