Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

There will always be anomalies in data that can create gaps in our analytics. We call these outliers, as they tend to be well outside

There will always be anomalies in data that can create gaps in our analytics. We call these outliers, as they tend to be well outside of the normal distribution of the data.
 

What steps can we take to smooth this data over when we see it? Should we simply delete the outlier, or are there other tactics we can take in order to normalize for such a huge dispersion? 

 

For example, in a Netflix data set, if it showed that someone was 185 years old, we can safely conclude this is an error. Should we get rid of that entry entirely, or are there ways to preserve the data?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Handling outliers in data is an important step in data preprocessing and analysis While outliers can be disruptive to statistical analyses and models it is generally not advisable to simply delete the... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Systems analysis and design

Authors: Alan Dennis, Barbara Haley Wixom, Roberta m. Roth

5th edition

978-1118057629, 1118057627, 978-111880817

More Books

Students also viewed these General Management questions

Question

14. Let X be uniform over (0, 1). Find E[X|X Answered: 1 week ago

Answered: 1 week ago