Answered step by step
Verified Expert Solution
Question
1 Approved Answer
table [ [ Age , Salary, table [ [ Education ] , [ level ] ] , Class,Prob ( yes ) ] ,
tableAgeSalary,tableEducationlevelClass,ProbyesYoungKPhDYes,OldKBScNoYoungKMScYes,OldKBScNoYoungKMScYes,OldKMScNoYoungKPhDYes,OldKBScNoOldKPhDYes,YoungKBScNoOldKBScYes,YoungKBScNoa Consider the following dataset where each row instance represents an
applicant to a loan, described by three predictive attributes Age Salary,
Education level and the Class variable, indicating whether or not an applicant
was granted a loan class labels yes and no The column Probyes shows
the probability that each applicant belongs to the class yes computed by a
certain classification algorithm.
Age Salary Education
level
Class Probyes
Young K PhD Yes
Old K BSc No
Young K MSc Yes
Old K BSc No
Young K MSc Yes
Old K MSc No
Young K PhD Yes
Old K BSc No
Old K PhD Yes
Young K BSc No
Old K BSc Yes
Young K BSc No
Assume that Age is a sensitive protected attribute, ie the distribution of the
class labels yes and no should not depend on the applicants age. Note
that in this dataset the distribution of the label yes is unfair with respect to
the attribute Age, since out of of young applicants have the label
yes whilst only out of of old applicants have the label yes
Consider the use of the data massaging technique to modify the class labels
of two applicants instances of this dataset, in order to make this dataset
fairer with respect to Age. Describe the application of this technique to this
dataset. Your answer must show all the steps that you performed, as well as
the final result which applicants had their class labels changed
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started