Suppose we are running a fraud classification model, with a training set of 10,000 records of which
Question:
Suppose we are running a fraud classification model, with a training set of 10,000 records of which only 400 are fraudulent. How many fraudulent records need to be resampled if we would like the proportion of fraudulent records in the balanced data set to be 20%?
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Discovering Knowledge In Data An Introduction To Data Mining
ISBN: 9780470908747
2nd Edition
Authors: Daniel T. Larose, Chantal D. Larose
Question Posted: