Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Predict if a transaction is fraudulent or not. The file transactions.csv contains all transactions. The file solely contains numerical input variables resulting from a
Predict if a transaction is fraudulent or not. The file "transactions.csv contains all transactions. The file solely contains numerical input variables resulting from a PCA transformation. Original features cannot be supplied due to privacy concerns. The main components derived with PCA are features V V V; the only features that have not been changed with PCA are 'Time' and 'Amount'. The 'Time' feature contains the number of seconds that have passed between each transaction and the first transaction in the dataset. The feature 'Amount' represents the transaction Amount; this feature can be utilised for costsensitive learning, for example. The answer variable is called 'Class, and it has a value of if there is fraud and otherwise.
Examine the dataset. Do you believe the dataset is balanced? Evaluate and compare the XGBoost and RandomForest algorithms without and after data balancing with SMOTE. For hyperparameter optimisation, you can compare performance using the F score and apply fold crossvalidation.
Which of the four situations Random Forest, XGBoost, Random Forest SMOTE, XGBoost SMOTE performs the best? Discuss.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started