Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Task 1 Apply suitable data pre - processing operations on the data: 1 . Identify in your report ( not just in your R code
Task Apply suitable data preprocessing operations on the data: Identify in your report not just in your R code what preprocessing operations you performed. points Produce R code in RStudio that implements these preprocessing operations. points Task Produce R code for implementing the classifiers in the preprocessed dataset. points Produce rule bases for your classifiers. Save these rule bases into your pdf report document and present them under the heading Task points Task In this task we will use fold cross validation cv with repeats to obtain an accurate assessment of a classifiers accuracy. The cv strategy takes into account variation of data across a dataset and is a more reliable assessment of accuracy. Once all the folds are in place you will carry out an F test to compare the types of classifiers. Collect the accuracy across the classifiers into vectors from the different folds. Carry out an F test in R Oneway ANOVA When and How to Use It With Examples scribbrcom on the vectors. Results of the F test may reveal that the classifiers have significantly different accuracies at a level of confidence or that they have no significant difference. If the F test reveals a significant difference, then the next step would be to determine which one of them has the highest accuracy. This can be done with the help of the Tukey post hoc test, which is covered from the same website as the one given for the oneway ANOVA. Produce the R code for this task. points Copy and paste the ANOVA table into your pdf report. points Identify, which if any classifier has the highest accuracy if it happens that the F test reveals that the are significantly different use the Tukey test here Noe that this part is not needed if all classifiers are flagged by the F test as having the same level of accuracy. points For this submission, your R code should be clearly commented, and each task should be labeled. Add print statements to your code to clearly demonstrate it working. Your code should be able to run directly in RStudio without any compilation errors or warnings. This task should not require any user input.
Task
Apply suitable data preprocessing operations on the data:
Identify in your report not just in your R code what preprocessing operations
you performed.
points
Produce R code in RStudio that implements these preprocessing operations.
points
Task
Produce R code for implementing the classifiers in the preprocessed dataset.
points
Produce rule bases for your classifiers. Save these rule bases into your pdf
report document and present them under the heading Task
points
Task
In this task we will use fold cross validation cv with repeats to obtain an
accurate assessment of a classifiers accuracy. The cv strategy takes into account
variation of data across a dataset and is a more reliable assessment of accuracy.
Once all the folds are in place you will carry out an F test to compare the types
of classifiers.
Collect the accuracy across the classifiers into vectors from the different folds.
Carry out an F test in R Oneway ANOVA When and How to Use It With Examples
scribbrcom on the vectors.
Results of the F test may reveal that the classifiers have significantly different
accuracies at a level of confidence or that they have no significant difference. If
the F test reveals a significant difference, then the next step would be to determine
which one of them has the highest accuracy. This can be done with the help of the
Tukey post hoc test, which is covered from the same website as the one given for
the oneway ANOVA.
Produce the R code for this task.
points
Copy and paste the ANOVA table into your pdf report.
points
Identify, which if any classifier has the highest accuracy if it happens that the
F test reveals that the are significantly different use the Tukey test here
Noe that this part is not needed if all classifiers are flagged by the F test as
having the same level of accuracy.
points
For this submission, your R code should be clearly commented, and each task should be labeled. Add
print statements to your code to clearly demonstrate it working. Your code should be able to run
directly in RStudio without any compilation errors or warnings. This task should not require any user
input.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access with AI-Powered Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started