Question
You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for
You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for task A from http://www.ecmlpkdd2006.org/data_task_a.zip.
Train a Naive Bayes classifier using the data found in task_a_labeled_train.tf file. Divide the data into a training (70%) and a test sample (30%) and test the performance of your classifier on the test sample.
Also test the performance of your classifier on the data found in task_a_u00_tune.tf and comment on your finding.
In your report, briefly describe how you approached the problem, what results you obtained, what practical difficulties you faced, and how you overcame these difficulties.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started