Question: Assignment 1: Duta exploration and preparation Dataset Description: You will work on Credit dataset. The dataset classifies people by a set of attributes as good

Assignment 1: "Duta exploration and preparation" Dataset Description: You will work on Credit dataset. The dataset classifies people by a set of attributes as good or bad credit risks. The dataset includes 750 examples and cach example is described by 20 attributes and a class label Data sets: Each group is assigned an individual dataset. Please, download the "Credit-Dataset" that is linked to your group name (es. group is assigned Credit Dataset().csv") Tasks: Your tasks include: A. Initial data exploration Al. Identify the type of each attribute (nominal, ordinal, asymmetric binary, symmetric binary, interval or ratio). A2. Using Weka, explore your data set and identify any outliers A3. Using Weka, explore your data set and identify any patterns. Hint: please consider scatter plots B. Data pre-processing BI. Use the following binning techniques to smooth the values of the duration" attribute: equi-width binning (3 bins). equi-depth binning (3 bins). B2. Use the following techniques to normalise the credit_amount" attribute min-max normalization to transform the values onto the range (0.0-1.0). 2-score normalization to transform the values B3. Discretise the "Age" attribute into the following categories: Teenager = 1-16 Young - 17-35: Mid Age = 36-55: Mature - 56-70; Old - 71 Provide the frequency of each category in your data set. C. Association Rules Mining Use Association rule techniques to CI. Extract and evaluate possible associations C2 Explain three selected rules The delivery for this assignment is a report. In the report include a section (starting with a section title) for cach of the tasks in this assignment including tasks A, B and C. Assignment 1: "Duta exploration and preparation" Dataset Description: You will work on Credit dataset. The dataset classifies people by a set of attributes as good or bad credit risks. The dataset includes 750 examples and cach example is described by 20 attributes and a class label Data sets: Each group is assigned an individual dataset. Please, download the "Credit-Dataset" that is linked to your group name (es. group is assigned Credit Dataset().csv") Tasks: Your tasks include: A. Initial data exploration Al. Identify the type of each attribute (nominal, ordinal, asymmetric binary, symmetric binary, interval or ratio). A2. Using Weka, explore your data set and identify any outliers A3. Using Weka, explore your data set and identify any patterns. Hint: please consider scatter plots B. Data pre-processing BI. Use the following binning techniques to smooth the values of the duration" attribute: equi-width binning (3 bins). equi-depth binning (3 bins). B2. Use the following techniques to normalise the credit_amount" attribute min-max normalization to transform the values onto the range (0.0-1.0). 2-score normalization to transform the values B3. Discretise the "Age" attribute into the following categories: Teenager = 1-16 Young - 17-35: Mid Age = 36-55: Mature - 56-70; Old - 71 Provide the frequency of each category in your data set. C. Association Rules Mining Use Association rule techniques to CI. Extract and evaluate possible associations C2 Explain three selected rules The delivery for this assignment is a report. In the report include a section (starting with a section title) for cach of the tasks in this assignment including tasks A, B and C

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Study Guide Strategic Business Management By A. J. Cataldo About the Author A. J. Cataldo is currently a professor of accounting at West Chester University, in West Chester, Pennsylvania. He holds a...

Analytics Report Overview The purpose of this task is to provide students with practical experience in writing a data analytical report to provide useful insights, pattern and trends in a chosen...

Background Lois Griffon, 36 years old, is a senior IT programmer with the Calgary Board of Education at its head office, downtown, at Macleod Trail and 5" Ave SE. Her husband, Peter, is 33 years of...

ISSUES IN ACCOUNTING EDUCATION Vol. 26, No. 3 2011 pp. 521-545 American Accounting Association DOI: 10.2308/iace-50031 Breach of Data at TJX: An Instructional Case Used to Study COSO and COBIT, with...

DATA REPORT published: 04 November 2015 doi: 10.3389/fpsyg.2015.01694 A trait prole of top and middle managers Anna K. Baczynska 1* and Tomasz Rowinski 2 1 Department of Management, Kozminski...

Kindly provide answer and supporting workings for Qn 1b & Qn 3a,3b for ACC217. ACC217 Accounting Information Systems Assignment 2 - Group-based Assignment January 2017 Presentation ACC217 Assignment...

Follow the steps given in Machine Learning With R , Chapter 5, section "Example Identifying Risky Bank Loans Using C5.0 Decision Trees." download the credit. csv file from Packt Publishing's website...

I have attached all documents, including the homework question. I can answer question 1, but I need help on 2 and 3. ACT 5140 ? Accounting for Decision Makers HW #2 ? Chapter 9 Question #1 List and...

See attached for instructions and powerpoints. Directions: Answer all the questions. Please submit your work in Word or PDF formats only. You can submit an Excel file to support calculations, but...

Abstract This article describes CRISP-DM (Cross-Industry Sandand Process for Data Mining), a non-proprietary, documented, and freely available data mining model. Dezeloped by indias- try leaders...

A combination of explanation of fraud examination and the fraud triangle. You will explain this by including introduction, discussion on the fraud triangle and how you, as a fraud examiner, you might...

Let G = (V, E) be a bipartite graph with V partitioned as X Y, where X = {x1, x2, . . ., xm] and Y = {x1, x2, . . . , xn}- How many complete matchings of X into Y are there if (a) m = 2, n = 4, and...

Imagine that your job requires you to pair individuals needing a credit card with one fitting their income and a desired interest rate. What role are you fulfilling? wholesaler agent retailer broker

Current Attempt in Progress The controller for Clint Tamarisk Co . is attempting to determine the amount of cash to be reported on its December 3 1 , 2 0 2 5 , balance sheet. The following...

KEY QUESTION Identify and state the significance of each of the following: ( a ) WTO; ( b ) EU; ( c ) euro; ( d ) NAFTA. What commonality do they share?

KEY QUESTION What are the two characteristics of public goods? Explain the significance of each for public provision as opposed to private provision. What is the free-rider problem as it relates to...

KEY QUESTION What are the three major legal forms of business organization? Which form is the most prevalent in terms of numbers? Why do you think that is so? Which form is dominant in terms of total...