file C Users 1 4 0 9 3 OneDrive credit 2 pdf Use Rstudio and give 3 sentences explaining each step of code This assignment asks you to examine the Support Vector Machine for classification Provide your answers to the questions in a Word document named Assign 4 LastName doc along with your source code that should be saved as Assign 4 LastName, and click the title link to upload and submit them The attached dataset contains German credit data To apply Support Vector Machines, the data requires preprocessing, such as data type transformation and normalization For data type transformations, we mainly perform factoring of the categorical variables, where we transform the data type of the categorical features from numeric to factor For example, code line 5 data preprocessing code line 6 data type transformation factoring coding line 7 to Factor function ( df , variables ) coding line 8 for ( variable in variables ) coding line 9 df variable as factor ( df variable ) coding line 1 0 coding line 1 1 return ( df ) coding line 1 2 There are several numeric variables, which include credit amount, age, and credit duration months Please check their distributions using histogram If they are skewed distributions, please normalize the data One possible way is using z normalization as follows coding line 1 4 Normalization scaling coding line 1 5 scales features function ( df , variables ) coding line 1 6 for ( variable in variables ) coding line 1 7 df variable scale ( df variable , center T , scale T ) coding line 1 8 coding line 1 9 return ( df ) coding line 2 0 You can pass some variables to the above functions to transform and normalize the data, such as the following example code Normalize variable numeric var c ( your variables 0 yourData scale features ( yourData , numeric var ) You can apply a similar method to transform your data Once the preprocessing is completed, partition the data randomly into training and testing sets using a 6 4 ratio Use the training set to fit a model and the testing set to assess the model performance Develop a model using various techniques you ve learned from the lecture, and propose and explain the best model There are three numeric variables credit duration month, amount, and age Plot the histogram of the variables ( 2 0 pts ) Normalize the above variables, plot the histogram of the variables again ( 2 0 pts ) Normalize and factorize the appropriate variables, and split in a 6 4 ratio for train test Use the training set, and create svm model using e 1 0 7 1 package svm Test the generated model on the test dataset, and explain the results of trained model and the comparison of the original dataset and tested model using pred ( 2 0 pts ) Repeat ( c ) but with linear kernel and non linear kernel ( 4 0 pts )

The Answer is in the image, click to view ...

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

file: / / / C: / Users / 1 4 0 9 3 / OneDrive / credit 2 . pdf Use Rstudio and give 3

file:

/ / /

/

Users

/ 14093 /

OneDrive

/

credit

2 .

pdf Use Rstudio and give

3

sentences explaining each step of code. This assignment asks you to examine the Support Vector Machine for classification. Provide your answers to the questions in a Word document named Assign

4_

LastName.doc along with your source code that should be saved as Assign

4_

LastName, and click the title link to upload and submit them.

The attached dataset contains German credit data. To apply Support Vector Machines, the data requires preprocessing, such as data type transformation and normalization. For data type transformations, we mainly perform factoring of the categorical variables, where we transform the data type of the categorical features from numeric to factor.For example,

code line

5

: #data preprocessing; code line

6

: #data type transformation

-

factoring; coding line

7

: to

.

Factor

< -

function

(

,

variables

) {

; coding line

8

: for

(

variable in variables

) {

; coding line

9

: df

[[

variable

]] <_

.

factor

(

[[

variable

]])

; coding line

10

}

; coding line

11

: return

(

)

; coding line

12

}

There are several numeric variables, which include credit.amount, age, and credit.duration.months. Please check their distributions using histogram. If they are skewed distributions, please normalize the data. One possible way is using z

-

normalization as follows:

coding line

14

: #Normalization

-

scaling; coding line

15

: scales.features

<_

function

(

,

variables

) {

; coding line

16

: for

(

variable in variables

) {

; coding line

17

: df

[[

variable

]] <_

scale

(

[[

variable

]],

center

=

,

scale

=

)

; coding line

18

}

; coding line

19

: return

(

)

; coding line

20

}

You can pass some variables to the above functions to transform and normalize the data, such as the following example:

code: #Normalize variable numeric.var

< -

("

your variables"

0

yourData

< -

scale.features

(

yourData

,

numeric.var

)

You can apply a similar method to transform your data.

Once the preprocessing is completed, partition the data randomly into training and testing sets using a

6

4

ratio. Use the training set to fit a model and the testing set to assess the model performance.

Develop a model using various techniques you

ve learned from the lecture, and propose and explain the best model.

There are three numeric variables: credit duration month, amount, and age. Plot the histogram of the variables.

(20

pts

)

Normalize the above variables, plot the histogram of the variables again.

(20

pts

)

Normalize and factorize the appropriate variables, and split in a

6

4

ratio for train:test. Use the training set, and create svm model using e

1071

package svm

.

Test the generated model on the test dataset, and explain the results of trained model and the comparison of the original dataset and tested model using pred

(20

pts

) .

Repeat

(

)

but with

linear kernel

and

non

-

linear kernel

(40

pts

) .

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Sql Practice Problems 57 Beginning Intermediate And Advanced Challenges For You To Solve Using A Learn By Doing Approach

Authors: Sylvia Moestl Vasilik

1st Edition

★★★★★

e. Based on this experience, identify some characteristics that may be important for successful intercultural communication.

Answered: 1 week ago

Previous Question Next Question