Answered step by step
Verified Expert Solution
Question
00
1 Approved Answer
( c ) A clasification tree, a SVM clnssifier with GRBF kernel, and a randen farest dasifier are compared and tuned using the caret R
c A clasification tree, a SVM clnssifier with GRBF kernel, and a randen farest dasifier are compared and tuned using the caret porknge. The classifiers are compared using a fold crossvalidation procedure replicsated times, over approprintely specified grids of vulues of their associnted hyperparame ters. The following figures report the avernge validation nocunacy as a function of the hyperparameters of the classification tree, of the SVM classifier, and of the randoen forest clnssifier. Lampataxy renameter Random forest i How many clossifers have been trained in total? Justify your answer. ii Which is the best clnsifier and what are its optimal hyperparameters? iii Are there any issues with the tuning procedure of any of the clasifiers? If yes, what solution would you suggest? d A team of engineers gnthered data on different material characteristics of a specific mechanical component usad in the construction of wind turbines, with the purpose of developing a machine learning system capable of detecting falty components, so to avoid the use of a potentially defective camponent in the canstruction of a turbine. The percentage of faulty components in the dnta is The team implements two logistic regression models, and defined using different sets of input variables. These models are evulunted and compared bosed on their ahility to detect fanlts in this specific mochanical camponemt of wind turbines. The models produce the following performance metrics The team af engineers cansiders it more costly to empley a malfunctioning component in the canstruction of the wind turbine than to wrongly discard a nondefective component. Using the information nvailable, which of the two models would they prefer? Justify your nnswer.Provide your answer and a concise explanation for soch of the following questions. a Association rule analysis is npplied to a large dntaset concerning food and beverage purchness of n cafe in Dublin. The rule Cof fee Croissant is considered for nnalysis. The rule hns a support meseure of and it wns mined using the npriori algorithm with support threshold set to and confidence threshold set to In the dntn there are transactions involving Coffee, and transacticns involving Croissant. Campute the confidence and lift mensures of this rule. b Data on the chemical analysis of different types of glass of differemt socurce: wndov, vehicle, contaner, tableware, nnd lamp. A clossification tree is trained an these: data with the purpose of predicting the type of glass on the busis of the chemical camponents far criminologiosal investigation. The output from the tree is reported below. node dosotes torainal node root uindow Bar uindow vindow g Cac vindow
c A clasification tree, a SVM clnssifier with GRBF kernel, and a randen farest
dasifier are compared and tuned using the caret porknge. The classifiers
are compared using a fold crossvalidation procedure replicsated times,
over approprintely specified grids of vulues of their associnted hyperparame
ters. The following figures report the avernge validation nocunacy as a function
of the hyperparameters of the classification tree, of the SVM classifier, and of
the randoen forest clnssifier.
Lampataxy renameter
Random forest
i How many clossifers have been trained in total? Justify your answer.
ii Which is the best clnsifier and what are its optimal hyperparameters?
iii Are there any issues with the tuning procedure of any of the clasifiers?
If yes, what solution would you suggest?
d A team of engineers gnthered data on different material characteristics of a
specific mechanical component usad in the construction of wind turbines, with
the purpose of developing a machine learning system capable of detecting
falty components, so to avoid the use of a potentially defective camponent
in the canstruction of a turbine. The percentage of faulty components in the
dnta is The team implements two logistic regression models, and defined using
different sets of input variables. These models are evulunted and compared
bosed on their ahility to detect fanlts in this specific mochanical camponemt
of wind turbines. The models produce the following performance metrics
The team af engineers cansiders it more costly to empley a malfunctioning
component in the canstruction of the wind turbine than to wrongly discard a
nondefective component.
Using the information nvailable, which of the two models would they prefer?
Justify your nnswer.Provide your answer and a concise explanation for soch of the following questions.
a Association rule analysis is npplied to a large dntaset concerning food
and beverage purchness of n cafe in Dublin. The rule Cof fee Croissant
is considered for nnalysis. The rule hns a support meseure of and
it wns mined using the npriori algorithm with support threshold set to
and confidence threshold set to In the dntn there are transactions
involving Coffee, and transacticns involving Croissant. Campute the
confidence and lift mensures of this rule.
b Data on the chemical analysis of different types of glass of differemt socurce:
wndov, vehicle, contaner, tableware, nnd lamp. A clossification tree is
trained an these: data with the purpose of predicting the type of glass on the
busis of the chemical camponents far criminologiosal investigation. The output
from the tree is reported below.
node
dosotes torainal node
root uindow
Bar uindow
vindow
g Cac vindow
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access with AI-Powered Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started