Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

True False ( Each question is 5 points ) . 1 . True or False? Classification algorithms work by examining an input training set of

True False (Each question is 5 points).
1. True or False? Classification algorithms work by examining an input training set of data to learn how the data values combine to create a result. True
2. True or False? Within a machine-learning program, the code will specify, for example, that 70% of the
data will be training data and 30% will be used for testing.
3. True or False? Classification techniques are optimal for binary classifications, which use two target
classes, such as a tumor being malignant or benign, or multiclass classifications, which have multiple
target classes, such as a pet being a dog, cat, or horse.
4. True or False? To create the training and test data sets, the data must contain not only the predictor
data values, but also the correct class assignments for that data.
5. True or False? After you test a model, you can examine the resulting mistake matrix to determine
which classification assignments the model got wrong.
6. True or False? The logarithmic regression classifier, which is best suited for binary classification,
assigns data to classes based upon the function (called a logit) that determines the probability that the
data belongs to the class.
7. True or False? A decision tree is a graph-based data structure that specifies a collection of decision
points. By following a path through the decision points, a decision-tree classifier assigns data to specific
classes.
8. True or False? When you overfit the model, the model may start to treat noise or errant data as valid
training data.
9. True or False? The Nave Bayes classification algorithm is called nave in that it treats the different
data-set attributes as independent and calculates a probability for each.
10. True or False? Regardless of the type of data-analytic project you are considering, the steps you will
perform to get started will be similar.
11. True or False? Data-analytic projects should begin with the question: What do we want to show?
12. True or False? Today, a cost-effective and fast way to get up and running with a scalable data store is to
leverage cloud-based managed services, in a pay-as-you-go environment.
13. True or False? A random-forest classification model creates many different decision trees for a data
set and then, based on each trees prediction, the trees essentially vote to select the tree that produces
the best result.
14. True or False? The support vector machine (SVM), often called SVC (support vector classifier),
classifies data by separating values with a box called a hypersquare.
15. True or False? In data classification, the dependent variable is the class to which the algorithm will
assign the data.
16. True or False? In classification, the accuracy score is the sum of the correct category assignments.
17. Neural networks are at the heart of machine learning and are used for a wide range of applications,
including classification.
18. True or False? Python is one of the worlds most popular programming languages and is used to
create solutions that range from websites, data mining, machine learning, visualization, and more.
19. True or False? Python is an interpreted language, as opposed to a compiled language, for which the
Python interpreter executes one statement at a time.
20. True or False? Unlike other programming languages that use braces {} to group related statements,
Python instead relies on statement indentation to group statements.
21. True or False? Data cleansing is normally a one-time event.
22. True or False? The goal of the data-governance board is to continually improve the quality of data.
23. True or False? Database developers refer to the process of correcting such data inconsistences as
normalizing the data.
24. True or False? The data-quality assessment framework provides eight standard measurements based
on quality dimensions.
25. True or False? Developers often refer to the process of transforming and cleansing data as data
wrangling, or, depending upon with whom you are speaking, data munging.
Multiple Choice (each question is 5 points)
1.
To use a Python library, the library code must exist on your system. To download and install a Python
library, developers use the ____ command. Then, you use the import statement to include the library
code within your script.
A) QLP
B) ELT
C) PIP
D) LOAD
2. The _____ library defines Python data structures and functions that support machine-learning and
data-mining operations such as clustering, classification, and regression.
A) pandas
B) sklearn
C) matplotlib
D) numpy
3. What values will the following statements display?
for i in range(1,11)
print(i)
A)1 to 10
B)1 to 11
C)1 and 11
D) None of these is correct.
4._____ is the process of detecting, correcting, and removing errors and inconsistences from data.
A) Data mining
B) Data cleansing
C) Data washing
D) Data reduction
5.______ is a measure of the dat

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

More Books

Students also viewed these Databases questions