Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 30, 2024

I have a created a function using these instructions: Using the Gower distances in the matrix D computed by the provided function, find the most

I have a created a function using these instructions:

Using the Gower distances in the matrix D computed by the provided function, find the most and least similar pairs of colleges in the dataset

(15

points

) .

Note that an item is most similar to itself

(

distance

= 0 .)

but you need to disallow this case since we actually care about finding two distinct items not along the diagonal that are most similar. One quick way to accomplish this is to replace the zeros along the diagonal of the distance matrix D returned by the gower

_

distances function, with a very large number

(

.

. 1000)

that wouldn't occur as a distance in practice.

You may also find numpy's unravel

_

index function, in combination with argmax or argmin, useful for finding min

/

max elements in an array. Remember that the least similar elements will have maximum distance from each other, and most similar will have minimum distance.

Your function should accept as an argument the college dataframe

(

)

provided above. The gower

_

distances

()

function will also utilize this dataframe. See the gower

_

distances

()

function definition above.

Your function should return a

2 -

element tuple, consisting itself of two tuples: the first tuple should be the names

(

via the College.Name field

)

of the two colleges that are least similar according the Gower distance. The second tuple should name the most similar colleges.

My function returns: Why are the elements in the second tuple not distinct?

(('

Augustana College IL

',

'Hope College'

),

('

Abilene Christian University', 'Abilene Christian University'

))

def answer

_

mixed

_

features

_

(

)

=

gower

_

distances

(

)

#a provided function that computes gower distances

.

fill

_

diagonal

(

, 1000)

# Find indices of least and most similar pairs

least

_

similar

_

idx

=

.

unravel

_

index

(

.

argmin

(

),

.

shape

)

most

_

similar

_

idx

=

.

unravel

_

index

(

.

argmax

(

),

.

shape

)

# Get names of least and most similar colleges

least

_

similar

_

colleges

= (

.

iloc

[

least

_

similar

_

idx

[0]] ["

College

.

Name"

],

.

iloc

[

least

_

similar

_

idx

[1]] ["

College

.

Name"

],

)

most

_

similar

_

colleges

= (

.

iloc

[

most

_

similar

_

idx

[0]] ["

College

.

Name"

],

.

iloc

[

most

_

similar

_

idx

[1]] ["

College

.

Name"

],

)

return

(

least

_

similar

_

colleges, most

_

similar

_

colleges

)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

OpenStack Trove

Authors: Amrith Kumar, Douglas Shelley

1st Edition

1484212215, 9781484212219

More Books

Students also viewed these Databases questions

Question

★★★★★

The American Hospital Association publishes information about U.S. hospitals and nursing homes in Hospital Statistics. The following contingency table provides a cross-classification of U.S....

Answered: 1 week ago

Question

★★★★★

Given a binomial variable with a mean of 20 and a variance of 16, find n and p.

Answered: 1 week ago

Question

★★★★★

Assuming the distribution being sampled is approximately normally distributed, use the small sample confidence interval for the mean to compute a 98% confidence interval for ???? when (a) ???? = 36,...

Answered: 1 week ago

Question

★★★★★

Solutions Plus is an industrial chemicals company that produces specialized cleaning fluids and solvents for a wide variety of applications. Solutions Plus just received an invitation to submit a bid...

Answered: 1 week ago

Question

★★★★★

Goods held by a consignee would be included in our ending inventory. Select one: True O False

Answered: 1 week ago

Question

★★★★★

Purpose: The purpose of this is to establish a plan for good communication throughout an agile project. Using the project you identified As Service APP, create your communication plan for your...

Answered: 1 week ago

Question

★★★★★

Kyles Manog has taken the pop world by storm over the first six months of his career and has asked you to prepare some financial statements for that trading period. Income Kyles sold 4 0 , 0 0 0 DVD...

Answered: 1 week ago

Question

★★★★★

1. Ralph Abernathy owns a small coffee shop. He wants to know the average number of customers queuing (lining up) in his coffee shop, to decide whether he needs to add more space to accommodate more...

Answered: 1 week ago

Question

★★★★★

provide a brief outline of how diversity policy needs to apply to your place of work/type of business

Answered: 1 week ago

Question

★★★★★

United States dollar strengthen and weakening: Present current valuation (U.S. Dollar versus the two countries' currency) and effect on corporate profits for China vs Indonesia

Answered: 1 week ago

Question

★★★★★

. What is meant by the term cash flow? For the table below determine the yearly cash flows and the total cash flow. If a discounting rate of 8% is used what is the Net Present Value? Year 0 1 2 3 4 5...

Answered: 1 week ago

Question

★★★★★

Recognize that some applicants have a tendency to interview the interviewer and know how to address this mode of behavior

Answered: 1 week ago

Question

★★★★★

Appreciate the importance of training and development as continuing activities

Answered: 1 week ago

Question

★★★★★

Understand how an interviewer should respond or react when receiving legally forbidden information that is voluntarily given

Answered: 1 week ago

Previous Question Next Question