Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 23, 2024

4 . 2 Classify To determine the accuracy of the model, you will use the test set that was configured earlier. While in training you

4.2

Classify

To determine the accuracy of the model, you will use the test set that was configured earlier. While in training you used only positive examples, the test data, Q

1_

test, Q

2_

test and y

_

test, is set up as pairs of questions, some of which are duplicates and some are not. This routine will run all the test question pairs through the model, compute the cosine similarity of each pair, threshold it and compare the result to y

_

test

-

the correct response from the data set. The results are accumulated to produce an accuracy; the confusion matrix is also computed to have a better understanding of the errors.

Exercise

04

Instructions

Loop through the incoming data in batch

_

size chunks, you will again define a tensorflow.data.Dataset to do so

.

This time you don't need the labels, so you can just replace them by None,

compute v

1,

2

using the model output,

for each element of the batch

-

compute the cosine similarity of each pair of entries, v

1 [

],

2 [

] -

determine if d

>

threshold

-

increment accuracy if that result matches the expected results

(

_

test

[

])

Instead of running a for loop, you will vectorize all these operations to make things more efficient,

compute the final accuracy and confusion matrix and return. For the confusion matrix you can use the tf

.

math.confusion

_

matrix function.

# GRADED FUNCTION: classify

def classify

(

test

_

1,

test

_

2,

_

test, threshold, model, batch

_

size

= 64,

verbose

=

True

)

" " "

Function to test the accuracy of the model.

Args:

test

_

1 (

numpy

.

ndarray

)

: Array of Q

1

questions. Each element of the array would be a string.

test

_

2 (

numpy

.

ndarray

)

: Array of Q

2

questions. Each element of the array would be a string.

_

test

(

numpy

.

ndarray

)

: Array of actual target.

threshold

(

float

)

: Desired threshold

model

(

tensorflow

.

Keras.Model

)

: The Siamese model.

batch

_

size

(

int

,

optional

)

: Size of the batches. Defaults to

64 .

Returns:

float: Accuracy of the model

numpy.array: confusion matrix

" " "

_

pred

= []

test

_

gen

=

.

data.Dataset.from

_

tensor

_

slices

(((

test

_

1,

test

_

2),

None

)) .

batch

(

batch

_

size

=

batch

_

size

)

### START CODE HERE ###

pred

=

None

_,

_

feat

=

None

1 =

None

2 =

None

# Compute the cosine similarity. Using

`

.

math.reduce

_

sum

` .

# Don't forget to use the appropriate axis argument.

=

None

# Check if d

>

threshold to make predictions

_

pred

=

.

cast

(

None

,

.

float

64)

# take the average of correct predictions to get the accuracy

accuracy

=

None

# compute the confusion matrix using

`

.

math.confusion

_

matrix

`

=

None

### END CODE HERE ###

return accuracy, cm

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

A Beginners Guide To Google Drive And Docs Step By Step Practical Instructions To Google Drive Docs Sheets And Forms

Authors: Robert William

1st Edition

★★★★★

Ethics. A pharmaceutical company hides indications of a drugs dangerous side effects and delays sending a message to physicians about possible effects until six months after research documented...

Answered: 1 week ago

Previous Question Next Question