[Solved] Write a function score_document(document,

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 25, 2024

Write a function score_document(document,lang_counts=default_lang_counts) which takes as input a document name as a string and a dictionary of dictionaries containing normalised language counts called lang_counts.

Write a function score_document(document,lang_counts=default_lang_counts) which takes as input a document name as a string and a dictionary of dictionaries containing normalised language counts called lang_counts. It should return a dictionary of scores for each language in lang_counts, as obtained by performing a 'dot product' of trigram counts from the document with the normalised language counts. That is, it should multiply the trigram counts from the document with the trigram counts in lang_counts and add the whole lot up. If a trigram from the document is not in the dictionary for a given language, assume the count for the language as zero.

We have provided a stub of code which trains the classifier for you. We have also provided train_classifier(training_set) in a hidden library.

There are also two files included, visible in the tabs at top right. These are en_163083.txt, written in English, and de_1231811.txt, written in German, and can be loaded and used to test your function, which should behave as follows:

>>> test1 = 'en_163083.txt'

>>> d = score_document(test1)

>>> d['Vietnamese']

9.427325768357315

>>> max([(v, n) for (n, v) in d.items()])

(21.428216914833023, 'English')

>>> test2 = 'de_1231811.txt'

>>> d = score_document(test2)

>>> d['Polish']

7.710346556417009

>>> max([(v,n) for (n, v) in d.items()])

(53.12937809633241, 'German')

How to code this in python??

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

How To Make A Database In Historical Studies

Authors: Tiago Luis Gil

1st Edition

ISBN: 3030782409, 978-3030782405

More Books

Students also viewed these Databases questions

Question

★★★★★

A certain factory contains a heavy rotating machine that causes the factory floor to vibrate. We want to operate another piece of equipment nearby and we measure the amplitude of the floor's motion...

Answered: 1 week ago

Question

★★★★★

What drives computer graphics? And Haw? (discuss in detail). 1200 words

Answered: 1 week ago

Question

★★★★★

4. Grievance Mediation and the EEOC. Mediation is a form of Alternative Dispute Resolution (ADR) that is offered by the U.S. Equal Employment Opportunity Commission (EEOC) as an alternative to the...

Answered: 1 week ago

Question

★★★★★

The stockholders equity section of Minh, Inc.s balance sheet as of December 31, 2013, follows. Contributed capital: Common stock , $3 par value, 1,000,000 shares authorized, 80,000 shares issued and...

Answered: 1 week ago

Question

★★★★★

Write a function score_document(document,lang_counts=default_lang_counts) which takes as input a document name as a string and a dictionary of dictionaries containing normalised language counts...

Answered: 1 week ago

Question

★★★★★

Why is correct payroll calculation so important? Why can't vacation pay and regular pay be combined? Are pension benefits going up or down at this time in the economy

Answered: 1 week ago

Question

★★★★★

The U.S. dollar has been steadily strengthening with respect to the Hong Kong dollar. A U.S. parent has a Hong Kong subsidiary. The subsidiary has positive net assets and its monetary assets are less...

Answered: 1 week ago

Question

★★★★★

What does RRAM stand for?Where is it used?What are its applications?

Answered: 1 week ago

Question

★★★★★

Which are non projected Teaching aids in advance learning system?

Answered: 1 week ago

Question

★★★★★

Define banking. Define negotiable instruments. What is core banking solutions? Expand SWIFT. What is RTGS ? What do you mean by particular lien ? Define risk. What is underwriting in insurance?...

Answered: 1 week ago

Question

★★★★★

What does the symbol '@' mean for the practicing nurse or nurse's aide?

Answered: 1 week ago

Question

★★★★★

Question Can a stock bonus plan or ESOP hold life insurance or investments other than employer stock?

Answered: 1 week ago

Question

★★★★★

Question What are the requirements for a safe harbor 401(k) plan?30

Answered: 1 week ago

Question

★★★★★

Question Can S corporation ownership of employer stock be used as part of a plan to convert nondeductible benefits for corporate shareholders into deductible items?

Answered: 1 week ago

Previous Question Next Question