Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Determining the Decile Thresholds We need a python function determine_decile_thresholds(frequencies) that splits the languages tokens into ten parts of equal frequency. The deciles or 10%-level

Determining the Decile Thresholds

We need a python function determine_decile_thresholds(frequencies) that splits the languages tokens into ten parts of equal frequency. The deciles or 10%-level thresholds are the number of word forms that are needed to cover 10% of a text, 20%, 30%, and so on to 90%. The correct result on the English data should imply that 62 word forms are enough to know 50% of all tokens in an English text, whereas to reach 90% of all tokens (a threshold frequently quoted as the threshold to pleasant reading in the literature), passive knowledge of 2,617 English word forms is necessary:

>>> thresholds = determine_decile_thresholds ( frequencies )

>>> thresholds [4:9] [62 , 115 , 252 , 662 , 2617]

def determine_decile_thresholds(frequencies): """ Returns a vector of 9 thresholds delineating 10% of overall token coverage, i.e. the number of tokens that are needed to cover 10%, 20%, 30%, etc. of all tokens. """

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Building Database Driven Catalogs

Authors: Sherif Danish

1st Edition

0070153078, 978-0070153073

More Books

Students also viewed these Databases questions

Question

LO5.2 Discuss government failure and explain why it happens.

Answered: 1 week ago