Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Aug 07, 2024

import string from collections import Counter # Computed from A Tale of Two Cities. Compare Table 1 . 3 in Hoffstein, Pipher, Silverman. english _

import string

from collections import Counter

# Computed from "A Tale of Two Cities". Compare Table

1.3

in Hoffstein, Pipher, Silverman.

english

_

freq

= {

'

'

0.0803,

'

'

0.014,

'

'

0.0232,

'

'

0.0467,

'

'

0.1247,

'

'

0.0226,

'

'

0.0209,

'

'

0.065,

'

'

0.0683,

'

'

0.0012,

'

'

0.008,

'

'

0.0367,

'

'

0.0255,

'

'

0.0706,

'

'

0.0776,

'

'

0.0166,

'

'

0.0011,

'

'

0.0621,

'

'

0.0626,

'

'

0.0902,

'

'

0.0279,

'

'

0.0087,

'

'

0.0236,

'

'

0.0012,

'

'

0.0203,

'

'

0.0004

}

def only

_

letters

(

,

case

=

None

)

'''

Returns the string obtained from X by removing everything but the letters.

If case

=

"upper" or case

=

"lower", then the letters are all

converted to the same case.

'''

='' .

join

(

c for c in X if c in string.ascii

_

letters

)

if len

(

) = = 0

return None

if case is None:

return X

elif case

= =

"lower":

return X

.

lower

()

elif case

= =

"upper":

return X

.

upper

()

def shift

_

char

(

,

shift

_

amt

)

'''

Shifts a specific character by shift

_

amt.

Example:

shift

_

char

("

", 3)

returns

"

"

'''

if ch in string.ascii

_

lowercase:

base

='

'

elif ch in string.ascii

_

uppercase:

base

='

'

# It's not clear what shifting should mean in other cases

# so if the character is not upper or lower

-

case, we leave it unchanged

else:

return ch

return chr

((

ord

(

) -

ord

(

base

) +

shift

_

amt

) % 26 +

ord

(

base

))

def shift

_

string

(

,

shift

_

amt

)

'''

Shifts all characters in X by the same amount.

'''

return

'' .

join

(

shift

_

char

(

,

shift

_

amt

)

for ch in X

)

def count

_

substrings

(

,

)

'''

Returns a Python Counter object of all n

-

grams in X

.'''

if not X:

return

{}

=

only

_

letters

(

)

shifts

= [

[

]

for i in range

(

)]

grams

= ['' .

join

(

chrs

)

for chrs in zip

(*

shifts

)]

return Counter

(

grams

)

def get

_

freq

(

)

'''

Returns the proportion that each letter occurs in

"

" .

I might change this later, but for now, it converts everything to lowercase.

The reason is to match what appears in the english

_

freq dictionary.

'''

=

only

_

letters

(

,

case

=

"lower"

)

=

len

(

)

ctr

=

count

_

substrings

(

, 1)

output

= {}

for char in string.ascii

_

lowercase:

output

[

char

] =

ctr

[

char

] /

return output

def mut

_

ind

_

(

1,

2)

'''

For letter frequency dictionaries d

1

and d

2,

return the Mutual Index of Coincidence.

See Equation

(5.9)

on page

222

in Hoffstein, Pipher, Silverman.

'''

= 0

for k in d

1 .

keys

()

+ =

1 .

get

(

, 0) *

2 .

get

(

, 0)

return s

## Import a Python helper file

*

Download the file

`

Lab

0_

Helper.py

` .

*

Import several functions defined in that uploaded file by evaluating the following code.

(

Copy and paste it into a new cell, and then evaluate by holding down Shift and hitting Enter.

)

` ` `

from Lab

0_

Helper import shift

_

string, only

_

letters, english

_

freq, get

_

freq, mut

_

ind

_

` ` `

*

Check that

`

shift

_

string

`

is working by evaluating the following.

` ` `

=

"yeah"

shift

_

string

(

, 3)

` ` `

*

Check that

`

only

_

letters

`

is working by evaluating the following code.

(

Notice that the number disappears, the same as the punctuation and spaces.

)

` ` `

=

"Hello, and welcome to Math

173

! "

only

_

letters

(

,

case

=

"lower"

)

` ` `

*

Check that

`

get

_

freq

`

is working by evaluating the following code.

` ` `

get

_

freq

("

Hello there"

)

` ` `

The result should be a Python dictionary whose keys are the lowercase letters, and whose values are the proportions which which those letters appear in the string. For example, the letter

"

"

appears three times and there are ten total letters, so the value for

` "

" `

is the real number

` 0.3 ` .

*

If you evaluate

`

english

_

freq

`,

you should see a similar dictionary, with estimated proportions for "average" English text.

(

These were calculated by computing the proportions from the book

*

A Tale of Two Cities

* .)

*

The function

`

mut

_

ind

_

`

is meant to be used with frequencies for two different pieces of text, but you can also use it by repeating the frequency for a single piece of text. This should produce a real number which is an estimate for the probability that two randomly chosen English letters are equal. The following computes the Mutual Index of Coincidence between our "average" English text and itself.

` ` `

mut

_

ind

_

(

english

_

freq, english

_

freq

)

` ` `

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Processing

Authors: David Kroenke

11th Edition

★★★★★

Discuss how the particular characteristics of a job, an organization, or an applicant affect the utility of any test. page 246

Answered: 1 week ago

Previous Question Next Question