Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Complete the cell below to fill out the word_counts dictionary variable. word_counts keeps track of how many times a word type appears across the corpus.

Complete the cell below to fill out the word_counts dictionary variable. word_counts keeps track of how many times a word type appears across the corpus. For instance, word_counts["movie"] should store the number 61492 -- the count of how many times the word movie appears in the corpus.

import glob

import codecs

word_counts = Counter() # Counters are often useful for NLP in python

for label in [POS_LABEL, NEG_LABEL]:

for directory in [TRAIN_DIR, TEST_DIR]:

for fn in glob.glob(directory + "/" + label + "/*txt"):

doc = codecs.open(fn, 'r', 'utf8') # Open the file with UTF-8 encoding

###### IMPLEMENT ######

pass

if word_counts["movie"] == 61492:

print ("yay! there are {} total instances of the word type movie in the corpus".format(word_counts["movie"]))

else:

print ("hmm. Something seems off. Double check your code")

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

50 Tips And Tricks For MongoDB Developers Get The Most Out Of Your Database

Authors: Kristina Chodorow

1st Edition

1449304613, 978-1449304614

More Books

Students also viewed these Databases questions

Question

What does the seller of a put option hope will happen?

Answered: 1 week ago