Answered step by step
Verified Expert Solution
Question
1 Approved Answer
NEED ANSWER FOR PART3 Part 1: Setup Familiarize yourself with the documentation available at https://www.nltk.org/ Install NLTK with pip Install pyPDF2 via pip In IDLE
NEED ANSWER FOR PART3
Part 1: Setup Familiarize yourself with the documentation available at https://www.nltk.org/ Install NLTK with pip Install pyPDF2 via pip In IDLE o Import nitk o Use nitk.download() to get the data. Download all packages, all corpora Part 2: Removing stopwords and Frequency Counts Import the Gutenberg collection and the stopwords for the English language as part of a program that counts the frequencies of the words in Shakespeare's Macbeth. The steps are as follows: Import the necessary modules Read in the words in Macbeth. This will include all stopwords Step though the list of words in Macbeth, appending those that are not stopwords to a list For the resulting list, you can obtain the frequencies using one of the nitk functions Submit a screenshot of the most common words in that list. Part 3: Removing Punctuation Improve the previous program to remove any punctuation as well. For that, you can create your own list of punctuations. Expand your program to calculate the frequencies of multiple works in the same collection. Submit a screenshot of the most common words in a collection of at least 2 works from the Gutenberg collectionStep by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started