Question
Problem 2: In this particular project, we are going to work on the inaugural corpora from the nltk in Python. We will be looking at
Problem 2:
In this particular project, we are going to work on the inaugural corpora from the nltk in Python. We will be looking at the following speeches of the Presidents of the United States of America:
- President Franklin D. Roosevelt in 1941
- President John F. Kennedy in 1961
- President Richard Nixon in 1973
(Hint: use .words(), .raw(), .sent() for extracting counts)
2.1 Find the number of characters, words, and sentences for the mentioned documents. - 3 Marks
2.2 Remove all the stopwords from all three speeches. - 3Marks
2.3 Which word occurs the most number of times in his inaugural address for each president? Mention the top three words.(after removing the stopwords) - 3Marks
2.4 Plot the word cloud of each of the speeches of the variable. (after removing the stopwords) - 3Marks [ refer to the End-to-End Case Study done in the Mentored Learning Session ]
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started