Answered step by step
Verified Expert Solution
Question
1 Approved Answer
The next question works with data from the New York Times Annotated Corpus, which is a repository of newspaper articles available from the Penn Library.
The next question works with data from the New York Times Annotated Corpus, which is a repository of newspaper articles available from the Penn Library. This question focuses on string manipulation and data visualization.
- The New York Times Annotated Corpus contains over 1.8 million articles written and published by the New York Times between 1987 and 2007. In this dataset, each row of data contains the date the article was published (date) and the full text of the article (text). We will use the data from November 2000 to practice using string and date and time variables to examine how the media covered the 2000 election between Bush and Gore and the Florida recount.
- First, read in the data called nyt.csv. Change the date column to type
. - Lets explore if the candidates got equal news coverage. How many articles mention Bush? How many mention Gore? Did 3rd party candidate Nader get similar coverage?
- Graph the number of articles that mention of Gore per day over the month. What date had the highest number of articles mentioning him? Do some quick research into the 2000 election to explain why his mentions peaked after election day.
- How many times is recount mentioned in the same article as Florida? Graph the number of articles per day that mention both recount and Florida. On what date do they peak?
- First, read in the data called nyt.csv. Change the date column to type
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started