Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

QUESTION: Scan through a file and make a bar chart of the top 20 most common words(words of at least 4 letters, that way it

QUESTION: Scan through a file and make a bar chart of the top 20 most common words(words of at least 4 letters, that way it will exclude "the", "it", "a", "I", etc.)

-You'll have to remove the punctuation before you split a line into words.

-To count the number of occurrences of each word, there's a nice class called "Counter", This "Counter" class has a function called most_common(n), where "n" is the number of items you want. This function returns a collection of word/count pairs.

-You'll have to convert this collection of pairs (tuples) into 2 lists. You can iterate through the collection using a for loop

- Lastly, you'll have to make a bar chart. Use matplotlib plt.bar function.

I started the code for this problem but am having trouble finishing it to display the correct graph/have the correct code.

image text in transcribed

import re import matplotlib.pyplot as plt import string from collections import Counter file-open ( 'shortFile.txt 'r) fline - re.sub(+string.punctuation+']','', line) wordList = [] for line in file: line) lineNoPunct re.sub(T+string . punctuation-T, wordsList-lineNoPunct.split() if len (WordsL1st) >= 4: '', print(wordsList) for word in file: c counter(' ' ) x = c.most-common (20) print(x) from collections import Counter for item in x: print('word',item[], end-") print( count:, item[1]) file.close() plt.figure(figsize-(5,8)) x [3,4] plt.bar(x , 2) # for heiaths, insert your List of counts for each of the 20 words plt.xticks(x) # insert your List of words here, then rotate them so they don't overlap. See this Link: # https://matplot lib.org/devdocs/ga LLery/ticks_and_spines/ticklabels-rotation.htmL plt.show()

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Managing Your Information How To Design And Create A Textual Database On Your Microcomputer

Authors: Tenopir, Carol, Lundeen, Gerald

1st Edition

1555700233, 9781555700232

More Books

Students also viewed these Databases questions

Question

Are the shorelines of the continents the edges of the continents?

Answered: 1 week ago