Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Feb 29, 2024

I need to make text analyzer class using the following methods. I must comment on code. Here is part 1 completed. import requests, re

I need to make text analyzer class using the following methods. I must comment on code.

Here is part 1 completed.

import requests, re

from bs4 import BeautifulSoup

from collections import Counter

import statistics as stats

import string

#I must make my class here

import operator

import matplotlib.pyplot as plt; plt.rcdefaults()

class TextAnalyzer:

def __init__(self, src, src_type="discover"):

self._src_type = None

self._content = None

self._orig_content = None

# determine src_type if not specified

if src_type == "discover":

if src.startswith("http"):

src_type = "url"

elif src.endswith(".txt"):

src_type = "path"

else:

src_type = "text"

self._src_type = src_type

# load content based on src_type

if self._src_type == "url":

response = requests.get(src)

self._orig_content = response.text

elif self._src_type == "path":

with open(src, "r") as f:

self._orig_content = f.read()

elif self._src_type == "text":

self._orig_content = src

# preprocess content

self._content = self._preprocess(self._orig_content)

def _preprocess(self, text):

# remove punctuation

text = text.translate(str.maketrans("", "", string.punctuation))

# remove whitespace

text = re.sub(r"\s+", " ", text)

# convert to lowercase

text = text.lower()

return text

def __init__(self, src, src_type="discover"):

self._src_type = None

self._content = None

self._orig_content = None

# determine src_type if not specified

if src_type == "discover":

if src.startswith("http"):

src_type = "url"

elif src.endswith(".txt"):

src_type = "path"

else:

src_type = "text"

self._src_type = src_type

# load content based on src_type

if self._src_type == "url":

response = requests.get(src)

self._orig_content = response.text

elif self._src_type == "path":

with open(src, "r") as f:

self._orig_content = f.read()

elif self._src_type == "text":

self._orig_content = src

# preprocess content

self._content = self._preprocess(self._orig_content)

def _preprocess(self, text):

# remove punctuation

text = text.translate(str.maketrans("", "", string.punctuation))

# remove whitespace

text = re.sub(r"\s+", " ", text)

# convert to lowercase

text = text.lower()

return text

This is what i need to do for part 2.

set_content_to_tag(self, tag, tag_id=None)

Changes _content to the text within a specific element of an HTML document.

Keyword arguments:

tag (str) - Tag to read
tag_id (str) - ID of tag to read

It's possible the HTML does not contain the tag being searched. I should use exception handling to catch any errors.

This is what i should do for part 3

reset_content(self)

Resets _content to full text that was originally loaded. Useful after a call to set_content_to_tag().

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Income Tax Fundamentals 2013

Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill

31st Edition

1111972516, 978-1285586618, 1285586611, 978-1285613109, 978-1111972516

More Books

Students also viewed these Programming questions

Question

★★★★★

Design a Java class that represents a cache with a fixed size. It should support operations like add, retrieve, and remove, and it should evict the least recently used item when it reaches capacity.

Answered: 1 week ago

Question

★★★★★

the Q question 1)Building a MultiThreaded Web Server In this lab we will develop a Web server in two steps. In the end, you will have built a multithreaded Web server that is capable of processing...

Answered: 1 week ago

Question

★★★★★

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Answered: 1 week ago

Question

★★★★★

South Sea Baubles has the following (incomplete) balance sheet and income statement. a. What is shareholders' equity in 2013 and 2014? b. What is net working capital in 2013 and 2014? c. What are...

Answered: 1 week ago

Question

★★★★★

Tsang Corporation operates in a state with no sales tax. Record the following transactions in a general journal: DATE 2019 June 5Sold merchandise on account to Benson Company, issued Sales Slip 1200...

Answered: 1 week ago

Question

★★★★★

A small turbine, shown in Fig. P4.48, is operated at part load by throttling a 0.25-kg/s steam supply at 1.4 MPa and 250C down to 1.1 MPa before it enters the turbine, and the exhaust is at 10 kPa....

Answered: 1 week ago

Question

★★★★★

A water network has three connections, C1,C2,C3, as shown in Figure 1.18. For each connection, at the places marked 1, 2, 3, some switches have been put and at a particular instant, any switch can be...

Answered: 1 week ago

Question

★★★★★

STOCK SUBSCRIPTIONS AND TREASURY STOCK Rogers & Hart formed a corporation and had the following stock transactions during the year: June 30 Incurred the following costs of incorporation:...

Answered: 1 week ago

Question

★★★★★

The Polaris Company uses a job - order costing system. The following transactions occurred in October:

Answered: 1 week ago

Question

★★★★★

A retail outlet sells holiday candy for $10 per bag. The cost of the product is $8 per bag. All units not sold during the selling season prior to the holiday are sold for half the retail price in a...

Answered: 1 week ago

Question

★★★★★

is mentally challenged and most employees understand his limitations. Lori, a new employee, files suit for sexual harassment against your business for Daniel's actions. What forms of sexual...

Answered: 1 week ago

Question

★★★★★

State the functions that a household automatic dishwasher performs during its operation cycle. For each function, indicate the primary medium (signals, data, materials, or energy) involved. Describe...

Answered: 1 week ago

Question

★★★★★

Shaanxi mining Company (GH) Ltd. is a Chinese mining company operating in Ghana. Using relevant examples discuss four (4) factors that might propel Shaanxi to engage in Corporate Social...

Answered: 1 week ago

Question

★★★★★

Dying for a Paycheck, Jeffrey Pfeffer. The toll in both excess mortality and health-care costs from harmful workplace practice is exceedingly high in the U.S. Pfeffer demonstrates the severity of the...

Answered: 1 week ago

Question

★★★★★

GIS Introductions This class utilizes a 100% online environment as mode of instructions plus workshops. This short introduction will help others to welcome and get to know you. It also functions as...

Answered: 1 week ago

Question

★★★★★

Item 3 Determine the horizontal and vertical components of force that pin A exerts on the frame. Answered: 1 week ago

Answered: 1 week ago

Question

★★★★★

Create a single Jupyter / IPython notebook ( see the Artefacts section below for all the requirements read the whole task specification first! ) , where you perform what follows. Do not use numpy nor...

Answered: 1 week ago

Question

★★★★★

QUESTION 9 HC-O-C-R R-C-O-CH HC-O-P-O-CH-CH-NH3* O || O a. Phosphatidic acid, Serine O b. Lysophosphatidic acid, Serine, Free FA O c. Lysophosphatidylserine, Free FA O d. 2 Free FAs, Serine, Glycerol...

Answered: 1 week ago

Question

★★★★★

Olive Corporation was formed and began operations on January 1, 2012. The corporation's income statement for the year and the balance sheet at year-end are presented below. The corporation made...

Answered: 1 week ago

Question

★★★★★

On July 1, 2012, Ted, age 73 and single, sells his personal residence of the last 30 years for $365,000. Ted's basis in his residence is $35,000. The expenses associated with the sale of his home...

Answered: 1 week ago

Question

★★★★★

Phil and Linda are 25-year-old newlyweds and file a joint tax return. Linda is covered by a retirement plan at work, but Phil is not. a. Assuming Phil's wages were $27,000 and Linda's wages were...

Answered: 1 week ago

Question

★★★★★

Some schools teach reading using phonics (the sounds made by letters) and others using whole language (word recognition). Suppose a school district wants to know which method works better. Suggest a...

Answered: 1 week ago

Question

★★★★★

A running-shoe manufacturer wants to test the effect of its new sprinting shoe on 100-meter dash times. The company sponsors 5 athletes who are running the 100-meter dash in the 2012 Summer Olympic...

Answered: 1 week ago

Question

★★★★★

Exercise 33 discussed an experiment to see if the time it took athletes with hamstring injuries to be able to return to sports was different depending on which of two exercise programs they engaged...

Answered: 1 week ago

Previous Question Next Question