Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 05, 2024

Answer The 2nd Part Of Question Part 2 Solve Using Python Here Are The Data Structure Please Answer These And Explain Them Also using Comment

Answer The 2nd Part Of Question Part 2 Solve Using Python image text in transcribed Here Are The Data Structure Please Answer These And Explain Them Also using Comment

HashSet.py

from dataclasses import dataclass

from typing import List

@dataclass

class HashSet:

buckets: List[List] = None

size: int = 0

def init(self):

self.size = 0

self.buckets = [[] for i in range(8)]

# Computes hash value for a word (a string)

def get_hash(self, word):

pass # Placeholder code ==> to be replaced

# Doubles size of bucket list

def rehash(self):

pass # Placeholder code ==> to be replaced

# Adds a word to set if not already added

def add(self, word):

pass # Placeholder code ==> to be replaced

# Returns a string representation of the set content

def to_string(self):

pass # Placeholder code ==> to be replaced

# Returns current number of elements in set

def get_size(self):

pass # Placeholder code ==> to be replaced

# Returns True if word in set, otherwise False

def contains(self, word):

pass # Placeholder code ==> to be replaced

# Returns current size of bucket list

def bucket_list_size(self):

pass # Placeholder code ==> to be replaced

# Removes word from the set if there does nothing

# if word not inset

def remove(self, word):

pass # Placeholder code ==> to be replaced

# Returns the size of the bucket with most elements

def max_bucket_size(self):

pass # Placeholder code ==> to be replaced

2nd structure

hash_main.py

import HashSet as hset

# Program starts

# Initialize word set

words = hset.HashSet() # Create new empty HashSet

words.init() # Initialize with eight empty buckets

# Add names to word set. Notice: a) contains duplicate names,

# b) more than eight names ==> will trigger rehash

names = ["Ella", "Owen", "Fred", "Zoe", "Adam", "Ceve", "Adam", "Ceve", "Jonas", "Ola", "Morgan", "Fredrik", "Simon", "Albin", "Jonas", "Amer", "David"]

for name in names:

words.add(name)

print(" to_string():", words.to_string()) # { Adam David Amer Ceve Owen Ella Jonas Morgan Fredrik Zoe Fred Albin Ola Simon }

print("get_size():", words.get_size()) # 14

print("contains(Fred):", words.contains("Fred")) # True

print("contains(Bob):", words.contains("Bob")) # False

# Hash specific data

mx = words.max_bucket_size()

print(" max bucket:", mx) # 2

buckets = words.bucket_list_size()

print("bucket list size:", buckets) # 16

# Remove elements

delete = ["Ceve", "Adam", "Ceve", "Jonas", "Ola"]

for s in delete:

words.remove(s)

print(" get_size:", words.get_size()) # 10

print("to_string():", words.to_string()) # { David Amer Owen Ella Morgan Fredrik Zoe Fred Albin Simon }

Project Task The project is about understanding hashing and binary search trees. We will use the two large text files you used in Assignment 3 as input data. We strongly recommend that you look at Lecture 10 before you start to work on the project exercises. The problem can be divided in five parts: 1. Count unique words using Python's set and dictionary 2. Implement two data structures suitable for working with words as data: a) A hash based set, and b) a binary search tree (BST) based map (dictionary). 3. Use your two data structures to repeat Part 1 (counting unique words) 4. Present word related plots using matplotlib (VG Exercise) 5. Measure the time to perform certain operations on your map and set implementation. (VG Exercise) Part 1 - Count unique words 1 (G exercise) In Exercise 6 in Assignment 3 you saved all words from the two text files eng_news_100K-sentences.txt and holy grail.txt in two separate files. (Do it now if you haven't done this exercise already.) Your task here is to 1) use Python's set class to count the number of unique words in each file, and 2) use Python's dictionary class to produce a Top 10 list of the ten most frequently used words having a length larger than 4 in each file. In Part 3 you will repeat the same computations using your own hash and BST based implementations. Part 2 - Implementing data structures (G exercise) Lecture 10 outlines the basic ideas of two techniques suitable for implementing maps (dictionaries) and sets: 1) Binary search trees (BST), and 2) Hashing. Your task is to implement a set (suitable for words) based on hashing and a map based on binary search trees. Additional limitations: The BST based map is a linked implementation where each node has four fields (key, value, left-child, right-child). The hash-based set is built using a Python list to store the buckets where each bucket is another Python list. The initial bucket list size is 8 and rehashing (double the bucket list size) takes place when the number of elements equals the number of buckets. Furthermore, code skeletons outlining which methods we expect for each data structure are available here. They also contains an example program showing how the various methods can be used. Notice: You are not allowed to make any changes of the method signatures in the given skeletons. Also, the demo programs should work as outlined in the provided example programs once your implementations are complete. Your task is simply to complete the given code fragments in the skeletons. However, feel free to add additional methods. Part 3 - Count unique words 2 (G exercise)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Making Databases Work The Pragmatic Wisdom Of Michael Stonebraker

Making Databases Work The Pragmatic Wisdom Of Michael Stonebraker

Authors: Michael L. Brodie

1st Edition

1947487167, 978-1947487161

Students also viewed these Databases questions

Question

★★★★★

Create an ER model to represent the data use by the library. The library provides books to borrowers. Each book is described by title, edition and year of publication and is uniquely identified using...

Answered: 1 week ago

Question

★★★★★

What is cognitive constructionism? What are some of its basic premises?

Answered: 1 week ago

Question

★★★★★

9.83 Breaststroke, continued Refer to Exercise 9.82. a. Construct a 99% confidence interval for the dif- ference in the average number of metres swum by breaststroke versus individual medley...

Answered: 1 week ago

Question

★★★★★

Following are typical questions that might appear on an internal control questionnaire for investments in marketable securities. 1. Is custody of investment securities maintained by an employee who...

Answered: 1 week ago

Question

★★★★★

You are working on a bid to build two city parks a year for the next three years. This project requires the purchase of $210,000 of equipment that will be depreciated using straight-line depreciation...

Answered: 1 week ago

Question

★★★★★

Robertson Resorts is considering whether to expand its Pagosa Springs Lodge. The expansion will create 2 4 additional rooms for rent. The following estimates are available: Robertson uses straight -...

Answered: 1 week ago

Question

★★★★★

A combustion unit is burning refused derived fuel (RDF) consisting of 88% organics, 7% water, and 5% inorganics (inerts) at a rate of 1000 kg h. Assume the heat value of the fuel to be 17,000 kJ/kg...

Answered: 1 week ago

Question

★★★★★

A 4-m long, 150-kg steel beam is attached to a wall with one end connected to a hinge that allows the beam to rotate up and down. The other end of the beam is held in a horizontal position with a...

Answered: 1 week ago

Question

★★★★★

DM & DL variances; journal entries Madzinga's Draperies manufactures curtains. Curtain #4571 requires the following: Direct material standard 10 square yards at $5 per yard Direct labor standard 5...

Answered: 1 week ago

Question

★★★★★

4. (a) A steel beam is 12 m long when installed at 32 C. Determine how much does its length change when it changes temperature from -23 C to 55 C. For steel = 1.1 105 C. [PHYS0030 MID-TERM...

Answered: 1 week ago

Question

★★★★★

5.2 Assuming the workstation currently has one hard drive installed, what additional hardware would be necessary to implement each of the two methods? 5.3 What additional fault tolerance method would...

Answered: 1 week ago

Question

★★★★★

What magazine and ads did you choose to examine?

Answered: 1 week ago

Question

★★★★★

2. Have you developed your thesis statement as a proposition of fact, value, or policy?

Answered: 1 week ago

Question

★★★★★

4. Have you worked to ensure that your speechand your deliverywill help the audience to engage in central processing?

Answered: 1 week ago

Previous Question Next Question