Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Implement by C++ code. Huffman Codes Description Suppose that we have to store a sequence of symbols (a file) efficiently, namely we want to minimize

image text in transcribed

image text in transcribed

Implement by C++ code.

Huffman Codes Description Suppose that we have to store a sequence of symbols (a file) efficiently, namely we want to minimize the amount of memory needed. For the sake of simplicity we assume that the symbols are restricted to the first 6 letters of the alphabet. For example, let us assume that the frequency of different symbols that you have to store are the following: symbol frequency 1000 150 200 800 300 Total 2500 As we have to store 6 different symbols, the obvious way is to encode each of them in 3 bits, as with 3 bits it is possible to encode 23 different symbols. With this encoding, we need 2500 x 3 7500 bits to store the above symbols. A different way to address the problem is the following. Instead of assigning to each symbol a code with the same length (i.e., number of bits), we assign shorter codes to symbols that are more frequent, and longer codes to symbols that are less frequent. One possible encoding according to this sequence is the following symbol encoding 10101 1011 100 10100 According to this encoding the number of required bits is 1000 x 1 + 150 x 5 + 200 x 4 + 800 x 2 +300 x 3 + 50 x 5-5300 This idea is at the basis of the programs used to compress files. First they analyze the input, then they choose the codes, and then they recode the input according to the determined codes. While this idea brings benefits in terms of the space requirements, using variable length codes presents some problems. Once we have coded a file according to a variable length code, we must also be able to decode it in the original format (i.e., once we have compressed the file, we want to able to decompress it). The encoding works only if the codes assigned to different characters are such that no code is a prefix of any other code. If this property does not hold, there isa problem of ambiguity when trying to decompress the sequence. You can prove that in the depicted example no code is a prefix of any other code. For example no code starts with 0 except from the code of A. So while decompressing the file, if we findia symbol whose code starts with 0, we know it's A. If we find a character whose code starts with 11, we know it's D. It can't be any other symbol, as no code starts with 11 other than D's code. And so on. How do we assign codes? This is done through a greedy algorithm. We assign the shortest code to the most frequent character, the second longest one to the second most frequent character, and so on. The figure below illustrates the first few stages of the algorithm

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Data Infrastructure For Medical Research In Databases

Data Infrastructure For Medical Research In Databases

Authors: Thomas Heinis ,Anastasia Ailamaki

1st Edition

1680833480, 978-1680833485

More Books

Students also viewed these Databases questions

Question

★★★★★

Compare and contrast the relative advantages and disadvantages of sequential, block, group, alphabetic, and mnemonic codes.

Answered: 1 week ago

Question

★★★★★

Find the relative maxima or minima in Exercises. Maximum of (x, y) = 4xy, subject to x + y = 16

Answered: 1 week ago

Question

★★★★★

What could be some of the consequences of these perceptions?

Answered: 1 week ago

Question

★★★★★

1. What kinds of applications are described here? What business functions do they support? How do they improve operational efficiency and decision making? 2. Identify the problems that businesses in...

Answered: 1 week ago

Question

★★★★★

A typical Linux system will run 5 virtual consoles True False

Answered: 1 week ago

Question

★★★★★

Largest voluntary corporate sustainability initiative calling for companies to align strategies and operations with universal principles on human rights, labour, environment and anti-corruption, and...

Answered: 1 week ago

Question

★★★★★

Walmart's human resource management uses internal and external recruitment sources for various positions. With reference to the Case Study and theory, critically discuss the relevance of various...

Answered: 1 week ago

Question

★★★★★

5. Consider the vector field F(x, y, z) = = (a) Prove that F is not a gradient field. (b) Show that F curl(F) = 0. 2y2zi+4yzj+2y2k. (c) Find a function = (x) satisfying curl (pF) (c) Find a potential...

Answered: 1 week ago

Question

★★★★★

On January 1, 2025, Sanderson, Inc. acquired a machine for $1,040,000. The estimated useful life of the asset is five (5) years. Residual value at the end of five (5) years is estimated to be...

Answered: 1 week ago

Question

★★★★★

(10 points) Consider the following sensitivity analysis of EBIT and earnings per share (EPS) from the Hill Country case where $34,000 is the expected level of EBIT after the acquisition and $20,000...

Answered: 1 week ago

Question

★★★★★

Accept a word as input and determine if its letters are in alphabetical order. Some examples of words whose letters are in alphabetical order are biopsy, adept, chintz, and lost. See Fig.3.52. The...

Answered: 1 week ago

Question

★★★★★

6-16 What are the main advantages and disadvantages of having multiple databases in a distributed architecture? Explain. The Lego Group, which is headquartered in Billund, Denmark, is one of the...

Answered: 1 week ago

Question

★★★★★

5. Give some examples of problems that would have occurred at American Water if its data were not clean? American Water, founded in 1886, is the largest public water utility in the United States....

Answered: 1 week ago

Question

★★★★★

6-5 It has been said there is no bad data, just bad management. Discuss the implications of this statement.

Answered: 1 week ago

Previous Question Next Question