Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 25, 2024

We want to build a tokenizer for simple expressions such as xpr = res = 3 + x_sum*11. Such expressions comprise only three tokens, as

We want to build a tokenizer for simple expressions such as xpr = "res = 3 + x_sum*11". Such expressions comprise only three tokens, as follows: (1) Integer literals:one or more digits e.g.,3 11; (2) Identifiers:strings starting with a letter or an underscore and followed by more letters, digits, or underscores e.g.,res x_sum; (3) Operators:= + * . Leading or trailing whitespace characters should be skipped.

(a) Write a regular expression patternfor the above grammar and use it with re.findallto split the expression into a list of lexemes. If usingxprabove, the list returned should be:

['res', '=', '3', '+', 'x_sum', '*', '11']

(b) The problem with the above is thatre.findall returns a list of all lexemes butnottheir respective tokens (as defined by the grammar). Modify your regex pattern so that each matched lexeme is in its own (regex) group. The list returned should now be as follows:

[('', 'res', ''), ('', '', '='), ('3', '', ''), ('', '', '+'),

('', 'x_sum', ''), ('', '', '*'), ('11', '', '')]

(c) To find which token each lexeme is associated with, we only need to find the first non-empty item in each tuple. Write a tokenize generator(usingre.findallandmap) that returns all pairs (tuples) of lexemes and tokens. The output oflist(tokenize(xpr))should thus be:

[('res', 'id'), ('=', 'op'), ('3', 'int'), ('+', 'op'),

('x_sum', 'id'), ('*', 'op'), ('11', 'int')]

(d) The above solution works fine if the number of tokens is small, but it will break down when the number increases. Better is to use a feature of the regular expression engine, which is that whenever it completes a matching group, it assigns the group number to an attribute of the match object, calledlastindex. Rewritetokenize to make use of this feature (usingre.matchand repeatedly scanning the match object) and still produce the same output as before.

(e) We can improve the approach further, using another feature which is the scanner()method of regular expressions: It creates a scanner object and attaches it to a string, keeps track of the current position, and moves forward after each successful match. Rewrite the tokenize generator to make use of this feature (usingscanner()and calling match()repeatedly, yielding lexeme and token pairs) and again produce the same output as before.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Visual Basic6 Database Programming

Visual Basic6 Database Programming

Authors: John W. Fronckowiak, David J. Helda

1st Edition

0764532545, 978-0764532542

More Books

Students also viewed these Databases questions

Question

★★★★★

According to the Federal Reserve Bank of Boston, among individuals who had credit cards in 2014, the mean number of cards was 3.5. Treat the individuals who have credit cards in the...

Answered: 1 week ago

Question

★★★★★

Assume that an investor has a manufacturing plant that produces automotive spare parts. The sale price of the part is 22 5/unit. The annual fixed cost of the plant is $1.700.000. Unit labor cost is 6...

Answered: 1 week ago

Question

★★★★★

3. Discuss the influence of the organisations size and sector for HRM in Austria and Switzerland.

Answered: 1 week ago

Question

★★★★★

In the audit of the Regional Transport Company, a large branch that maintains its own bank account, cash is periodically transferred to the central account in Cedar Rapids. On the branch account's...

Answered: 1 week ago

Question

★★★★★

We want to build a tokenizer for simple expressions such as xpr = "res = 3 + x_sum*11". Such expressions comprise only three tokens, as follows: (1) Integer literals: one or more digits e.g.,3 11;...

Answered: 1 week ago

Question

★★★★★

Fill in the excel sheet provided with answers. please. xxx=090 Material Requirements Planning Exercise Assignment Learning Objective: The purpose of this exercise is to demonstrate the materials...

Answered: 1 week ago

Question

★★★★★

22. Psoriasis is an inflammatory skin condition that affects roughly 2% of Americans. While an exact mechanism is not known, researchers believe that the end result is an increased multiplication...

Answered: 1 week ago

Question

★★★★★

A machine costs a company Rs 65,000 and its effective life is estimated to be 25 years. A sinking fund is created for replacing the machine at the end of its life time, when its scrap realise a sum...

Answered: 1 week ago

Question

★★★★★

Explore the role of cryptography in blockchain technology. How does cryptographic hashing ensure the integrity of the blockchain? What are the security implications of public and private keys in...

Answered: 1 week ago

Question

★★★★★

Most virtual teams meet at their launch and during crisis times. True O False

Answered: 1 week ago

Question

★★★★★

In what ways is an array constrained, and why do these constraints make the array a problematic structure for dynamic sets?

Answered: 1 week ago

Question

★★★★★

c. How will members be allowed to rotate out of the team if they so desire?

Answered: 1 week ago

Question

★★★★★

Do you currently have a team agreement?

Answered: 1 week ago

Question

★★★★★

How will the members be held accountable?

Answered: 1 week ago

Previous Question Next Question