Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

The following github repository has all the files needed to complete the instructions below. Implement all instructions into these files or use the files in

The following github repository has all the files needed to complete the instructions below. Implement all instructions into these files or use the files in some way.

https://github.com/jake-doss/Profiler

All instruction must be implemented and the problem must work as intended.

Problem: Build a program profiler. Construct a program to instrument C++ source code to support program profiling. It is often important to determine how many times a function or statement is executed. This is useful not only for debugging but for determining what parts of a program may need to be optimized. This process is called profiling. That is, a execution profile presents how many times each part of a program is executed using a given set of input data (or for some run time scenario). To compute a profile, specialized statements need to be inserted in to the code that keep track of how many times a function or statement is executed. The process of inserting these statements is called instrumenting the code. To implement a profiler one must first parse the source code and generate an Abstract Syntax Tree (AST) of the code. Each node of the AST describes the syntactic category of the code stored within it (function, statement, while-statement, etc.). So at the top level is a syntactic category corresponding to a program, class, or function (such as in the case of a main). Under that are sub-trees that further detail the syntactic categories of each part of the code. Such things as declarations, parameter lists, while-statement, and expression statements will describe the various parts of the program. After the AST is generated it can then be traversed and the appropriate syntactic structures can be found that need to be instrumented. Once a construct is found, say a function, new code can be inserted that keeps track of how many times that function is executed. The most difficult part of constructing a profiler is correctly parsing the source code. Unfortunately, C++ is notoriously difficult to parse. So here we will use a parsing tool called src2srcml. This tool reads in C++ code and marks up the code with XML tags (e.g., block, if, while, condition, name, etc). That is, the output is an AST in XML. The XML representation is called srcML (source code markup language). A number of srcML data files are provided for the project. However, you can use your own program as input. To run srcML on wasp or hornet you will first need to set a PATH variable so the command can be found. You need to execute the command: export PATH=/local/opt/srcml/bin:$PATH It is best if you insert this line into your .bash_profile file in your home directory on wasp/hornet. Then, to generate the srcML file for your own code, use the following: src2srcml main.cpp -o main.cpp.xml Use the following for a list of all options: src2srcml --help More information about srcML can be found at www.srcML.org including a list of all the tag names (see Documentation). You can also download srcML if you want it on your own machine. Tag names needed for the project:

- an identifier or function name

- a block { stmts }

- a function or method

- a constructor

- a destructor

- an expression statement (e.g., a = 5 + x; cin >> x; ++i; foo(30);)

- a return statement (e.g., return result;)

Your program will read in srcML files (in XML) and build an internal representation of the AST as a tree data structure. When using XML this internal representation is typically called a Document Object Model (DOM). A set of classes defining the data structure (DOM) is in shared (syntax_tree.hpp and syntax_tree.cpp). This part of the project is partially implemented (i.e., the read and write work). The given code compiles and runs. The program currently is able to read in a srcML file and construct the internal tree representation. It can also write the code back out (minus the XML) from the internal tree representation. Your job will be to complete the implementation and build the instrumenter for the profiler. The specification for the profiler is as follows:

The program will take as input a list of files that are to be profiled. The first file in the list will be the file with the main body.

The program will instrument all the files to keep track of each function and statement executed in the program.

The program will also instrument the file with the main body with two additional parts. The first is a set of global profiling variables. Second, at the end of the main body the results of the profiling data will be printed to the screen.

Profile how many times each function, method, constructor, and destructor in the supplied files is executed. Keep track of this by the filename, line number, and function name.

Profile how many times each statement is executed. Keep track of this by filename and line number. Include all statements ending with a semicolon (e.g., assignments, function calls) and all statements within loops, if/else, and case statement.

Do not count:

break, return, goto, or throw statements

declaration statements

statements in for-loop specifications, for example, don't count the i=0, i==n, or ++i in for (i=0; i==n; ++i)

conditions (but see challenge), that is, only count statements in the bodies of if-stmts or loop-stmts and not how many times the condition is executed

You do not need to worry about following any formating style (indentation, etc.) for the inserted code. The inserted code does not have to be in the tree format. You may insert the code as text nodes.

Output: For each file print out, ordered by line number, the counts for each line and function.

In short, your profiler program will read in source code and write out source code that has been instrumented. Then you will compile and run the instrumented version of the code to get the profile information. Below is a full example of the tool chain. First convert main.cpp into srcML. Then run the profiler on that XML file. This will produce a instrumented version of main.cpp called p-main.cpp. Finally, you compile and run this program. src2srcml main.cpp -o main.cpp.xml ./profiler main.cpp.xml clang++ p-main.cpp -o p-main ./p-main Requirements:

Milestone 1 - Implementation: (COMMENT WHERE YOU COMPLETED THIS MILESTONE PLEASE)

Name the folder for this project profiler.

Read the code given in syntax_tree.hpp/cpp, profile.hpp/cpp, and main.cpp under shared. You will be required to add code and comments to these files. You are free to modify them as you see fit.

The provided code works (compiles and runs) and the read and write are implemented.

Implement the necessary 1) destructor, 2) copy constructor, 3) swap, and 4) assignment operator for the syntax_tree class. There are comments pointing out which methods still need to be implemented.

The code uses STL and you may use STL in your solution, including std::string.

Milestone 2 - Implementation: (COMMENT WHERE YOU COMPLETED THIS MILESTONE PLEASE)

 //File main1.cpp #include  int main() { int i; i = 0; while (i < 10) { ++i; std::cout << i; } return 0; } 
 //File p-main1.cpp #include  #include "profile.hpp" profile main1_cpp("main1.cpp"); int main() { int i; i = 0; while (i < 10) { ++i; std::cout << i; } return 0; } 

Use the provided profile ADT (profile.hpp/cpp). This is a simple ADT that uses the STL map data type. It is a map of (string X int). With the string being the function name and line number and the integer being the count of times that line/function was executed.

To profile the program you will need to create a profile object for each file (as given on the command line) to keep track of statements, functions, etc. These will be inserted as code into the main file as global variable declarations. The first file given is the main file.

All profile objects need to be declared in the main file and for each additional file you will need to include profile.hpp and declare the file's profile object as external, for example extern foo_cpp;

Your profiler only needs to work on cpp files. You do not need to instrument header files (.h, .hpp).

Implement the functionality to insert profile declarations into the main file. You will need this for milestone 3. See example below.

Implement the functionality to insert extern profile declarations into the remaining files. You will need this for milestone 3. See milestone 3 for example.

The profiler will need to insert the following lines of code (in red):

Milestone 3 - Implementation: (COMMENT WHERE YOU COMPLETED THIS MILESTONE PLEASE)

 //File: p-main1.cpp #include  #include "profile.hpp" profile foo_cpp("foo.cpp"); profile main1_cpp("main1.cpp"); int main() { main1_cpp.count(__LINE__, "main"); int i; i = 0; main1_cpp.count(__LINE__); while (i < 10) { ++i; main1_cpp.count(__LINE__); std::cout << i; main1_cpp.count(__LINE__); } std::cout << foo_cpp << std::endl; std::cout << main1_cpp << std::endl; return 0; } 
 //File p-foo.cpp #include "profile.hpp" extern profile foo_cpp; int foo() { foo_cpp.count(__LINE__, "foo"); int i; i = 0; foo_cpp.count(__LINE__); return i; } 

For every function (i.e., free function, method, constructor, destructor. In srcML, these are just function, constructor, and destructor), insert a line of code to update the corresponding file profile object given the function name and line number. The line number is used to provide a unique map index for overloaded functions. This will add one to the corresponding map element. Insert this code as the first line in the function, right after the { is fine. You may assume there are no nested functions.

For every line of code, (i.e., expr_stmt) as defined above, add a line of code after it that updates the correct file profile object given the line number. Use the __LINE__ macro for this. This macro returns an integer and you will need to convert this value to a string for use in your profile ADT.

You may ASSUME all loops and conditionals have a block. That is, no single line bodies. For example, there will always be a block after a loop as in for (i=0; i==n; ++i) { x=x+1; } versus no block as in for (i=0; i==n; ++i) x=x+1;.

Lastly, you will need to add statements to the end of the main to print out the profile data. This should go just before the return. You may assume there is only one return in the main. This report should print each item in the map (function or line number) followed by the counts.

Your profiler only needs to work on cpp files. You do not need to instrument header files (.h, .hpp).

You should send the profiler output for each input file to a new file. Prefix a "p-" to the begining of the orginal file name. For example: p-main.cpp or p-syntax_tree.cpp

Given profiler main1.cpp foo.cpp the final code will look something like the following for file p-main1.cpp:

And p-foo.cpp

Challenge - implementation: (COMMENT WHERE YOU COMPLETED THIS MILESTONE PLEASE)

 i = 0; //Line 1 while (i < 10) { //Line 2 ++i; //Line 3 } 

Profile all conditionals.

Count the number of times the conditional in a while, if, for, repeat, or switch is executed.

For example, given the following code:

If this code is run only one time, it will have Line 1 executing 1 time, Line 2 (the conditional) executing 11 times, and Line 3 executing 10 times.

These should be stored by file and line number just as other statements.

Hint: use operator&& or the coma operator (operator,).

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Multidimensional Array Data Management In Databases

Authors: Florin Rusu

1st Edition

1638281483, 978-1638281481

More Books

Students also viewed these Databases questions