Question: Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help

Instructions: In assignment 1, I had you provide me some details on the content of a data set. That

Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables. Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables.

Step by Step Solution

3.43 Rating (150 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

It appears that you have an assignment that requires using Python to perform data analysis tasks that you previously did with R Unfortunately you have... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!