Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help
Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables. Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables.
Step by Step Solution
★★★★★
3.43 Rating (150 Votes )
There are 3 Steps involved in it
Step: 1
It appears that you have an assignment that requires using Python to perform data analysis tasks that you previously did with R Unfortunately you have...Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started