Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

For your final project, imagine you re a data analyst at an organization that helps other institutions with their data. Your supervisor has provided you

For your final project, imagine youre a data analyst at an organization that helps other institutions with their data. Your supervisor has provided you with data and is asking you to take a preliminary look and give a brief report of your findings. Since you are new to the organization, they would also like to see your thought process and have asked for both the code used to run your analysis and screen captures of your work in addition to your informed conclusions.
Specifically, you must address the critical elements listed below. Most of the critical elements align with a particular course outcome (shown in brackets).
Data Assessment: Your first task is to review the Excel file and perform a preliminary data assessment. You will be sharing this work in the Appendices section only.
Excel Calculations: For two columns in each data set, calculate the minimum, maximum, and average.
Source Code Management: Include text box comments in the document to explain your work. Then, take a screenshot/screen capture of your work and include it in the Appendices section.
Data Validation and Discovery: In this section, youll be validating the information you discovered in the previous section. You will be sharing this work in the Appendices section only.
Prepare Data: Utilize the command line interface in Linux to copy files from the ~/workspace/SNHU/DAT-500/finalproject folder to the
~/workspace/Analysis folder.
From the supplied files in the ~/workspace/SNHU/DAT-500/finalproject folder, copy only the dat500_final_project_GBR_data.csv and dat500_final_project_USA_data.csv data set files to the Analysis folder in preparation for analysis.
Modify Files: Utilize Linux commands to rename files.
From the supplied files, rename them by removing the dat500_final_project_ portion of the file name.
Import Data: From the Integrated Development Environment in RStudio, import the data files into your workspace using Rscript.
Summary: Perform the summary function to get the descriptive statistics from both files for comparison. Describe your findings and include your rationale. Show your work and be sure to include inline comments.
Variables: Create variables for the minimum, maximum, and averages for both the columns in the previous section. Show your work and be sure to include inline comments.
Source Code Management: Include inline comments (denoted by # symbol) to explain why you are using certain code. Then, take a screenshot/screen capture of your work and include it in the Appendices section.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Systems For Advanced Applications 27th International Conference Dasfaa 2022 Virtual Event April 11 14 2022 Proceedings Part 2 Lncs 13246

Authors: Arnab Bhattacharya ,Janice Lee Mong Li ,Divyakant Agrawal ,P. Krishna Reddy ,Mukesh Mohania ,Anirban Mondal ,Vikram Goyal ,Rage Uday Kiran

1st Edition

ISBN: 3031001257, 978-3031001253

Students also viewed these Databases questions

Question

here) and other areas you consider relevant.

Answered: 1 week ago