Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

reate two boxplots based on sales data showing total sales by region and the second showing profits by region. A . Write pseudocode explaining what

reate two boxplots based on sales data showing total sales by region and the second showing profits by region.
A. Write pseudocode explaining what the MySQL and R script are doing at each step of the process in a Word document.
B. Create a database and table using MySQL Workbench. Note, if you execute the script more than one time within the same VM instance, read the two comments in the create_table script.
i. Run the create_table.sql file found in the lab environment.
ii. Populate the database with Sales data by running the populate_table.sql file found in the lab environment.
C. Use RStudio to create R script that displays two boxplot visualizations, one illustrating total sales per region and one total profits by region. Include comments in script as appropriate to explain each line
i. Using R, connect to MySQL and import the needed data fields from step B above.
ii. Transform data to show profit by region.
iii. Utilize the MySQL transformed data to create two boxplots with clear titles and labels.
1) Total profit by region in the first boxplot.
2) Total sales by region in the second boxplot.
iv. Analyze the scripts and debug if needed.
v. Take a screenshot or your completed scripts including the lines of code showing connection to MySQL database, comments, and two boxplots.
Write an analysis report including the following:
A. Summarize the visualizations that tell the story and include the population data located in Milestone One, with source cited, for the distribution of data and answer the following:
i. Explain how the boxplot visuals are useful to illustrate and communicate the data distribution of profit by region and total sales.
ii. Do any regions have outliers? Do any regions not have outliers? Explain your answers.
B. Compare and contrast the code needed to connect to your data source for the boxplot with the code used to connect to the SQL Server database. Many companies store their data in database systems, such as an SQL Server. The visualizations you created in this project are possible using these large databases. Review the following code and address the following as you compare and contrast these codes:
i. Explain how you would connect to and retrieve data stored within a SQL Server database.
ii. Include at least one reference to support your answer.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Refactoring Databases Evolutionary Database Design

Authors: Scott Ambler, Pramod Sadalage

1st Edition

0321774515, 978-0321774514

Students also viewed these Databases questions

Question

Write a mechanism for the following reaction. cat. H.SO OMe

Answered: 1 week ago

Question

=+a) Show that mixing implies ergodicity.

Answered: 1 week ago

Question

What is the cerebrum?

Answered: 1 week ago