Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

--- h1.title { font-size: 38px; color: DarkRed; text-align: center; } h4.author { /* Header 4 - and the author and data headers use this too

---

```{r setup, include = F} knitr::opts_chunk$set(echo = TRUE) ```

# Instructions for the Data Analytics (DA) Projects : {-}

a) If because of **technological issues or any other issues**, you cannot submit the project, then you will get 0. It is your responsibility to submit the project on time. Therefore, please take all necessary steps so that the project can be turned in on time. b) Please note that you **MUST** upload the `html` file on D2L (Not the `code` file); however, you **MUST** keep a copy of `code` file with you because you might be requested to turn in your code file. If you are requested to turn in `code` file and you fail to do so, you will get 0. c) This is RMarkdown file, which is different from R Script (we use R Script in class room practice). In order to write code in RMarkdown file, please press **ctrl + alt + I** for **Windows** OR **Cmd + Option + I** for **macOS** , then you will see space between two-set of 03 back tick (which is also called back quote and you will find it tilde (~) button on your keyboard). Please write your code between the space of two-set of 03 dots.

# Loading the Data Set {-} Please first install the `tidyverse` package by writing - `install.packages("tidyverse")` if you did not install it. If you already install the package, then load it by writing the code - `library (tidyverse)`. Also install and load `janitor` and `lubridate` package. If you need to install packages, please do not install them in this document; rather, install them in `console`.

```{r}

```

# Question # 1 (5 Points) Import a dataset directly into R from the link - . Name the dataset `df` and check whether it is a `tibble`. If the dataset is not `tibble`, please convert it into `tibble`. The necessary packages for data wrangling and visualization must be loaded before importing the data.

```{r}

```

# Question # 2 (5 Points) Please present some information about the `df` data (discuss about `metadata`). Also mention the number of rows (observations) and columns (variables) of the dataset. Additionally, mention the type of the variables, i.e., `ticker` and `year`. Please change the type of the variable `sic` into `character (chr)`. Finally, convert the names of all the columns into uppercase.

```{r}

```

# Question # 3 (10 Points) Please mention the number of companies included in the `df` dataset. Additionally, please mention the duration of the data (the number of years covered by the dataset). Furthermore, please mention the number of industries included in the dataset. Please draw a `bar` diagram of the `SIC` variable for the top five industries.

# Question # 4 (5 Points) Please subset the `df` dataset for the company `ALPHABET INC` whose `TICKER` symbol is `GOOGL` and name the dataset as `googl`. Please discuss the metadata of the `googl` dataset. Please also draw a `scatter plot` of this dataset for the variables `SALE` and `NETINCOME` and include comments about your observations.

# Question # 5 (5 Points) Create a new dataset by filtering two companies - `TARGET CORP` and `WALMART INC`. Please compare their total sales (variable - `SALE`) and net profit (variable - `NETINCOME`) for the years 2008-2017 and include comments on your observations.

# Question # 6 (10 Points) Create the following new variables - `GROSSPROFIT`, which is the difference between the variables `SALE` and `COSTOFGOODSSOLD`; `ROA`, which is `NETINCOME` divided by `TOTALASSETS`; `ROE`, which is `NETINCOME` divided by `COMMONEQUITY`; `GPMARGIN`, which is `GROSSPROFIT` divided by `SALE`; and `NPMARGIN`, which is `NETINCOME` divided by `SALE`. Please mention the company that has the highest `ROA` in 2008, one of the years of the recent financial crisis.

# Question # 7 (10 Points) For the transportation industry (`SIC` code 4512), calculate the net profits for each company in the industry over the period 2008-2017. Identify three companies with the highest profit in the period.

# Question # 8 (20 Points) Create a `boxplot` of the total assets (`TOTALASSETS`) and sales (`SALE`) of two companies, `PFIZER INC` and `JOHNSON & JOHNSON,` and note down your observations.

# Question # 9 (10 Points) Please verify if there is any relationship between the advertising expenses (`ADVERTISING`) and sales (`SALE`). Plot the relationship between the advertising expenses and sales and elaborate your observations.

# Question # 10 (20 Points) Please mention the state in which most of the US companies are incorporated.

**GOOD LUCK!**

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modern Advanced Accounting In Canada

Authors: Murray Hilton

6th Edition

0070001537, 978-0070001534

More Books

Students also viewed these Accounting questions