Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Part of data analysis often includes pulling data together from different sources. Often data need to be cleaned up in order for effective analysis. For

Part of data analysis often includes pulling data together from different sources. Often data need to be cleaned up in order for effective analysis. For example, public companies file annual financial statements with the Securities and Exchange Commission (SEC) through 10-k filings. These are available for public download as Excel files on the SEC website called EDGAR (EDGAR Guide). This allows researchers to combine data from many companies into one table for analysis. Yet companies are not consistent with how they name certain accounts or the formatting of their financial statements. Data analysts have to clean up the data to make naming convention and formatting consistent in order to effectively combine and analyze the data.

VLOOKUP Practice

Go to the SEC website and download the FY18 10-k Excel files for Pfizer (PFE), Merck (MRK) and Johnson & Johnson (JNJ). In order to download the files you will need to search for each company and find the 10k filing for the year 2018. You can limit the file types you see by typing 10-k into filing type within the filter results option. Click on the interactive data option next to the 10-k filing information. Directly below the company name you will see an option to View Excel Document click here and download the 10-k. Save each of the three files to the same folder. Name the files based on company name, file type and year (e.g. MRK_10K_FY18). In order to combine data, we will use an Excel function called VLOOKUP (Microsoft Excel VLOOKUP).

Steps:

  1. Read through how the VLOOKUP function works (Microsoft Excel VLOOKUP).
  2. Create a new Excel file
  3. Make the following columns
    1. Company Name
    2. Year
    3. Sales
    4. Cost of Goods Sold
    5. Gross Profit
    6. Net Income
  4. Use VLOOKUP to populate the Company Name from your files.
    1. For example, if using the naming convention above, the following would populate the Pfizer name:

=VLOOKUP("Entity Registrant Name",'[PFE_10K_FY18.xlsx]Document and Entity Information'!$A:$D, 2, FALSE)

The formula looks for the cell in the first row of the array with the value Entry Registrant Name and returns the value in the second column of the array in the file PFE_10K_FY18.xlsx and in the tab Document and Entity Information. In this case the value in that cell is PFIZER INC.

You can see the power of VLOOKUP to quickly pull data from different sources. One could duplicate this formula and only change the file name to pull data from many sources, assuming the files are formatted the same that is the same array, columns, tab names and lookup value apply.

  1. Create nine total rows in your Excel file. Three for each company. Use VLOOKUP to populate the company name (3 rows for each company), and then fill in the years FY18, FY17 and FY16. Each company should have three rows and three years when completed. The Find and Replace feature in Excel (Control H) can help you quickly replace company names within the formula.
  2. We can use the same VLOOKUP function to pull data for Sales, Cost of Goods Sold and Net Income. Look through the excel files for the three companies and identify some barriers to using VLOOKUP. For example, VLOOKUP works best when the files are formatted the same, the tab names are the same and the lookup value are the same. Is that the case with these files? Identify at least three challenges you see with using VLOOKUP.
  3. This is the reality with many data sets such as 10-Ks. They are inconsistent between companies, and often inconsistent even within a company. It is often faster to keep the VLOOKUP formula consistent, and instead change the data sets to fit the VLOOKUP parameters. In this case we would have to:
    1. Make the Consolidated Statements of Income tab have a consistent name
    2. Make Sales, Cost of Goods Sold and Net Income consistent lookup values
    3. Make sure the data we want to pull is within the same column
  4. Use the attached Excel files with cleaned up data to practice VLOOKUP and populate the remaining values for Sales, Cost of Goods Sold, and Net Income for each company for years FY18, FY17 and FY16. (Note that the cleaned up files only include the Document and Entity Information and Consolidated Statements of Income tab, and that only those values needed for pulling data have been changed). Calculate Gross Profit based on sales and cost of goods sold. When you are complete you should have a data set that is 10 rows (with header) by 6 columns.
  5. A useful tool to analyze time series data for year over year comparisons are pivot tables (Pivot Table Excel). Create a new sheet and use pivot tables to analyze year over year changes in Sales, Cost of Goods Sold, Gross Profit and Net Income.
  6. Comment on some individual corporate and industry trends you see.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

The Lost Continent The BBCs Europe Editor On Europes Darkest Hour Since World War Two

Authors: Gavin Hewitt

1st Edition

1444764829, 9781444764826

More Books

Students also viewed these Accounting questions

Question

What do you see as your biggest strength/weakness?

Answered: 1 week ago

Question

Learning is a good thing for everyone. Discuss.

Answered: 1 week ago

Question

What should be the role of managers in HRD?

Answered: 1 week ago

Question

What should be the role of government in HRD?

Answered: 1 week ago