Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Noah has noted that the payroll expense in 2 0 1 6 was more than in 2 0 1 5 by approximately $ 2 6

Noah has noted that the payroll expense in 2016 was more than in 2015 by approximately $266,000 and he has asked you to explain why. Prepare a list of at least five questions you would like answered to address Noahs request. That is, you should think about possible reasons why payroll was higher in 2016 than in 2015 and the questions Noah is likely to want answered about the differences in payroll between the years.
The client provided you with three pipe-delimited text files. It is important to understand the characteristics of the data in each of these files. Read the description of the data in each file in the appendix on the following page. Use the framework of the four Vs of data (variety, velocity, veracity and volume) and prepare responses to the questions below. Note that this list is not exhaustive, and you may have other questions about your data to fully understand it before you begin your analysis.
Variety different forms and formats of the data
Are all of the data set formats the same? Do they need to be the same for your analysis?
Do all fields contain the same labels? Does the data with similarly titled labels contain the same type of data?
How are the files delimited? Are there any extra delimiters that may cause problems when importing? What strategies can you use to deal with any of these challenges?
Is the data structured or unstructured? What transformation would be needed to any unstructured data to make it possible to analyze it?
Is the data aggregated at the same level?
Velocity frequency of incoming data that needs processing
Is your analysis performed on live data or only on historical data?
How often will you be updating this analysis? How automated should the analysis be? What tool might make sense to use in automating this process?
Veracity trustworthiness of the data
Is the data you have complete? Do the data files you received contain all transactions? Are all of the data fields complete for each year and do the files contain all of the same data for each year?
Does the data contained in the data files accurately represent the economic transactions?
What human judgment went into establishing the data?
Volume the amount or scale of data
Should you include data for all years?
Should you include data from all entities?
Are all fields relevant to your analysis?
How many rows will you need to import? What tools can handle this quantity of data?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modern Database Management

Authors: Jeff Hoffer, Ramesh Venkataraman, Heikki Topi

13th Edition Global Edition

1292263350, 978-1292263359

More Books

Students also viewed these Databases questions