Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 10, 2024

Stock Exchange Data Analysis Project 1 DESCRIPTION Objective : To use hive features for data engineering or analysis and sharing the actionable insights Problem Statement:

Stock Exchange Data Analysis

Project 1

DESCRIPTION

Objective: To use hive features for data engineering or analysis and sharing the actionable insights

Problem Statement: NewYork stock exchange data of seven years, between 2010 to 2016, is captured for 500+ listed companies. The data set comprises of intra-day prices and volume traded for each listed company. The data serves both for machine learning and exploratory analysis projects, to automate the trading process and to predict the next trading-day winners or losers.. The scope of this project is limited to exploratory data analysis.

Domain: BFSI

Analysis to be done: Exploratory analysis to understand how MoM or YoY companies from different sectors or industries and states have progressed in a period of 7 years

Content: This data set contains prices.csv and securities.csv files having the following features:

Prices.csv:

Date: Trading date

Symbol: Ticker code or listed company code on NY stock exchange

Open: Intra-day opening price for each listed company

Close: Intra-day closing price for each listed company

Low: Intra-day lowest price for each listed company

High: Intra-day highest price for each listed company

Volume: Number of shares traded per day per company

Securities.csv:

Ticker_Symbol: Country to which the customer belongs

Security: Legal name of the listed company

Sector: Business vertical of the listed company

Sub_Industry: Business domain of the listed company within a Sector.

Headquarter: Headquarters address

Steps to perform:

1) Create a data pipeline using sqoop to pull the data from the table below from MYSQL server into Hive.

a. MYSQL DATABASE NAME: BDHS_PROJECT

i. Stock_prices ii. Stock_companies

Check the TABLE description: STOCK_PRICES

Column Name Datatype

Trading_date Date

Symbol String

Open double

Close double

Low double

High double

Volume int

TABLE: STOCK_COMPANIES

Column Name Datatype

Symbol String

Company_name String

Sector String

Sub_industry String

Headquarter String

2) Create a new hive table with the following fields by joining the above two hive tables. Please use appropriate Hive built-in functions for columns (a,b,e and h to l).

Trading_year: Should contain YYYY for each record

Trading_month: Should contain MM or MMM for each record

Symbol: Ticker code

CompanyName: Legal name of the listed company

State: State to be extracted from headquarters value.

Sector: Business vertical of the listed company

Sub_Industry: Business domain of the listed company within a sector

Open: Average of intra-day opening price by month and year for each listed company

Close: Average of intra-day closing price by month and year for each listed company

Low: Average of intra-day lowest price by month and year for each listed company

High: Average of intra-day highest price by month and year for each listed company

Volume: Average of number of shares traded by month and year for each listed company

DATA ANALYSIS USING HIVE

3) Find the top five companies that are good for investment 4) Show the best-growing industry by each state, having at least two or more industries mapped. 5) For each sector find the following.

Worst year

b. Best year

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

The Controllers Toolkit

Authors: Christine H. Doxey

1st Edition

1119700647, 9781119700647

More Books

Students also viewed these Accounting questions

Question

★★★★★

ABC wants to issue 18-year, zero coupon bonds that yield 11.35 percent. What price should they "charge for these bonds if they have a par value of $1,000? That is, solve for PV. Assume annual...

Answered: 1 week ago

Question

★★★★★

Write each radical as an exponential and simplify. Leave answers in exponential form. Assume that all variables represent positive real numbers. Vx x4

Answered: 1 week ago

Question

★★★★★

1. Show enthusiasm for the subject you teach.

Answered: 1 week ago

Question

★★★★★

Tim Madsen is the purchasing agent for Computer Center, a large discount computer store. He has recently added the hottest new computer, the Power model, to the stores stock of goods. Sales of this...

Answered: 1 week ago

Question

★★★★★

The tollowing information applies to the questions displayed below The general ledger of Red Storm Cleaners at January 1, 2018, includes the following account balances Debits Crecits 10,500 6,100...

Answered: 1 week ago

Question

★★★★★

cost accounting please show all computations Throughout the course we discussed 3 different methods of costing as they relate to what costs are traced versus what are assigned to the work-in-process...

Answered: 1 week ago

Question

★★★★★

(1) Suppose your supplier offered you credit terms of 1.5/10 net 40 days. Your firm is not taking discounts, but is paying after 35 days instead of waiting until Day 40. What is the effective annual...

Answered: 1 week ago

Question

★★★★★

Compare the differences in terminology encountered when creating a Linux server on each cloud provide?

Answered: 1 week ago

Question

★★★★★

1. Create a T-Account ledger for the following Balances, open the T-accounts with the balances from the Balance Sheet. 2. Input the transactions into each account on the T-Accounts page. There is no...

Answered: 1 week ago

Question

★★★★★

Which decision can be made with managerial accounting information? Deciding whether or not the company can repay its debts Deciding whether or not the company is providing useful products Deciding...

Answered: 1 week ago

Question

★★★★★

"The cash accounts of Ria on December 31,2021 has balance of P151,000 and it consist of the following: Bills and coins on hand P52,780 Traveler's check 22,400 Credit memo from supplier's for purchase...

Answered: 1 week ago

Question

★★★★★

What considerations would you make in preparing for a presentation in a German or Chinese company?

Answered: 1 week ago

Question

★★★★★

How do patients across cultures prefer to make medical decisions?

Answered: 1 week ago

Question

★★★★★

Do you prefer to handle conflict directly or to avoid it altogether?

Answered: 1 week ago

Previous Question Next Question

Column Name	Datatype
Trading_date	Date
Symbol	String
Open	double
Close	double
Low	double
High	double
Volume	int