Question

1 Approved Answer

Posted on Sep 21, 2024

For the final project you will conduct analysis of the dividends of companies from the following two countries: Australia and the UK. Relevant financial data

For the final project you will conduct analysis of the dividends of companies from the following two countries: Australia and the UK. Relevant financial data is available on Moodle in one excel file: dividend_au_uk.xlsx. This file is in the "Final Project Documents" folder on Moodle. You will have to download the data, upload to your Jupyter notebook and then conduct your analysis using Pandas and other Python Libraries. You will have to submit the Jupyter notebook you used to do the analysis for this project. You will have to write up your analysis in a report of up to words. Your report should also include tables and graphs from your analysis. These tables and graphs have to be produced using Python and you will have to submit all the relevant codes in a Jupyter notebook. The objectives of your analysis are as follows: Document and discuss the distribution and trends in dividend payout ratio (dividend/net income) and the number and percentage of dividend payers (positive dividend) over time in each country: o Use dividend to net income ratio as the measure of dividend payout ratio. o Dividend payout ratio attempts to measure what percentage a firm's earnings is paid out in dividends. o Dividend payout ratio is not a meaningful measure in the following two cases, so you need to deal with these cases in the data pre-processing step: 1. When a firm has negative net income, the dividend payout ratio is not a meaningful measure. So exclude observations (rows) with negative net income from your sample. 2. When a firm has a dividend payout ratio higher than 1 (dividend is higher than net income), the dividend payout ratio is not reliable as a firm cannot pay out more dividend than net income in the long run. So cap the value of dividend payout ratio at 1.0 --- set any value higher than 1.0 to 1.0. o You will conduct the analysis for Australia and the UK and you will discuss how the dividend payout ratios of the two countries compare with each other and if they show similar or different trends over time. o You should perform similar analysis of dividend payers in the two countries. In two separate graphs, you should show the number and percentage of dividend paying firms in the two counties and how these have changed over time. TFIN605 Data Analytics in Finance Spring 2021 2 o You will document the distribution of dividend payout ratio in each country in 2007 and 2017 to see if the distribution has changed over time. You can use histograms, boxplots, kernel density plots and percentile plots to show the distributions. Correlation and Regression Analysis Analyse the determinants of dividend payout ratio in each country. So you will have two sets of results. o Initially, explore the relations between various firm characteristics (such as firm size, profitability, growth opportunity etc.) and dividend payout ratio using scatter plot. o You will then conduct correlation analysis to determine if there are significant correlations between these characteristics and dividend payout ratio. o Then use simple linear regressions to quantify the relation between dividend payout ratio and these characteristics one at a time. Here you will use regressions with one independent variable (see lecture 7). o Finally you will use multiple linear regression analysis to consider the effects of all the different firm characteristics on dividend payout ratio. o You will compare and contrast the results you get from the above analysis for the two countries in your sample: Australia and the UK. Machine Learning Analysis Finally, you should estimate two Machine Learning models and evaluate the predictive performance of these models. o The first model will try to predict the dividend payout ratio of a firm. You can use the Boston House Price example as a template for this analysis and do similar analysis on dividend payout ratio (instead of house price). As X (or independent) variables (features matrix), use the four firm characteristics we used in the group project: Firm size (Log of SALES_USD), Profitability, Tangibility and Market to book ratio. The y variable or dependent variable (the target vector) in your model would be the dividend payout ratio. You should to the train-test split and evaluate the model's performance on the test dataset and interpret the results. o The second model will try to predict whether a firm pays dividends --- that is, whether the dividend of a firm is positive. variable in your dataframe called PAYER which should be 1 if a firm has positive dividend (and therefore positive dividend payout ratio) and 0 otherwise. This variable will be the categorical dependent variable in your supervised classification model. Same as in the first model, as X (or independent) variables, use the four firm characteristics we used in the group project: Firm size (Log of SALES_USD), Profitability, Tangibility and Market to book ratio. Use the K Nearest Neighbor model or KNN model for this analysis You can use Iris flower example (covered in lecture 8) as a template for this analysis and do similar analysis on dividend PAYER (instead of Iris flower types).