- DOMAIN : Startup ecosystem - CONTEXT: Company X is a EU ontine publisher focusing on the startups industry. The company speci fically reports on the business related to technology news, analysis of emerging trends and pro filing of new tech businesses and products. Their event i.e. Startup Battle field is world's preeeminent startup competition. Startup Battle audience, present in person and online. field features 1530 top early stage startups pitching top judges in front of a vast live - DATA DESCRIPTION :CornpanyX_EU.csv - Each row in the dataset is a Start-up company and the columns describe the company. ATTRIB INFORMATION: - PROJECT OBJECTIVE : Analyse the data of the various companies from the given dataset and perform the tasks that are speci 1. Startup :Name of the company 2. Product : Actual product 3. Funding : Funds raised by the company in USD 4. Event : The event the company participated in 5. Result :Described by Contestant :Finalist ,Audience choice =Winner or Runner up 6. Operatingstate : Current status ofthe company, Operating ,Closed; Acquired or IPO 'Dataset has been downloaded from the internet .All the Hedi! for the damsel: goes to the original Hearty! of the data. below steps. Draw insights from the various attributes that are present in the dataset: plot distributions, state hypotheses and draw conclusions from the dataset. Steps and tasks :[ Total Score: 30 points] 1. 3. 4. 5. Data warehouse - Read the CSV file. Data exploration - Check the datatypes of each attribute ' Check for null values in the attributes Data preprocessing &. visualisation ' Drop the null values. ' Convert the 'Funding' features to a numerical value. - Plot box plot for funds in million. - Get the lower fence from the box plot. - Check number of outliers greater than upper fence . ' Drop the values that are greater than upper fence. - Plot the box plot aer dropping the values . ' Check frequency of the OperatingState features classes. ' Plot a distribution plot for Funds in million . ' Plot distribution plots for companies still operating and companies that closed . Statistical analysis Is there any signi ficant di 'erence between Funds raised by companies that are still operating vs companies that closed down? Write the null hypothesis and alternative hypothesis. Test for signi finance and conclusion - Make a copy of the original data frame. - Check 'equency distribution of Result variable . - Calculate percentage of winners that are still operating and percentage of contestants that are still operating - W'rite your hypothesis comparing the proportion of companies that are operating between winners and contestants Write the null hypothesis and alternative hypothesis. Test for signi ficance and conclusion ' Check distribution of the Event variable. ' Select only the Event that has disrupt keyword from 2013 onwards. ' Write and perform your hypothesis along with signi ficance test comparing the funds raised by companies across NY, SF and EU events from 2013 onwards. - Plot the distribution plot comparing the 3 city events . Write your observations on improvements or suggestions on quality, quantity, variety: velocity, veracity etc. on the data points collected to perform abetter data analysis . fied in