Answered step by step
Verified Expert Solution
Link Copied!

Question

00
1 Approved Answer

Homework 1: Structured Data Due: 24 hours before the live session of Week 5 You can choose to complete this assignment by yourself, or with

image text in transcribed
image text in transcribed
Homework 1: Structured Data Due: 24 hours before the live session of Week 5 You can choose to complete this assignment by yourself, or with a group of at most two total participants. Each person must turn the assignment in for grading and each person must contribute to the development of the program. Use the file Donors_Data.csv. Structured Data Processing For purposes of this write-up, we will use examples from the Donors data file. The main outline of your assignment is to write a program that will read in the data from a le, such as the .csv file saved from excel. This will be in a format that is structured with lines of data representing one type of unit (i.e. one donor in the donors file). Your program will represent the data as Python data structures. You may choose for the overall structure to be one or both of the following: - A list of dictionaries, or some combination of lists, dictionaries and numpy arrays - A pandas dataframe You will do data exploration and cleaning on this data. The program will do some processing to convert the data to a form that will answer at least two questions, as described below, and write files with the data suitable for answering each question. Graphing is optional. Data: You may choose a dataset to work with. As a guideline, datasets should be chosen that have somewhere between 500 and 4,000 lines of data with some number of columns between 4 and 50. These guidelines are not exact limits, just guidance for selecting data. If the data comes in an Excel spread sheet with a lot of columns, it is okay to first edit the le to remove columns that you do not need for your processing. For example, in the Donors data, you might with to create a separate spreadsheet with only a few columns of data. Questions: For this assignment, at least one question that you choose to answer should look at the data in different unit of analysis than is present in the data le. For example, instead of looking at individual donors, you could look at the donors of each of the 9 income or wealth types. Simplest example question (you should do one more complex than this): For each wealth type, what is the average home values of all the donors of that type

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Statistics Informed Decisions Using Data

Authors: Michael Sullivan III

5th Edition

978-0134135373, 134133536, 134135377, 978-0134133539

Students also viewed these Mathematics questions