Question
Project description: The assignment for this project is to pose an interesting statistical question that can be researched or developed; collect a relevant data set
Project description:
The assignment for this project is to pose an interesting statistical question that can be researched
or developed; collect a relevant data set (using the links below); and use the data, in conjunction
with the tools we have learned in this class, to answer the question you have posed.
Make sure to talk about any uncertainty that arises in answering your question. Also, address any
shortcomings in the answer provided by your data and analysis. You will be evaluated both on
the technical correctness and the overall intellectual quality of your approach and write-up.
Pre- Work 1- What is your statistical question?
2- Are you conducting a survey or an experiment?
3- Define the variables and anticipated
confounding variables/ bias.
Dive in: 1- Find/Collect Data.
2- Conduct data analysis.
3- Calculate
4- Graph
Reflect: 1- Interpret the results.
2- Did the result support your hypothesis? Or
were they rejected?
3- Why did you choose this topic?
Present In class: poster May 5th
We will have class meetings (peer circles) during every phase, to make sure you are on time
and your questions are answered. Peer-circles and phases' deadlines will be published on
Blackboard.
Links for suitable datasets:
1- http://www.lock5stat.com/datapage1e.html (dataset description and helpful tools can be
found here https://sites.google.com/site/lock5stat/home/dataset-documenation)
2- https://www.kaggle.com/datasets
3- https://data.census.gov/cedsci/
Project outlines:
1- Introduction (1-2 paragraphs + a table):
The introduction should include the following:
a) Your statistical question- your question needs to be approved by me before you start with the
project. Define your cases and the variables you need to collect to answer your question.
b) The reason you chose that question- Why is this question important to you?
c) A glimpse of your dataset (6 rows only)- This needs a table or snapshot from your csv
document.
2- Collecting Method (1-2 paragraphs)
In here, include:
a) How you collected your data.
b) Any challenges you faced doing so?
c) Do you believe that your sample is random? Explain
If you used an online dataset, state your website and describe what the website is about.
3- Procedure (3-5 paragraphs):
In here, include:
a) The sample notation and value (mean, proportion...etc.)
b) Visual description of your dataset.
c) A 95 % confidence interval: describe how you obtained it, include an image of the bootstrap you
created, and your interpretation of the 95% CI. Make sure you have the proper population
parameter. Include the bootstrap distribution.
d) Find the p-value: state your hypothesis (null and alternative), your randomization distribution
image, and your p-value. Include the randomization distribution.
4- Conclusion (1-2 paragraphs):
In here, include:
a) Did the result match your expectation? Explain.
b) Were you able to conduct your inference as you wanted or did you face any limitations (the
sample was small or not random, bootstrap distribution was not symmetric...)
c) Are there other questions you wish to investigate further in the future?
To turn in:
1- A written project report- A reasonable length here would be 4-6 type-written pages,
including figures but treat this only as a rough guideline rather than an absolute quota or
limit. If you have lots of figures and tables, you might easily go over 6 pages.
2- The dataset you used.
3- Include the citations of any resources you use.
4- All materials will be submitted on Blackboard under the "Project Folder."
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started