Question
Here is my quetsion of the project: [ Overview and Rationale Data mining is used to reveal hard to see and hidden patterns and relationships
Here is my quetsion of the project: [Overview and Rationale
Data mining is used to reveal hard to see and hidden patterns and relationships in Big Data datasets. Data mining helps to classify data for further examination or create models to predict outcomes for a different set of data. As data miners, you should be able to explain how the code used to mine the data is functioning and be able to analyze and interpret the results of the mining. This allows you to summarize and clarify the results for stakeholders.
Assignment Description
Many people forage for mushrooms and sell them to restaurants or use them for their own consumption. These are experts who know their mushroom. However, as a novice, it is important to be able to spot a poisonous mushroom.
In this assignment, you will use the data set provided to mine the data using the methods presented in this module. You will document in a report the results of each step of the mining process, analyze and interpret the results. Suggest the characteristics to use when determining if a mushroom is safe to eat. Make recommendations for additional analysis and variables to examine to build other classifications such as use of the mushrooms that are not poisonous.
mushrooms.xlsx Download mushrooms.xlsx
Instructions
Your report should include the following:
- Code walk through: in this section provide a step by step explanation of how the code is interacting with and/or transforming the data. Provide examples from the output to support your explanations.
- Analysis: Based on the output, analyze the data and the relationships revealed about the variables of interest. Explains the insights provided by the output. Use visualizations to support your analysis.
- Interpretation and Recommendations: Interpret the results of your analysis and explain what the results mean for the data owner. Provide recommendations for actions to be taken based on your interpretation. Support those with the data. Explain why and what explicit variables you suggest incorporating. For example, median income by city and state from the census.gov website might be useful for examining home ownership.
] here is my code: [import pandas as pd
mushrooms = pd.read_excel('/content/mushrooms.xlsx', header=None)
] this is the output: [0 1 2 3 4 5 6 \ 0 class cap-shape cap-surface cap-color bruises odor gill-attachment 1 p x s n t p f 2 e x s y t a f 3 e b s w t l f 4 p x y w t p f 7 8 9 ... 13 \ 0 gill-spacing gill-size gill-color ... stalk-surface-below-ring 1 c n k ... s 2 c b k ... s 3 c b n ... s 4 c n n ... s 14 15 16 17 \ 0 stalk-color-above-ring stalk-color-below-ring veil-type veil-color 1 w w p w 2 w w p w 3 w w p w 4 w w p w 18 19 20 21 22 0 ring-number ring-type spore-print-color population habitat 1 o p k s u 2 o p n n g 3 o p n n m 4 o p k s u [5 rows x 23 columns]] can you please hep me how and what should I do? i should do it in python, please give me the step by step coding in python that what needed to do? Thanks
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started