Question
a)Study the dataframe below and answer the questions that follow. Column1Column2Column3Column4Column5Column6 Name1 Alpha1224540Alpha Name2 Beta1632511Beta Name3 Alpha52104320Gamma Name4 Beta3672841Delta Name5 Beta4590320Phi Name6 Alpha1224120Zeta Name7 Beta3264641Sigma
a)Study the "dataframe" below and answer the questions that follow.
Column1Column2Column3Column4Column5Column6
Name1 Alpha1224540Alpha
Name2 Beta1632511Beta
Name3 Alpha52104320Gamma
Name4 Beta3672841Delta
Name5 Beta4590320Phi
Name6 Alpha1224120Zeta
Name7 Beta3264641Sigma
Name8 Alpha4284540Mu
Name9 Alpha56112311Eta
dataframe
i.Indicate code in R or Python that will calculate the correlation between "Column2" and "Column3" of a "dataframe"[1 Mark]
ii.The above dataset has been loaded for you in R or Python in a variable named "dataframe". indicate a code that will select only the rows for which parameter is Alpha?[1 Mark]
iii.A majority of work in Python or R uses systems internal memory and with large datasets, situations may arise when the Python or R workspace cannot hold all the data in memory. So, removing the unused objects is one of the solutions. indicate a command that will remove rows with values called "Beta" [1 Mark]
iv.State and explain Techniques and tools (R or Python packages) that are used to preprocess data so that it can be ready for data mining[5 Marks]
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started