Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

For the following questions, please use R commands to find solutions when applicable. Please use Jupyter Notebook and submit Notebook File ( . ipynb )

For the following questions, please use R commands to find solutions when
applicable. Please use Jupyter Notebook and submit Notebook File (.ipynb) of
your notebook, including all code and results.
)
Please follow the steps on slides to set up your anaconda. You should be able to
run the class Jupyter notebook successfully.
Try to resolve any issues you may have. I encourage discussion and help each
other on the installation. Google is also a good resource.
It is your job to set up the environment on your computer. I am not familiar with
your computer so I won't do your homework for you.
You will take full responsibility of your dev environment during exams, i.e., if
your Jupyter doesn't work during exams, you know how to quickly fix it or you
lose time.
1.20 The built-in data set islands contains the size of the world's land masses that
exceed 10,000 square miles. Use sort() with the argument decreasing=TRUE to
find the seven largest land masses.
For Example, the expected solution is
> sort(islands, decreasing=TRUE)[1:7]
1.21 Load the data set primes (UsingR). This is the set of prime numbers in
1,2003. How many are there? How many in the range 1,100?100,1000?
1.22 Load the data set primes (UsingR). We wish to find all the twin primes. These
are numbers p and p+2, where both are prime.
Explain what primes -1 returns.If you set n= length (primes), explain what primes -n returns.
Why might primes [-1]- primes -n give clues as to what the twin primes
are?
How many twin primes are there in the data set?
1.23 For the data set treering, which contains tree-ring widths in dimension-less
units, use an R function to answer the following:
How many observations are there?
Find the smallest observation.
Find the largest observation.
How many are bigger than 1.5?
1.24 The data set mandms (UsingR) contains the targeted color distribution in a
bag of M&Ms as percentages for varies types of packaging. Answer these
questions.
Which packaging is missing one of the six colors?
Which types of packaging have an equal distribution of colors?
Which packaging has a single color that is more likely than all the others? What
color is this?
1.25 The t imes variable in the data set nym.2002(UsingR) contains the time to
finish for several participants in the 2002 New York City Marathon. Answer these
questions.
How many times are stored in the data set?
What was the fastest time in minutes? Convert this into hours and minutes using R.
What was the slowest time in minutes? Convert this into hours and minutes using R.
1.26 For the data set rivers, which is the longest river? The shortest?
1.27 The data set uspop contains decade-by-decade population figures for the
United States from 1790 to 1970.
Use names 0 and seq 0 to add the year names to the data vector.
Use diff() to find the inter-decade differences. Which decade had the greatest increase?
Explain why you could reasonably expect that the difference will always increase with each
decade. Is this the case with the data?
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modern Database Management

Authors: Heikki Topi, Jeffrey A Hoffer, Ramesh Venkataraman

13th Edition

0134773659, 978-0134773650

More Books

Students also viewed these Databases questions