Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 02, 2023

Determine what hours of the day most checkins occur. Create a variable hours_by_checkin_count. This should be a PySpark DataFrame The DataFrame should be ordered

In [31]: assert type (hours_by_checkin_count) == pyspark.sql.dataframe.DataFrame,

Determine what hours of the day most checkins occur. Create a variable hours_by_checkin_count. This should be a PySpark DataFrame The DataFrame should be ordered by count and contain 24 rows The DataFrame should have these columns (in this order): hour (the hour of the day as an integer, the hour after midnight being 0) count (the number of checkins that occurred in that hour) Note that the date column in the checkin data is a string with multiple date times in it. You'll need to split that string before parsing. In [33]: # YOUR CODE HERE from pyspark.sql.functions import hour, split, col #Split the date column in the checkin data and extract the hour from it. checkin "1 ").getItem(1)) checkin.withColumn ("hour", split(col ("date"), checkin= checkin.withColumn("hour", split(col ("hour"), ":").getItem(0).cast("int")) #Group by hour and count the number of checkins in each hour. hours_by_checkin_count = checkin.groupBy("hour").count().orderBy("count", ascending=False) hours_by_checkin_count.show() #raise NotImplementedError() [Stage 39:> +--- | hour count | +----+-----+ 1 19|13481| 23 13207| 22|13191| 18 13177 21 12960 20 12553 17|12304| 0|11577 | 16 10416 1 9803 | 2 7258 15 7000| 3 5225 14 4340 (0 + 1) / 1] In [31]: assert type (hours_by_checkin_count) == pyspark.sql.dataframe.DataFrame, \ "The hours_by_checkin_count variable should be a Spark DataFrame.' assert hours_by_checkin_count.columns == ["hour", "count"], \ "The columns are not in the correct order." submitted = AutograderHelper.parse_spark_dataframe (hours_by_checkin_count) In [32]: # Autograder cell. This cell is worth 1 point (out of 20). This cell does not contain hidden tests. assert len(submitted) == 24, \ "The hours_by_checkin_count DataFrame must have 24 rows." assert submitted [ "hour" ][0] == 1, \ 'The first row should have hour 1' AssertionError Cell In [32], line 6 1 # Autograder cell. This cell is 3 assert len (submitted) == 24, \ 4 11 Traceback (most recent call last) worth 1 point (out of 20). This cell does not contain hidden tests. "The hours_by_checkin_count DataFrame must have 24 rows." > 6 assert submitted [ "hour"][0] == 1, \ 7 'The first row should have hour 1' AssertionError: The first row should have hour 1 In [18] #Autograder cell. This cell is worth 4 points (out of 20). This cell contains hidden tests.

Step by Step Solution

★★★★★

3.50 Rating (157 Votes )

There are 3 Steps involved in it

Step: 1

appears that youre working with PySpark and trying to analyze checkin data to determine the ... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Cost Management Measuring Monitoring And Motivating Performance

Authors: Leslie G. Eldenburg, Susan Wolcott, Liang Hsuan Chen, Gail Cook

2nd Canadian Edition

1118168879, 9781118168875

More Books

Students also viewed these Programming questions

Question

Graph the system of inequalities. Then find the coordinates of the vertices of the shaded region. 4y-x 4 y+5x -3 Use the graphing tool to graph the system. Click to enlarge graph What is/are the...

Answered: 1 week ago

Question

New products are being introduced to the marketplace at a rapid pace, and consumer trends seem to be changing faster. What are the most important factors that have shifted the demand for and supply...

Answered: 1 week ago

Question

★★★★★

Managing Scope Changes Case Study Scope changes on a project can occur regardless of how well the project is planned or executed. Scope changes can be the result of something that was omitted during...

Answered: 1 week ago

Question

★★★★★

Presented here are summarized data from the balance sheets and income statements of Wiper Inc.: WIPER INC. Condensed Balance Sheets December 31, 2020, 2019, 2018 (in millions) 2020 2019 Current...

Answered: 1 week ago

Question

★★★★★

Selected transactions for Rojas Company are presented below in journal form (without explanations). Post the transactions to Taccounts. Account Title Debit Credit Date May 5 Accounts Receivable...

Answered: 1 week ago

Question

★★★★★

Explain why it is important to understand consumer behavior.

Answered: 1 week ago

Question

★★★★★

Define self-discipline and cite its benefits.

Answered: 1 week ago

Question

★★★★★

When rental cars are sold on the used car market, they are sold for lower prices than cars of the same model and year that were owned by individual owners. Does this price difference reflect adverse...

Answered: 1 week ago

Question

★★★★★

A straight 5% coupon bond has two years remaining to maturity and is priced at $981.67. A callable bond that is the same in every respect as the straight bond, except for the call feature, is priced...

Answered: 1 week ago

Question

★★★★★

Using the Public MACRO BITCOIN scorecard spreadsheet (linked in its associated masterclass lesson - Long Term 32), create a COPY of it and perform a complete analysis for the date 22/2/2022....

Answered: 1 week ago

Question

★★★★★

Mortality risk is a speculative risk. pure risk. liability risk. comprehensive risk

Answered: 1 week ago

Question

★★★★★

Contrary to the strategy of maintaining a large inventory adopted by most American retailers, 7-11 is more selective in the products that they put on shelves. 7-11 chooses to shelve products that are...

Answered: 1 week ago

Question

★★★★★

AApproximate air as a mixture of 7 8 % Nitrogen and 2 2 % oxygen ( by moles ) . Calculate the followings for air using values you find in ( A . 5 ) . Be careful about units. ( a ) Specific gas...

Answered: 1 week ago

Question

★★★★★

(7) (2 pts) Location update is an important task to perform as a mobile unit moves around in a mobile IP network. If we use indirect-routing, new location update only needs to be done at the home...

Answered: 1 week ago

Question

★★★★★

A 8 B = 0.25, 0.50 105 0 The uniform slender bar has an ideal roller at its upper end A. Determine the minimum value of the angle e for which equilibrium is possible for us = 0.25 in degrees round...

Answered: 1 week ago

Question

★★★★★

Figures 2 show a statically determinate beam under distributed loading conditions where C is the constant. q(x) = C(2x- - x L Figure 2 A RA L RB a) Determine the resultant of the distributed load...

Answered: 1 week ago

Question

★★★★★

Write a skeleton for a "For loop" with decreasing counter.

Answered: 1 week ago

Question

★★★★★

Repeat Exercise 16.6 using the t-test of the coefficient of correlation. Is this result identical to the one you produced in Exercise 16.6?

Answered: 1 week ago

Question

★★★★★

Refer to the information from Problem 6.46. Information from Problem 6.46 Physical Units Beginning WIP (25% complete) ...................................................11,000 Started during January...

Answered: 1 week ago

Question

★★★★★

CICA Handbook Section 5135, "The Auditor's Responsibility to Consider Fraud," requires auditors to plan and perform an audit to obtain reasonable assurance about whether the financial statements are...

Answered: 1 week ago

Question

★★★★★

What factors need to be considered when setting a selling price?

Answered: 1 week ago

Question

★★★★★

T F Most people with multiple personalities had normal and uneventful childhoods. (p. 214)

Answered: 1 week ago

Question

★★★★★

T F The term hysteria derives from the Greek word for testicle. (p. 225)

Answered: 1 week ago

Question

★★★★★

1. Why should we not accept claims of recovered memories at face value? A high-level business executives comfortable life fell apart one day when his 19-year-old daughter accused him of having...

Answered: 1 week ago

Previous Question Next Question