Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Consider the following Spark Code to construction a Dataframe. a = [ ( ' Chris ' , 'Budweiser', 1 5 ) , ( ' Chris

Consider the following Spark Code to construction a Dataframe.

= [('

Chris

',

'Budweiser',

15), ('

Chris

',

'Becks',

5), ('

Chris

',

'Heineken',

2), ('

Bob

',

'Becks',

15), ('

Bob

',

'Budweiser',

10), ('

Bob

',

'Heineken',

2), ('

Alice

',

'Heineken',

8)]

rdd

=

.

parallelize

(a)

=

sqIContext.createDataFrame

(

rdd

, ['

drinker

',

'beer', 'score'

])

sqIContext.registerDataFrameAsTable

(

,

"drinkers"

)

How can we get the total score of each beer brand?

We want to have the following answer from the above example:

(

Your print out may be different

-

values are important

)

beer

=

'Becks', total score

= 20

beer

=

'Budweiser', total score

= 25

beer

=

'Heineken', total score

= 12

(

Multiple Choice with Negative Scores for wrong Answers

)

.

.

drop

('

drinker

') .

groupByKey

('

beer

') .

reduceByKey

(

add

) .

collect

()

.

.

drop

('

drinker

') .

groupBy

('

beer

') .

agg

({'

score

'

: 'sum'

}) .

collect

()

.

.

drop

('

drinker

') .

groupByKey

('

beer

') .

map

(

lambda a

,

b: a

+

) .

top

()

.

.

filter

('

drinker

') .

groupBy

('

beer

') .

agg

({'

score

'

: 'sum'

}) .

collect

()

.

.

filter

('

drinker

') .

groupByKey

('

beer

') .

reduceByKey

(

add

) .

collect

()

.

sqIContext.sql

("

SELECT beer, sum

(

score

)

from drinkers GROUP BY drinker"

) .

collect

()

.

.

filter

('

drinker

') .

groupByKey

('

beer

') .

agg

({'

score

'

: 'sum'

}) .

collect

()

.

.

drop

('

drinker

') .

groupByKey

('

beer

') .

map

(

lambda a

,

b: a

+

) .

collect

()

I. sqIContext.sqI

("

SELECT beer, sum

(

score

)

from drinkers GROUP BY beer"

) .

collect

()

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Programming Languages 12th International Symposium Dbpl 2009 Lyon France August 2009 Proceedings Lncs 5708

Authors: Philippa Gardner ,Floris Geerts

2009th Edition

3642037925, 978-3642037924

More Books

Students also viewed these Databases questions

Question

★★★★★

A confidence interval for a population mean has length 20. a. Determine the margin of error. b. If the sample mean is 60, obtain the confidence interval. c. Construct a graph similar to Fig. 8.6 on...

Answered: 1 week ago

Question

★★★★★

Write an Oz recursive function (ComputeAverage N) that calculates the average of N numbers. For example, (ComputeAverage [1.3 4.5 0 3.7 -4.3] ) returns 1.04. Test your program for some other...

Answered: 1 week ago

Question

★★★★★

Explain remediation theory and apply it to a social media marketing strategy.

Answered: 1 week ago

Question

★★★★★

A restaurant pays its website developer a monthly fee for hosting and maintaining its web presence. The monthly fee charged by the developer is $400, plus $0.003 per site click. What would be the...

Answered: 1 week ago

Question

★★★★★

Consider the following Spark Code to construction a Dataframe. a = [ ( ' Chris ' , 'Budweiser', 1 5 ) , ( ' Chris ' , 'Becks', 5 ) , ( ' Chris ' , 'Heineken', 2 ) , ( ' Bob ' , 'Becks', 1 5 ) , ( '...

Answered: 1 week ago

Question

★★★★★

Sisco Co. purchased a patent from Thornton Co. for $180,000 on July 1, 2014. Expenditures of $68,000 for successful litigation in defense of the patent were paid on July 1, 2017. Sisco estimates that...

Answered: 1 week ago

Question

★★★★★

Save & The Cheyenne Hotel in Big Sky, Montana, has accumulated records of the total electrical costs of the hotel and the number of occupancy-days over the last year. An occupancy-day represents a...

Answered: 1 week ago

Question

★★★★★

The Leto Construction Company began work on a $12,000 contract on 1/1/20. Planned completion was in 2022. The Percentage of Completion method is used. Given the following: 2020 December 31 2021 2022...

Answered: 1 week ago

Question

★★★★★

Problem 22-03A Cullumber Company bottles and distributes B-Lite, a diet soft drink. The beverage is sold for 50 cents per 16-ounce bottle to retailers, who charge customers 75 cents per bottle. For...

Answered: 1 week ago

Question

★★★★★

Saved Help Save & Exit Problem 9-11A Partial year's depreciation; revising depreciation rates LO1, 2, 4 NeverLate Ltd. completed the following transactions involving delivery trucks: Dec. 2023 26...

Answered: 1 week ago

Question

★★★★★

While James Craig and his former classmate Paul Dolittle both studied accounting at school, they ended up pursuing careers in professional cake decorating. Their company, Good to Eat (GTE),...

Answered: 1 week ago

Question

★★★★★

Identify the issues that arose over the call- centre strike in 2012. Put forward arguments for both the union and management sides in this issue. Discuss the proposition that the growth of call...

Answered: 1 week ago

Question

★★★★★

Read the articles and case studies on e- learning and blended learning and suggest how these might be used in (a) a supermarket chain, (b) a bank and (c) a government department of your choice.

Answered: 1 week ago

Question

★★★★★

Look up the case in spotlight on the law 9.3. Discuss how you would respond, as a tribunal member, to a claim by a citizen of Leeds that they have been discriminated against because of their location...

Answered: 1 week ago

Previous Question Next Question