Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

There is a collection of e - commerce transactions that occurred in NSW in 2 0 2 0 , which records transaction ID , seller

There is a collection of e-commerce transactions that occurred in NSW in 2020, which records transaction ID, seller ID, buyer ID, date of transaction, suburb of buyer, total price of the transaction. Now you are asked to write MapReduce programs in Python. The programs need to complete the following two tasks.
Task 1: count the total number of transactions in each suburb and in each month, and
Task 2: generate a sorted list of (month, suburb, # of all transactions in the suburb). This list is ordered first by chronological order of months, then by descending order of # of all transactions in each suburb.
A sample output can be found below:
December Marsfield 51,198
December Eastwood 50,799
December Epping 49,649
...
July Chatswood 61,348
July Lane Cove 51,002
July Blacktown 50,834
...
January Macquarie Park 38,146
January Epping 28,844
January Eastwood 25,491
(1) Introduce the steps in each task and explain where and how to use MapReduce model in each task.
(2) Draw a diagram to introduce the workflow of each task.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Readings In Database Systems

Authors: Michael Stonebraker

2nd Edition

0934613656, 9780934613651

More Books

Students also viewed these Databases questions

Question

2 6 .

Answered: 1 week ago