Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

PYTHON PANDAS QUESTION! The dataset has 50,000 rows and 12 columns: itemid, shopid, item_name, item_description, item_variation, price, stock, categor, cb_option, is_preferred, sold_count, item_creation_time Q1 (a):

PYTHON PANDAS QUESTION!

The dataset has 50,000 rows and 12 columns:

itemid, shopid, item_name, item_description, item_variation, price, stock, categor, cb_option, is_preferred, sold_count, item_creation_time

Q1 (a): How many products were created in the year 2018?

Q1 (b): Find the top 3 categories that have the greatest number of unique cross border products.

Q1 (c): Find the top 3 shops with highest revenue.

Q1 (d): Identify duplicated listings within each shop and mark those duplicated listings with True Otherwise False in a separate column called is_duplicated (If listing A and B in shop S have the exactly same product title, product detailed description, and price, both listing A and B are considered as duplicated listings)

Q1 (e): Find duplicated listings that have less than 2 sold_count and store in an excel file called duplicated_listings.xlsx

Q1 (f): Find the number of products that have more than 3 variations.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Introduction To Data Mining

Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar

1st Edition

321321367, 978-0321321367

Students also viewed these Databases questions