Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 26, 2024

I need help cleaning a dataset please provide the code it can be downloaded from here https://www.kaggle.com/tmdb/tmdb-movie-metadata/data what i have done so far below, dont

I need help cleaning a dataset please provide the code

it can be downloaded from here https://www.kaggle.com/tmdb/tmdb-movie-metadata/data

what i have done so far below, dont mind the importing because I will use the rest when I have a clean set.

from datetime import timedelta, date import datetime import numpy as np import pandas as pd import string import re import csv import requests import string

data from https://www.kaggle.com/tmdb/tmdb-movie-metadata/data df_movies = pd.read_csv('tmdb_5000_movies.csv', delimiter = ',', header = 0, skipinitialspace = True)

df_movies.drop(columns='homepage', inplace=True) df_movies.drop(columns='popularity', inplace=True) df_movies.drop(columns='overview', inplace=True) df_movies.drop(columns='status', inplace=True) df_movies.drop(columns='tagline', inplace=True) df_movies.drop(columns='vote_average', inplace=True) df_movies.drop(columns='vote_count', inplace=True) df_movies.drop(columns='id', inplace=True)

df_movies.drop(columns='id', inplace=True)

df_movies.head()

I want it so that the 'genres' column only says the genre whether it is action adventure and so on. Same goes for 'production_company' and 'production_country' and 'spoken_language'.

Then I need you to remove all rows where 'spoken_language is not english or en, and create a separate column with just the year of the movie's release, titled 'release_year' and order it by 'release-year' and then 'revenue'.

Thanks!

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Beginning Apache Cassandra Development

Beginning Apache Cassandra Development

Authors: Vivek Mishra

1st Edition

1484201426, 9781484201428

More Books

Students also viewed these Databases questions

Question

★★★★★

Why are sales letters normally longer than routine messages? What guidelines apply as to the recommended lengths for paragraphs?

Answered: 1 week ago

Question

★★★★★

Cole and Barker each own 50% of the shares of NRS Ltd., a Canadian-controlled private corporation. NRS had conducted a small active business, which was closed down two years ago, in late 20X0.The...

Answered: 1 week ago

Question

★★★★★

=+ Employers associations: What is the nature and role of the employers associations in each country?

Answered: 1 week ago

Question

★★★★★

On January 1, 2014, Ellen Greene Company makes the two following acquisitions. 1. Purchases land having a fair value of $200,000 by issuing a 5-year, zero-interest-bearing promissory note in the face...

Answered: 1 week ago

Question

★★★★★

I need help cleaning a dataset please provide the code it can be downloaded from here https://www.kaggle.com/tmdb/tmdb-movie-metadata/data what i have done so far below, dont mind the importing...

Answered: 1 week ago

Question

★★★★★

hello I need help with these 6 managerial finance questions. if someone could help me which each question step by step thank you. if more information is provided please let me know question 1...

Answered: 1 week ago

Question

★★★★★

Games Galore has provided its condensed financial statements for the year ended December 31, 2016. The Controller has asked you to calculate liquidity, solvency, and profitability ratios that...

Answered: 1 week ago

Question

★★★★★

Games Galore has provided its condensed financial statements for the year ended December 31, 2016. The Controller has asked you to calculate liquidity, solvency, and profitability ratios that...

Answered: 1 week ago

Question

★★★★★

4-82 The demand for water use in Phoenix in 2003 hit a high of about 442 milion gallons per day on June 27. Water use in the summer is normally distributed with a mean of 310 million gallons per day...

Answered: 1 week ago

Question

★★★★★

Required information Skip to question [The following information applies to the questions displayed below.] Lauder Company manufactures and distributes various fixtures used primarily in new building...

Answered: 1 week ago

Question

★★★★★

Overview Most business are forced to evaluate opportunities for capital investments to allocate scarce funds. While there are many ways to evaluate the wisdom of a capital investment, one such way is...

Answered: 1 week ago

Question

★★★★★

How many Tables Will Base HCMSs typically have? Why?

Answered: 1 week ago

Question

★★★★★

What is the process of normalization?

Answered: 1 week ago

Question

★★★★★

What is Notation in Data Modeling, and what is the most common Notation Type used?

Answered: 1 week ago

Previous Question Next Question