Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

the answer should be relevant to the field of data science where data repositories are datasets (like meteorite csv etc.)thereby as per the question plz

image text in transcribed

the answer should be relevant to the field of data science where data repositories are datasets (like meteorite csv etc.)thereby as per the question plz help choose relevant datasets and answer all question parts

Project A: Search and Discover [points] 1. Find one or more data repositories that were not already discussed in class. [5] [4] Describe it in a few sentences: what is the purpose of the repository, the clientele it serves, how big is it, what time frame it spans, how it serves its clients and [1] (if possible) what it might be lacking or how it could be improved. An ideal repository is very large, comprehensive, possibly spans a large time frame, may include multiple aspects, is public, has downloadable data, but can also be queried via the internet. Example. The website imdb.com is the world's largest movie database. It contains information about every movie made in every country. It also contains comprehensive information about actors and actresses, the directors, producers, and other individuals that go into making a movie. It can be queried and reportedly can be used in software (add details). 2. Generate [2] five straightforward questions (= answered by a simple query) and [3] five meaningful analytical questions that (= require more than one query and) have not been asked with regard to the data repository you reported above. [3] For each complex question: why is the question novel (not on the internet). Why is the answer to your question(s) useful or impactful? [2] Does one or more of your questions require collection of fresh data? If so, what? [2] How can this fresh data be collected and [1] how should you go about finding the answer to your question? Example: is there a correlation between the number of Academy Award winning actors/actresses born in a certain state and the number of degree programs in theater and drama in that state. 3. Find (an) excellent, relevant visual communication of data analysis on the web. [2] Explain why it is effective. Example: Temperature Circle" communicates how warming is a real phenomenon and is "global, not localized to some parts of the world. This is achieved by animating global temperature data over a 100 years. You already installed Anaconda (or equivalent). Look for enterprise versions on the cloud (AWS, GCP, Azure) - no need to submit. don't take movies data set as said in question Programming is not required,just go upon theoretical bases. thanks

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Concepts International Edition

Authors: David M. Kroenke

6th Edition International Edition

0133098222, 978-0133098228

More Books

Students also viewed these Databases questions

Question

List the elements in the set. is an integer between - 2 and 2

Answered: 1 week ago

Question

Question What integration level should an employer choose?

Answered: 1 week ago