Answered step by step
Verified Expert Solution
Question
1 Approved Answer
the answer should be relevant to the field of data science where data repositories are datasets (like meteorite csv etc.)thereby as per the question plz
the answer should be relevant to the field of data science where data repositories are datasets (like meteorite csv etc.)thereby as per the question plz help choose relevant datasets and answer all question parts
Project A: Search and Discover [points] 1. Find one or more data repositories that were not already discussed in class. [5] [4] Describe it in a few sentences: what is the purpose of the repository, the clientele it serves, how big is it, what time frame it spans, how it serves its clients and [1] (if possible) what it might be lacking or how it could be improved. An ideal repository is very large, comprehensive, possibly spans a large time frame, may include multiple aspects, is public, has downloadable data, but can also be queried via the internet. Example. The website imdb.com is the world's largest movie database. It contains information about every movie made in every country. It also contains comprehensive information about actors and actresses, the directors, producers, and other individuals that go into making a movie. It can be queried and reportedly can be used in software (add details). 2. Generate [2] five straightforward questions (= answered by a simple query) and [3] five meaningful analytical questions that (= require more than one query and) have not been asked with regard to the data repository you reported above. [3] For each complex question: why is the question novel (not on the internet). Why is the answer to your question(s) useful or impactful? [2] Does one or more of your questions require collection of fresh data? If so, what? [2] How can this fresh data be collected and [1] how should you go about finding the answer to your question? Example: is there a correlation between the number of Academy Award winning actors/actresses born in a certain state and the number of degree programs in theater and drama in that state. 3. Find (an) excellent, relevant visual communication of data analysis on the web. [2] Explain why it is effective. Example: Temperature Circle" communicates how warming is a real phenomenon and is "global, not localized to some parts of the world. This is achieved by animating global temperature data over a 100 years. You already installed Anaconda (or equivalent). Look for enterprise versions on the cloud (AWS, GCP, Azure) - no need to submit. don't take movies data set as said in question Programming is not required,just go upon theoretical bases. thanksStep by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started