Question
If the data set is diverse, data mining results may not be accurate. Is this a true or false statement? * 2 points 1. True
If the data set is diverse, data mining results may not be accurate. Is this a true or false statement? *
2 points
1. True
2. False
Which below is a Front-End tool used in a Data Warehouses top tier? *
2 points
1. Data Mining Tools
2. Analysis Tools
3. Reporting Tools
4. All of the above
What is the ultimate goal of big data collection? *
2 points
1. Create cool visualizations
2. Gain knowledge to make fact based decisions
3. Collect as much data as possible
4. All of the above
Which module was created to allow for interactive and stream processing in a Hadoop ecosystem? *
2 points
1. MapReduce
2. YARN
3. HDFS
4. None of the above
In a data warehouse, which of the below is responsible for organizing and putting the data in to a consistent format? *
2 points
1. Data Mart Databases
2. Operational Layer
3. External Data
4. None of the above
The computers at school use which type of storage system? *
2 points
1. Object
2. Block
3. File
4. All of the above
What turns in to a data swamp if its data integrity is compromised? *
2 points
1. Data mart
2. Data lake
3. Data warehouse
4. Data schema
When you pay your bills online, this would be an example of what type of data? *
2 points
1. Sensor
2. Unstructured
3. Transactional
4. Machine
Which company case study did the company put a strong emphasis on using big data to make real-time business decisions? *
2 points
1. Uber
2. Walmart
3. Netflix
4. Procter & Gamble
Data lakes are designed to be indiscriminate of data and allow for a schema-on-write data model. Is this a true or false statement? *
2 points
1. True
2. False
Which of the below is considered the majority of what people do when they are using the internet? *
2 points
1. Video streaming
2. Shopping habits
3. Social media
4. Music streaming
For whom it is important to work with unstructured data such as social media, video feeds and videos? *
2 points
1. Data Scientist
2. Big Data Professional
3. Data Analyst
4. None of the above
Which Data Mining Implementation Process will you likely spend the most time occupied with? *
2 points
1. Data transformation
2. Modelling
3. Deployment
4. Data preparation
Which is a tool used to make machines smarter, eliminating the human element? *
2 points
1. Data Learning
2. Sensor Learning
3. Machine Learning
4. None of the above
Which type of data does not have a predefined model? *
2 points
1. Sensor
2. Structured
3. Log
4. Unstructured
How many tiers did we discuss in class when it comes to data warehouse architecture? *
2 points
1. 4
2. 6
3. 3
4. 5
HDFS is a filesystem designed for storing very large files with streaming data access patterns, running on clusters of commodity hardware. Is this a true or false statement? *
2 points
1. True
2. False
Why is Data Visualization important? *
2 points
1. It helps people understand the significance of visualizations
2. Patterns, trends and correlations that might go undetected in text-based data can be exposed and recognized easier
3. Infographics of complex algorithms are generally easier to interpret than numerical output
4. All of the above
How many copies of each data block does Hadoop store? *
2 points
1. 3
2. 4
3. 2
4. 5
Which is very important to know or have if you wanted to be a Data Analyst? *
2 points
1. Communication and Data Visualization skills
2. Machine learning skills
3. Knowledge of programming languages R and Python
4. All of the above
Which type of data is considered unstructured? *
2 points
1. Social Data
2. Big Data
3. Transactional Data
4. All of the above
What does Hadoop provide? *
2 points
1. Ability to analyze storage
2. Data Storage and ability to analyze storage
3. Ability to analyze data and robust storage platform
4. Data and node facilities
Which type of storage is accessed over an ethernet network? *
2 points
1. SAN
2. NAS
3. LAN
4. None of the above
What percentage of data has been created in the last two years? *
2 points
1. 80%
2. 70%
3. 90%
4. 60%
Data Analytics is the science of examining raw data with the purpose of drawing conclusions about that information. Is this a true or false statement? *
2 points
1. True
2. False
Which type of data center provides large companies the ability to hold their IT infrastructure? *
2 points
1. Edge
2. Wholesale Colocation
3. Colocation
4. Hyperscale
Do we use information to create data from which we create information? *
2 points
1. Yes
2. No
Data Science encompasses of which of the below? *
2 points
1. Programming
2. Statistics
3. Problem Solving
4. All of the above
If a security camera records you walking through the metro, what type of data would this be considered? *
2 points
1. Structured & Human Generated
2. Unstructured & Human Generated
3. Structured & Machine Generated
4. Unstructured & Machine Generated
Where would you find File Level Storage in use? *
2 points
1. NAS
2. SAN
3. WAN
4. None of the above
Which type of machine learning would you use to identify data based on its characteristics? *
2 points
1. Supervised Learning
2. Unsupervised Learning
3. Reinforcement Learning
4. All of the above
During which industrial revolution did we begin the digitizing of information? *
2 points
1. 3rd
2. 6th
3. 4th
4. 2nd
Which V of Big Data do we look at the data and determine it is meaningful and useful? *
2 points
1. Volatility
2. Volume
3. Validity
4. Veracity
Which is not a cloud computing service model? *
2 points
1. SaaS
2. On-Premise
3. IaaS
4. PaaS
If you had a system which was capable of complex problem solving using multiple systems, this would be an example of what? *
2 points
1. Grid Computing
2. Edge Data Center
3. Distributed Computing
4. Data Warehouse
Which type of database has a dynamic schema? *
2 points
1. Relational
2. NoSQL
3. Star
4. Snowflake
Data mining can be performed on following types of data? *
2 points
1. Heterogeneous and legacy databases
2. Data warehouses
3. Object-oriented and object-relational databases
4. All of the above
When using Microsoft Excel which one allows you to extract significance from a large, detailed data set? *
2 points
1. Gauge Chart
2. Pivot Table
3. Histogram
4. Bar Chart
Which is a fundamental concept of cloud computing? *
2 points
1. Delivery of computing or storage as services
2. Allowing multiple users to utilize a shared computing resource
3. Access to services over a network connection
4. All of the above
Data stored outside of your standard databases such as documents on network shares which are important for organizations is known as? *
2 points
1. NAS
2. Unmanaged
3. Object
4. SAN
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started