All Matches
Solution Library
Expert Answer
Textbooks
Search Textbook questions, tutors and Books
Oops, something went wrong!
Change your search query and then try again
Toggle navigation
FREE Trial
S
Books
FREE
Tutors
Study Help
Expert Questions
Accounting
General Management
Mathematics
Finance
Organizational Behaviour
Law
Physics
Operating System
Management Leadership
Sociology
Programming
Marketing
Database
Computer Network
Economics
Textbooks Solutions
Accounting
Managerial Accounting
Management Leadership
Cost Accounting
Statistics
Business Law
Corporate Finance
Finance
Economics
Auditing
Hire a Tutor
AI Study Help
New
Search
Search
Sign In
Register
study help
computer science
fundamentals of database systems
Questions and Answers of
Fundamentals Of Database Systems
Give the chronological history of the development of MapReduce/Hadoop technology.
What is meant by veracity of data?
What are the four major characteristics of big data? Provide examples drawn from current practice of each characteristic.
What are the various types of analytics mentioned in the IBM (2014) book?
What are the important points made in the McKinsey Global Institute report of 2012?
What is data analytics and what is its role in science and industry?
Discuss the interfaces and distributed systems characteristics of Neo4j.
What is the query language for Neo4j?
What are the data modeling concepts used in the graph-oriented NOSQL system Neo4j?
Discuss the storage and distributed system methods used in Hbase.
What are the main CRUD operations in Hbase?
What are the data modeling concepts used in column-based NOSQL systems and Hbase?
Describe the consistent hashing schema for data distribution, replication, and sharding. How are consistency and versioning handled in Voldemort?
Discuss the data modeling concepts in DynamoDB.
Discuss how replication and sharding are done in MongoDB.
What are the data modeling concepts used in MongoDB? What are the main CRUD operations of MongoDB?
What are the similarities and differences between using consistency in CAP versus using consistency in ACID?
What is the CAP theorem? Which of the three properties (consistency, availability, partition tolerance) are most important in NOSQL systems?
What are the main characteristics of NOSQL systems in the areas related to distributed systems and distributed databases?
Compare the primary site method with the primary copy method for distributed concurrency control. How does the use of backup sites affect each?
When a failure occurs, the log is consulted and each operation is either undone or redone. This is a problem becausea. Searching the entire log is time consumingb. Many redos are unnecessaryc. Both
Suppose that the system crashes before the [write_item, T2, D, 25, 26] entry is written to the log in Figure 22.1(b). Will that make any difference in the recovery process?Figure 22.1(b) (b) A BCD 30
Suppose that the system crashes before the [read_item, T3, A] entry is written to the log in Figure 22.1(b). Will that make any difference in the recovery process?Figure 22.1(b) (b) A BCD 30 15 40
Discuss how disaster recovery from catastrophic failures is handled.
Describe the two-phase commit protocol for multidatabase transactions.
What do the terms steal/no-steal and force/no-force mean with regard to buffer management for transaction processing?
What are log sequence numbers (LSNs) in ARIES? How are they used? What information do the Dirty Page Table and Transaction Table contain? Describe how fuzzy checkpointing is used in ARIES.
Describe the three phases of the ARIES recovery method.
Describe the shadow paging recovery technique. Under what circumstance does it not require a log?
What is the difference between the UNDO/REDO and the UNDO/NO REDO algorithms for recovery with immediate update? Develop the outline for an UNDO/NO-REDO algorithm.
Explain the main differences between the database and IR systems mentioned in Table 27.1.Table 27.1 Table 27.1 A Comparison of Databases and IR Systems Databases IR Systems 1 Structured
Define recall and precision in IR systems.
Describe the main components of the IR system as shown in Figure 27.1.Figure 27.1 Document 3 Documont Document Corpus SEARCH INTENT Document 1 Information Need/Search Stopword
What are probabilistic and semantic models of IR?
Define the TF-IDF scheme of determining the weight of a keyword in a document. Why is it necessary to include IDF in the weight of a term?
What is the vector space model of IR? How does a vector get constructed to represent a document?
What is the Boolean model of IR? What are its limitations?
Give a brief history of IR and mention the landmark developments in this field.
Consider the COMPANY database described in Figure 5.6. Using the syntax of Oracle triggers, write active rules to do the following:a. Whenever an employee’s project assignments are changed, check
Name some digital libraries that you have accessed. What do they contain and how far back does the data go?
Consider the following rules:REACHABLE(X, Y) :– FLIGHT(X, Y)REACHABLE(X, Y) :– FLIGHT(X, Z), REACHABLE(Z, Y)where REACHABLE(X, Y) means that city Y can be reached from city X, and FLIGHT(X, Y)
What are digital libraries? What types of data are typically found in them?
Show how the following updates, if applied in sequence, would change the contents of the bitemporal EMP_BT relation in Figure 26.9. For each update, state whether it is a retroactive or proactive
What are the two main modes of interaction with an IR system? Describe and provide examples.
What is meant by navigational, informational, and transformational search?
Discuss the types of data and the types of users in today’s information retrieval systems.
Give a general definition of information retrieval (IR). What does information retrieval involve when we consider information on the Web?
What is structured data and what is unstructured data? Give an example of each from your experience.
Consider a deductive database with the following rules:ANCESTOR(X, Y) :– FATHER(X, Y)ANCESTOR(X, Y) :– FATHER(X, Z), ANCESTOR(Z, Y)Notice that FATHER(X, Y) means that Y is the father of X;
Consider the following set of facts for the relation PARENT(X, Y), where Y is the parent of X:PARENT(a, aa), PARENT(a, ab), PARENT(aa, aaa), PARENT(aa, aab),PARENT(aaa, aaaa), PARENT(aaa,
Discuss the inference mechanism based on relational operations that can be applied to evaluate nonrecursive Datalog queries.
Give examples of rules that can define relational operations SELECT, PROJECT, JOIN, and SET operations.
What is a safe rule?
What are fact-defined predicates and rule-defined predicates?
What is model-theoretic interpretation and how does it differ from proof-theoretic interpretation?
What is theorem proving, and what is proof-theoretic interpretation of rules?
Define the clausal form of formulas and Horn clauses.
Write sample rules in Prolog to define that courses with course number above CS5000 are graduate courses and that DBgrads are those graduate students who enroll in CS6400 and CS8803.
What are deductive databases?
What are the difficulties in analyzing audio sources?
How is semantic tagging of images used?
What are the different approaches to recognizing objects in images?
What important features of images are used to compare them?
How are multimedia sources indexed for content-based retrieval?
What are the different types of multimedia sources?
State the general form of a spatial association rule. Give an example of a spatial association rule.
What are the different types of spatial data mining?
Describe how a spatial join index between spatial objects can be constructed.
What are the properties of R-trees that act as an index for spatial data?
Name the main types of spatial operators and different classes of spatial queries.
What are the different types of spatial data?
How do spatial databases differ from regular databases?
What are the differences among valid time, transaction time, and bitemporal relations?
Discuss some applications of active databases.
Briefly discuss the consistency and termination problems when designing a set of active rules.
What are the differences among immediate, deferred, and detached execution of active rule actions?
What are the differences among immediate, deferred, and detached consideration of active rule conditions?
What are the differences between row-level and statement-level active rules?
Find out more about ongoing projects such as Spark, Mesos, Shark, and BlinkDB as they relate to the Berkeley Data Analysis Stack.
Discuss the concept of RDDs (resilient distributed datasets).
What are some of the challenges faced by big data technologies today?
What services can YARN offer beyond MapReduce?
What are the data locality issues related to big data applications in cloud storage?
Compare parallel relational DBMSs and the MapReduce/Hadoop systems.
In what way are big data and cloud technology complementary to one another?
Give the rationale in moving from Hadoop v1 to Hadoop v2 (YARN).
What are Apache Tez, Apache Giraph, and Hoya?
How does Resource Manager work in YARN?
What is the function of the NameNode and secondary NameNode in HDFS?
What does the Journal in HDFS refer to? What data is kept in it?
Describe the heartbeat mechanism in HDFS.
How are copies of data (replicas) managed in HDFS?
Shvachko (2012) reported on HDFS performance. What did he find? Can you list some of his results?
What other projects are included in the open source Hadoop ecosystem?
Describe the workings of the JobTracker and TaskTracker in MapReduce.
Describe the overall flow of the job in MapReduce.
What are the different ways in which MapReduce provides fault tolerance?
What is the Shuffle procedure in MapReduce?
Describe how the various job schedulers for MapReduce work.
What are the different types of joins that can be optimized using MapReduce?
Showing 100 - 200
of 606
1
2
3
4
5
6
7