Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Do you agree or disagree from this feedback? https://www.youtube.com/ watch?v=f180l9_edsA Pat O'Sullivan talk on Big Data Modelling Lots has changed but lot of it remain

Do you agree or disagree from this feedback?

https://www.youtube.com/watch?v=f180l9_edsA

image text in transcribed
Pat O'Sullivan talk on Big Data Modelling "Lots has changed but lot of it remain the same" is the basic theme of the talk on Big Data modelling by Pat O'Sullivan. A single data model can now be used to span Hadoop and relational Database. The considerations for creating such a data model remain more or less similar to building a relational data warehouse. There are physical constraints in such a model but the logical model remains same. The talk takes us through how Organizations are looking for Big Data to augment traditional Data Warehouse. Big Data is new and data models are still new and organizations are still learning. It is fast changing. Some Examples presented where big data is being used to augment the traditional data warehouse data can be found in insurance, healthcare and Banking industry. Business issues are still the primary focus and have to be addressed by the models. These are now powered by additional information from Hadoop. Same best practices, same challenges and issues like traditional relational DB warehouse 1 Need for enforce consistency across enterprise wide data assets. 2 Business users engaged in data warehouse development. 3 Good governance. Some additional challenges in using big data as opposed to relational DB is to have a balancing act between business user, data architect and data scientist. This varies between organizations and is an ongoing process. Model has to be managed in multiple physical environments. The presentation takes us through the types of models in big data environment, Big data platform and modelling considerations. Some key aspects emphasized in the talk are the importance of business vocabulary in the design of the data models, Landing area where structured and unstructured data are present, the level of schema and level or processing applied to the data before it moves to Hadoop Information ingestion, core warehouse, data governance

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Multidimensional Array Data Management In Databases

Authors: Florin Rusu

1st Edition

1638281483, 978-1638281481

More Books

Students also viewed these Databases questions

Question

Will the company need financing? Explain. (12 marks)

Answered: 1 week ago