1 LM 7 Data Warehouse Dr. Lei Li
Learning Objectives Describe the needs for data warehouse Describe the three levels of a data warehouse Explain the independent and dependent data mart Explain the basic concept of big data, NoSQL, OLAP, data visualization, and data mining. 2
Data Warehouse A relational database designed for query and analysis. Subject-oriented: e.g. customers, patients, students, products Integrated: consistent naming conventions, formats, encoding structures; from multiple data sources Time-variant: can study trends and changes Non-updatable: read-only, periodically refreshed Data Mart A data warehouse that is limited in scope 3
Why Data Warehouse? Integrated, company-wide view of high-quality information (from disparate databases) Separation of operational and informational systems and data (for improved performance) 4
Difference between OLTP & Data Warehouse 5
Three Tier Data Warehouse Architecture Image source:
Data Warehouse Architecture 7
Data Warehouse Architecture with Data Mart 8
Data Mart Stand-alone data mart Dependent data mart
Stand-alone data mart Image source:
Dependent Data Mart
Other Database Concepts Data mining – knowledge discovery in database Big Data No-SQL database 12