Data Warehousing A QUICK SUMMARY Sushanthan Premanath & Indrajith Premanath CSCI 4707
The Main Protagonists Bill Inmon Born 1945 Father of Data Warehousing Ralph Kimball Born 1944 Commercialized Data Warehousing
Big Data The data should be de-normalized to 2NF. This means you get data redundancy. This means you need more storage. The data can retrieved more quickly. Data Warehousing is to provide aggregate data which is in a suitable format for decision making.
ETL and Data Marts Extraction, Transformation and Loading (ETL) E – Extraction: Get the data. T – Transformation: Make it useful. L – Loading : Save it to the warehouse. Data Marts Don’t mess with the data. Keep it simple for the user. Small problems are easier to solve.