Download presentation
Presentation is loading. Please wait.
Published byElvin Fields Modified over 9 years ago
1
ETL and Metadata
2
The ETL Process Source Systems Extract Transform Staging Area Load Presentation System
3
Source Data Record the name location and data that exists in the TPS environment. File names and location Layout Attribute meaning Source Business Owner IS Owner Platform Location Data Source Description
4
Extraction Copy specific data directly from the source tables into a working dataset in the staging area. Targe t Table Target Column Da ta Ty pe Le n Target Column Description Sour ce Syst em Source Table / File Sourc e Col / Field Data Txform Notes
5
Transformation (Dimension Tables) Generate surrogate key in a primary- surrogate table. Make this permanent. Insert the surrogate key into the working dimension tables. Conduct any editing/cleaning operations you need (usually on the working table) Generate any derived attributes you need. Generate and retain process logs.
6
Transformation (Fact tables) Join all dimensions to the fact table (using original primary keys). Insert surrogate keys Generate derived facts Generate indicator flags Ch g Fla g Fact Gro up Derived Fact Name Derived Fact Description Typ e Ag g Rul e Formula Constra ints Transf or- matio ns
7
Target Data Describe the presentation data structure. Model Metadata Usage and constraints Table Name Column Name Dat a Typ e LenNull s? Column Description PK Or der FK
8
Flow Documentation DFD for the ETL process ERD for Source, Staging and Target databases. Metadata Usage notes.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.