December 4, 2008 1 Financial Data Model Overview Daniel Grieb Lori Silvestri.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Chapter 10: Designing Databases
Data Warehouse Overview (Financial Analysis) May 02, 2002.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
University of Missouri System PeopleSoft Financials Reporting for Today and Tomorrow.
Technical BI Project Lifecycle
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
May 21, Budget Activity Budget Roll Forward Relationship to Reserves Year End Training May 21, 2007.
Hyperion EPM Overview & Case Study.
Oracle General Ledger, Financial Reporting and Data Warehouse 6/22/2015 RIAS PHASE II Overview.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Recording / Financing Fixed Asset Acquisition Human Resources Purchasing Revenue Traditional files approach: separate systems (Legacy Systems) Expenditure.
1 Self Service Banner FOAP AND NAVIGATION Fall 2005 Ver 7.1 Presented by the Auburn University Business Office.
Data Warehousing at Notre Dame October 7, 2004 Dale Carter, Manager, Decision Support Jared Barnard, Database Administrator.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
1 Open Forum Presentation BI Tools Upgrade Project July 25, 2008.
UR Financials Project Company Level Reporting SIG February 18, 2014.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Jeremy Brinkman Director of Administrative Systems University of Northwestern Ohio Great Lakes Users’ Group Conference August 10-11,
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
1 CADE Finance and HR Reports Administrative Staff Leadership Conference Presenter: Mary Jo Kuffner, Assistant Director Administration.
Data Governance Data & Metadata Standards Antonio Amorin © 2011.
Module 3: Table Selection
Blue Coat Systems, Inc. Oracle Enterprise, Planning & Budgeting (EPB) April 8, 2005 Bob Verheecke Chief Financial Officer.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
© 2008 California State University, Fullerton Account Management & Reporting Tools Financial Services Division of Administration & Finance.
PO320: Reporting with the EPM Solution Keshav Puttaswamy Program Manager Lead Project Business Unit Microsoft Corporation.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Methodology - Conceptual Database Design Transparencies
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Using SAS® Information Map Studio
Enterprise Reporting Solution
Information Systems Today (©2006 Prentice Hall) 3-1 CS3754 Class Note 12 Summery of Relational Database.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
Methodology - Conceptual Database Design
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
December 5, Repository Metadata: Tips and Tricks Peggy Rodriguez, Kathy Kimball.
University of California, Berkeley New/Updated Reports 8/01/12* *Latest updates 11/19/12 BAIRS | New Reports Overview.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
DATA WAREHOUSING AND REPORTING
UNIT-II Principles of dimensional modeling
12/4/ OBIEE Technical Conference Getting Started with a Dashboard Development Project Theresa May.
Best Practices for Implementing
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
3 Copyright © 2013, Oracle and/or its affiliates. All rights reserved. PeopleSoft General Ledger 9.2 New Features 9.2 Release New Features.
Year-End Close: Reports and Finance Data warehouse
Operation Data Analysis Hints and Guidelines
Fundamentals & Ethics of Information Systems IS 201
Chapter 13 The Data Warehouse
Star Schema.
Applying Data Warehouse Techniques
Overview and Fundamentals
AFD Finance – Open Forum
PeopleSoft Financials Reports and Inquiry Training
PeopleSoft Financials Reports and Inquiry Training
Presentation transcript:

December 4, Financial Data Model Overview Daniel Grieb Lori Silvestri

December 4, Agenda ■ Reporting Solution ■ Star Schema Primer ■ Data Modeling Process ■ Finance Data Models ■ Design Challenges and Choices ■ Implementation ■ Conclusion

December 4, Finance Data Modeling Guidelines ■ Campus Solution must use CSU Finance Reporting Solution as Source ■ Replace Existing 1.Revenue and Expense (P & L) 2.Trial Balance Reporting 3.Drill from Summary to Transaction ■ Need daily refresh of large data sets ■ Anticipate analytical reporting

December 4, Levels of Reporting Transactional Transactional Reporting  Supports day to day transactional users  Requires knowledge of transactional data Enterprise Data Warehouse  Combined information from multiple source systems. Current and historical information  Much more sophisticated data structures to enable analysis: cubes and star schema Analytics Operational Reporting  Tactical data from production systems that address operational needs  Denormalized data structures with embedded business logic Operational

December 4, REPORTING SOLUTION

December 4, CSU Reporting Solution ■ Attribute Tables – one set for each Set ID – XXCMP, XXCSU, XXGAP ■ Transaction Tables – separate tables per Business Unit ■ Summary Table – XXCMP and XXCSU * Brothwell, Kist, and Yelland, “Finance 9.0 Reporting Solution Training” April, 2008

December 4, CSU Reporting Solution - Attributes ■ Attribute Tables – one set for each Set ID (XXCMP, XXCSU, XXGAP) – FundCSU_R_FUND_TBL – DepartmentCSU_R_DEPT_TBL – AccountCSU_R_ACCT_TBL – ProgramCSU_R_PRGM_TBL – ProjectCSU_R_PROJ_TBL – ClassCSU_R_CLASS_TBL ■ Can be joined to transaction and summary tables ■ Department table contains “flattened” version of the campus organization department tree * Brothwell, Kist, and Yelland, “Finance 9.0 Reporting Solution Training” April, 2008

December 4, CSU Reporting Solution - Transactions ■ Transaction Tables – separate tables per Business Unit ■ Campus Business Unit Transaction Tables – Actuals CSU_R_ACTDT_CMP – BudgetsCSU_R_BUDDT_CMP – EncumbrancesCSU_R_ENCDT_CMP – Pre-EncumbrancesCSU_R_PREDT_CMP ■ CSU Business Unit Transaction Tables ■ GAP Business Unit Transaction Tables * Brothwell, Kist, and Yelland, “Finance 9.0 Reporting Solution Training” April, 2008

December 4, CSU Reporting Solution - Summary ■ Summary Tables (XXCMP and XXCSU) ■ Campus Business Unit Summary Table – CSU_R_SUMBL_CMP ■ CSU Business Unit Summary Table – CSU_R_SUMBL_CSU * Brothwell, Kist, and Yelland, “Finance 9.0 Reporting Solution Training” April, 2008

December 4, Benefits of the Reporting Solution to the Dimensional Data Model ■ Validated independently – Reporting solution was validated between January and September 2008 – Finance was heavily invested in, helped design and trusted the reporting solution – Sped up data model validation because we could tie to the reporting solution » Finance validated within days, rather than weeks » Validated using the dashboards

December 4, Benefits of the Reporting Solution to the Dimensional Data Model Reporting solution now used in parallel by Finance for internal querying and to fill ad hoc requests – Phase one of the data models did not have to incorporate all of the reporting solution data – Helped constrain project scope

December 4, STAR SCHEMA PRIMER

December 4, What Is a Star Schema The star schema is perhaps the simplest data warehouse schema. It is called a star schema because the diagram of this schema resembles a star, with points radiating from a central table. The center of the star consists of a large fact table and the points of the star are the dimension tables.star schema

December 4, Dimension Table Fact Table Star Schema - a data model that consists of one fact table and one or more dimension tables Contains: facts and/or measures to be analyzed (i.e., amount, count, etc.) and foreign keys (keys to dimension tables) Dimension Table – Contains attributes describing a campus entity (i.e., department, account type, ledger, etc.) Star Schema Database Design

December 4, Star Schema Fact tables contain process activity located in the center (quantitative data) Some example facts are monetary amount, budget amount and statistics amount Dimensions tell the story and provide the detail to the facts. Which department’s budget? When was the last transaction posted for a given account? THE FACTS WHERE? WHO? WHAT? WHEN?

December 4, Star Schema Benefits ■ Data model is easy to understand – Based on business process ■ Easy to define hierarchies – City-State-Country – Day-Accounting Period-Fiscal Year ■ Easy to navigate – Number of table joins reduced – Star schema recognized by leading query tools ■ Maintainable and Scalable – Dimension tables shared between data models – Can add new fact tables which use existing dimensions

December 4, Why Star Schema for Cal Poly Finance? 1.Dimensions can easily be reused ■ across current and future finance models 2.Superior query performance for large datasets ■ i.e., over 5 million rows 3.Usability ■ Understandable for users ■ Better support unanticipated questions 4.Star schemas are extremely compatible with business intelligence query tools such as OBIEE.

December 4, DATA MODELING PROCESS

December 4, Data Modeling Process ■ Interactive/ Iterative Process ■ Requirements Gathering ■ Domain research ■ Data profiling ■ Modeling tool ■ Design sessions with data steward

December 4, Data Modeling Process: Requirements Gathering ■ Primarily Done by Reporting Solution Development ■ Our Requirement – Refashion Reporting Solution into a Dimensional Model – Performance – Accessibility

December 4, Data Modeling Process: Research ■ Domain research – Finance – Cal Poly Financials – Cal Poly Reports (nVision, Brio) – Industry Finance Models (Kimball) ■ Data profiling – Querying reporting solution – Correlating fields/ values – Matrix of Attributes Across Document Sources

December 4, Data Modeling Process: Design ■ Modeling tool – Needed a tool to support efficient design – Limitations of modeling tools like Visio – Embarcadero ER Studio ■ Design sessions with data steward – model reviews » Validated groupings of attributes into dimensions » New (non-reporting solution) sources (i.e., dept, prog and proj trees) – prototyping dashboards

December 4, FINANCE DATA MODELS

December 4, Cal Poly Finance Data Models ■ 4 data models implemented to date ■ 22 Dimensions – Reused across models – Chart fields, Business unit, Ledger, etc ■ 4 Fact tables – Actual Transactions – Budget Transactions – Encumbrance Transactions – Actual, Budget and Encumbrance Summary

Who (Dept ID, Vendor, etc) What (Account, Fund, etc) Where (Business Unit, etc) When (Acctg. Period, Fiscal Year, etc.) Actual Fact Summary Fact Budget Fact High Level Finance Data Model Diagram Encumbrance Fact

December 4, Model Overview – Actual, Budget and Encumbrance Summary

December 4, Model Overview – Actual Transactions

December 4, Model Overview – Budget Transactions

December 4, Model Overview – Encumbrance Transactions

December 4, Closer Look at a Dimension ■ Department – FINANCE_DEPARTMENT ■ Initial source was CSU Reporting Solution Department Attribute table – PS_CSU_R_DEPT_TBL

December 4, Closer Look at a Dimension ■ Source Department table – contains “flattened” version of campus organization department tree – Ragged hierarchy ■ Added additional source data – Cal Poly department tree – Non-ragged hierarchy – Robust hierarchy for data exploration – Supports reporting on department reorganization or renaming – Cal Poly users are accustomed to using this tree

December 4, Closer Look at Department Dimension ■ Department Budget Specialist and Manager – Reporting Solution provides a single manager field ■ Cal Poly Needs Primary and Secondary Budget Specialists and Managers – Available for querying and display in reports – Used for access control in Finance dashboards - filtering / ease of use ■ Source – Excel Spreadsheet – Provided by Finance – Updated weekly – Plan to create mini-web application to capture data in future

December 4, Department Dimension

December 4, Presentation of Data Models

December 4, Transactional vs. Summary Models ■ Dimensions in the summary model are a subset of those in the transactional models – Allows for drill-across from summary to transactional models – “Feels like” a drill-down

December 4, Design Challenges and Choices

December 4, Design Challenges Challenge ■ Reporting solution is denormalized – PolyData typically sources normalized data sources and manages denormalization Solution ■ Took us a little outside of our comfort zone ■ Deconstructed the reporting tables into unique combinations of elements

December 4, Design Challenges Challenge ■ Attributes are “overloaded” – For example, a document_id can represent an invoice number, a PO number, a journal identifier, etc. Solution ■ Preserved this concept in the dimensional models because it is familiar to Finance

December 4, Design Challenges Challenge ■ Uniqueness not enforced in the reporting solution Solution ■ Added an instance number for identical transactions

December 4, Design Challenges Challenge ■ Nightly rebuild of the reporting solution potentially deletes rows Solution ■ Effective-dated transactions in the fact

December 4, Design Challenges Challenge ■ Transactional and summary reporting tables may not tie – journal vs. ledger sources – summing the detail may give the wrong answer Solution ■ This is a known issue to which Finance is accustomed ■ Opportunity for a dashboard integrity report

December 4, Design Challenges - Naming Challenge ■ Reporting Solution names did not conform with PolyData Warehouse standards Solution ■ Data Warehouse standards – Field and table names use full English words when possible for usability – Codes precede corresponding description (Code, Descr) ■ Used reporting solution names with full spelling and adding ‘Code’ and ‘Descr’ where appropriate.

December 4, Design Choices – Slowly Changing Dimensions ■ Most dimensional attributes were determined by data steward to be slowly changing dimension Type 1 (SCD1). ■ Exception: Department Table – SCD1 attributes such as department description – SCD2 department tree data ■ *IF* you need to track historical changes to dimensions – You may need to source dimensions from source system(s) – Candidates include chart fields, vendors, customers

December 4, Design Choices – SCD Example ■ Cal Poly needs department tree history – Department tree data » Slowly Changing Dimension Type 2 - preserves history » Effective date rows (effective from and to dates) » Add new row for each change – All other department attributes » Slowly Changing Dimension Type 1 – overwrites history » Replace old/outdated data with current

December 4, Design Choices – New Sources ■ In design and prototyping sessions with end users, it became apparent that additional source data was needed ■ New non-reporting solution sources were needed to supplement existing source. – Department tree – Program tree – Project tree ■ Design change from using only reporting solution as source

December 4, IMPLEMENTATION

December 4, Time and Resources ■ Modeling/Domain familiarization – 2 data modelers – June through August 2008 ■ Source-to-Target analysis and documentation – 2 analysts – July through September 2008

December 4, Time and Resources ■ Coding and system integration – 4 ETL programmers – August through October 2008 ■ Total person-days – July through October 2008 – Approximately 140 person-days

December 4, Time and Resources ■ Caveats – Established documentation methods and coding standards – Slowly changing logic developed or provided by toolset – 3 transactional models implemented identically

December 4, Nightly Build JobMinutes (approximate) Source pull30 Reporting solution build70 Data model build140 End user table refresh60 TOTAL300

December 4, Performance Tuning: Nightly Build ■ Coordination with Finance on their builds – Nightly processing – Reporting solution (in transactional database) ■ Approximately one month to level out on timing – Tuning specific to the finance jobs – Coordination with other PolyData warehouse jobs

December 4, Performance Tuning: End-User Tables ■ Performance was reasonable prior to indexing – Largely due to the dimensional structure ■ Performance screamed after indexing – Indexes on fields used in selection criteria and drillable hierarchies – Bitmap indexes on foreign keys in facts

December 4, Implementation: Interface with Front End Developers ■ joins should be fully documented ■ front end developers may need some training in interpreting models ■ we still have not come up with an ideal method for documenting hierarchies ■ challenge - knowledge of hierarchies is shared – data steward – front end developers – modelers

December 4, CONCLUSION

December 4, Future Work ■ Labor Cost ■ GAAP Reporting ■ Management Dashboard/Analytics ■ Integration with HR and Student Data

December 4, Questions? Daniel Grieb Data Warehouse Architect, Analyst/Programmer Lori Silvestri Data Warehouse Analyst/Programmer

December 4, Contact ■ OBIEE Technical Conference: ■