Business Intelligence/ Decision Models Week 3 Data Preparation and Transformation.

Slides:



Advertisements
Similar presentations
Microsoft Dynamics AX 2009 Integration and Development with.NET Framework Business Intelligence: OLAP and Analytics.
Advertisements

Data Mining in Industry: Putting Theory into Practice Bhavani Raskutti.
Business Intelligence
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Business Intelligence/ Decision Models Week 4 Lifetime Value.
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Databases and Warehouses
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
Business Intelligence /Decision Models Dr. Richard Michon TRSM 1-040Ext. 7454
Developing A Strategy For The Internet Age The Five Forces Model
Chapter 3 Databases and Data Warehouses: Building Business Intelligence McGraw-Hill/Irwin Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Data Staging Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
McGraw-Hill © 2008 The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES.
Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence
Business Intelligence Andrew Davis Andria Zippler Jana Krinsky Tiffany Ferris.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Introduction to Database Management
Database Processing for Business Intelligence Systems
Data Warehouse Concepts & Architecture.
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
Business Intelligence Instructor: Bajuna Salehe Web:
Business Intelligence/ Decision Models Lifetime Value.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
CLV 2 Lab Richard MICHON Ted Rogers School of Management Ryerson University, Toronto.
Database Systems – Data Warehousing
Business Intelligence/ Decision Models
Ch 5. The Evolution of Analytic Processes
The Business Intelligence Side of Blue Mountain RAM Bill Lucas, IT Systems Architect and Senior Software Engineer.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Databases and Data Warehouses: Supporting the Analytics-Driven.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
Chapter 3 and Module C DATABASES AND DATA WAREHOUSES Building Business Intelligence.
Managing Knowledge in Business Intelligence Systems Dr. Jan Mrazek.
3-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Databases and Data.
Business Intelligence Systems Appendix J DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Chapter 3: Databases and Data Warehouses Building Business Intelligence Management Information Systems for the Information Age.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter 4 Marketing Intelligence and Database Research.
Part II Tools for Knowledge Discovery Ch 5. Knowledge Discovery in Databases Ch 6. The Data Warehouse Ch 7. Formal Evaluation Technique.
CLV Based Product Recommendation Integrating AHP and data mining for product recommendation based on customer lifetime value Shaheen Syed Department of.
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Business Intelligence/ Decision Models CRISP.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
© 2014 IBM Corporation IBM SPSS Modeler Gold on Cloud Jump Start Service.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Data Mining Copyright KEYSOFT Solutions.
MIS 451 Building Business Intelligence Systems Data Staging.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
1 Chapter The Impact of Database Customer centric approach - A highly personal approach Marketing databases are essential to the marketing process.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
SAP BI – The Solution at a Glance : SAP Business Intelligence is an enterprise-class, complete, open and integrated solution.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Integration - The ETL Process Module 4: BIC#4 – Data Integration Capability Populating Data Warehouse (Data Mart) 1.
نمايندگي استان يزد. نمايندگي استان يزد طراحی کسب و کار الکترونیکی ارائه کننده : محسن افسر قره باغ.
01-Business intelligence
Intro to MIS – MGS351 Databases and Data Warehouses
Chapter 8 Business Intelligence & ERP
Chapter 21: Customer Relationship Management (CRM)
Chapter 11 Building a Customer-Centric Organization – Customer Relationship Management 11-1.
19 MKTG CHAPTER Lamb, Hair, McDaniel
Databases and Data Warehouses Chapter 3
انباره داده Data Warehouse
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Chapter 14: Meta Data Repository Development
Data warehouse.
Customer lifetime value (CLV)
Presentation transcript:

Business Intelligence/ Decision Models Week 3 Data Preparation and Transformation

Last Week OLTP, data warehouse repository and data mart structures (flat and relational files) Data integrity and normalization DB interrogation (SQL) for: OLAP and Reporting Migration into data mining suites

Time/ Cost Cumulated Productivity

Learning by association or problem solving

This Week CRISP ( Cross Industry Standard Procedure for Data Mining) Data preparation (import, aggregate and merge) Data transformation (for analytics)

CRISP-DM Phases Source SPSS Inc. 2008

Case Study A large telecom (XYZ PHONE) has discovered that it is losing customers at a much higher rate than in previous years. Reporting through the corporate dashboard (OLAP)has shown churn rates growing by a large margin last year.

Source SPSS Inc Define Business Objectives Strategic objective definition Increase revenues by retaining more customers Related business goal identification Retain high value customers Identify process problems that need to be changed Clear success factor (metric) Decrease customer churn by 1% Cost-benefit analysis Increase revenues by $750,000 Actionable BI objectives XYZ wants to retain more customers by identifying likely churners 2 months prior and putting an action in place to retain them

Source SPSS Inc Timeline Example XYZ’s project: 13 weeks 8 weeks a) business understanding and b) data preparation Involved line of business manager and data expert Included better defining high-value and churner definition 2 weeks data understanding Heavy reliance on data expert and database administrator 2 weeks modeling and evaluation Models developed by data miner and results evaluated by line of business manager 1 week deployment ? Heavy involvement of database administrator Model deployment entailed setting up a data model for monthly scoring of customer base with resulting reports feeding a mail offer

Source PSS Inc Time Allocation Generally accepted industry timeline standards 50 to 70 percent data preparation 20 to 30 percent data understanding 10 to 20 percent modeling, evaluation, and business understanding 5 to 10 percent deployment

Data Import and Transformation

Lab Objectives Extract data from Customer file Transactional file Transform data into information Data preparation Aggregate data from transactional file Merge aggregate data & customer file

Data Import Step by Step Import files from Access or Excel Customer and Transaction files Document variables labels and value labels using the data dictionary Aggregate the transaction file by cust_id with summary data and key variables Merge Customer and aggregated transaction file using cust_id as a common key

Aggregating Transaction File Order _id DateCust_ id Prod_ num Amt / / / / / / / / / / / Cust_ id FreqDate1Date2Amt_ sum /2111/ /3011/ /0511/ /0511/12380

Lab Objectives (Cont) Data transformation Compute customers’ length on file Compute recency of last purchase Compute frequency of purchases Compute amount spent Compute customer status Purpose CLV (Week4) RFM (Week5)

Data Transformation Step by Step Revisit measurement variables (nominal, ord, scale) Define date formats Auto recode nominal string variables Define missing values Calculate length on file or tenure (Date last purchase – Date first purchase) tenure Calculate time since last purchase (Date of current file – Date last purchase) Define customer status (active or lapsed)

Merging Customer and Transaction Summary Files Cust_ id Na- me Add- ress TypeCC 1011JeanNY1Visa 2234JohnOH1MC 2876JanetCA2Visa 3454JaneNY3Amex FreqDate1Date2Amt_ sum 410/2111/ /3011/ /0511/ /0511/12380

Data Transformation Cust _ ids Na- me Add- ress TypeCC 1011Jean1/NY1/Res1/Visa 2234John2/OH1/Res2/MC 2876Janet3/CA2/Bus1/Visa 3454Jane1/NY3/DNK3/Amx FreqDte1Dte2AmtDaysRec- ency 410/2111/ /3011/ /0511/ /0511/

Purpose of this exercise? Prepare data for next two weeks: Lifetime Customer Value RFM Analysis …