Capabilities Apollo and SQL Server Data Mining Presented by Jeff Kaplan, Principal Client Services Paul Bradley, Ph.D., Principal Data Mining Technology.

Slides:



Advertisements
Similar presentations
Test Automation Success: Choosing the Right People & Process
Advertisements

Data Mining in Industry: Putting Theory into Practice Bhavani Raskutti.
Planning Metrics & Implementation Control
IBM SPSS Solutions A SELECT INTERNATIONAL COMPANY.
Why Market First  We Work with a Retailer’s Call Center or the Market First Preferred Call Center to Maximize their Effectiveness by Offering:  Scripting.
Advance Analytics Capabilities
Software Applications in Supply Chain Management Project Management and Implementation Implications Guest Lecture - February 26, 2002 Dominic Noce
Workload Management BMO Financial Group Case Study IRMAC, January 2008 Sorina Faur, Database Development Manager.
Chapter 3 Database Management
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
11.1 © 2006 by Prentice Hall 11 Chapter Enterprise Applications and Business Process Integration.
Data Mining By Archana Ketkar.
Driving Productivity with Microsoft Dynamics CRM Presenter Name Presenter Title Presenter Date.
Enterprise Applications and Business Process Integration
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Chapter 5 Data mining : A Closer Look.
Chapter 11 Building a Customer-Centric Organization – Customer Relationship Management 11-1.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Peter Myers Bitwise Solutions Pty Ltd. Predictive Analytics PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned.
Module 3: Business Information Systems Enterprise Systems.
Actionable Intelligence via Speech Analytics
Comparison of Classification Methods for Customer Attrition Analysis Xiaohua Hu, Ph.D. Drexel University Philadelphia, PA, 19104
BGS Customer Relationship Management Chapter 7 Database and Customer Data Development Chapter 7 Database and Customer Data Development Thomson Publishing.
10 Data Mining. What is Data Mining? “Data Mining is the process of selecting, exploring and modeling large amounts of data to uncover previously unknown.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
CUSTOMER RELATIONSHIP MANAGEMENT
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Optimizing Your Message With Advanced Analytics Thursday March 19 th, 2015 Paul Maiste & Brett Mowry.
GBA IT Project Management Final Project – “ FoodMart Corp - Making use of Business Intelligence” July 12, 2004 N.Khuda.
More value from data using Data Mining Allan Mitchell SQL Server MVP.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
DisruptionDisruption Empower frontline staff, expand self care capabilities and provide consistency across channels. Design and execute new campaigns.
Copyright © 2005, SAS Institute Inc. All rights reserved. A Holistic Approach to Customer Lifecycle Management Name: Ken King Title: Industry Strategist.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
Lecture 3 Strategic E-Marketing Instructor: Hanniya Abid
Business Insight Services Where Does Location Intelligence Fit in An Enterprise Data Mining/BI Strategy? Tim Pletcher
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Copying distribution or use of the contents of this document is prohibited without written authorization from SafeHarbor Technology Corporation. Maximizing.
Business Solutions. Agenda Overview Business Solutions Benefits Company Summary.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
DAT377 Data Mining In SQL Server 2000 And SQL Server 2005 (Code Named “Yukon”) Paul Bradley Principal, Data Mining Technology Apollo Data Technologies.
MIS2502: Data Analytics Advanced Analytics - Introduction.
ORCALE CORPORATION:-Company profile Oracle Corporation was founded in the year 1977 and is the world’s largest s/w company and the leading supplier for.
Data Mining Copyright KEYSOFT Solutions.
Impact Research 1 Enabling Decision Making Through Business Intelligence: Preview of Report.
Chapter 1 MARKETING IS ALL AROUND US. The Scope of Marketing Marketing is activity, set of institutions, and processes for creating, communicating, delivering,
A Global fully incorporated Virtual ecommerce Software Solution.
Data Mining With SQL Server Data Tools Mining Data Using Tools You Already Have.
Show Me Potential Customers Data Mining Approach Leila Etaati.
1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
Global Mobile Analytics Market 2015 to 2020 No. Pages :115 Published on : October-2015.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
What we mean by Big Data and Advanced Analytics
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Real Time Inventory November 2016.
Cloud University Live: 8 Steps to Build Your Cloud Go to Market Plan
Decision Support Systems
Data Based Decision Making
MIS2502: Data Analytics Advanced Analytics - Introduction
Chapter 11 Building a Customer-Centric Organization – Customer Relationship Management 11-1.
Carl Holmes Christy Lee
Delivering Business Insight with SQL Server 2005
Clear Demand Price & Promotion Optimization
MIS5101: Data Analytics Advanced Analytics - Introduction
TechEd /28/ :48 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
Supporting End-User Access
KEY INITIATIVE Financial Data and Analytics
Presentation transcript:

Capabilities Apollo and SQL Server Data Mining Presented by Jeff Kaplan, Principal Client Services Paul Bradley, Ph.D., Principal Data Mining Technology

2 Agenda Apollo Overview Data Mining 101 Project REAL Case Study SQL Server 2005 Data Mining Demo Real-life Examples

3 Apollo Overview PART ONE

4 Company Background First company delivering true predictive analytic solutions 10 plus years in data mining and data warehousing Premier Partner for SQL Server 2005 Data Mining Cater to a wide range of business including Microsoft, Sprint, Wal-Mart, Barnes & Noble, Seattle Times, Knight Ridder Variety of Industries Retail and Consumer Goods Media Financial Services Manufacturing Public Services overview

5 Industry Recognition overview

6 Testimonials overview

7 Testimonials overview

8 Testimonials overview

9 Analytic Landscape

10 Capabilities overview Customer Acquisition Campaign Targeting Cross-sell/Up-sell Customer Segmentation Retention Modeling Behavioral Targeting Personalization Claim Analysis Call Center Analytics Data Warehousing Dashboard Reporting Marketing Sales & Distribution Correlation Analysis Key Driver Analysis Verbatim Summarization Market Research Operations Inventory Forecasting Sales Forecasting Pricing Optimization Next Best Offer Market Basket Analysis Recency & Frequency Modeling

11 Red Card Booking Call Center SQL-Server 2005 Stores Predictive Models Dashboard & Ad-hoc Reporting Customer Clustering Models Measure Promotion Success Web Direct Mail Phone Automate Predictions for Targeting, Forecasting, Detection, etc. Join Customer Data Sources Customer Targeting Models Score Model Results Deliver Targeted Predictions Run Predictive Algorithms overview

12 MS Data Mining PART TWO

13 Fastest Growing BI Segment (IDC) Data Mining Tools: $1.85B in 2006 Predictive Analytic projects yield a high median ROI of 145% Uses Marketing: Customer Acquisition and Targeting, Cross-Sell/Up-Sell Retail: Inventory Forecasting, Price Optimization Market Research: Driver Analysis, Verbatim Summarization Operations: Call Center Analytics Finance: Fraud Detection, Risk Models Mainstream Emergence E-commerce (e.g Amazon.com) Search (e.g. Vivisimo.com) Behavioral Advertising SQL-Server is in a Unique Position to Service Market Needs ms data mining Background

14 Evolution of SQL Server Data Mining SQL 2000 SQL 2005 Enter the Game Create industry standard Create industry standard Target developer audience Target developer audience V1.0 product with 2 algorithms V1.0 product with 2 algorithms Win Leadership Continue standards and developer effort Continue standards and developer effort Comprehensive feature set Comprehensive feature set Penetrate the Enterprise Penetrate the Enterprise Thought leadership Thought leadership ms data mining

15 SQL-Server 2005 OLAP Reports (Adhoc) Reports (Static) Data Mining Business Knowledge EasyDifficult Relative Business Value ms data mining Value of Data Mining

16 SQL-Server 2005 BI Platform Analysis Services OLAP & Data Mining Integration Services ETL SQL Server Relational Engine Reporting Services Management Tools Development Tools ms data mining

17 SQL Server 2005 BI Platform Embed Data Mining: Development Tool Integration Make Decisions Without Coding Customized Logic Based on Client Data Logic Updated by Model Reprocessing – Applications Do Not Need to be Re- Written, Re-Compiled, and Re-Deployed Data Mining Key Points Price Point to Achieve Market Penetration Database Metaphors for Building, Managing, Utilizing Extracted Patterns and Trends APIs for Embedding Data Mining Functionality into Applications ms data mining

18 SQL-Server 2005 Algorithms Decision Trees Time Series Neural Net Neural NetClustering Sequence Clustering Sequence ClusteringAssociation Naïve Bayes ms data mining Linear and Logistic Regression

19 Project REAL PART THREE

20 Client Profile – Inventory Forecasting Create a Reference Implementation of a BI System Using Real Retail Data. Partners - Barnes & Noble, Microsoft, Scalability Experts, EMC, Unisys, Panorama, Apollo Forecast Out-of-Stock for 5 Book Titles Across Entire Chain (800 Stores) Predictive Models to Flag Items That Are Going to be Out-of-Stock Model on 48 Weeks of Data, Predictions for Month of December Models Predicted Out-of-Stock Occurrences > 90% Accuracy Conservative Sales Opportunity for just 5 Titles: $6,800 per year Extrapolate Across Millions of Titles - Million Dollar Sales Opportunity project real

21 Predictive Modeling Process + Each item belongs to a category For the category, create a set of store clusters predictive of sales in the category Category STEP 1 STEP 2 Identify the cluster which the store belongs to, for the category of that item. STEP 3 Utilize sales data predict item sales 2 weeks out. ITEMSTORE CATEGORY project real

22 Store Clustering Demo project real

23 Store Clustering Overview Average Category Sales 685, $14,366 Average Category Sales 2,532, $45,153 Average Category Sales 120, $2,081 Average Category Sales 8,936, $188,921 Average Category Sales 1,320, $22,805 project real

24 Out-of-Stock Data Preparation Summary Apollo Explored 3 Data Preparation Strategies 1. Use Sales, On-Hand, On-Order History Data for All Stores in the Same Cluster Build One Mining Structure per Cluster, For All Stores in that Cluster for Each Title Build One Mining Model per Store, per Cluster for Each Title Negative: Few OOS Examples per Store, Computation to Deploy One Mining Model per Store/Title Combination 2. Use Sales, On-Hand, On-Order History for All Stores, Across All Clusters Build One Mining Structure per Book, Use Cluster Membership of Store as Input Attribute Positive: Optimizes OOS Examples per Title by Considering All Stores Negative: Does Not Capture Derivative Sales Information 3. Removed Negative of Strategy 2 Included Historical Week-on-Week Sales Derivative Information for Each Title Increase the Information Content of the Source Data for Modeling project real

25 Creating Variables for Success Using: Sales and Inventory History from January 2004 to end of November 2004 Recommend two (2) years of Historical Data to Increase accuracy for training model Key: Store + Fiscal Year + WeekID Predicted Variables 1 Week Ahead OOS Boolean 1 Week Ahead Sales Bin (None, 1 to 2, 3 to 4, 4+) 2 Week Ahead OOS Boolean 2 Week Ahead Sales Bin (None, 1 to 2, 3 to 4, 4+) Input Attributes Store Cluster Membership (Derived from Store Cluster Model) Current Week Sales, On-Hand, On-Order Preceding 1-5 Week Sales, On-Hand, On-Order Sales Derivative Atttributes project real

26 Model Training and Testing Scenarios Purpose: Intelligence on Model Training Frequency Scenario 1: Train Models Every 2 Weeks Training Dataset: All Data Prior to Last 2 Fiscal Weeks in December 2004 Test Dataset: Last 2 Fiscal Week in December 2004 Scenario 2: Train Models Monthly Training Dataset: All Data Prior to End of Fiscal November 2004 Test Dataset: Fiscal Month of December 2004 project real

27 Balancing Training Data When Considering All Stores, Still Have Un-Balanced Datasets [# Store/Week Combinations Where OOS is False] >> [# Store/Week Combinations Where OOS is True] Common in Many Data Mining Applications Training Datasets were Balanced Sample Store/Week Combinations Where OOS is False to Obtain Equal Proportion of True/False Values “Cost” of Predictive Errors are Equal Requested by Client project real

28 Prediction Methods Algorithm Selection Microsoft Decision Trees for Predicting OOS Boolean flags Consistently High Overall Accuracy Straightforward Interpretation Data Preparation Scenario 2 Rebuild models monthly Predictive Models are Contextual and Optimized for Behavior in the Coming Month project real

29 Prediction Methods Modeling Methodology Benefits Scalability (Titles and Stores) Saves 4x to 5x on Computational Cost when Rebuilding Models (versus Neural Networks) 5 Minutes for All 5 Titles => 1 Minute per Title for All Stores project real

30 Out-of-Stock Prediction Demo project real

31 Predictive Models Identify Opportunities to Improve Forecasting Rules Save Scored Results to Database and Leverage UI to View KPI and Alerts for Store Managers and Inventory analysts project real

32 Inventory Prediction Results 1 week and 2 week prediction accuracies project real

33 Sales Opportunity Data Mining created revenue generating opportunity Based on 55 titles for Jan Dec 2004 (# of weeks OOS across all stores)(Apollo Boolean Predicted Accuracy) X (actual % of actual sales across all stores) x (retail price) = Yearly Increase in Sales Opportunity using Apollo OOS Predictions Sales bins produced $3.4K, $6.8K potential lift in sales project real

34 Client Profiles PART FOUR

35 Client Profile – Customer Acquisition Decrease Subscriber Churn Increase New Subscriptions Segment Geo-Demographic and Attitudinal Behaviors for Subscribers and Non-Subscribers Build Predictive Models to Identify Likely New Subscribers Using Analysis to Deliver Targeted Marketing Campaigns for Acquisition Increased Stop Saves by 2% client profiles

36 Client Profile – Cross sell / Up sell (Global Catalog Retailer) Increase Average Purchase Size Deploy Product Recommendations on their Website Modeling Historical Sales to Determine Product Affinities Incorporate Business Logic into Modeling Process (e.g. Same category recommendation) Increase Average Shopping Cart Size Increase Sales Lift Data Mining Driven Product Recommendation Performed Better than Manual Recommendations client profiles

37 Client Profile – Customer Support Automation Increase Visibility into Customer Service Center Increase Speed of Customer Support Utilizing Text Mining Engines to Automate Processing of Customer Support ( , Web Inquiries, etc.) Automating the Process of Rolling up Keywords into Concepts Customer Support Center has the Ability to View Trends in Minutes versus Weeks Improved Accuracy - Text Mining Engines Removed the Bias and Inaccuracies Often Occurring in Call Center Representative Notes and Tagging. client profiles

38 Client Profile – Key Driver Analysis Evaluate Customer Satisfaction Metrics Increase Customer Satisfaction Partnered with Apollo to Develop Market Research Database and Reporting Developed Models to Identify “Key” Satisfaction Drivers Successfully Identified Drivers to Increase Customer Satisfaction Delivered Driver Recommendations to Field Operations - Insight into Action Company Wide (sales, marketing, executive level) Visibility into Customer Satisfaction Metrics client profiles

Presented by Jeff Kaplan Principal Client Services