MIS2502: Data Analytics Advanced Analytics - Introduction.

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Supporting End-User Access
McGraw-Hill/Irwin Business Research Methods, 10eCopyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Clarifying the Research.
McGraw-Hill/Irwin Business Research Methods, 10eCopyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Clarifying the Research.
Chapter 9 Business Intelligence Systems
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
Data Mining By Archana Ketkar.
Chapter 14 The Second Component: The Database.
Clarifying the Research Question through Secondary Data and Exploration Chapter 5 組員 黎旭崴 李承霖.
Data Mining Ketaki Borkar CS157A November 29, 2007.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data mining By Aung Oo.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining: A Closer Look
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Enterprise systems infrastructure and architecture DT211 4
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Data Mining: Introduction. Why Data Mining? l The Explosive Growth of Data: from terabytes to petabytes –Data collection and data availability  Automated.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Final Exam Review. The following is a list of items that you should review in preparation for the exam. Note that not every item in the following slides.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Banking on Analytics Dr A S Ramasastri Director, IDRBT.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
New Developments in Business Intelligence ( Decision Support Systems) BUS 782.
1 What is Data Mining? l Data mining is the process of automatically discovering useful information in large data repositories. l There are many other.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Secondary Data Searches
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Data Mining Functionalities
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
MIS5101: Data Analytics Advanced Analytics - Introduction
Data Warehousing and Data Mining
Supporting End-User Access
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Data Mining: Introduction
MIS2502: Data Analytics Introduction to Advanced Analytics
Welcome! Knowledge Discovery and Data Mining
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Presentation transcript:

MIS2502: Data Analytics Advanced Analytics - Introduction

The Information Architecture of an Organization Transactional Database Analytical Data Store Stores real-time transactional data Stores historical transactional and summary data Data entry Data extraction Data analysis Now we’re here…

The difference between OLAP and data mining Analytical Data Store The (dimensional) data warehouse feed both… OLAP can tell you what is happening, or what has happened Data mining can tell you why it is happening, and help predict what will happen …like a pivot table …like what we’ll do with SAS

The Evolution of Advanced Data Analytics Evolutionary StepBusiness QuestionEnabling TechnologiesCharacteristics Data Collection (1960s) "What was my total revenue in the last five years?" Storage: Computers, tapes, disks Retrospective, static data delivery Data Access (1980s) "What were unit sales in New England last March?" Relational databases (RDBMS), Structured Query Language (SQL) Retrospective, dynamic data delivery at record level Data Warehousing/ Decision Support (1990s) "What were unit sales in New England last March?” Now “drill down” to Boston? On-line analytical processing (OLAP), dimensional databases, data warehouses Retrospective, dynamic data delivery at multiple levels Data Mining and Predictive Analytics (2000s and beyond) "What’s likely to happen to Boston unit sales next month? Why?" Advanced algorithms, parallel computing, massive databases Prospective, proactive information delivery

Origins of Data Mining Draws ideas from – Artificial intelligence – Pattern recognition – Statistics – Database systems Traditional techniques may not work because of – Sheer amount of data – High dimensionality – Heterogeneous, distributed nature of data Artificial intelligence Pattern recognition Statistics Database systems Data Mining

Data Mining and Predictive Analytics is Extraction of implicit, previously unknown, and potentially useful information from data Exploration and analysis of large data sets to discover meaningful patterns

What data mining is not… What are the sales by quarter and region? How do sales compare in two different stores in the same state? Sales analysis Which is the most profitable store in Pennsylvania? Which product lines are the highest revenue producers this year? Profitability analysis Which salesperson produced the most revenue this year? Does salesperson X meet this quarter’s target? Sales force analysis If these aren’t data mining examples, then what are they ? If these aren’t data mining examples, then what are they ?

Data Mining Tasks Use some variables to predict unknown or future values of other variables Likelihood of a particular outcome Prediction Methods Find human-interpretable patterns that describe the data Description Methods from Fayyad et al., Advances in Knowledge Discovery and Data Mining, 1996

Case Study A marketing manager for a brokerage company Problem: High churn (customers leave) – Turnover (after 6 month introductory period) is 40% – Customers get a reward (average: $160) to open an account – Giving incentives to everyone who might leave is expensive – Getting a customer back after they leave is expensive

…a solution One month before the end of the introductory period, predict which customers will leave Offer those customers something based on their future value Ignore the ones that are not predicted to churn

Data Mining Tasks Descriptive Clustering Association Rule Discovery Sequential Pattern Discovery Visualization Predictive Classification Regression Neural Networks Deviation Detection

Decision Trees Used to classify data according to a pre-defined outcome Based on characteristics of that data Uses Predict whether a customer should receive a loan Flag a credit card charge as legitimate Determine whether an investment will pay off

A more realistic one… Will a customer buy some product given their demographics? What are the characteristics of customers who are likely to buy?

Clustering Used to determine distinct groups of data Based on data across multiple dimensions Here you have four clusters of web site visitors. What does this tell you? Here you have four clusters of web site visitors. What does this tell you? Uses Customer segmentation Identifying patient care groups Performance of business sectors

Uses What products are bought together? Amazon’s recommendation engine Telephone calling patterns Association Mining Find out which items predict the occurrence of other items Also known as “affinity analysis” or “market basket” analysis

Bottom line In large sets of data, these patterns aren’t obvious And we can’t just figure it out in our head We need analytics software We’ll be using SAS to perform these three analyses on large sets of data