Data Resource Management – MGMT 4170. An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Back to Table of Contents
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Final Review and Study Guide MIS2502, Spring 2011 Section 03.
Chapter 9 Business Intelligence Systems
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Database Processing for Business Intelligence Systems
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Enterprise systems infrastructure and architecture DT211 4
Peter Myers Bitwise Solutions Pty Ltd. Predictive Analytics PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned.
BGS Customer Relationship Management Chapter 7 Database and Customer Data Development Chapter 7 Database and Customer Data Development Thomson Publishing.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Chapter 5: Data Mining for Business Intelligence
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Data warehousing Data Mining.
Data Mining Techniques
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Chapter 10 Target Markets: Segmentation, Evaluation, and Positioning
Enabling Organization-Decision Making
Lesson 4 : Chapter 4 Building an E-commerce Presence: Web Sites, Mobile Sites, and Apps Copyright © 2014 Pearson Education, Inc.
Chapter 7 DATA, TEXT, AND WEB MINING Pages , 311, Sections 7.3, 7.5, 7.6.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
More value from data using Data Mining Allan Mitchell SQL Server MVP.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Chapter 11 Business Intelligence Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall 11-1.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Business Intelligence Systems Appendix J DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
AN INTELLIGENT AGENT is a software entity that senses its environment and then carries out some operations on behalf of a user, with a certain degree of.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Scenario Management Data.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Information Management and Market Research. Marketing Research Links…. Consumer, Customer, and Public Marketer through information Marketing Research:
Data Mining and Decision Support
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Week 11 Knowledge Discovery Systems: Systems That Create Knowledge.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Data Mining Copyright KEYSOFT Solutions.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Show Me Potential Customers Data Mining Approach Leila Etaati.
Copyright  2007 McGraw-Hill Pty Ltd PPTs t/a Marketing Research 2e by Lukas, Hair, Bush and Ortinau Slides prepared by Judy Rex 19-1 Chapter Nineteen.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Ahmed K. Ezzat, SQL Server 2008 and Data Mining Overview 1 Data Mining and Big Data.
OLAP Theory-English version On-Line Analytical processing (Buisness Intelligence) Ing.Skorkovský,CSc Department of Corporate Economy Faculty of Economics.
01-Business intelligence
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
MIS5101: Data Analytics Advanced Analytics - Introduction
Sangeeta Devadiga CS 157B, Spring 2007
Week 11 Knowledge Discovery Systems & Data Mining :
Supporting End-User Access
Presentation transcript:

Data Resource Management – MGMT 4170

An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical Data OLAP CUBE 2 Warehouse Cube OLAP CUBE 3 Store Cube Transaction Processing System -RDBMS- Relational Data ETL Packages Cube Developer DW DeveloperETL Developer Operational Users OLAP client Excel OLAP client Application OLAP client web based DM Model 1 DM Model 2 DM Model 3 Data Miner

Information Retrieval Evolution

Predictive Analysis PresentationExplorationDiscovery Passive Interactive Proactive Role of Software Business Insight Static Reports Dynamic Reports - Parameters OLAP Data mining DM Enables Predictive Analysis copyright Rafal Lukawiecki Cross tabbing - Pivoting

Basic Data Mining Terminology Data mining needs historical data. Hence the importance of the data warehouse. The DW does not only provide historical data; it is designed to respond to business needs by OLAP cubes and data mining models. The data mining software would use this historical data to build prediction models like customer behavior or product sales. We build a model, we test it in cases we know the answer, we verify its predictive power and then we apply the model in situations where we do not know the answer.

Data Mining Modeling Data warehouse historical data Data Mining Algorithm Data Mining Model We build a model from data we have in the data warehouse. We then test the model to verify its predictive power. We can then apply the model in new data.

Historical Data

Data Mining Models

What skills a data miner needs Good business sense and good-to-excellent working relationships with the business folks. Good-to-excellent knowledge of Integration Services and SQL. A good understanding of statistics and probability. Data mining experience.

Basic Data Mining Terminology Dependent variable(s): The variable we are trying to predict like the likelihood of purchase. Independent variable(s): The variables which provide the data used to build the model like home ownership, education level, cars owned, etc. Algorithm: The programmatic technique used to identify the relationships or patterns in the data.

Basic Data Mining Terminology Continuous variables: variables with decimal numbers or uncertain quantities are continuous. A column in an employee table such as Salary that contains a variety of actual salary values is a continuous variable. Discrete variables: You can add a column to the table during data preparation called SalaryRange, containing integers to represent encoded salary ranges (1 = "0 to $25,000"; 2 = "between $25,000 and $50,000"; and so on). This is a discrete variable.

Data Mining Models The basic algorithms include: Classification Estimation Prediction Affinity grouping Clustering Description and profiling.

Classification Definition Classification is the task of assigning each item in a set to one predetermined set based on its attributes or behaviors (buyer or non buyer). We can identify classes of consumers who have common geographic, demographic, economic, and behavioral attributes and can be expected to respond to certain opportunities in a similar way. Classification assigns an item to a specific class based on a discrete variable value like 0 or 1. Determining whether someone is likely to respond to a direct mail piece involves putting them in the category of Likely Responder or not. Algorithms to build the models: Decision Trees Neural Networks Naïve Bayes

Estimation (likelihood to respond) Definition Estimation is the continuous version of classification. That is to say, where classification returns a discrete value like 0 or 1, estimation returns a continuous number. For example, a promotions manager with a budget for 200,000 pieces and a list of 12 million prospects would use the predicted Response_Likeiihood variable to limit the target subset. Including only those prospects with a Response_Likelihood greater than some number, say 0.80, would give the promotions manager a target list of the top 200,000 prospects. Most of the estimation algorithms are based on regression analysis techniques. As a result, this category is often called regression. Algorithms to use: Decision Trees Neural Networks

Prediction (Predicting a value) Definition Prediction seeks to determine a value as accurately as possible before the value is known. This future- oriented element is what places prediction in its own category. For example, a lending company offering mortgages might want to predict the market value of a piece of property before it's sold regardless of the actual amount has been offered for the given property. In order to build a predictive data mining model, the company needs a training set that includes predictive attributes that are known prior to the sale, such as total square footage, number of bathrooms, city, school district, and the actual sale price of each property in the training set. The data mining algorithm uses this training set to build a model based on the relationships between the predictive variables and the known historical sale price. The model can then be used to predict the sale price of a new property based on the known input variables about that property. One feature of predictive models is that their accuracy can be tested. At some point in the future, the actual sale amount of the property will become known and can be compared to the predicted value. Algorithms to use: Decision Trees Neural Networks When prediction involves time series data, it is often called forecasting. Time Series is the first choice algorithm for predicting time series data, like monthly sales forecasts.

Association (market basket analysis) Definition Association looks for correlations among items. E-commerce systems are big users of association models in an effort to increase sales. This can take the form of an association modeling process known as market basket analysis. The online retailer first builds a model based on the contents of recent shopping carts and makes it available to the web server. As the shopper adds products to the cart, the system feeds the contents of the cart into the model. The model identifies items that commonly appear with the items currently in the cart. Most recommendation systems are based on association algorithms. Algorithms to use: Association Decision Trees

Clustering (Segmentation) Definition Clustering can be thought of as auto-classification. Clustering algorithms group cases into clusters that are as similar to one another, and as different from other clusters, as possible. The clusters are not predetermined, and it's up to the data miner to examine the clusters to understand what makes them unique. When applied to customers, this process is also known as customer segmentation. The idea is to segment the customers into smaller, homogenous groups that can be targeted with customized promotions and even customized products. One form of clustering is to identify frequent sequences in the data. For example, a consumer electronics product manufacturer's website might identify several clusters of users based on their browsing behavior. Algorithms to use: Clustering Sequence Clustering