Mining the Data Ira M. Schoenberger, FACHCA Senior Administrator 2011 AHCA/NCAL Quality Symposium Friday February 18, 2011.

Slides:



Advertisements
Similar presentations
Copyright © 2012, SAS Institute Inc. All rights reserved. INTRODUCTION TO DATA AND TEXT MINING ANDREW PEASE, 8 MARCH 2013.
Advertisements

Data Mining: What? WHY? HOW?
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
DAMA-NCR Tuesday, November 13, 2001 Laura Squier Technical Consultant What is Data Mining?
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
An Introduction to Assurance and Financial Statement Auditing
SLIDE 1IS 257 – Fall 2009 Data Mining and the Weka Toolkit University of California, Berkeley School of Information IS 257: Database Management.
Business Intelligence /Decision Models Dr. Richard Michon TRSM 1-040Ext. 7454
/faculteit technologie management Introduction to Data Mining a.j.m.m. (ton) weijters (slides are partially based on an introduction of Gregory Piatetsky-Shapiro)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining.
SLIDE 1IS 257 – Fall 2008 Data Mining and the Weka Toolkit University of California, Berkeley School of Information IS 257: Database Management.
Data Mining – Intro.
DataMining By Guan Hang Su CS157A section 2 fall 2005.
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge.
Software Process and Product Metrics
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
SLIDE 1IS 257 – Fall 2012 Data Mining and OLAP University of California, Berkeley School of Information IS 257: Database Management.
Teaching Data Mining: The New “Required Competency” for Marketing Professionals Today’s Presenters: Tom Nugent Kenneth Elliott, Ph.D.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
Chapter 19Copyright ©2008 by South-Western, a division of Thomson Learning. All rights reserved 1 MKTG Designed by Amy McGuire, B-books, Ltd. Prepared.
1 REVIEW LEARNING OUTCOME Customer Relationship Management LO I.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
DATA MINING Team #1 Kristen Durst Mark Gillespie Banan Mandura University of DaytonMBA APR 09.
Data Mining Chun-Hung Chou
4.04 Understand marketing- research activities to show command of their nature and scope.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
The CRISP-DM Process Model
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Data Mining Process A manifestation of best practices A systematic way to conduct DM projects Different groups has different versions Most common standard.
Chapter 19Copyright ©2008 by South-Western, a division of Thomson Learning. All rights reserved 1 MKTG Designed by Amy McGuire, B-books, Ltd. Prepared.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
1. 2 Traditional Income Statement LO1: Prepare a contribution margin income statement.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
CogNova Technologies 1 Evaluating Induced Models Evaluating Induced Models with Daniel L. Silver Daniel L. Silver Copyright (c), 2004 All Rights Reserved.
Data Analysis Superintendents Trust. Increase test scores and graduation rates through targeted efforts and investments that lead to student success Proactively.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
October 2-3, 2015, İSTANBUL Boğaziçi University Prof.Dr. M.Erdal Balaban Istanbul University Faculty of Business Administration Avcılar, Istanbul - TURKEY.
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Week 11 Knowledge Discovery Systems: Systems That Create Knowledge.
Data Mining Copyright KEYSOFT Solutions.
Research Word has a broad spectrum of meanings –“Research this topic on ….” –“Years of research has produced a new ….”
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Chapter 29 Conducting Market Research. Objectives  Explain the steps in designing and conducting market research  Compare primary and secondary data.
Data Mining Introduction to data mining concepts.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Machine Learning with Spark MLlib
Data Mining – Intro.
Data Mining in Higher Education
Data Based Decision Making
MIS2502: Data Analytics Advanced Analytics - Introduction
Data Mining 101 with Scikit-Learn
Knowledge Discovery Systems: Systems That Create Knowledge
Week 11 Knowledge Discovery Systems & Data Mining :
What is Data Mining? DAMA-NCR Tuesday, November 13, 2001 Laura Squier
KNOWLEDGE MANAGEMENT (KM) Session # 37
Data Warehousing Data Mining Privacy
CSE591: Data Mining by H. Liu
Presentation transcript:

Mining the Data Ira M. Schoenberger, FACHCA Senior Administrator 2011 AHCA/NCAL Quality Symposium Friday February 18, 2011

Agenda Understand the concept of Data Mining Discuss how information can be converted into knowledge about historical patterns and future trends Learn how the process can be used to increase quality in your organization

.. Data Mining Is A hot buzzword for a class of techniques that find patterns in data A user-centric, interactive process which leverages analysis technologies and computing power A group of techniques that find relationships that have not previously been discovered Not reliant on an existing database A relatively easy task that requires knowledge of the business problem/subject matter expertise

Data mining is not Brute-force crunching of bulk data “Blind” application of algorithms Going to find relationships where none exist Presenting data in different ways A database intensive task A difficult to understand technology requiring an advanced degree in computer science

Data Mining Versus Statistical Analysis Data Analysis –Tests for statistical correctness of models Are statistical assumptions of models correct? –Eg Is the R-Square good? –Hypothesis testing Is the relationship significant? –Use a t-test to validate significance –Tends to rely on sampling –Techniques are not optimised for large amounts of data –Requires strong statistical skills Data Mining –Originally developed to act as expert systems to solve problems –Less interested in the mechanics of the technique –If it makes sense then let’s use it –Does not require assumptions to be made about data –Can find patterns in very large amounts of data – Requires understanding of data and business problem

Examples of What Organizations are Doing with Data Mining: Fraud/Non-Compliance Anomaly detection –Isolate the factors that lead to fraud, waste and abuse –Target auditing and investigative efforts more effectively Credit/Risk Scoring Intrusion detection Parts failure prediction Customer Satisfaction Surveys Revenue/Profitability Expenses Recruiting/Attracting customers Maximizing profitability (cross selling, identifying profitable customers) Service Delivery and Customer Retention –Build profiles of customers likely to use which services Web Mining Clinical Indicators-falls, unplanned transfers, weight loss, pressure ulcers etc. Retention/Turnover

What data mining has done for... HSBC need to cross-sell more effectively by identifying profiles that would be interested in higher yielding investments and... Reduced direct mail costs by 30% while garnering 95% of the campaign’s revenue.

What data mining has done for... analyzed suspects’ cell phone usage to focus investigations. The US Drug Enforcement Agency needed to be more effective in their drug “busts” and

What data mining has done for… Scheduled its workforce to provide faster, more accurate answers to questions. The US Internal Revenue Service needed to improve customer service and...

Data Mining Process

Steps in the DM Process (1 & 2) Business Understanding: –Statement of Business Objective –Statement of Data Mining objective –Statement of Success Criteria Data Understanding –Explore the data and verify the quality –Find outliers

Steps in the DM Process (3) Data preparation: –Takes usually over 90% of our time Collection Assessment Data selection –active role in ignoring non-contributory data? –outliers? Transformations - create new variables

Steps in the DM Process (4) Model building –Selection of the modeling techniques is based upon the data mining objective –Modeling is an iterative process - different for supervised and unsupervised learning May model for either description or prediction

Steps in the DM Process (5) Model Evaluation –Evaluation of model: how well it performed on test data –Methods and criteria depend on model type: e.g., decision tree matrix,classification models, mean error rate with regression models –Interpretation of model: important or not, easy or hard depends on algorithm

Phases in the DM Process (6) Deployment –Determine how the results need to be utilized –Who needs to use them? –How often do they need to be used Deploy Data Mining results by: –Scoring a database –Utilizing results as business rules –interactive scoring on-line

How would you intrepid this?

How would you interpret this Data?

How would you Interpret this Data?

How would you interpret this Data?

Final Comments Data Mining can be utilized in any healthcare organization that needs to find patterns or relationships in their data. By using Data Mining you can have a reasonable level of assurance that your efforts will render useful, repeatable, and valid results.

QUESTIONS Ira M. Schoenberger FACHCA Senior Administrator Heritage Hall Campus