CSCI 200 Data MINING Lecture 1.

Slides:



Advertisements
Similar presentations
Data Mining in Computer Games By Adib Adam Hussain & Mohammed Sarfraz.
Advertisements

Examples of data mining Marketing & Advertisement (Case of Bank of America) In the past, each caller would have listened to the same marketing advertisement,
Target Markets: Segmentation and Evaluation
Chapter 9 Business Intelligence Systems
Chapter 9 Competitive Advantage with Information Systems for Decision Making © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Building Knowledge-Driven DSS and Mining Data
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data Mining: A Closer Look
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Business Intelligence
Science Inquiry Minds-on Hands-on.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
Exploring Marketing Research William G. Zikmund Chapter 2: Information Systems and Knowledge Management.
Data Mining Chun-Hung Chou
Understanding Data Analytics and Data Mining Introduction.
Business Intelligence. business intelligence is a broad category of applications and technologies for gathering, providing access to, and analyzing data.
Chapter 9 Business Intelligence and Information Systems for Decision Making.
Target Markets: Segmentation and Evaluation
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Chapter 1 Introduction to Data Mining
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Business Intelligence and Reports Creation - Basic Concepts - August BTA Bootcamp As a Villanova graduate you are well positioned for a career.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Business Plug-In B18 Business Intelligence.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Introduction to Information Systems Chapter One. IS for Management2 Information Concepts Knowledge Information Data Raw facts A collection of facts organized.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Data Mining and Decision Support
Knowledge Discovery and Data Mining 19 th Meeting Course Name: Business Intelligence Year: 2009.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Supplemental Chapter: Business Intelligence Information Systems Development.
Chapter 1 marketing is all around us Section 1.1
01-Business intelligence
Chapter 5: Target Markets: Segmentation and Evaluation
Data Mining Functionalities
Data Mining.
SNS COLLEGE OF TECHNOLOGY
Decision Support Systems
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
MIS2502: Data Analytics Advanced Analytics - Introduction
Multivariate Analysis - Introduction
Big-Data Fundamentals
Adrian Tuhtan CS157A Section1
Data Mining: Concepts and Techniques Course Outline
Data Analysis.
DATA MINING.
Supporting End-User Access
MIS2502: Data Analytics Clustering and Segmentation
Course Introduction CSC 576: Data Mining.
MIS2502: Data Analytics Clustering and Segmentation
Day 5 Morning 1. Attendance/Recap/Questions
Business Intelligence
MIS2502: Data Analytics Introduction to Advanced Analytics
Chapter 12 Analyzing Semistructured Decision Support Systems
Business Intelligence
Marketing Research and Consumer Behavior Insights
Presentation transcript:

CSCI 200 Data MINING Lecture 1

Recommended Textbooks and Resources 1. Applied predictive Analytics: Principles and Techniques for the Professional Data Analyst, Dean Abbott, Wiley, ISBN-13: 978-1118727966 http://www.wiley.com/WileyCDA/WileyTitle/productCd-1118727967.html www.abbottanalytics.com 2. Data Mining and Business Analytics with R, Johannes Ledolter, Willey, ISBN: 978-1-118-44714-7 http://www.biz.uiowa.edu/faculty/jledolter/DataMining/ 3. Data Mining for Business Intelligence, Galit Shmueli, Nitin R. Patel, Peter C. Bruce, Willey, ISBN – 13: 9780470526828 http://www.dataminingbook.com/user/register

Recommended Textbooks and Resources 4. QlikView Your Business: An expert guide to Business Discovery with QlikView and Qlik Sense, Oleg Troyansky, Tammy Gibson, Charlie Leichtweis, Lars Bjork (Foreword by) http://www.wiley.com/WileyCDA/WileyTitle/productCd-1118949552.html 5. Discovering Knowledge in Data: An Introduction to Data Mining 2nd Edition by Daniel T. Larose http://www.dataminingconsultant.com/DKD2e.htm 6. Business Intelligence: A Managerial Approach, Efraim Turban; Ramesh Sharda; Dursun Delen; David King

Recommended Textbooks and Resources 7. Qlik Website http://www.qlik.com/ 8. R Studio, R https://www.rstudio.com/ https://www.r-project.org/ 9. W. Eckerson, Smart Companies in the 21st Century: The Secrets of Creating Successful Business Intelligent Solutions. http://download.101com.com/tdwi/research_report/2003BIReport_v7.pdf 10. Garcia, M., Harmsen, B. (2012). QlikView 11 for Developers. Birmingham: Packt Publishing

Data Mining Common Tasks Daniel T. Larose [5] Data mining is the process of discovering useful patterns and trends in large data sets. Common Data Mining Tasks Description Estimation Prediction Classification Clustering Association

Description Daniel T. Larose [5] Description of patterns and trends lying within the data. Exploratory data analysis, a graphical method of exploring the data in search of patterns and trends Example: a pollster may uncover evidence that those who have been laid off are less likely to support the present incumbent in the presidential election. Descriptions of patterns and trends often suggest possible explanations for such patterns and trends. For example, those who are laid off are now less well off financially than before the incumbent was elected, and so would tend to prefer an alternative.

Estimation and Prediction Daniel T. Larose [5] approximate the value of a numeric target variable using a set of numeric and/or categorical predictor variables Example: Estimating the amount of money a randomly chosen family of four will spend for back-to-school shopping this fall Example: Estimating the grade point average (GPA) of a graduate student, based on that student’s undergraduate GPA Prediction Similar to estimation, except that for prediction, the results lie in the future. Example: Predicting whether a particular molecule in drug discovery will lead to a profitable new drug for a pharmaceutical company.

Classification, Daniel T. Larose [5] Similar to estimation, except that the target variable is categorical rather than numeric Categorical variables represent types of data which may be divided into groups. Example: income bracket - high income, middle income, and low income Suppose the researcher would like to be able to classify the income bracket of new individuals not in the current database, based on age, gender, and occupation

Clustering, Daniel T. Larose [5] Grouping of records, observations, or cases into classes of similar objects. A cluster is a collection of records that are similar to one another, and dissimilar to records in other clusters. Clustering differs from classification in that there is no target variable for clustering. The clustering task does not try to classify, estimate, or predict the value of a target variable

Clustering, Daniel T. Larose [5] Nielsen MyBestSegments is in the clustering business which provides a demographic profile of each oi the geographic areas in the country, as defined by zip code Clustering Mechanisms - PRIZM segmentation system, which describes every American zip code area in terms of distinct lifestyle types. https://segmentationsolutions.nielsen.com/mybestsegments/Default.jsp?ID=0&menuOption=home&pageName=Home&filterstate=&sortby=segment_code&prevSegID=CLA.PZP

Association Daniel T. Larose [5] The association task for data mining is the job of finding which attributes “go together.” The task of association seeks to uncover rules for quantifying the relationship between two or more attributes Example: Investigating the proportion of subscribers to your company’s cell phone plan that respond positively to an offer of a service upgrade Example: Finding out which items in a supermarket are purchased together, and which items are never purchased together

Different Terms Same Meaning Dean Abbott Analytics is the process of using computational methods to discover and report influential patterns in data. The goal of analytics is to gain insight and often to affect decisions. 2005, Google Analytics. The ideas behind analytics are not new Different terms for analytics: cybernetics, data analysis, neural networks, pattern recognition, statistics, knowledge discovery, data mining, and now even data science.

Business Intelligence Efraim Turban; Ramesh Sharda; Dursun Delen; David King Business intelligence (BI) is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies. BI’s objectives: to enable interactive access (sometimes in real time) to data, to enable manipulation of data to give business managers and analysts the ability to conduct appropriate analysis.

Decisions Actions DATA INFORMATION Business Intelligence Efraim Turban; Ramesh Sharda; Dursun Delen; David King By analyzing historical and current data, situations, and performances, decision makers get valuable insights that enable them to make more informed and better decisions. The process of BI is based on the transformation of data to information, then to decisions, and finally to actions DATA INFORMATION Decisions Actions

Major Components of Business Intelligence Data Warehouse Business Analytics - a collection of tools for manipulating, mining, and analyzing the data in the data warehouse Business Performance Management (BPM) for monitoring and analyzing performance User Interface

Many Information Products RAW DATA Many Information Products From Data to Information - a data warehouse extracts data from multiple transaction or operational systems and integrates and stores the data in a dedicated database. This extraction and integration process turns data into a new product - information From Information to Knowledge. Then, users equipped with analytical tools access and analyze the information in the data warehouse. Their analysis identifies trends, patterns, and exceptions. Analytical tools enable users to turn information into knowledge.

BI environment takes raw material — data—and processes it into a many information products. From Knowledge to Rules. Armed with these insights, users then create rules from the trends and patterns they have discovered. From Rules to Plans and Action. Users then create plans that implement the rules. The plans turn knowledge and rules into action. Feedback Loop. Once the plan is executed, the cycle repeats itself.