DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Supporting End-User Access
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
EXPERT SYSTEMS apply rules to solve a problem. –The system uses IF statements and user answers to questions in order to reason just like a human does.
Data warehouse example
Chapter 9 Business Intelligence Systems
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
Data Mining Knowledge Discovery in Databases Data 31.
Data Mining.
Classical Techniques: Statistics, Neighborhoods, and Clustering.
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Knowledge Discovery Centre: CityU-SAS Partnership 1 Speakers: Prof Y V Hui, CityU Dr H P Lo, CityU Dr Sammy Yuen, CityU Dr K W Cheng, SAS Institute Mr.
Data Mining – Intro.
Data mining By Aung Oo.
DataMining By Guan Hang Su CS157A section 2 fall 2005.
Data Mining: A Closer Look
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Enterprise systems infrastructure and architecture DT211 4
Chapter 4 Data, Text, and Web Mining
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Warehousing by Industry Chapter 4 e-Data. Retail Data warehousing’s early adopters Capturing data from their POS systems  POS = point-of-sale Industry.
Data Mining Techniques
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Data Mining By Dave Maung.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Prediction as Data Mining Task Definition and business-related examples Prepared by Huan Truong Omer Demir.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Mining Basics. “Copyright and Terms of Service Copyright © Texas Education Agency. The materials found on this website are copyrighted © and trademarked.
CHAPTER 8 DATA MINING BASICS.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Data Mining Copyright KEYSOFT Solutions.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
Data Mining is the process of analyzing data and summarizing it into useful information Data Mining is usually used for extremely large sets of data It.
Data Mining.
Introduction BIM Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Data and Applications Security Introduction to Data Mining
Adrian Tuhtan CS157A Section1
MIS5101: Data Analytics Advanced Analytics - Introduction
Data Science introduction.
Supporting End-User Access
Understanding Customer Behaviors with Information Technologies
MIS2502: Data Analytics Introduction to Advanced Analytics
Kenneth C. Laudon & Jane P. Laudon
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Presentation transcript:

DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER

Data Rich, Information Poor The amount of raw data stored in corporate databases is exploding. For instance, every day, Wal-Mart uploads 20 million point-of-sale transactions to an A&T massively parallel system with 483 processors running a centralized database. Raw data by itself, however, does not provide much information

Data Warehouses The drop in price of data storage has given companies willing to make the investment a tremendous resource: Data about their customers and potential customers stored in "Data Warehouses."Data Warehouses A data warehouse stores large quantities of data by specific categories so it can be more easily retrieved, interpreted, and sorted by users

Data Warehouses(cont’d) Companies will want to learn more about that data to improve knowledge of customers and markets The company benefits when meaningful trends and patterns are extracted from the data.

What is Data Mining? Data mining, or knowledge discovery, is the computer-assisted process of digging through and analyzing enormous sets of data and then extracting the meaning of the dataData mining Data mining tools predict behaviors and future trends, allowing businesses to make proactive, knowledge-driven decisions.

How Data Mining Works That technique that is used to perform these feats is called modeling Modeling is simply the act of building a model (a set of examples or a mathematical relationship) based on data from situations where the answer is known and then applying the model to other situations where the answers aren't known

Data Mining Technologies The analytical techniques used in data mining are often well-known mathematical algorithms and techniques What is new is the application of those techniques to general business problems made possible by the increased availability of data and inexpensive storage and processing power

Data Mining Technologies(cont’d) Some of the tools used for data mining are: Artificial neural networks - Non-linear predictive models that learn through training and resemble biological neural networks in structure.Artificial neural networks Decision trees - Tree-shaped structures that represent sets of decisions. These decisions generate rules for the classification of a dataset.

Data Mining Technologies(cont’d) Rule induction - The extraction of useful if-then rules from data based on statistical significance Genetic algorithms - Optimization techniques based on the concepts of genetic combination, mutation, and natural selection.Genetic algorithms Nearest neighbor - A classification technique that classifies each record based on the records most similar to it in an historical database.

Real-World Examples Details about who calls whom, how long they are on the phone, and whether a line is used for fax as well as voice can be invaluable in targeting sales of services and equipment to specific customers. But these tidbits are buried in masses of numbers in the database Using its data mining system, it discovered how to pinpoint prospects for additional services by measuring daily household usage for selected periods

Real-World Examples(cont’d) For example, households that make many lengthy calls between 3 p.m. and 6 p.m. are likely to include teenagers who are prime candidates for their own phones and lines

The Future of Data Mining In the short-term, the results of data mining will be in profitable, if mundane, business related areas. Micro-marketing campaigns will explore new niches. Advertising will target potential customers with new precision

Privacy Concerns What if every telephone call you make, every credit card purchase you make, every flight you take, every visit to the doctor you make, every warranty card you send in, every employment application you fill out, every school record you have, your credit record, every web page you visit... was all collected together?