Oracle Advanced Analytics

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

1. Abstract 2 Introduction Related Work Conclusion References.
Week 9 Data Mining System (Knowledge Data Discovery)
An overview of The IBM Intelligent Miner for Data By: Neeraja Rudrabhatla 11/04/1999.
Data Mining Concepts 1.1 COT5230 Data Mining Week 1 Data Mining Concepts M O N A S H A U S T R A L I A ’ S I N T E R N A T I O N A L U N I V E R S I T.
Data Mining – Intro.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Overview DM for Business Intelligence.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Anomaly detection with Bayesian networks Website: John Sandiford.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Mining and Decision Support
DR. SATISH NARGUNDKAR GEORGIA STATE UNIVERSITY Analytics Overview.
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
Data Mining Practical Machine Learning Tools and Techniques Chapter 6.3: Association Rules Rodney Nielsen Many / most of these slides were adapted from:
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
CSE 4705 Artificial Intelligence
Pengantar Sistem Informasi
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining – Intro.
Data Mining ICCM
Data Transformation: Normalization
Data Mining Motivation: “Necessity is the Mother of Invention”
Sentiment analysis algorithms and applications: A survey
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
School of Computer Science & Engineering
A Research Oriented Study Report By :- Akash Saxena
Data Mining 101 with Scikit-Learn
Waikato Environment for Knowledge Analysis
Introduction to Azure Machine Learning Studio
MIS5101: Data Analytics Advanced Analytics - Introduction
I don’t need a title slide for a lecture
Supporting End-User Access
Word Embedding Word2Vec.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
MIS2502: Data Analytics Introduction to Advanced Analytics
Topic 5: Cluster Analysis
Machine Learning – a Probabilistic Perspective
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Data Pre-processing Lecture Notes for Chapter 2
Presentation transcript:

Oracle Advanced Analytics PRODUCT Oracle Data Miner & Oracle Enterprise R Data mining, machine learning & data science in general are primarily algorithms. Open source R, in addition, is a programming language, primarily made for statistics, but it is a language and does much more. ORE is R that can run in the database and without data volume limitations. Both exist for the Oracle DB EE and for the BDA.

Oracle data mining algorithms (2/7): Technique Application areas Algorithm(s) CLASSIFICATION   Most commonly used technique for predicting a specific outcome such as response / no-response, high / medium / low-value customer, likely to buy / not buy. Logistic Regression —classic statistical technique but now available inside the Oracle Database and supports text and transactional data Naive Bayes —Fast, simple, commonly applicable Support Vector Machine—Next generation, supports text and wide data Decision Tree —Popular, provides human-readable rules REGRESSION Technique for predicting a continuous numerical outcome such as customer lifetime value, house value, process yield rates. Multiple Regression —classic statistical technique but now available inside the Oracle Database and supports text and transactional data Support Vector Machine —Next generation, supports text and wide data

Oracle data mining algorithms (4/7): Technique Application areas Algorithm(s) ATTRIBUTE IMPORTANCE   Ranks attributes according to strength of relationship with target attribute. Use cases include finding factors most associated with customers who respond to an offer, factors most associated with healthy patients. Minimum Description Length—Considers each attribute as a simple predictive model of the target class ANOMALY DETECTION Identifies unusual or suspicious cases based on deviation from the norm. Common examples include health care fraud, expense report fraud, and tax compliance.  One-Class Support Vector Machine —Trains on "normal" cases to flag unusual cases

Oracle data mining algorithms (6/7): Technique Application areas Algorithm(s) CLUSTERING   Useful for exploring data and finding natural groupings. Members of a cluster are more like each other than they are like members of a different cluster. Common examples include finding new customer segments, and life sciences discovery. Enhanced K-Means—Supports text mining, hierarchical clustering, distance based Orthogonal Partitioning Clustering—Hierarchical clustering, density based Expectation Maximization—Clustering technique that performs well in mixed data (dense and sparse) data mining problems. ASSOCIATION Finds rules associated with frequently co-occurring items, used for market basket analysis, cross-sell, root cause analysis. Useful for product bundling, in-store placement, and defect analysis. Apriori—Industry standard for market basket analysis

Oracle data mining algorithms (7/7): Technique Application areas Algorithm(s) FEATURE SELECTION and EXTRACTION   Produces new attributes as linear combination of existing attributes. Applicable for text data, latent semantic analysis, data compression, data decomposition and projection, and pattern recognition. Non-negative Matrix Factorization—Next generation, maps the original data into the new set of attributes Principal Components Analysis (PCA)—creates new fewer composite attributes that respresent all the attributes. Singular Vector Decomposition—established feature extraction method that has a wide range of applications.

Oracle text mining (x IBM Watson) Sentiment analysis on social media, document- centric and text-centric analyses etc. all use techniques based on analysis of text. And there is this strange myth that says that Oracle does not have text analytics. Take a look at these: Oracle Text on 12c: http://www.oracle.com/technetwork/testcontent/index-098492.html Oracle text mining: https://docs.oracle.com/cd/B28359_01/datamine.111/b28129/text.htm Tutorial (text mining with Oracle Data Miner): http://www.oracle.com/webfolder/technetwork/tutorials/obe/db/11g/r2/pr od/bidw/datamining/ODM11gR2-TextMining.htm Table 20 – Oracle Data Mining Algorithms that Support Text Algorithm Mining Function Naïve Bayes Classification Generalized Linear Models Classification. Regression Support Vector Machine Classification, Regression, Anomaly Detection k-Means Clustering Non-Negative Matrix Factorization Feature Extraction Apriori Association Rules Minimum Descriptor Length Attribute Importance

R in R Studio (screen capture): PREDICT FAILURE!