Intro to Machine Learning

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Godfather to the Singularity
Intelligent Environments1 Computer Science and Engineering University of Texas at Arlington.
An Overview of Machine Learning
Chapter 2: Pattern Recognition
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Introduction to Machine Learning course fall 2007 Lecturer: Amnon Shashua Teaching Assistant: Yevgeny Seldin School of Computer Science and Engineering.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Learning Programs Danielle and Joseph Bennett (and Lorelei) 4 December 2007.
Chapter 5 Data mining : A Closer Look.
Introduction to machine learning
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Overview DM for Business Intelligence.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
NEURAL NETWORKS FOR DATA MINING
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Decision Trees Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Artificial Intelligence, Expert Systems, and Neural Networks Group 10 Cameron Kinard Leaundre Zeno Heath Carley Megan Wiedmaier.
Data Mining and Decision Support
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Computacion Inteligente Least-Square Methods for System Identification.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Linear Classifiers (LC) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Introduction to Machine Learning, its potential usage in network area,
Usman Roshan Dept. of Computer Science NJIT
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
CS 9633 Machine Learning Support Vector Machines
Machine Learning with Spark MLlib
CSIE Dept., National Taiwan Univ., Taiwan
Machine Learning for Computer Security
Quadratic Classifiers (QC)
Introduction to Pattern Recognition
National Taiwan University
Neural Networks and Its Deep Structures
Intro to Machine Learning
School of Computer Science & Engineering
National Taiwan University
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Feature Selection for Pattern Recognition
Introductory Seminar on Research: Fall 2017
Machine Learning Ali Ghodsi Department of Statistics
Neuro-Computing Lecture 4 Radial Basis Function Network
Neuro-Fuzzy and Soft Computing for Speaker Recognition (語者辨識)
Overview of Machine Learning
PROJECTS SUMMARY PRESNETED BY HARISH KUMAR JANUARY 10,2018.
Deep Neural Networks (DNN)
Machine Learning in FinTech
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Intro to Machine Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Overview of deep learning
Multivariate Methods Berlin Chen
Scientific Computing: Closing 科學計算:結語
Machine Learning with Clinical Data
Naive Bayes Classifiers (NBC)
Usman Roshan Dept. of Computer Science NJIT
Presentation transcript:

Intro to Machine Learning J.-S. Roger Jang (張智星) jang@mirlab.org http://mirlab.org/jang MIR Lab, CSIE Dept. National Taiwan University

Machine Learning (ML) Definition Concepts Field of study that gives computers the ability to learn without being explicitly programmed – by Arthur Samuel, 1959. Computational methods that use existing data to make predictions Concepts 視其所以,觀其所由,察其所安,人焉廋哉?人焉廋哉? 近朱者赤、近墨者黑 Artificial intelligence Machine learning

More about Machine Learning Quick examples Use height and weight to predict gender Use historical data to predict stock market Comparison Before ML  Rule-based After ML  Data-driven Prerequisites for ML Probability & statistics Linear algebra Optimization …

Broad Areas of ML Classification: Assign a category to a given object Determine if it will rain tomorrow Regression: Predict a real-value for a given object Predict the exchange rate Ranking: Order objects according to some criterion Rank the webpages returned by a search engine Clustering: Partition data into homogeneous groups Dimensionality reduction: Find low-dimensional manifold preserving some properties of the data Density estimation: Learning probability density function according to sample data

2018/12/6 Concept of Modeling Given desired i/o pairs (training set) of the form (x1, ..., xn; y), construct a model to match the i/o pairs Two steps in modeling structure identification: input selection, model complexity parameter identification: optimal parameters x1 xn . . . Unknown target system y Model f() y* Given an input condition, it very easy to derive the overall output. However, our task is more than that. What we want to do is to construct an appropriate fuzzy inference system that can match a desired input/output data set of a unknown target system. This is called fuzzy modeling and it a branch of nonlinear system identification. In general, fuzzy modeling involves two step. The first step is structure identification, where we have to find suitable numbers of fuzzy rules and membership functions. In FLT, this is accomplished by subtractive clustering proposed by Steve Chiu in Rockwell Science Research Center. The second step is parameter identification, where we have to find the optimal parameters for membership functions as well as output linear equations. In the FLT, this is done by ANFIS, which was proposed by me back in 1991. Since ANFIS uses some neural network training techniques, it is also classified as one of the neuro-fuzzy modeling approaches.

Example of Modeling Source: https://www.slideshare.net/tw_dsconf/ss-70083878

More ML Application Examples Optical character recognition Document classification Part-of-speech tagging Speech processing Recognition synthesis Image recognition Face recognition Info. retrieval Recommendation search Security Fraud detection (credit card, telephone) Network intrusion Video surveillance Speaker id. Self-driving cars

What People Do with ML For practitioners For researchers Acquire application domain knowledge Know properties of machine learning methods Get familiar with ML tools For researchers Design new learning algorithms Speed up the learning process Derive theoretical bound of accuracy Take care of data Big data Imbalanced data Missing data …

First Look at a Dataset GHW (gender-height-weight) dataset 10000 entries with 3 columns Gender: Categorical Height: Numerical Weight: Numerical Source

Multiple Ways of Prediction Multiple ways of prediction based on GHW dataset Classification: Height, weight  gender Regression: Gender, height  weight Regression: Gender, weight  height In general Classification: the output is categorical Regression: the output is numerical Sometimes it’s not so obvious! Quiz!

List of Classifiers Commonly used classifiers K-nearest-neighbor classifiers Quadratic classifiers Naïve Bayes classifiers Linear classifiers Single-layer perceptrons SVM Neural networks Multilayer perceptrons DNN Radial-basis function networks Classification and regression trees (CART) Random forests …

A Canonical Way to ML Data processing Model construction Dataset collection Feature extraction Missing value imputation Visualization Outlier removal Feature normalization Model construction Classifier Structure id Feature selection Feature extraction Parameter id Cross validation for perf. evaluation Error analysis Confusion matrix DET/ROC plots PR plot

One Step Further  AutoML Machine learning for automated algorithm design Keys Transfer learning Learning to learn Hyper-parameter tuning Resources ML4AAD Google’s Cloud AutoML Official site Media coverage

An Example & On-going Projects An example of canonical ML (with a flavor of AutoML) Leaf identification On-going projects Customer’s behavior prediction (Esun Bank) Plant image recognition (ITRI) Speaker verification (China Trust) Healthcare and medical data analytics (NTU Hospital)