Intro to Machine Learning

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Introduction to Machine Learning BITS C464/BITS F464
Data Visualization STAT 890, STAT 442, CM 462
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
Chapter 2: Pattern Recognition
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Introduction to Machine Learning course fall 2007 Lecturer: Amnon Shashua Teaching Assistant: Yevgeny Seldin School of Computer Science and Engineering.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Learning Programs Danielle and Joseph Bennett (and Lorelei) 4 December 2007.
Chapter 5 Data mining : A Closer Look.
Introduction to machine learning
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
NEURAL NETWORKS FOR DATA MINING
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Decision Trees Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
CS 536 – Ahmed Elgammal CS 536: Machine Learning Fall 2005 Ahmed Elgammal Dept of Computer Science Rutgers University.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Artificial Intelligence, Expert Systems, and Neural Networks Group 10 Cameron Kinard Leaundre Zeno Heath Carley Megan Wiedmaier.
1 Statistics & R, TiP, 2011/12 Neural Networks  Technique for discrimination & regression problems  More mathematical theoretical foundation  Works.
Data Mining and Decision Support
Data Mining Copyright KEYSOFT Solutions.
Machine Learning in CSC 196K
Introduction to Machine Learning © Roni Rosenfeld,
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Computacion Inteligente Least-Square Methods for System Identification.
Introduction of Fuzzy Inference Systems By Kuentai Chen.
Introduction to Machine Learning. Introduce yourself Why you choose this course? Any examples of machine learning you know?
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Linear Classifiers (LC) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
Chapter 12 Case Studies Part B. Control System Design.
Introduction to Machine Learning, its potential usage in network area,
Usman Roshan Dept. of Computer Science NJIT
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning, Bio-informatics and Weka
Machine Learning with Spark MLlib
Introduction to Machine Learning
CSIE Dept., National Taiwan Univ., Taiwan
Machine Learning for Computer Security
Chapter 7. Classification and Prediction
Introduction to Pattern Recognition
National Taiwan University
Neural Networks and Its Deep Structures
School of Computer Science & Engineering
National Taiwan University
Machine Learning With Python Sreejith.S Jaganadh.G.
Feature Selection for Pattern Recognition
CH. 1: Introduction 1.1 What is Machine Learning Example:
Machine Learning Ali Ghodsi Department of Statistics
What is Pattern Recognition?
Kocaeli University Introduction to Engineering Applications
Intro to Machine Learning
Overview of Machine Learning
Deep Neural Networks (DNN)
Hierarchical Clustering
Machine Learning in FinTech
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Usman Roshan Dept. of Computer Science NJIT
Presentation transcript:

Intro to Machine Learning J.-S. Roger Jang (張智星) jang@mirlab.org http://mirlab.org/jang MIR Lab, CSIE Dept. National Taiwan University

Machine Learning (ML) Definition Concepts Field of study that gives computers the ability to learn without being explicitly programmed – by Arthur Samuel, 1959. Computational methods that use existing data to make predictions Concepts 視其所以,觀其所由,察其所安,人焉廋哉?人焉廋哉? 近朱者赤、近墨者黑 Artificial intelligence Machine learning

More about Machine Learning Examples Use height and weight to predict gender Use historical data to predict stock market … Prerequisites for ML Probability & statistics Linear algebra Optimization

Broad Areas of ML Classification: Assign a category to a given object Determine if it will rain tomorrow Regression: Predict a real-value for a given object Predict the exchange rate Ranking: Order objects according to some criterion Rank the webpages returned by a search engine Clustering: Partition data into homogeneous groups Dimensionality reduction: Find low-dimensional manifold preserving some properties of the data Density estimation: Learning probability density function according to sample data

2018/6/15 Concept of Modeling Given desired i/o pairs (training set) of the form (x1, ..., xn; y), construct a model to match the i/o pairs Two steps in modeling structure identification: input selection, model complexity parameter identification: optimal parameters x1 xn . . . Unknown target system y Model f() y* Given an input condition, it very easy to derive the overall output. However, our task is more than that. What we want to do is to construct an appropriate fuzzy inference system that can match a desired input/output data set of a unknown target system. This is called fuzzy modeling and it a branch of nonlinear system identification. In general, fuzzy modeling involves two step. The first step is structure identification, where we have to find suitable numbers of fuzzy rules and membership functions. In FLT, this is accomplished by subtractive clustering proposed by Steve Chiu in Rockwell Science Research Center. The second step is parameter identification, where we have to find the optimal parameters for membership functions as well as output linear equations. In the FLT, this is done by ANFIS, which was proposed by me back in 1991. Since ANFIS uses some neural network training techniques, it is also classified as one of the neuro-fuzzy modeling approaches.

Example of Modeling Source: https://www.slideshare.net/tw_dsconf/ss-70083878

More ML Application Examples Optical character recognition Document classification Part-of-speech tagging Speech processing Recognition synthesis Image recognition Face recognition Info. retrieval Recommendation search Security Fraud detection (credit card, telephone) Network intrusion Video surveillance Speaker id. Self-driving cars

What People Do with ML For practitioners For researchers Acquire application domain knowledge Know properties of machine learning methods Get familiar with ML tools For researchers Design new learning algorithms Speed up the learning process Derive theoretical bound of accuracy Take care of data Big data Imbalanced data Missing data …

First Look at a Dataset GHW (gender-height-weight) dataset 10000 entries with 3 columns Gender: Categorical Height: Numerical Weight: Numerical Source

Multiple Ways of Prediction Multiple ways of prediction based on GHW dataset Classification: Height, weight  gender Regression: Gender, height  weight Regression: Gender, weight  height In general Classification: the output is categorical Regression: the output is numerical Sometimes it’s not so obvious!