Download presentation
Presentation is loading. Please wait.
1
Intro to Machine Learning
J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept. National Taiwan University
2
Machine Learning (ML) Definition Concepts
Field of study that gives computers the ability to learn without being explicitly programmed – by Arthur Samuel, 1959. Computational methods that use existing data to make predictions Concepts 視其所以,觀其所由,察其所安,人焉廋哉?人焉廋哉? 近朱者赤、近墨者黑 Artificial intelligence Machine learning
3
More about Machine Learning
Quick examples Use height and weight to predict gender Use historical data to predict stock market Comparison Before ML Rule-based After ML Data-driven Prerequisites for ML Probability & statistics Linear algebra Optimization …
4
Broad Areas of ML Classification: Assign a category to a given object
Determine if it will rain tomorrow Regression: Predict a real-value for a given object Predict the exchange rate Ranking: Order objects according to some criterion Rank the webpages returned by a search engine Clustering: Partition data into homogeneous groups Dimensionality reduction: Find low-dimensional manifold preserving some properties of the data Density estimation: Learning probability density function according to sample data
5
2018/12/6 Concept of Modeling Given desired i/o pairs (training set) of the form (x1, ..., xn; y), construct a model to match the i/o pairs Two steps in modeling structure identification: input selection, model complexity parameter identification: optimal parameters x1 xn . . . Unknown target system y Model f() y* Given an input condition, it very easy to derive the overall output. However, our task is more than that. What we want to do is to construct an appropriate fuzzy inference system that can match a desired input/output data set of a unknown target system. This is called fuzzy modeling and it a branch of nonlinear system identification. In general, fuzzy modeling involves two step. The first step is structure identification, where we have to find suitable numbers of fuzzy rules and membership functions. In FLT, this is accomplished by subtractive clustering proposed by Steve Chiu in Rockwell Science Research Center. The second step is parameter identification, where we have to find the optimal parameters for membership functions as well as output linear equations. In the FLT, this is done by ANFIS, which was proposed by me back in Since ANFIS uses some neural network training techniques, it is also classified as one of the neuro-fuzzy modeling approaches.
6
Example of Modeling Source:
7
More ML Application Examples
Optical character recognition Document classification Part-of-speech tagging Speech processing Recognition synthesis Image recognition Face recognition Info. retrieval Recommendation search Security Fraud detection (credit card, telephone) Network intrusion Video surveillance Speaker id. Self-driving cars
8
What People Do with ML For practitioners For researchers
Acquire application domain knowledge Know properties of machine learning methods Get familiar with ML tools For researchers Design new learning algorithms Speed up the learning process Derive theoretical bound of accuracy Take care of data Big data Imbalanced data Missing data …
9
First Look at a Dataset GHW (gender-height-weight) dataset
10000 entries with 3 columns Gender: Categorical Height: Numerical Weight: Numerical Source
10
Multiple Ways of Prediction
Multiple ways of prediction based on GHW dataset Classification: Height, weight gender Regression: Gender, height weight Regression: Gender, weight height In general Classification: the output is categorical Regression: the output is numerical Sometimes it’s not so obvious! Quiz!
11
List of Classifiers Commonly used classifiers
K-nearest-neighbor classifiers Quadratic classifiers Naïve Bayes classifiers Linear classifiers Single-layer perceptrons SVM Neural networks Multilayer perceptrons DNN Radial-basis function networks Classification and regression trees (CART) Random forests …
12
A Canonical Way to ML Data processing Model construction
Dataset collection Feature extraction Missing value imputation Visualization Outlier removal Feature normalization Model construction Classifier Structure id Feature selection Feature extraction Parameter id Cross validation for perf. evaluation Error analysis Confusion matrix DET/ROC plots PR plot
13
One Step Further AutoML
Machine learning for automated algorithm design Keys Transfer learning Learning to learn Hyper-parameter tuning Resources ML4AAD Google’s Cloud AutoML Official site Media coverage
14
An Example & On-going Projects
An example of canonical ML (with a flavor of AutoML) Leaf identification On-going projects Customer’s behavior prediction (Esun Bank) Plant image recognition (ITRI) Speaker verification (China Trust) Healthcare and medical data analytics (NTU Hospital)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.