ECE 8443 – Pattern Recognition Objectives: Course Introduction Typical Applications Resources: Syllabus Internet Books and Notes D.H.S: Chapter 1 Glossary.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Pattern Recognition and Machine Learning
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
LECTURE 11: BAYESIAN PARAMETER ESTIMATION
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Jensen’s Inequality (Special Case) EM Theorem.
Chapter 1: Introduction to Pattern Recognition
OUTLINE Course description, What is pattern recognition, Cost of error, Decision boundaries, The desgin cycle.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Pattern Classification, Chapter 1 1 Basic Probability.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
ECE 8443 – Pattern Recognition ECE 3163 – Signals and Systems Objectives: Pattern Recognition Feature Generation Linear Prediction Gaussian Mixture Models.
Introduction to Pattern Recognition Charles Tappert Seidenberg School of CSIS, Pace University.
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Classification. An Example (from Pattern Classification by Duda & Hart & Stork – Second Edition, 2001)
IBS-09-SL RM 501 – Ranjit Goswami 1 Basic Probability.
Perception Introduction Pattern Recognition Image Formation
ECE 8443 – Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Normal Distributions Whitening Transformations Linear Discriminants Resources.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 02: BAYESIAN DECISION THEORY Objectives: Bayes.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Whitening.
2. Bayes Decision Theory Prof. A.L. Yuille Stat 231. Fall 2004.
Compiled By: Raj G Tiwari.  A pattern is an object, process or event that can be given a name.  A pattern class (or category) is a set of patterns sharing.
ECE 8443 – Pattern Recognition Objectives: Error Bounds Complexity Theory PAC Learning PAC Bound Margin Classifiers Resources: D.M.: Simplified PAC-Bayes.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Occam’s Razor No Free Lunch Theorem Minimum.
LML Speech Recognition Speech Recognition Introduction I E.M. Bakker.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Conjugate Priors Multinomial Gaussian MAP Variance Estimation Example.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
ECE 8443 – Pattern Recognition LECTURE 08: DIMENSIONALITY, PRINCIPAL COMPONENTS ANALYSIS Objectives: Data Considerations Computational Complexity Overfitting.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Introduction to Pattern Recognition (การรู้จํารูปแบบเบื้องต้น)
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.
Objectives: Terminology Components The Design Cycle Resources: DHS Slides – Chapter 1 Glossary Java Applet URL:.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
ECE 8443 – Pattern Recognition LECTURE 04: PERFORMANCE BOUNDS Objectives: Typical Examples Performance Bounds ROC Curves Resources: D.H.S.: Chapter 2 (Part.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 04: GAUSSIAN CLASSIFIERS Objectives: Whitening.
Lecture 3: MLE, Bayes Learning, and Maximum Entropy
Univariate Gaussian Case (Cont.)
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 10: PRINCIPAL COMPONENTS ANALYSIS Objectives:
Pattern Recognition NTUEE 高奕豪 2005/4/14. Outline Introduction Definition, Examples, Related Fields, System, and Design Approaches Bayesian, Hidden Markov.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Jensen’s Inequality (Special Case) EM Theorem.
PatReco: Introduction Alexandros Potamianos Dept of ECE, Tech. Univ. of Crete Fall
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Objectives: Student Introductions Review Course Syllabus Demonstrate Basic Concepts Resources: Syllabus Java Applet Internet Books and Notes URL:.../publications/courses/ece_8443/lectures/lecture_01.ppt.../publications/courses/ece_8443/lectures/lecture_01
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
Artificial Intelligence
Machine Learning for Computer Security
Introduction Machine Learning 14/02/2017.
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
LECTURE 01: COURSE OVERVIEW
CS668: Pattern Recognition Ch 1: Introduction
Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas
Introduction to Pattern Recognition and Machine Learning
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
LECTURE 01: COURSE OVERVIEW
LECTURE 23: INFORMATION THEORY REVIEW
LECTURE 07: BAYESIAN ESTIMATION
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Presentation transcript:

ECE 8443 – Pattern Recognition Objectives: Course Introduction Typical Applications Resources: Syllabus Internet Books and Notes D.H.S: Chapter 1 Glossary LECTURE 01: COURSE OVERVIEW Audio: URL:

ECE 8443: Lecture 01, Slide 1 Pattern Recognition: “the act of taking raw data and taking an action based on the category of the pattern.” Common Applications: speech recognition, fingerprint identification (biometrics), DNA sequence identification Related Terminology:  Machine Learning: The ability of a machine to improve its performance based on previous results.  Machine Understanding: acting on the intentions of the user generating the data. Related Fields: artificial intelligence, signal processing and discipline-specific research (e.g., target recognition, speech recognition, natural language processing). Terminology

ECE 8443: Lecture 01, Slide 2 Which of these images are most scenic? How can we develop a system to automatically determine scenic beauty? (Hint: feature combination) Solutions to such problems require good feature extraction and good decision theory. Recognition or Understanding?

ECE 8443: Lecture 01, Slide 3 Feature Extraction

ECE 8443: Lecture 01, Slide 4 Regions of overlap represent the classification error Error rates can be computed with knowledge of the joint probability distributions (see OCW-MIT Fall-2006).OCW-MIT Fall-2006 Context is used to reduce overlap. In real problems, features are confusable and represent actual variation in the data. The traditional role of the signal processing engineer has been to develop better features (e.g., “invariants”). Features Are Confusable

ECE 8443: Lecture 01, Slide 5 Feature Extraction Post-Processing Classification Segmentation Sensing Input Decision Decomposition

ECE 8443: Lecture 01, Slide 6 Train Classifier Choose ModelChoose FeaturesEvaluate Classifier End Collect Data Start Key issues: “There is no data like more data.” Perceptually-meaningful features? How do we find the best model? How do we estimate parameters? How do we evaluate performance? Goal of the course: Introduce you to mathematically rigorous ways to train and evaluate models. The Design Cycle

ECE 8443: Lecture 01, Slide 7 I got 100% accuracy on...  Almost any algorithm works some of the time, but few real-world problems have ever been completely solved.  Training on the evaluation data is forbidden.  Once you use evaluation data, you should discard it. My algorithm is better because...  Statistical significance and experimental design play a big role in determining the validity of a result.  There is always some probability a random choice of an algorithm will produce a better result. Hence, in this course, we will also learn how to evaluate algorithms. Common Mistakes

ECE 8443: Lecture 01, Slide 8 Sorting Fish: incoming fish are sorted according to species using optical sensing (sea bass or salmon?) Feature Extraction Segmentation Sensing Problem Analysis:  set up a camera and take some sample images to extract features  Consider features such as length, lightness, width, number and shape of fins, position of mouth, etc. Image Processing Example

ECE 8443: Lecture 01, Slide 9 Conclusion: Length is a poor discriminator Length As A Discriminator

ECE 8443: Lecture 01, Slide 10 Lightness is a better feature than length because it reduces the misclassification error. Can we combine features in such a way that we improve performance? (Hint: correlation) Add Another Feature

ECE 8443: Lecture 01, Slide 11 Treat features as a N-tuple (two-dimensional vector) Create a scatter plot Draw a line (regression) separating the two classes Width And Lightness

ECE 8443: Lecture 01, Slide 12 Can we do better than a linear classifier? What is wrong with this decision surface? (Hint: generalization) Decision Theory

ECE 8443: Lecture 01, Slide 13 Why might a smoother decision surface be a better choice? (Hint: Occam’s Razor). This course investigates how to find such “optimal” decision surfaces and how to provide system designers with the tools to make intelligent trade-offs. Generalization and Risk

ECE 8443: Lecture 01, Slide 14 Degrees of difficulty: Real data is often much harder: Correlation

ECE 8443: Lecture 01, Slide 15 …. There are many excellent resources on the Internet that demonstrate pattern recognition concepts. There are many MATLAB toolboxes that implement state of the art algorithms. One such resource is a Java Applet that lets you quickly explore how a variety of algorithms process the same data.Java Applet An important first principle is:  There are no magic equations or algorithms.  You must understand the properties of your data and what a priori knowledge you can bring to bear on the problem. First Principle

ECE 8443: Lecture 01, Slide 16 How much can we trust isolated data points? Optimal decision surface is a line Optimal decision surface changes abruptly Optimal decision surface still a line Can we integrate prior knowledge about data, confidence, or willingness to take risk? Generalization And Risk

ECE 8443: Lecture 01, Slide 17 Bayesian Formulations Bayesian formulation for speech recognition: Objective: minimize the word error rate by maximizing Approach: maximize (training)  acoustic model (hidden Markov models, Gaussian mixtures, etc.  language model (finite state machines, N-grams)  acoustics (ignored during maximization) Bayes Rule allows us to convert the problem of estimating an unknown posterior probability to a process in which we can postulate a model, collect data under controlled conditions, and estimate the parameters of the model. Message Source Linguistic Channel Articulatory Channel Acoustic Channel Message WordsPhonesFeatures

ECE 8443: Lecture 01, Slide 18 Summary Pattern recognition vs. machine learning vs. machine understanding First principle of pattern recognition? We will focus more on decision theory and less on feature extraction. This course emphasizes statistical and data-driven methods for optimizing system design and parameter values. Second most important principle?