Modeling Consensus: Classifier Combination for WSD Authors: Radu Florian and David Yarowsky Presenter: Marian Olteanu.

Slides:

Advertisements

Similar presentations

INTRODUCTION TO MACHINE LEARNING Bayesian Estimation.

Advertisements

Pattern Recognition and Machine Learning

Ensemble Methods An ensemble method constructs a set of base classifiers from the training data Ensemble or Classifier Combination Predict class label.

Supervised Learning Recap

Pattern Classification, Chapter 2 (Part 2) 0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R.

Pattern Classification Chapter 2 (Part 2)0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O.

Chapter 4: Linear Models for Classification

Psychology 202b Advanced Psychological Statistics, II February 10, 2011.

Psychology 202b Advanced Psychological Statistics, II February 15, 2011.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

2D1431 Machine Learning Boosting.

Announcements  Project proposal is due on 03/11  Three seminars this Friday (EB 3105) Dealing with Indefinite Representations in Pattern Recognition.

Ensemble Learning: An Introduction

Taking the Kitchen Sink Seriously: An Ensemble Approach to Word Sense Disambiguation from Christopher Manning et al.

Classification and application in Remote Sensing.

Optimal Adaptation for Statistical Classifiers Xiao Li.

Data mining and statistical learning - lecture 13 Separating hyperplane.

Kernel Methods Part 2 Bing Han June 26, Local Likelihood Logistic Regression.

Distributional clustering of English words Authors: Fernando Pereira, Naftali Tishby, Lillian Lee Presenter: Marian Olteanu.

Examples of Ensemble Methods

CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {

Towards the automatic identification of adjectival scales: clustering adjectives according to meaning Authors: Vasileios Hatzivassiloglou and Kathleen.

EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.

COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.

LOGISTIC REGRESSION David Kauchak CS451 – Fall 2013.

Empirical Research Methods in Computer Science Lecture 7 November 30, 2005 Noah Smith.

CS Statistical Machine learning Lecture 10 Yuan (Alan) Qi Purdue CS Sept

Today Ensemble Methods. Recap of the course. Classifier Fusion

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.

Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.

CLASSIFICATION: Ensemble Methods

ISQS 6347, Data & Text Mining1 Ensemble Methods. ISQS 6347, Data & Text Mining 2 Ensemble Methods Construct a set of classifiers from the training data.

Overview of the final test for CSC Overview PART A: 7 easy questions –You should answer 5 of them. If you answer more we will select 5 at random.

Yuya Akita , Tatsuya Kawahara

1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.

Chapter1: Introduction Chapter2: Overview of Supervised Learning

Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.

Ensemble Methods in Machine Learning

ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Ensemble Methods: Bagging, Boosting Readings: Murphy 16.4; Hastie 16.

11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.

Classification Ensemble Methods 1

Introduction to Machine Learning Multivariate Methods 姓名 : 李政軒.

Ensemble Methods Construct a set of classifiers from the training data Predict class label of previously unseen records by aggregating predictions made.

Decision Trees IDHairHeightWeightLotionResult SarahBlondeAverageLightNoSunburn DanaBlondeTallAverageYesnone AlexBrownTallAverageYesNone AnnieBlondeShortAverageNoSunburn.

NTU & MSRA Ming-Feng Tsai

Statistical Models for Automatic Speech Recognition Lukáš Burget.

Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.

Introduction to Classifiers Fujinaga. Bayes (optimal) Classifier (1) A priori probabilities: and Decision rule: given and decide if and probability of.

CMPS 142/242 Review Section Fall 2011 Adapted from Lecture Slides.

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.

University of Waikato, New Zealand

Probability Theory and Parameter Estimation I

Learning Coordination Classifiers

Ch3: Model Building through Regression

Computer vision: models, learning and inference

Special Topics In Scientific Computing

Classifiers Fujinaga.

Introduction to Data Mining, 2nd Edition

Machine Learning Ensemble Learning: Voting, Boosting(Adaboost)

Ensemble Methods for Machine Learning: The Ensemble Strikes Back

EE513 Audio Signals and Systems

Pattern Recognition and Machine Learning

Parametric Methods Berlin Chen, 2005 References:

Multivariate Methods Berlin Chen, 2005 References:

INTRODUCTION TO Machine Learning 3rd Edition

EM Algorithm and its Applications

What is Artificial Intelligence?

Presentation transcript:

Modeling Consensus: Classifier Combination for WSD Authors: Radu Florian and David Yarowsky Presenter: Marian Olteanu

Introduction Ensembles (classifier combination)  If errors are uncorrelated, decrease error by a factor of 1/N  In practice, all classifiers tend to make errors at hard examples

Approach & Features Automatic POS tagging and lemma extraction Features  Bag of words  Local  Syntactic

Classifier methods (6) Vector-based  Enhanced Naïve Bayes Weighted  Cosine  BayesRatio (good for sparse data)

Classifier methods (cont.) MMVC (Mixture Maximum Variance Correction)  2 stages  Second stage: select sense with variance over threshold

Classifier methods (cont.) Discriminative Models  TBL (Transformation Based Learning)  Non-hierarchical decision lists

Combining classifiers Agreement

Combining classifiers (cont.) Three methods 1. Combine posterior sense probability distribution

Combining classifiers (cont.) determined:  Linear regression Minimize mean square error (MSE)  Expectation-Maximization (EM)  Approximate k with the performance of the classifier (PB)

Combining classifiers (cont.) 2. Combination based on Order Statistics

Combining classifiers (cont.) 3. Voting  (each classifier chose only one sense) Win the one with max. # of votes TagPair  Each classifier votes  Each pair of classifiers votes for the sense most likely by the joint classification Combining – stacking

Evaluation

Evaluation (unseen data)