Machine Learning Theory Maria-Florina (Nina) Balcan Lecture 1, August 23 rd 2011.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Statistical Machine Learning- The Basic Approach and Current Research Challenges Shai Ben-David CS497 February, 2007.
Introduction to Support Vector Machines (SVM)
Godfather to the Singularity
New Horizons in Machine Learning Avrim Blum CMU This is mostly a survey, but last part is joint work with Nina Balcan and Santosh Vempala [Workshop on.
INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 – Fall 2013.
Machine learning continued Image source:
Machine Learning Theory Machine Learning Theory Maria Florina Balcan 04/29/10 Plan for today: - problem of “combining expert advice” - course retrospective.
Boosting Approach to ML
Class Project Due at end of finals week Essentially anything you want, so long as it’s AI related and I approve Any programming language you want In pairs.
Design and Analysis of Algorithms Maria-Florina (Nina) Balcan Lecture 1, Jan. 14 th 2011.
Semi-Supervised Learning
Maria-Florina Balcan Modern Topics in Learning Theory Maria-Florina Balcan 04/19/2006.
Co-Training and Expansion: Towards Bridging Theory and Practice Maria-Florina Balcan, Avrim Blum, Ke Yang Carnegie Mellon University, Computer Science.
Evolutionary Computational Intelligence Lecture 10a: Surrogate Assisted Ferrante Neri University of Jyväskylä.
Active Learning of Binary Classifiers
Unsupervised Learning: Clustering Rong Jin Outline  Unsupervised learning  K means for clustering  Expectation Maximization algorithm for clustering.
Machine Learning (Extended) Dr. Ata Kaban
Computational Learning Theory
Support Vector Machines Based on Burges (1998), Scholkopf (1998), Cristianini and Shawe-Taylor (2000), and Hastie et al. (2001) David Madigan.
Maria-Florina Balcan Carnegie Mellon University Margin-Based Active Learning Joint with Andrei Broder & Tong Zhang Yahoo! Research.
Active Learning with Support Vector Machines
Introduction to Boosting Aristotelis Tsirigos SCLT seminar - NYU Computer Science.
Machine Learning Rob Schapire Princeton Avrim Blum Carnegie Mellon Tommi Jaakkola MIT.
Maria-Florina Balcan A Theoretical Model for Learning from Labeled and Unlabeled Data Maria-Florina Balcan & Avrim Blum Carnegie Mellon University, Computer.
Part I: Classification and Bayesian Learning
Machine Learning Theory Maria-Florina Balcan Lecture 1, Jan. 12 th 2010.
Introduction to machine learning
Introduction to Data Mining Engineering Group in ACL.
Incorporating Unlabeled Data in the Learning Process
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Crash Course on Machine Learning
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
ECE 8443 – Pattern Recognition Objectives: Error Bounds Complexity Theory PAC Learning PAC Bound Margin Classifiers Resources: D.M.: Simplified PAC-Bayes.
Universit at Dortmund, LS VIII
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Boosting of classifiers Ata Kaban. Motivation & beginnings Suppose we have a learning algorithm that is guaranteed with high probability to be slightly.
Machine Learning.
1 Machine Learning (Extended) Dr. Ata Kaban Algorithms to enable computers to learn –Learning = ability to improve performance automatically through experience.
SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.
Machine Learning in Ad-hoc IR. Machine Learning for ad hoc IR We’ve looked at methods for ranking documents in IR using factors like –Cosine similarity,
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
Machine Learning Tutorial Amit Gruber The Hebrew University of Jerusalem.
Maria-Florina Balcan Active Learning Maria Florina Balcan Lecture 26th.
CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS.
Bing LiuCS Department, UIC1 Chapter 8: Semi-supervised learning.
Connections between Learning Theory, Game Theory, and Optimization Maria Florina (Nina) Balcan Lecture 2, August 26 th 2010.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Agnostic Active Learning Maria-Florina Balcan*, Alina Beygelzimer**, John Langford*** * : Carnegie Mellon University, ** : IBM T.J. Watson Research Center,
Maria-Florina Balcan 16/11/2015 Active Learning. Supervised Learning E.g., which s are spam and which are important. E.g., classify objects as chairs.
Distribution Specific Learnability – an Open Problem in Statistical Learning Theory M. Hassan Zokaei-Ashtiani December 2013.
Reinforcement Learning AI – Week 22 Sub-symbolic AI Two: An Introduction to Reinforcement Learning Lee McCluskey, room 3/10
Kernels and Margins Maria Florina Balcan 10/13/2011.
Learning with General Similarity Functions Maria-Florina Balcan.
Introduction to Classification & Clustering Villanova University Machine Learning Lab Module 4.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Machine Learning for Computer Security
Introduction to Classification & Clustering
Neural Network Architecture Session 2
CSSE463: Image Recognition Day 11
Data Mining: Concepts and Techniques Course Outline
CSSE463: Image Recognition Day 11
Semi-Supervised Learning
INF 5860 Machine learning for image classification
A Theory of Learning and Clustering via Similarity Functions
CSSE463: Image Recognition Day 11
CSSE463: Image Recognition Day 11
Presentation transcript:

Machine Learning Theory Maria-Florina (Nina) Balcan Lecture 1, August 23 rd 2011

2 Image Classification Document Categorization Speech Recognition Branch Prediction Protein Classification Spam Detection Fraud Detection Machine Learning Playing Games Computational Advertising

what kinds of tasks we can hope to learn, and from what kind of data Goals of Machine Learning Theory Develop and analyze models to understand: what types of guarantees might we hope to achieve prove guarantees for practically successful algs (when will they succeed, how long will they take?); Algorithms Interesting connections to other areas including: Combinatorial Optimization Probability & Statistics Game Theory Information TheoryComplexity Theory develop new algs that provably meet desired criteria

4 Example: Supervised Classification Goal: use s seen so far to produce good prediction rule for future data. Not spam spam Decide which s are spam and which are important. Supervised classification

5 example label Reasonable RULES: Predict SPAM if unknown AND (money OR pills) Predict SPAM if 2money + 3pills –5 known > 0 Represent each message by features. (e.g., keywords, spelling, etc.) Example: Supervised Classification Linearly separable

6 Two Main Aspects of Supervised Learning Algorithm Design. How to optimize? Automatically generate rules that do well on observed data. Confidence Bounds, Generalization Guarantees, Sample Complexity Confidence for rule effectiveness on future data. Well understood for passive supervised learning.

7 Semi-Supervised Learning Using cheap unlabeled data in addition to labeled data. Active Learning The algorithm interactively asks for labels of informative examples. Other Protocols for Supervised Learning Theoretical understanding severely lacking until a couple of years ago. Lots of progress recently. We will cover some of these. Learning with Membership Queries Statistical Query Learning

Structure of the Class Simple algos and hardness results for supervised learning. Classic, state of the art algorithms: AdaBoost and SVM (kernel based mehtods). Basic models: PAC, SLT. Standard Sample Complexity Results (VC dimension) Weak-learning vs. Strong-learning Passive Supervised Learning Modern Sample Complexity Results Rademacher Complexity Margin analysis of Boosting and SVM

Structure of the Class Classification noise and the Statistical-Query model Incorporating Unlabeled Data in the Learning Process. Online Learning and Game Theory Learning Real Valued Functions Incorporating Interaction in the Learning Process: Active Learning Learning with Membership Queries connections to Boosting Other Learning Paradigms Other Topics

Admin Course web page: 4-5 hwk assignments. Exercises/problems (pencil-and-paper problem-solving variety). Project: explore a theoretical question, try some experiments, or read a couple of papers and explain the idea. Short writeup and possibly presentation. Small groups ok. Take-home exam. [50%] [35%] [15%]