General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301

Slides:



Advertisements
Similar presentations
EECS738 Xue-wen Chen EECS 738: Machine Learning Fall 2011, Prof. Xue-wen Chen The University of Kansas.
Advertisements

CHAPTER 2: Supervised Learning. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Learning a Class from Examples.
INTRODUCTION TO Machine Learning 2nd Edition
IS5152 Decision Making Technologies
An Overview of Machine Learning
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
Statistical pattern recognition and machine learning Jen-Chang Liu, Spring, 2008.
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
EECS 349 Machine Learning Instructor: Doug Downey Note: slides adapted from Pedro Domingos, University of Washington, CSE
CSE 546 Data Mining Machine Learning Instructor: Pedro Domingos.
Three kinds of learning
Lecture #1COMP 527 Pattern Recognition1 Pattern Recognition Why? To provide machines with perception & cognition capabilities so that they could interact.
Learning Programs Danielle and Joseph Bennett (and Lorelei) 4 December 2007.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
CSE 515 Statistical Methods in Computer Science Instructor: Pedro Domingos.
Introduction to machine learning
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
For Better Accuracy Eick: Ensemble Learning
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
General Information Course Id: COSC6342 Machine Learning Time: MO/WE 2:30-4p Instructor: Christoph F. Eick Classroom:SEC 201
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
CSCI 347 – Data Mining Lecture 01 – Course Overview.
CEN 592 PATTERN RECOGNITION Spring Term CEN 592 PATTERN RECOGNITION Spring Term DEPARTMENT of INFORMATION TECHNOLOGIES Assoc. Prof.
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
Machine Learning Queens College Lecture 13: SVM Again.
General information CSE : Probabilistic Analysis of Computer Systems
Anomaly detection with Bayesian networks Website: John Sandiford.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 10a-11:30a Instructor: Christoph F. Eick Classroom:AH123
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 16 Nov, 3, 2011 Slide credit: C. Conati, S.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009.
Christoph F. Eick: COSC 6368 and ‘What is AI?” 1 COSC 6368 and “What is AI?” 1.Introduction to AI (today, and TH) What is AI? Sub-fields of AI Problems.
Instructor: Pedro Domingos
COSC 6340 Projects & Homeworks Spring Learn how to define tables Learn how to load and create an Oracle database Learn how to define user views.
CPE542: Pattern Recognition Course Introduction Dr. Gheith Abandah د. غيث علي عبندة.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Machine Learning in CSC 196K
Machine Learning: A Brief Introduction Fu Chang Institute of Information Science Academia Sinica ext. 1819
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Usman Roshan Dept. of Computer Science NJIT
Brief Intro to Machine Learning CS539
Who am I? Work in Probabilistic Machine Learning Like to teach 
Machine Learning Models
Machine Learning for Computer Security
2009: Topics Covered in COSC 6368
Eick: Introduction Machine Learning
Special Topics in Data Mining Applications Focus on: Text Mining
Data Mining: Concepts and Techniques Course Outline
COSC 6340 Projects & Homeworks Spring 2002
CHAPTER 3: Bayesian Decision Theory
Basic Intro Tutorial on Machine Learning and Data Mining
CSE 515 Statistical Methods in Computer Science
INTRODUCTION TO Machine Learning
Neural Network Design and Application
Christoph F. Eick: A Gentle Introduction to Machine Learning
Derek Hoiem CS 598, Spring 2009 Jan 27, 2009
Usman Roshan Dept. of Computer Science NJIT
Machine learning CS 229 / stats 229
What is Artificial Intelligence?
Presentation transcript:

General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301 Homepage:

2 What is Machine Learning? Machine Learning is the Machine Learning is the study of algorithms thatstudy of algorithms that improve their performanceimprove their performance at some taskat some task with experiencewith experience Role of Statistics: Inference from a sample Role of Statistics: Inference from a sample Role of Computer science: Efficient algorithms to Role of Computer science: Efficient algorithms to Solve optimization problemsSolve optimization problems Representing and evaluating the model for inferenceRepresenting and evaluating the model for inference

3 Applications of Machine Learning Supervised Learning Supervised Learning ClassificationClassification PredictionPrediction Unsupervised Learning Unsupervised Learning Association AnalysisAssociation Analysis ClusteringClustering Preprocessing and Summarization of Data Preprocessing and Summarization of Data Reinforcement Learning and Adaptation Reinforcement Learning and Adaptation Activities Related to Models Activities Related to Models Learning parameters of modelsLearning parameters of models Choosing/Comparing modelsChoosing/Comparing models Evaluating Models (e.g. predicting their accuracy)Evaluating Models (e.g. predicting their accuracy)

Prerequisites Background Probabilities Probabilities Distributions, densities, marginalization…Distributions, densities, marginalization… Basic statistics Basic statistics Moments, typical distributions, regressionMoments, typical distributions, regression Basic knowledge of optimization techniques Basic knowledge of optimization techniques Algorithms Algorithms basic data structures, complexity…basic data structures, complexity… Programming skills Programming skills We provide some background, but the class will be fast paced We provide some background, but the class will be fast paced Ability to deal with “abstract mathematical concepts” Ability to deal with “abstract mathematical concepts”

Textbooks Textbook: Ethem Alpaydin, Introduction to Machine Learning, MIT Press, Mildly Recommended Textbooks: 1.Christopher M. Bishop, Pattern Recognition and Machine Learning, Tom Mitchell, Machine Learning, McGraw-Hill, 1997.

Grading Spring Exams61-69% 3 Projects and 4HW35-40% Attendance 1% NOTE: PLAGIARISM IS NOT TOLERATED. Remark: Weights are subject to change

Topics Covered in 2011 (Based on Alpaydin) Topic 1: Introduction to Machine Learning Topic 2: Supervised Learning Topic 3: Bayesian Decision Theory (excluding Belief Networks) Topic 5: Parametric Model Estimation Topic 6: Dimensionality Reduction Centering on PCA Topic 7: Clustering1: Mixture Models, K-Means and EM Topic 8: Non-Parametric Methods Centering on kNN and density estimation Topic 9: Clustering2: Density-based Approaches Topic 10 Decision Trees Topic 11: Comparing Classifiers Topic 12: Combining Multiple Learners Topic 13: Linear Discrimination Centering on Support Vector Machines Topic 14: More on Kernel Methods Topic 15: Graphical Models Centering on Belief Networks Topic 16: Applications of Machine Learning---Urban Driving, Netflix, etc. Topic 17: Hidden Markov Models Topic 18: Reinforcement Learning Topic 19: Neural Networks Topic 20: Computational Learning Theory Remark: Topics 17, 19, and 20 likely will be only briefly covered or skipped---due to the lack of time.

Course Projects 1.February 2011: Homework1 (available Feb. 6)Individual Project; Classification and Prediction; learn how obtain, use, and evaluate models(available Feb. 10). 2.March/April 2011: Group Project, giving a survey about a subfield of Machine Learning, Homework2 (available after Spring Break) 3.Second Half April 2011: Individual Project (Short); Reinforcement Learning and Adaptation: Learn how to act intelligently in an unknown/changing environment

Course Elements Total: classes lectures lectures 3-4 classes for review and discussing course projects 3-4 classes for review and discussing course projects 2 classes will be allocated for student presentations 2 classes will be allocated for student presentations 2 exams 2 exams Graded and ungraded paper and pencil problems Graded and ungraded paper and pencil problems

Schedule ML Spring 2011 WeekTopic Jan 17 Introduction Jan 24 Supervised Learning Jan 31 Bayesian Decision Theory, Parametric Approaches Feb. 7 Multivariate Methods, Project1, Homework1 Feb. 14 Multivariate Methods, Dim. Reduction Feb. 21 Clustering1 Feb. 28 Non-parametric Methods, Review1 March 7 Decision Trees, Review2, Project2, Midterm Exam March 21 Decision Trees, Clustering2, Reinforcement Learning March 28 Reinforcement Learning April 4 Ensembles, SVM April 11 SVM, Project 3, Project2 SP April 18 Project2 SP, More on Kernels, Project3, Comparing Learners April 28 only May 3 only Review3, Graphical Models, Kaelbling Article, TE Post Analysis Project1, Review 4 April 14, 2011 Green: will use other teaching material

Dates to Remember Dates to rememberEvents March 10 + May 12, 2pExams April 14+19Project2 Student Project Presentations March 15 /17No class (Spring Break) March 24, April 16/18, May 3 Submit Project Report /Software/Deliverable

Exams  Will be open notes/textbook  Will get a review list before the exam  Exams will center (80% or more) on material that was covered in the lecture  Exam scores will be immediately converted into number grades  We only have 2009 sample exams; I taught this course only once recently

Other UH-CS Courses with Overlapping Contents 1. COSC 6368: Artificial Intelligence  Strong Overlap: Decision Trees, Bayesian Belief Networks  Medium Overlap: Reinforcement Learning  COSC 6335: Data Mining  Strong Overlap: Decision trees, SVM, kNN, Density- based Clustering based Clustering  Medium Overlap: K-means, Decision Trees, Preprocessing/Exploratory DA, AdaBoost Preprocessing/Exploratory DA, AdaBoost  COSC 6343: Pattern Classification  Medium Overlap: all classification algorithms, feature selection—discusses those topics taking a different perspective. a different perspective.