Https://xkcd.com/894/.

Slides:



Advertisements
Similar presentations
Introduction to Machine Learning BITS C464/BITS F464
Advertisements

Godfather to the Singularity
INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 – Fall 2013.
PROBABILISTIC MODELS David Kauchak CS451 – Fall 2013.
PROBABILISTIC MODELS David Kauchak CS451 – Fall 2013.
DECISION TREES David Kauchak CS 451 – Fall Admin Assignment 1 available and is due on Friday (printed out at the beginning of class) Door code for.
2015/6/1Course Introduction1 Welcome! MSCIT 521: Knowledge Discovery and Data Mining Qiang Yang Hong Kong University of Science and Technology
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
MACHINE LEARNING 1. Introduction. What is Machine Learning? Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Need.
Introduction to Machine Learning Anjeli Singh Computer Science and Software Engineering April 28 th 2008.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
Introduction to Machine Learning course fall 2007 Lecturer: Amnon Shashua Teaching Assistant: Yevgeny Seldin School of Computer Science and Engineering.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Part I: Classification and Bayesian Learning
INTRODUCTION TO Machine Learning 3rd Edition
Data Mining: A Closer Look
Introduction to machine learning
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
MULTICLASS CONTINUED AND RANKING David Kauchak CS 451 – Fall 2013.
Pattern Recognition & Machine Learning Debrup Chakraborty
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
MACHINE LEARNING BASICS David Kauchak CS159 Fall 2014.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Yazd University, Electrical and Computer Engineering Department Course Title: Advanced Software Engineering By: Mohammad Ali Zare Chahooki 1 Introduction.
1 Machine Learning (Extended) Dr. Ata Kaban Algorithms to enable computers to learn –Learning = ability to improve performance automatically through experience.
Machine Learning Tutorial Amit Gruber The Hebrew University of Jerusalem.
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
CHAPTER 1: Introduction. 2 Why “Learn”? Machine learning is programming computers to optimize a performance criterion using example data or past experience.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Brief Intro to Machine Learning CS539
Teck Chia Partner, Exponent.vc
CSE 4705 Artificial Intelligence
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Who am I? Work in Probabilistic Machine Learning Like to teach 
Machine Learning for Computer Security
Probability David Kauchak CS158 – Fall 2013.
Deep learning David Kauchak CS158 – Fall 2016.
Machine Learning overview Chapter 18, 21
Machine Learning overview Chapter 18, 21
Introducing Apache Mahout
Eick: Introduction Machine Learning
Machine Learning With Python Sreejith.S Jaganadh.G.
Introduction to Data Science Lecture 7 Machine Learning Overview
CH. 1: Introduction 1.1 What is Machine Learning Example:
What is Pattern Recognition?
INTRODUCTION.
3.1.1 Introduction to Machine Learning
Chapter 1: Introduction
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Christoph F. Eick: A Gentle Introduction to Machine Learning
CS639: Data Management for Data Science
Machine Learning basics
Welcome! Knowledge Discovery and Data Mining
Machine learning: What is it?
Machine learning CS 229 / stats 229
Machine Learning in Business John C. Hull
Evaluation David Kauchak CS 158 – Fall 2019.
Presentation transcript:

https://xkcd.com/894/

Introduction to Machine Learning David Kauchak CS 158 – Fall 2016

Why are you here? What is Machine Learning? Why are you taking this course? What topics would you like to see covered?

Machine Learning is… Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence.

Machine Learning is… Machine learning is programming computers to optimize a performance criterion using example data or past experience. -- Ethem Alpaydin The goal of machine learning is to develop methods that can automatically detect patterns in data, and then to use the uncovered patterns to predict future data or other outcomes of interest. -- Kevin P. Murphy The field of pattern recognition is concerned with the automatic discovery of regularities in data through the use of computer algorithms and with the use of these regularities to take actions. -- Christopher M. Bishop

Machine Learning is… Machine learning is about predicting the future based on the past. -- Hal Daume III

Machine Learning is… predict learn Training Data Testing Data Machine learning is about predicting the future based on the past. -- Hal Daume III past future predict learn Training Data model/ predictor Testing Data model/ predictor

Machine Learning, aka data mining: data analysis, not prediction, though often involves some shared techniques inference and/or estimation in statistics pattern recognition in engineering signal processing in electrical engineering induction optimization

Goals of the course: learn about… Different machine learning problems Common techniques/tools used theoretical understanding practical implementation Proper experimentation and evaluation Dealing with large (huge) data sets Parallelization frameworks Programming tools

Goals of the course Be able to laugh at these signs (or at least know why one might…)

Administrative Course page: Assignments http://www.cs.pomona.edu/~dkauchak/classes/cs158/ Assignments Weekly Mostly programming (Java, mostly) Some written/write-up Generally due Sunday evenings Two “midterm” exams and one final Late Policy Collaboration

Course expectations Plan to stay busy! Applied class, so lots of programming Machine learning involves math

Other things to note Videos before class Lots of class participation! Read the book (it’s good)

Machine learning problems What high-level machine learning problems have you seen or heard of before?

Data examples Data

Data examples Data

Data examples Data

Data examples Data

Supervised learning examples labeled examples Supervised learning: given labeled examples

Supervised learning Supervised learning: given labeled examples label model/ predictor label3 label4 label5 Supervised learning: given labeled examples

Supervised learning Supervised learning: learn to predict new example model/ predictor predicted label Supervised learning: learn to predict new example

Supervised learning: classification label apple Classification: a finite set of labels apple banana banana Supervised learning: given labeled examples

Classification Example Differentiate between low-risk and high-risk customers from their income and savings

Classification Applications Face recognition Character recognition Spam detection Medical diagnosis: From symptoms to illnesses Biometrics: Recognition/authentication using physical and/or behavioral characteristics: Face, iris, signature, etc ...

Supervised learning: regression label -4.5 Regression: label is real-valued 10.1 3.2 4.3 Supervised learning: given labeled examples

Regression Example Price of a used car x : car attributes (e.g. mileage) y : price y = wx+w0

Regression Applications Economics/Finance: predict the value of a stock Epidemiology Car/plane navigation: angle of the steering wheel, acceleration, … Temporal trends: weather over time …

Supervised learning: ranking label 1 Ranking: label is a ranking 4 2 3 Supervised learning: given labeled examples

Ranking example Given a query and a set of web pages, rank them according to relevance

Ranking Applications User preference, e.g. Netflix “My List” -- movie queue ranking iTunes flight search (search in general) reranking N-best output lists …

Unsupervised learning Unupervised learning: given data, i.e. examples, but no labels

Unsupervised learning applications learn clusters/groups without any label customer segmentation (i.e. grouping) image compression bioinformatics: learn motifs …

Reinforcement learning left, right, straight, left, left, left, straight GOOD left, straight, straight, left, right, straight, straight BAD left, right, straight, left, left, left, straight 18.5 left, straight, straight, left, right, straight, straight -3 Given a sequence of examples/states and a reward after completing that sequence, learn to predict the action to take in for an individual example/state

Reinforcement learning example Backgammon … WIN! … LOSE! Given sequences of moves and whether or not the player won at the end, learn to make good moves

Reinforcement learning example http://www.youtube.com/watch?v=VCdxqn0fcnE

Other learning variations What data is available: Supervised, unsupervised, reinforcement learning semi-supervised, active learning, … How are we getting the data: online vs. offline learning Type of model: generative vs. discriminative parametric vs. non-parametric

Representing examples What is an example? How is it represented?

Features examples features How our algorithms actually “view” the data Features are the questions we can ask about the examples f1, f2, f3, …, fn f1, f2, f3, …, fn f1, f2, f3, …, fn f1, f2, f3, …, fn

Features examples features How our algorithms actually “view” the data Features are the questions we can ask about the examples red, round, leaf, 3oz, … green, round, no leaf, 4oz, … yellow, curved, no leaf, 8oz, … green, curved, no leaf, 7oz, …

Classification revisited label examples red, round, leaf, 3oz, … apple learn green, round, no leaf, 4oz, … apple model/ classifier yellow, curved, no leaf, 8oz, … banana green, curved, no leaf, 7oz, … banana During learning/training/induction, learn a model of what distinguishes apples and bananas based on the features

Classification revisited predict Apple or banana? model/ classifier red, round, no leaf, 4oz, … The model can then classify a new example based on the features

Classification revisited predict model/ classifier Apple red, round, no leaf, 4oz, … Why? The model can then classify a new example based on the features

Classification revisited Training data Test set label examples red, round, leaf, 3oz, … apple ? green, round, no leaf, 4oz, … apple red, round, no leaf, 4oz, … yellow, curved, no leaf, 4oz, … banana green, curved, no leaf, 5oz, … banana

Classification revisited Training data Test set label examples red, round, leaf, 3oz, … apple ? green, round, no leaf, 4oz, … apple red, round, no leaf, 4oz, … yellow, curved, no leaf, 4oz, … banana Learning is about generalizing from the training data green, curved, no leaf, 5oz, … banana What does this assume about the training and test set?

Past predicts future Training data Test set

Past predicts future Training data Test set Not always the case, but we’ll often assume it is!

Past predicts future Training data Test set Not always the case, but we’ll often assume it is!

More technically… We are going to use the probabilistic model of learning There is some probability distribution over example/label pairs called the data generating distribution Both the training data and the test set are generated based on this distribution What is a probability distribution?

Probability distribution Describes how likely (i.e. probable) certain events are

Probability distribution Training data High probability Low probability round apples curved bananas apples with leaves … curved apples red bananas yellow apples …

data generating distribution Training data Test set data generating distribution

data generating distribution Training data Test set data generating distribution

data generating distribution Training data Test set data generating distribution