16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.

Slides:



Advertisements
Similar presentations
How to finish your project on time ?
Advertisements

LECTURE 1: COURSE INTRODUCTION Xiaowei Yang. Roadmap Why should you take the course? Who should take this course? Course organization Course work Grading.
16-824: Learning-based Methods in Vision Instructors: Alexei (Alyosha) Efros 225 Smith Hall Leon Sigal
Welcome to the seminar course
COMP 208/214/215/216 – Lecture 5 Presentation Skills.
16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page:
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
CORNELL UNIVERSITY CS 764 Seminar in Computer Vision Ramin Zabih Fall 1998.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
CEP Welcome September 1, Matthew J. Koehler September 1, 2005CEP Cognition and Technology Who’s Who?  Team up with someone you don’t.
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Announcements Kevin Matzen office hours – Tuesday 4-5pm, Thursday 2-3pm, Upson 317 TA: Yin Lou Course lab: Upson 317 – Card access will be setup soon Course.
15-463: Rendering and Image Processing Staff Prof: Alexei Efros TA: James Hays Web Page
16-899A Pixels to Percepts Instructors: Alexei (Alyosha) Efros, 225 Smith Hall, CMU.Lavanya Sharan, Disney Research Pittsburgh Web Page:
(15-862): Computational Photography Staff Prof: Alexei Efros 4207 TA: Mohit Gupta Web Page
Visual Object Recognition Rob Fergus Courant Institute, New York University
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TA: Jim McCann Graphics Lab Web Page
(15-862): Computational Photography Staff Prof: Alexei Efros 4207 Web Page Newsgroup:
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Independent Work Fall 2007 David Walker. Welcome Junior ABs (both semesters) Junior ABs (both semesters) Senior ABs (thesis) Senior ABs (thesis) BSEs.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Welcome to CS 395/495 Internet Architectures. What is this class about? (1) Goal: to help you understand what the future Internet will look like –What.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Class Info Website for materials available at: – We’ll use Moodle for turning in assignments, grades, s, blog.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
Human abilities Presented By Mahmoud Awadallah 1.
COMP Introduction to Programming Yi Hong May 13, 2015.
Some years ago, CarTech CEO told us: What I want to see in our new hires is:
Sight Words.
How do we know that we solved vision? : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009.
Computing & Information Sciences Kansas State University Wednesday, 03 Dec 2008CIS 530 / 730: Artificial Intelligence Lecture 38 of 42 Wednesday, 03 December.
By: Leo Chu Hamza Suhail.  The class is all about is learning how to look at things in ways that you never have before and to apply critical thinking.
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
The Call to Write, Third Edition
Science Fair How To Get Started… (
How to Read Research Papers? Xiao Qin Department of Computer Science and Software Engineering Auburn University
English Composition Jonathan Watts. Welcome back to class! I hope you had a wonderful weekend! Today we will talk about Essay Development –Pg
Guidelines for Paper Presentation Mei-Chen Yeh 03/30/2010.
Why is computer vision difficult?
Welcome to EECS 395/495 Networking Problems in Cloud Computing.
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
CPSC : Data-driven Computer Graphics Jinxiang Chai.
CIS 842: Specification and Verification of Reactive Systems Lecture ADM: Course Administration Copyright , Matt Dwyer, John Hatcliff, Robby. The.
6.819 / 6.869: Advances in Computer Vision Guidelines: Short Talks and CVPR style papers Website:
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TAs: Alvaro Collet and Laura Trutoiu
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Genius Hour An Introduction and Overview. Intro Video
Computer Vision No. 1 What is the Computer Vision?
Welcome to EECS 395/495 Online Advertising: A Systems Approach.
Operating Systems CMPSC 473 Introduction and Overview August 24, Lecture 1 Instructor: Bhuvan Urgaonkar.
How to do a great Power Point Document. The Principles 1. The audience should focus on YOU, not the screen 2. Keep it simple! Both text and visual layout.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
Guidelines for Project Presentation Mei-Chen Yeh 04/03/2012.
Dana Nau: CMSC 722, AI Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
CS 558 Computer Vision John Oliensis. Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall.
Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.
Welcome to My Reading Recovery Lesson. Rereading Familiar Books In every lesson every day I get to read lots of little books. I get to pick some of my.
Welcome to Class 2! Theme: Physical Science and Constructivism Please sit at the table marked with your ideal teaching level (could be based on current.
Welcome to EECS 395/495 IoT Networks Seminar
Welcome to EECS 395/495 Networking Problems in Cloud Computing
CAP 6412: Advanced Computer Vision
GSP 470/570 Advanced Geospatial Analysis and Modeling
Presentation transcript:

16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521 NSHJean-Francois Lalonde Web Page: urses/LBMV07/ urses/LBMV07/

Today Introduction Why This Course? Administrative stuff Overview of the course Image Datasets Projects / Challenges

A bit about me Alexei (Alyosha) Efros Relatively new faculty (RI/CSD) Ph.D 2003, from UC Berkeley (signed by Arnie!) Research Fellow, University of Oxford, ’03-’04 Teaching I am still learning… The plan is to have fun and learn cool things, both you and me! Social warning: I don’t see well Research Vision, Graphics, Data-driven “stuff”

PhD Thesis on Texture and Action Synthesis Antonio Criminisi’s son cannot walk but he can fly Smart Erase button in Microsoft Digital Image Pro:

Why this class? The Old Days™: 1. Graduate Computer Vision 2. Advanced Machine Perception

Why this class? The New and Improved Days: 1. Graduate Computer Vision 2. Advanced Machine Perception Physics-based Methods in Vision Geometry-based Methods in Vision Learning-based Methods in Vision

Describing Visual Scenes using Transformed Dirichlet ProcessesTransformed Dirichlet Processes. E. Sudderth, A. Torralba, W. Freeman, and A. Willsky. NIPS, Dec The Hip & Trendy Learning

Learning as Last Resort

from [Sinha and Adelson 1993] EXAMPLE: Recovering 3D geometry from single 2D projection Infinite number of possible solutions!

Learning-based Methods in Vision This class is about trying to solve problems that do not have a solution! Don’t tell your mathematician frineds! This will be done using Data: E.g. what happened before is likely to happen again Google Intelligence (GI): The AI for the post-modern world! Why is this even useful? Even a decade ago at ICCV99 Faugeras claimed it wasn’t!

The Vision Story Begins… “What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking.” -- David Marr, Vision (1982)

Vision: a split personality “What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking. In other words, vision is the process of discovering from images what is present in the world, and where it is.” Answer #1: pixel of brightness 243 at position (124,54) …and depth.7 meters Answer #2: looks like bottom edge of whiteboard showing at the top of the image Which Do we want? Is the difference just a matter of scale? depth map

Measurement vs. Perception

Brightness: Measurement vs. Perception

Proof!

Lengths: Measurement vs. Perception Müller-Lyer Illusion

Vision as Measurement Device Real-time stereo on Mars Structure from Motion Physics-based Vision Virtualized Reality

…but why do Learning for Vision? “What if I don’t care about this wishy-washy human perception stuff? I just want to make my robot go!” Small Reason: For measurement, other sensors are often better (in DARPA Grand Challenge, vision was barely used!) For navigation, you still need to learn! Big Reason: The goals of computer vision (what + where) are in terms of what humans care about.

So what do humans care about? slide by Fei Fei, Fergus & Torralba

Verification: is that a bus? slide by Fei Fei, Fergus & Torralba

Detection: are there cars? slide by Fei Fei, Fergus & Torralba

Identification: is that a picture of Mao? slide by Fei Fei, Fergus & Torralba

Object categorization sky building flag wall banner bus cars bus face street lamp slide by Fei Fei, Fergus & Torralba

Scene and context categorization outdoor city traffic … slide by Fei Fei, Fergus & Torralba

Rough 3D layout, depth ordering

Challenges 1: view point variation Michelangelo

Challenges 2: illumination slide credit: S. Ullman

Challenges 3: occlusion Magritte, 1957

Challenges 4: scale slide by Fei Fei, Fergus & Torralba

Challenges 5: deformation Xu, Beihong 1943

Challenges 6: background clutter Klimt, 1913

Challenges 7: object intra-class variation slide by Fei-Fei, Fergus & Torralba

Challenges 8: local ambiguity slide by Fei-Fei, Fergus & Torralba

Challenges 9: the world behind the image

In this course, we will: Take a few baby steps…

Goals Read some interesting papers together Learn something new: both you and me! Get up to speed on big chunk of vision research understand 70% of CVPR papers! Use learninig-based vision in your own work Try your hand in a large vision project Learn how to speak Learn how think critically about papers

Course Organization Requirements: 1.Paper Presentations (50%) Paper Presenter Paper Evaluator 2.Class Participation (20%) Keep annotated bibliography Ask questions / debate / flight / be involved! 3.Final Project (30%) Do something with lots of data (at least 500 images) Groups of 1 or 2

Paper Advocate 1.Pick a paper from list That you like and willing to defend Sometimes I will make you do two papers, or background 2.Meet with me before starting, to talk about how to present the paper(s) 3.Prepare a good, conference-quality presentation (20-45 min, depending on difficulty of material) 4.Meet with me again 2 days before class to go over the presentation Office hours at end of each class 5.Present and defend the paper in front of class

Paper Evaluator For some papers, we will have Evaluators Sign up for a paper you find interesting 1.Get the code online (or implement if easy) 2.Run it on a toy problem, play with parameters 3.Run it on a new dataset 4.Prepare short min presentation detailing results 5.Discuss the paper critically

Class Participation Keep annotated bibliography of papers you read (always a good idea!). The format is up to you. At least, it needs to have: Summary of key points A few Interesting insights, “aha moments”, keen observations, etc. Weaknesses of approach. Unanswered questions. Areas of further investigation, improvement. Submit your thoughts for current paper(s) at the end of each class (printout)

Class Participation Be active in class. Voice your ideas, concerns. You need to participate JF will be watching and keeping track!

Final Project Can grow out of paper presentation, or your own research But it needs to use large amounts of data! 1-2 people per project. Project proposals in a few weeks. Project presentations at the end of semester. Results presented as a CVPR-format paper. Hopefully, a few papers may be submitted to conferences.

End of Semester Awards We will vote for: Best Paper Presenter Best Paper Evaluator \Best Project Prize: dinner in a nice restaurant

Course Outline

Datasets See web page