16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page:

Slides:



Advertisements
Similar presentations
16-824: Learning-based Methods in Vision Instructors: Alexei (Alyosha) Efros 225 Smith Hall Leon Sigal
Advertisements

Welcome to the seminar course
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
CORNELL UNIVERSITY CS 764 Seminar in Computer Vision Ramin Zabih Fall 1998.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Announcements Kevin Matzen office hours – Tuesday 4-5pm, Thursday 2-3pm, Upson 317 TA: Yin Lou Course lab: Upson 317 – Card access will be setup soon Course.
15-463: Rendering and Image Processing Staff Prof: Alexei Efros TA: James Hays Web Page
16-899A Pixels to Percepts Instructors: Alexei (Alyosha) Efros, 225 Smith Hall, CMU.Lavanya Sharan, Disney Research Pittsburgh Web Page:
Highlights Lecture on the image part (10) Automatic Perception 16
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.
What should be done at the Low Level?
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TA: Jim McCann Graphics Lab Web Page
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
(15-862): Computational Photography Staff Prof: Alexei Efros 4207 Web Page Newsgroup:
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Computer Vision (CSE/EE 576) Staff Prof: Steve Seitz TA: Aseem Agarwala Web Page
Welcome to CS 395/495 Internet Architectures. What is this class about? (1) Goal: to help you understand what the future Internet will look like –What.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Medical Law and Bioethics
Computer Integrated Surgery II /646/452 Spring 2004 Russell H. Taylor Th-Fri 1:00-2:15.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License Creative Commons Attribution-NonCommercial-ShareAlike.
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.
Cornell Notes Note-taking strategy that will improve your study skills and your grades!!
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
Computing & Information Sciences Kansas State University Wednesday, 03 Dec 2008CIS 530 / 730: Artificial Intelligence Lecture 38 of 42 Wednesday, 03 December.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Why is computer vision difficult?
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
CPSC : Data-driven Computer Graphics Jinxiang Chai.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TAs: Alvaro Collet and Laura Trutoiu
Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Welcome to EECS 395/495 Online Advertising: A Systems Approach.
Operating Systems CMPSC 473 Introduction and Overview August 24, Lecture 1 Instructor: Bhuvan Urgaonkar.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
CS 558 Computer Vision John Oliensis. Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall.
Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.
COS 429: Computer Vision Thanks to Chris Bregler.
Taking Lecture Notes Presented by Moorpark College Writing Center.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Lecture 25: Introduction to Recognition
Lecture 25: Introduction to Recognition
ICCV 2009 Kyoto, Short Course, September 24
Brief Review of Recognition + Context
Computer Vision (CSE 455) Staff Web Page Handouts
Computer Vision (CSE 455) Staff Web Page Handouts
Wrap-up Computer Vision Spring 2019, Lecture 26
GSP 470/570 Advanced Geospatial Analysis and Modeling
Presentation transcript:

16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page: urses/AP06/ urses/AP06/

Today Introduction Why Perception? Administrative stuff Overview of the course Image Datasets

A bit about me Alexei (Alyosha) Efros Relatively new faculty (RI/CSD) Ph.D 2003, from UC Berkeley (signed by Arnie!) Research Fellow, University of Oxford, ’03-’04 Teaching I am still learning… The plan is to have fun and learn cool things, both you and me! Social warning: I don’t see well Research Vision, Graphics, Data-driven “stuff”

PhD Thesis on Texture and Action Synthesis Antonio Criminisi’s son cannot walk but he can fly Smart Erase button in Microsoft Digital Image Pro:

The story begins… “All happy families are alike; each unhappy family is unhappy in its own way.” -- Lev Tolstoy, Anna Karenina “What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking.” -- David Marr, Vision (1982)

Vision: a split personality “What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking. In other words, vision is the process of discovering from images what is present in the world, and where it is.” Answer #1: pixel of brightness 243 at position (124,54) …and depth.7 meters Answer #2: looks like bottom edge of whiteboard showing at the top of the image Is the difference just a matter of scale? depth map

Measurement vs. Perception

Brightness: Measurement vs. Perception

Proof!

Lengths: Measurement vs. Perception Müller-Lyer Illusion

Vision as Measurement Device Real-time stereo on Mars Structure from Motion Physics-based Vision Virtualized Reality

…but why? Reason #1: Semester too short, can’t cover everything Other great classes offered at CMU, e.g.: –Appearance Modeling (Srinivas Narasimhan, every fall) –Medical Vision (Yanxi Liu) –Structure from Motion (Martial Hebert, sometime?) “But what if I don’t care about this wishy-washy human perception stuff? I just want to make my robot go!” Reason #2: For measurement, other sensors are often better (in DARPA Grand Challenge, vision was barely used!) Reason #3: The goals of computer vision (what + where) are in terms of what humans care about.

So what do humans care about? slide by Fei Fei, Fergus & Torralba

Verification: is that a bus? slide by Fei Fei, Fergus & Torralba

Detection: are there cars? slide by Fei Fei, Fergus & Torralba

Identification: is that a picture of Mao? slide by Fei Fei, Fergus & Torralba

Object categorization sky building flag wall banner bus cars bus face street lamp slide by Fei Fei, Fergus & Torralba

Scene and context categorization outdoor city traffic … slide by Fei Fei, Fergus & Torralba

Rough 3D layout, depth ordering

Challenges 1: view point variation Michelangelo

Challenges 2: illumination slide credit: S. Ullman

Challenges 3: occlusion Magritte, 1957

Challenges 4: scale slide by Fei Fei, Fergus & Torralba

Challenges 5: deformation Xu, Beihong 1943

Challenges 6: background clutter Klimt, 1913

Challenges 7: object intra-class variation slide by Fei-Fei, Fergus & Torralba

Challenges 8: local ambiguity slide by Fei-Fei, Fergus & Torralba

Challenges 9: the world behind the image

In this course, we will: Take a few baby steps…

Course Organization Requirements: 1.Paper Presentations (50%) Paper Advocate Paper Demo Presenter Paper Opponent 2.Class Participation (20%) Keep annotated bibliography Post questions / comments on Quick-topic Ask questions / debate / flight / be involved! 3.Final Project (30%) Do something with lots of data (at least 500 images) Groups of 1, 2, or 3

Paper Advocate 1.Pick a paper from list That you like and willing to defend Sometimes I will make you do two papers, or background 2.Meet with me before starting to talk about how to present the paper(s) 3.Prepare a good, conference-quality presentation (20-45 min, depending on difficulty of material) 4.Meet with me again 2 days before class to go over the presentation Office hours at end of each class 5.Present and defend the paper in front of class

Paper Demo Presenter For some papers, we will have separate demo presentations 1.Sign up for a paper you find interesting 2.Get the code online (or implement if easy) 3.Run it on a toy problem, play with parameters 4.Run it on a new dataset 5.Prepare short 5-10 min presentation detailing results 6.Can cooperate with Paper Advocate

Paper Opponent 1.Sign up for a paper you don’t like / suspicious about 2.Prepare an argument (with or without slides) against the paper: Paper weaknesses Relevance to real problems Existence of better alternative approaches Etc. 3.Present in front of class (5-10 min)

Class Participation Keep annotated bibliography of papers you read (always a good idea!). The format is up to you. At least, it needs to have: Summary of key points A few Interesting insights, “aha moments”, keen observations, etc. Weaknesses of approach. Unanswered questions. Areas of further investigation, improvement. Submit your thoughts for current paper(s) before each class (printout)

Class Participation In addition, submit interesting observations or questions to QuickTopic before class for public discussion. Be active in class. Voice your ideas, concerns. You need to participate: either in class or in QuickTopic every week! Dave will be watching and keeping track!

Final Project Can grow out of paper presentation, or your own research But it needs to use large amounts of data! 1-3 people per project. Project proposals in a few weeks. Project presentations at the end of semester. Results presented as a CVPR-format paper. Hopefully, a few papers may be submitted to conferences.

End of Semester Awards We will vote for: Best Paper Presenter Best Paper Opponent Best Demo Best Project Prize: dinner in a nice restaurant

Course Outline Physiology of Vision (1 lecture) Overview of Human Visual Percetion (1 lecture) Need presenter for Monday! Part I: Low-level vision (images as texture) Texture segmentation, image retrieval, scene models, “Bag of words” representations Part II: Mid-level vision (segmentation) Principles of grouping, Normalized Cuts, Mean-shift, DD- MCMC, Graph-cut, super-pixels Part III: 2D Recognition Window scanning (Schniderman+Kanade, Viola+Jones) Correspondence Matching (schanfer matching, housedorf distance, shape contexts, invariant features, active appearance models) Recognition with Segmentation (top-down + buttom-up) Words and Pictures

Course Outline (cont.) Part IV: Intrinsic Images Shading vs. reflectance Recovering surface orientations and depth Style vs. content Part V: Dealing with Data Isomap, LLE, Non-negative Matrix Factorization Part VI: Tracking and Motion Segmentation Particle filtering, examplar-based, layers Sign up to present one paper on Wed on QuickTopic

Datasets See web page