Visual Scene Understanding (CS 598) Derek Hoiem Course Number: 46411 Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.

Slides:



Advertisements
Similar presentations
Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
Advertisements

CS-550 Syllabus1 CS-550: Comparative Operating Systems Instructor: Dr. Marius D. Soneru Fall 2001.
Wrap Up. We talked about Filters Edges Corners Interest Points Descriptors Image Stitching Stereo SFM.
Slide 01-1COMP 7370, Auburn University COMP 7370 Advanced Computer and Network Security Dr. Xiao Qin Auburn University
Introduction to CS 491 / 2 Senior Design Project I / II Prof. Dr. H. Altay Güvenir.
Consistency of Assessment
CSSE463: Image Recognition Matt Boutell F-224 x8534
Logistics: –My office hours: T, Th 4-5pm or by appointment –Class Web page:
Welcome to CS 450 Internet Security: A Measurement-based Approach.
1 MA375 Introduction To Numerical Computing Fall 2003 Prof. Tim Warburton
April 13, 2004CS WPI1 CS 562 Advanced SW Engineering General Dynamics, Needham Tuesdays, 3 – 7 pm Instructor: Diane Kramer.
ECS 298 Photorealistic Image Synthesis course overview Brian Budge Center for Image Processing and Integrated Computing Computer Science Department University.
Computer Vision. Computer vision is concerned with the theory and technology for building artificial Computer vision is concerned with the theory and.
EECS 395/495 Algorithmic Techniques for Bioinformatics General Introduction 9/27/2012 Ming-Yang Kao 19/27/2012.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Computer Science 102 Data Structures and Algorithms V Fall 2009 Lecture 1: administrative details Professor: Evan Korth New York University 1.
Quick Overview of Robotics and Computer Vision. Computer Vision Agent Environment camera Light ?
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
Chapter 4 Designing Significant Learning Experiences II: Shaping the Experience.
Computer Network Fundamentals CNT4007C
05/16/001 MRKT 520-MARKETING MANAGEMENT DR. Ugur Yucelt Office Phone: Summer 2002 MW:6:00-9:10 pm Office Hours: MW: 5:00-6:00pm.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
COMP Introduction to Programming Yi Hong May 13, 2015.
Meta-Cognition, Motivation, and Affect PSY504 Spring term, 2011 January 13, 2010.
Student Centered Teaching Through Universal Instructional Design Part II.
Computer Networks CEN 5501C Spring, 2008 Ye Xia (Pronounced as “Yeh Siah”)
HCI / CprE / ComS 575: Computational Perception
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
04/30/13 Last class: summary, goggles, ices Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1 Image: wordpress.com/2011/11/22/lig.
Understanding the Academic Structure of the US Classroom: Syllabus.
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
1 Database Management for Electronic Commerce and EBusiness Walt Scacchi, Ph.D. GSM 274/FEMBA 274 Spring 2002.
Introduction EE 520: Image Analysis & Computer Vision.
CS 6961: Structured Prediction Fall 2014 Course Information.
CS 858 – Hot Topics in Computer and Communications Security Winter 2009 Introduction.
Conceptual and Experimental Vision Introduction R.Bajcsy, S.Sastry and A.Yang Fall 2006.
CSCE 496/896 Self-Managing Computer Systems Ying Lu 106 Schorr Center
CM220: Unit 1 Seminar “You must be the change you wish to see in the world.” ~ Mohandas Gandhi.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
CS Welcome to CS 5383, Topics in Software Assurance, Toward Zero-defect Programming Spring 2007.
IST 210: Organization of Data
September 3, 2013Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2013 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Capstone Project Fall Course Information Instructor Ye Zhao –Office: MSB 220 – Fall 2015 (MSB162) –Time: Tue, Thu 10:45am.
CM220 College Composition II Friday, January 29, Unit 1: Introduction to Effective Academic and Professional Writing Unit 1 Lori Martindale, Instructor.
Spring, 2005 CSE391 – Lecture 1 1 Introduction to Artificial Intelligence Martha Palmer CSE391 Spring, 2005.
Quiz Week 8 Topical. Topical Quiz (Section 2) What is the difference between Computer Vision and Computer Graphics What is the difference between Computer.
Computer Networks CNT5106C
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 3, 2002.
CSCE 990 Advanced Distributed Systems Seminar Ying Lu 104 Schorr Center
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 7, 2003.
6/11/2016 Welcome! Kaplan University Robin Clark CS 410 – Advanced Career Development Strategies.
Course Overview Stephen M. Thebaut, Ph.D. University of Florida Software Engineering.
Introduction to the course Aug 30, Day 1 Object-oriented Programming thru Video Games TIDE 1840 Harry Howard Tulane University.
COS 429: Computer Vision Thanks to Chris Bregler.
Computer Network Fundamentals CNT4007C
Welcome to CS 4390/CS5381: Introduction to Formal Methods
Robotics – Syllabus and Logistics
CSC207 Fall 2016.
Computer Networks CNT5106C
Autonomous Cyber-Physical Systems: Course Introduction
Object detection as supervised classification
Crowd Simulation (INFOMCRWS) - Course Introduction
CS & CS Capstone Project & Software Development Project
CAP 6412: Advanced Computer Vision
CSS-304: Computer Systems Interface
CS 332 Visual Processing in Computer and Biological Vision Systems
Presentation transcript:

Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday 11:00am – 12:15pm Office Hours: Tuesday and Thursday 12:15-1pm; by appointment Contact: Siebel 3312

Today Introductions Overview of logistics Overview of class material

Vision: What is it good for? Biological (Humans) Technological (Computers) Note: Unfortunately, these got erased when my computer crashed

Course Logistics

Class Content Overview Tutorials and Perspectives Paper reading I)Spatial Inference II)Objects III)Actions IV)Context and Integration

Visual Scene Understanding Visual scene understanding is the ability to infer general principles and current situations from imagery in a way that helps achieve goals.

Visual Scene Understanding Visual scene understanding is the ability to infer general principles and current situations from imagery in a way that helps achieve goals.

Visual Scene Understanding Visual scene understanding is the ability to infer general principles and current situations from imagery in a way that helps achieve goals.

Visual Scene Understanding Visual scene understanding is the ability to infer general principles and current situations from imagery in a way that helps achieve goals.

I. Spatial Inference

Getting Around

Spatial Inference: applications Household Robots Automated Vehicles Graphics Applications Predict object size/position

Spatial Inference: open questions How do we represent space? – Surface orientations, depth maps, voxels? How do we infer it from available sensory data (image, stereo, motion, laser range finder)?

II. Objects

Finding Things and Observing Them Image classification: Are there any dogs? Photo credit: iansand – flickr.com

Finding Things and Observing Them Object Localization: Where are the dog(s)?

Finding Things and Observing Them Verification: Is this a dog?

Finding Things and Observing Them Description: Furry, small, nice, side view

Finding Things and Observing Them Identification: My friend Sally?

Recognizing Stuff SKY SAND WATER

Object Recognition: applications Photo Search Security Robots

Object Recognition: open questions How many examples does it take to learn one category well? How many examples does it take to learn 100 categories well? How do these answers depend on the level of supervision? Can recognition be solved with simple methods and massive amounts of data? How can we quickly recognize an object? How can we scale up to deal with thousands of categories?

III. Actions

Taking Action [Saxena et al. 2008]

Recognizing Actions KTH Dataset Figure from Laptev et al. 2008

Recognizing Actions Figure from Laptev et al. 2008

Reading Emotions Photo credit: Comstok

Actions: applications SecurityVideo Search

Actions: open questions How are actions defined? Does it make sense to categorize them? – If not, how do we recognize them? What are good visual representations for inferring actions? How can we recognize activities?

IV. Context and Integration [Hoiem et al. 2008]

Context and Integration [Hoiem et al. 2008] Objects + scene categories  better detection Movement + objects  action/activity recognition Space + objects  navigation

Context and Integration: applications Everything that vision is good for

Context and Integration: open questions Should context be explicit (e.g., “cars drive on the road”) or implicit (feature-based)? How do we model and learn the interactions between different processes and scene characteristics? How do we deal with the growing complexity as more and more pieces are put together?

General Problems in Computer Vision Better understanding of limitations and their sources – Need new experimental paradigms Improve generalization – Aim to generalize across datasets, categories, and tasks – Work on knowledge sharing and transfer Vision as a way of learning about the world – Integration into AI – Systems that acquire knowledge over time

Successes of Computer Vision Point matching (e.g. 2d3)2d3 – Tracking – Structure from motion – Stitching Product inspection Multiview 3d reconstruction Face recognition and modeling Object recognition on pre-2000 datasets Interactive segmentation (ongoing)

To Do Register on bulletin board Post comments on Thursdays reading (due tomorrow) Look over schedule and decide which days to present (due next Tues) Start thinking about projects – Let me know if you want a specific pairing (due Tues)

Questions?

Goals Make you a better researcher (esp. in vision) – More knowledge – Better critical thinking skills – Improved communication skills – Improved research skills

Grades Participation: 25% – Posting – Class discussion Presentation: 25% Projects: 50% – Proposal, progress report, final paper, and oral

Policies Attendance required (see syllabus) Give credit where due No formal prerequisites Everything needs to be on time

Reading Read well Post comments to bulletin board at least 24 hours before class

Presentations Presenter – Everyone does two – Good quality coverage of topic (40 min) – See syllabus for guidelines – Sign up by next Tuesday (at latest) – TBAs are your choice (decide at least 4 weeks in advance) Demonstrator – If all days are taken, pair up – One person’s job will be to demonstrate some aspect of the algorithm (e.g., where it succeeds and fails) by running it on many examples – May require implementation Note taker

Projects Timeline – Proposal: Feb 12 (3 ½ weeks!) – Progress report: Mar 19 – Presentation: paper May 5, oral later Progress report Presentation – Paper – Oral In pairs – Can choose partner or be randomly paired Suggestions on web Potentially will lead to publication (e.g. NIPS)

To Do Register on bulletin board Post comments on Thursdays reading (due tomorrow) Look over schedule and decide which days to present (due next Tues) Start thinking about projects – Let me know if you want a specific pairing (due Tues)

Questions?