How do we know that we solved vision? 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009.

Slides:



Advertisements
Similar presentations
3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.
Advertisements

Chapter 3.5 Debugging Games
Game Design Serious Games Miikka Junnila.
Scientific Method Life Science Mr. P.
AI and Autonomous Vehicles Daniel Landers COMP Intelligent and Interactive Systems 3/10/2007.
James Tam Introduction To Computer Science In this section you will get an overview of some areas of Computer Science.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
Intelligence without Reason
Video: savior, or “more of the same”? : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 © A.A. Efros.
Jochen Triesch, UC San Diego, 1 Real Artificial Life: Robots.
Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Static vector fields Wind can be simulated using vector fields. The specific vectors in the field determine the direction and size of the wind force applied.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
27-Jun-15 Profiling code, Timing Methods. Optimization Optimization is the process of making a program as fast (or as small) as possible Here’s what the.
Mechanics, Dynamics & Aesthetics The Complexity of Games Expectations of Players Efficiency Game Mathematics o Collision Detection & Response o Object.
Introduction to mobile robots Slides modified from Maja Mataric’s CSCI445, USC.
1cs426-winter-2008 Notes  Assignment 1 is out, due immediately after reading week (Feb 25)  Please read: William T. Reeves, "Particle systems: a technique.
Artificial Intelligence
Quick Overview of Robotics and Computer Vision. Computer Vision Agent Environment camera Light ?
© 2014 Carnegie Robotics LLC. Use or disclosure of document data is subject to the restrictions on the title pagePage 1 Lessons Learned The most important.
Thermostat or Thermometer Which Are You?. Are you setting the climate or simply reflecting it?
IGCSE ICT Control.
1 Perception and VR MONT 104S, Spring 2008 Lecture 22 Other Graphics Considerations Review.
OPEN UP NOTEBOOKS TO WARM-UPS PAGE DO NOW There are over fifty members in my family; Yet none quarrel. We all live merrily. Four leaders have.
1 Constant Following Distance Simulations CS547 Final Project December 6, 1999 Jeremy Elson.
CS206Evolutionary Robotics “Cogito ergo sum.”. CS206Evolutionary Robotics “Cogito ergo sum.” “I think, therefore I am.” “Do I exist?” “There is something.
The AI Challenge: Who are we? Images Copyright Twentieth Century Fox, Paramount, Sony;
Lecture 1 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.
Human abilities Presented By Mahmoud Awadallah 1.
How We’re Going to Solve the AI Problem Pedro Domingos Dept. Computer Science & Eng. University of Washington.
The Particle Theory, Temperature and Thermal Energy.
Introduction to Programming ICS2O Findlay. Learning Goals  We will learn  The definitions of a computer, program and programming language.  The different.
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Computer Graphics 2 In the name of God. Outline Introduction Animation The most important senior groups Animation techniques Summary Walking, running,…examples.
Introduction EE 520: Image Analysis & Computer Vision.
CSC 423 ARTIFICIAL INTELLIGENCE Intelligence Agents.
Artificial Intelligence By Michelle Witcofsky And Evan Flanagan.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
How Solvable Is Intelligence? A brief introduction to AI Dr. Richard Fox Department of Computer Science Northern Kentucky University.
I Robot.
Robotics. What is a robot? Have you seen a robot? Where? – In a photo? – Video? – In real life?
Braitenberg Vehicles. A little history… Valentino Braitenberg (born 1926) is a cyberneticist and former director at the Max Planck Institute for Biological.
The Scientific Method Variables (Physics Unit 1) What is a variable? What types of variables are there?
Cognitive ability: Challenge: How to recognize objects in a scene; where are the object’s boundaries? This problem is known as ‘image segmentation’ in.
Energy forms and transformations. What is energy? We use the word all the time – but very few people have a strong understanding what it is It.
Sorting: Implementation Fundamental Data Structures and Algorithms Klaus Sutner February 24, 2004.
Thermostat or Thermometer Which Are You?. Are you setting the climate or simply reflecting it?
Braitenberg Vehicles. A little history… Valentino Braitenberg (born 1926) is a cyberneticist and former director at the Max Planck Institute for Biological.
CS851 – Biological Computing February 6, 2003 Nathanael Paul Randomness in Cellular Automata.
What is Artificial Intelligence?
Young Guns Production By Lisa Figueira. Planning.
Warm-Up Identify an important decision you made in the last month. Explain what factors influenced your decision. Were you pleased with the outcome?
AGENDA: flash cards and HW are due Friday Warm Up: 1. Brainstorm a list of about 5 moving objects. Share your ideas about what makes those objects move.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Week 2 - Tutorial Interactive Digital Moving Image Production | CU3003NI | - Pratik Man Singh Pradhan.
1 How to Take Tests 4 How to Take a True/False Test.
Introduction to Computer Science – Chapter 6 CSc 2010 Spring 2011 Marco Valero.
Christoph F. Eick: Thoughts on the Rook Project Challenges of Playing Bridge Well 
LCC 6310 Computation as an Expressive Medium Lecture 11.
Animation Animation is about bringing things to life Technically: –Generate a sequence of images that, when played one after the other, make things move.
  Computer vision is a field that includes methods for acquiring,prcessing, analyzing, and understanding images and, in general, high-dimensional data.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
The Robot Revolution has been Postponed (until we can debug the sensors) Bill Smart Oregon State University
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Course Instructor: knza ch
Chapter 1: Creating a Program.
Presentation transcript:

How do we know that we solved vision? : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009

Columbia Object Image Library (COIL-100) (1996)

Corel Dataset

Yu & Shi, 2004

Average Caltech categories (Torralba)

{ } all photos Flickr.com

Flickr Paris

Real Paris

Automated Data Collection Kang, Efros, Hebert, Kanade, 2009

Something More Objective? Middlebury Stereo Dataset Famous Tsukuba Image

Issue 1 We might be testing too soon… Need to evaluate the entire system: –Give it enough data –Ground it in the physical world –Allow it to affect / manipulate its environment Do we need to solve Hard AI? –Maybe not. We don’t need Human Vision per se – how about Rat Vision?

Issue 2 We might be looking for “magic” where none exist…

Valentino Braitenberg, Vehicles Source Material: Pearl_Street/Margin/Vehicles/index.html Introduces a series of (hypothetical) simple robots that seem, to the outside observer, to exhibit complex behavior. The complex behavior does not come from a complex brain, but from a simple agent interacting with a rich environment. Vehicle 1: Getting around A single sensor is attached to a single motor. Propulsion of the motor is proportional to the signal detected by the sensor. The vehicle will always move in a straight line, slowing down in the cold, speeding up in the warm. Braitenberg: “Imagine, now, what you would think if you saw such a vehicle swimming around in a pond. It is restless, you would say, and does not like warm water. But it is quite stupid, since it is not able to turn back to the nice cold sport it overshot in its restless ness. Anyway, you would say, it is ALIVE, since you have never seen a particle of dead matter move around quite like that.”

More complex vehicles

Moral of the Story “Law of Uphill Analysis and Downhill Invention: machines are easy to understand if you’re creating them; much harder to understand ‘from the outside’. Psychological consequence: if we don’t know the internal structure of a machine, we tend to overestimate its complexity.”

Turing Tests for Vision Your thoughts…

Have we solved vision if we solve all the boundary cases? Varum

Computer Vision Database Zhaoyin Jia Object segmentation/recognition Detailed segmented/labeled, all the scenes in life. Semantic meaning in image/video Human understanding of the image/story behind the image Feeling/reaction after understanding During the Spring break Before the deadline Failed in Best project in LoveKissIn the class Cute Adorable Safe More threatened Run faster Need more help Threatened Run Call for help

How do we know that we solved vision? General Rule: Turing test If CVS == HVS in Training & Performance & Speed & Failure case Then We declare vision is solved. Beers and Being laid off. Verifiable Specific Rules: Challenges in Training Full-automatic object Discovery & Categorization from unlabeled, long video sequence. Multi-view robust real-time Recognition of ten of thousands of objects, given few trainings of each object. Challenges in Performance Pixel-wise Localization and Registration in cluttered and degraded scene; Long-term real-time robust Tracking for generic objects in cluttered and degraded video sequence. Human failure – human vision illusion Able to explain human vision illusions, and Reproduce them. Conclusion: Good luck for all! Yuandong Tian : Learning-based method in vision

Turing Test for Vision From the blog: – No overall test. Vision is task-dependent. Do one problem at a time. –Use Computer Graphics to generate tons of test data –A well-executed Grand Challenge Genre Classification in Video –The Ultimate Dataset (25-year-old grad student) –Need to handle corner cases / illusions. “Dynamic range of difficulty”. –It’s all about committees, independent evaluations, and releasing source code –It’s hopeless…