Visual Recognition: The Big Picture Jitendra Malik University of California at Berkeley.

Slides:



Advertisements
Similar presentations
Chapter 4: The Visual Cortex and Beyond
Advertisements

Physical Education Vocabulary. Body Composition The amount of fat tissues and lean tissue in the body.
Growing up!. Ages 0 – 6 months  Turn their head toward sounds and movement  Gradually holds own head up  Watch an adult's face when feeding  Smile.
P I PLAYING GAMES SOCIAL STUDIES L1 - L2 MUSIC and RHYTHOM MATHS ART DRAMA.
The Visual System: Feature Detection Model Lesson 17.
Classification spotlights
Newborn & Infant Development Georgia CTAE Resource Network Instructional Resources Office Written by: LaDonna Steele Bartmas July 2009.
100 most common verbs In English Press the word to hear it!
Level 1 Enter the water unaided from side or steps Breath control holding the side of the pool, submerge face three times rhythmically exhaling beneath.
My Group’s Current Research on Image Understanding.
DANCE CONCEPTS REVIEW. SPACE SIZE: large, small LEVEL: high, mid-level, low SHAPE: curved, straight DIRECTIONS: forward, backward, sideways, diagonal,
SAFETY FIRST on the playground Bici South.  When you see a RED background, there are children who are NOT being safe on the playground.  When you see.
Craig Nicholson British Swimming
1 Contours and Junctions in Natural Images Jitendra Malik University of California at Berkeley (with Jianbo Shi, Thomas Leung, Serge Belongie, Charless.
16.899A: Physiology (contd) Lavanya Sharan January 24 th, 2011.
1 Biological Neural Networks Example: The Visual System.
Recognizing Objects and Actions in Images Jitendra Malik U.C. Berkeley.
Front-end computations in human vision Jitendra Malik U.C. Berkeley References: DeValois & DeValois,Hubel, Palmer, Spillman &Werner, Wandell Jitendra Malik.
Computer Vision Group University of California Berkeley Visual Grouping and Object Recognition Jitendra Malik * U.C. Berkeley * with S. Belongie, C. Fowlkes,
Visual Cognition I basic processes. What is perception good for? We often receive incomplete information through our senses. Information can be highly.
Computational Vision Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
Categorization: Scenes & Objects (P) Lavanya Sharan March 16th, 2011.
Computational Vision Jitendra Malik, UC Berkeley.
LYNN VERMEIREN, LAUREN WALTHER, RYAN BOYER, LYNDA MASTERSON, ELIZABETH LANDAU Developmental Milestones 3-36 Months.
What can parents do to help promote the sensory and motor development in their children to lay the foundation for early school success?
Motor Development. What IS “motor development”? Crawling.
Language/Literacy Speak in complete sentences Use new vocabulary to express feelings & ideas Say ABCs Exposure to rhyming (say/recall simple nursery.
Developing Object Control Skills
Developmental Stages of Infants
Physical Health and Motor Development. Agenda Body Growth Brain Development Sensory Development Influences on Growth and Development Gross Motor Development.
Animal Adaptations S4L2. Students will identify factors that affect the survival or extinction of organisms such as adaptation, variation of behaviors.
Preschool Physical Development 3 & 4 year olds
1 Contours and Junctions in Natural Images Jitendra Malik University of California at Berkeley (with Jianbo Shi, Thomas Leung, Serge Belongie, Charless.
Handwritten digit recognition Jitendra Malik. Handwritten digit recognition (MNIST,USPS) LeCun’s Convolutional Neural Networks variations (0.8%, 0.6%
Technical & Expressive Nature of Dance Technical Nature.
Multiple Intelligences
Copy the chart Age Physical Cognitive Social Communication.
Subject wearing a VR helmet immersed in the virtual environment on the right, with obstacles and targets. Subjects follow the path, avoid the obstacels,
WHAT IS COMMUNICATION?. COMMUNICATION Different types of communication:  One to one conversations  Group conversations  Formal communication  Informal.
Sign Language – 5 Parameters Sign Language has an internal structure. They can be broken down into smaller parts. These parts are called the PARAMETERS.
By Amy M. Burns © Pre-Kindergarten and Kindergarten Music and Movement Class Activity: Moving with Music.
Physical Literacy and the Young Child Patricia Maude MBE Homerton College, University of Cambridge, England.
Computer Vision Group University of California Berkeley On Visual Recognition Jitendra Malik UC Berkeley.
Human vision Jitendra Malik U.C. Berkeley. Visual Areas.
What can we do ? Hello !. Name the sound. Read the words. a: t h b ai r w z f p эе can’t, park, car tree, pet, eat, flute can, black, animal big, bike,
NAMES I CAN… Take responsibility for my belongings Get changed appropriately Demonstrate 3 Rules: Toilet - Shower – Blow nose Wear my costume (hat if.
Multiple Intelligences What is intelligence?. We are all smart. We are smart in different ways. One way is not better than another.
Babysitting Class Orange City Area Health System Characteristics and Milestones of Children.
The Human Visual System Background on Vision Human vision – the best system around Deep network models.
1 Computational Vision CSCI 363, Fall 2012 Lecture 12 Review for Exam 1.
Computational Vision Jitendra Malik University of California, Berkeley.
Slide NumberStroke 2Front Crawl 3Backstroke 4Breaststroke 5Butterfly.
Настоящее продолженное. I go to school. – Я хожу в школу. I am going to school now. – Я иду в школу сейчас. Present Simple Present Continuous.
yfcoart factory Warm-up hagul laugh Warm-up manomdc command Warm-up.
Gross Motor Skills Motor skills used in sitting, crawling, walking, and reaching for things.
Gross Motor Skills Gross motor skills refer to activities that involve the use of the large muscles of the neck, trunk, arms, and legs.
Unit 5 Whose dog is it? B Let’s learn. jump play sleep eat walk do homework drink climb.
Visual Computation and Learning Lab
P. E. Activity Sheets Knockconan N.S. 2010
Verb Tense Worksheet (1)
Visual Attributes in Video
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Optic Nerve Projections
ACTION WONDERFUL! Sing Run Play football Eat Dance Walk
1 Copyright © 2005 – 2006 MES-English.com.
The Visual System: Feature Detection Model
What animal is it?.
Present Continuous He She is They are It WELL DONE! dancing drinking
Clear the court. Dynamic warmup Legs + Arms
Presentation transcript:

Visual Recognition: The Big Picture Jitendra Malik University of California at Berkeley

The more you look, the more you see!

PASCAL Visual Object Challenge

We want to locate the object Orig. ImageSegmentation Orig. ImageSegmentation

The Visually Tagged Human Project And we want to detect and label parts..

Computer Vision Group UC Berkeley Categorization at Multiple Levels Tiger Grass Water Sand outdoor wildlife Tiger tail eye legs head back shadow mouth

Examples of Actions Movement and posture change –run, walk, crawl, jump, hop, swim, skate, sit, stand, kneel, lie, dance (various), … Object manipulation –pick, carry, hold, lift, throw, catch, push, pull, write, type, touch, hit, press, stroke, shake, stir, turn, eat, drink, cut, stab, kick, point, drive, bike, insert, extract, juggle, play musical instrument (various)… Conversational gesture –point, … Sign Language

Computer Vision Group University of California Berkeley We need to identify Objects Agents Relationships among objects with objects, objects with agents, agents with agents … Events and Actions

Computer Vision Group UC Berkeley Taxonomy and Partonomy Taxonomy: E.g. Cats are in the order Felidae which in turn is in the class Mammalia –Recognition can be at multiple levels of categorization, or be identification at the level of specific individuals, as in faces. Partonomy: Objects have parts, they have subparts and so on. The human body contains the head, which in turn contains the eyes. These notions apply equally well to scenes and to activities. Psychologists have argued that there is a “basic-level” at which categorization is fastest (Eleanor Rosch et al). In a partonomy each level contributes useful information for recognition.

Visual Processing Areas

Macaque Visual Areas

Computer Vision Group UC Berkeley Object Detection can be very fast On a task of judging animal vs no animal, humans can make mostly correct saccades in 150 ms (Kirchner & Thorpe, 2006) –Comparable to synaptic delay in the retina, LGN, V1, V2, V4, IT pathway. –Doesn’t rule out feed back but shows feed forward only is very powerful Detection and categorization are practically simultaneous (Grill-Spector & Kanwisher, 2005)

Hubel and Wiesel (1962) discovered orientation sensitive neurons in V1

These cells respond to edges and bars..

Computer Vision Group UC Berkeley Orientation based features were inspired by V1 (SIFT, GIST, HOG, GB etc)

Computer Vision Group UC Berkeley Attneave’s Cat (1954) Line drawings convey most of the information

Rolls et al (2000)

Convolutional Neural Networks (LeCun et al)