USING LINKING FEATURES IN LEARNING NON-PARAMETRIC PART MODELS * Ammar Kamal Hattab ENGN2560 Final Project Presentation May 17, 2013 * Leonid Karlinsky,

Slides:



Advertisements
Similar presentations
Evidential modeling for pose estimation Fabio Cuzzolin, Ruggero Frezza Computer Science Department UCLA.
Advertisements

The chains model for detecting parts by their context Leonid Karlinsky, Michael Dinerstein, Daniel Harari, and Shimon Ullman.
Top-Down & Bottom-Up Segmentation
Sweeping a Variable Resistor Wheatstone Bridge. Place the Parts 1.VDC 2.R, which you place 3 times. The numbering of the resistor increases sequentially.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
Part Based Models Andrew Harp. Part Based Models Physical arrangement of features.
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד.
An Online Microsoft Word Tutorial & Evaluation Begin.
Face Alignment with Part-Based Modeling
Logging In Go to web site:
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
2D Human Pose Estimation in TV Shows Vittorio Ferrari Manuel Marin Andrew Zisserman Dagstuhl Seminar July 2008.
The University of Wisconsin-Madison Universal Morphological Analysis using Structured Nearest Neighbor Prediction Young-Bum Kim, João V. Graça, and Benjamin.
Knowing a Good HOG Filter When You See It: Efficient Selection of Filters for Detection Ejaz Ahmed 1, Gregory Shakhnarovich 2, and Subhransu Maji 3 1 University.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Scale Invariant Feature Transform (SIFT)
MAT 1000 Mathematics in Today's World Winter 2015.
Predicting Matchability - CVPR 2014 Paper -
I mage and M edia U nderstanding L aboratory for Performance Evaluation of Vision-based Real-time Motion Capture Naoto Date, Hiromasa Yoshimoto, Daisaku.
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
MAT 1000 Mathematics in Today's World Winter 2015.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Disambiguation of References to Individuals Levon Lloyd (State University of New York) Varun Bhagwan, Daniel Gruhl (IBM Research Center) Varun Bhagwan,
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
Object detection, tracking and event recognition: the ETISEO experience Andrea Cavallaro Multimedia and Vision Lab Queen Mary, University of London
Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
Association Rule Mining on Multi-Media Data Auto Annotation on Images Bhavika Patel Hau San Si Tou Juveria Kanodia Muhammad Ahmad.
Professor: S. J. Wang Student : Y. S. Wang
Combining Regression Trees and Radial Basis Function Networks paper by: M. Orr, J. Hallam, K. Takezawa, A. Murray, S. Ninomiya, M. Oide, T. Leonard presentation.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Endnote Tutorial How to Navigate Click to move forward Click to go back Click to return to Menu page.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Human pose recognition from depth image MS Research Cambridge.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
1 Image Matching using Local Symmetry Features Daniel Cabrini Hauagge Noah Snavely Cornell University.
Timo Ahonen, Abdenour Hadid, and Matti Pietikainen
By: David Gelbendorf, Hila Ben-Moshe Supervisor : Alon Zvirin
Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations ZUO ZHEN 27 SEP 2011.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Learning video saliency from human gaze using candidate selection CVPR2013 Poster.
Project 3 SIFT Matching by Binary SIFT
Frank Bergschneider February 21, 2014 Presented to National Instruments.
What is an Object? —— an experimental evaluation Presented by: Yao Pan.
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Animating Human Locomotion
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Compositional Human Pose Regression
Nonparametric Semantic Segmentation
Modeling the world with photos
Boosting Nearest-Neighbor Classifier for Character Recognition
Sketching an Isometric Circle
Limits of Accuracy.
Synthesis of Motion from Simple Animations
How to Submit Google Docs to the Homework Drop Box
Limits of Accuracy.
Sketching an Isometric Circle
ECE734 Project-Scale Invariant Feature Transform Algorithm
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Given that {image} {image} Evaluate the limit: {image} Choose the correct answer from the following:
Presentation transcript:

USING LINKING FEATURES IN LEARNING NON-PARAMETRIC PART MODELS * Ammar Kamal Hattab ENGN2560 Final Project Presentation May 17, 2013 * Leonid Karlinsky, Shimon Ullman, ECCV (3) 2012

Project Goal tll Nk Torso trl bll brl

Linking Features Method links The elbow appearance “links” the correct arm part candidates How do we choose the right lower arm candidate?  To use local features in strategic locations  To provide evidence on the connectivity of the part candidates

ALGORITHM STEPS

Training Steps Movie File Linking Features SIFT Extract Annotate Parts Features Training Model Database Save

Testing Steps SIFT Max With Orientations Training Model Database KDE Using Linking Features

MY PROGRESS

Mid Presentation Status  I was able to generate parts candidates  I was able to use linking features to find the correct configuration of two part candidates P = P =

Problems  Applying it to many images resulted in many errors:  In the detected center location of the parts  In the detected orientations of the parts  So to fix these errors : 1. Added two circles to the two ends of each part stick. 2. Fixed the voting locations (each feature votes for 25 locations) 3. Evaluated many different orientations

 Instead of using boxes only to collect features for different parts  Adding two circles to both ends enhances finding candidate part centers 1- Adding Two Circles

2- Finding Correct Voting Locations  Each test image feature votes for candidate center locations (using Nearest Neighbors)  The correct candidate center locations could be found by adding  the offset between training Nearest Neighbors features and their center locations  to the feature location

2- Finding Correct Voting Locations  Example: Eye Feature

2- Finding Correct Voting Locations  Example: Eye Feature Nearest Neighbors

2- Finding Correct Voting Locations Eye Feature (Test Image)One of the Nearest Neighbors (Training Image) Candidate Center Training Center Using the offset Voting of the Head Center Location

2- Finding Correct Voting Locations Using the Eye FeatureUsing All the Feature In the Image 25 voting locations Voting of the Head Center Location

2- Finding Correct Voting Locations Voting of the Torso Center Location

2- Finding Correct Voting Locations Voting of the Upper Left Arm Center Location

3- Using Many Orientations  To fix the problem of wrong orientations I used 7 orientations instead of three (as in the paper) to find the correct part orientation

EVALUATION AND RESULTS

Dataset  I have tested the algorithm using a movie file,  32 frames for training  50 frames for testing  Running the algorithm on this file took around 10 hours

Evaluation Criterion  I used the standard PCP criterion (Percentage of Correctly Detected Body Parts) for parts detection which is used by the author of the paper.  PCP t Criterion: both endpoints of the detected part should be within t ground-truth part length from the ground-truth part endpoints. Ground Truth Detected Part

Result  Result Detection:

My Result  Result PCP Curve  Result PCP 0.5 =  96% of the parts are returned with 0.5 L from the ground truth

Paper Results Paper Results PCP 0.5 Paper PCP Curve

Conclusion  My implementation gave higher PCP 0.5  due to the use of smaller dataset (50 images)  with fewer hard positions  Compared to the paper which applied it to large datasets with hundreds of images

Conclusion about Linking Features Algorithm  Provides high detection results comparable to state of the art methods.  Doesn’t need prior kinematic constrains  Linking Features add much values compared to part candidates scores alone  Could be combine with other methods to have better results.  Poor speed performance  Needs more clarification

END

My Results

2- Finding Correct Voting Locations  Each test image feature votes for candidate center locations (using Nearest Neighbors) with voting weight proportional with : d r descriptors distance to r th neighbor o is the offset between feature and center