CVPR 2012 POSTER Mobile Object Detection through Client-Server based Vote Transfer.

Slides:



Advertisements
Similar presentations
Computer Vision Group UC Berkeley How should we combine high level and low level knowledge? Jitendra Malik UC Berkeley Recognition using regions is joint.
Advertisements

1 / 18 Network Characteristics of Video Streaming Traffic Ashwin Rao, Yeon-sup Lim *, Chadi Barakat, Arnaud Legout, Don Towsley *, and Walid Dabbous INRIA.
Université du Québec École de technologie supérieure Face Recognition in Video Using What- and-Where Fusion Neural Network Mamoudou Barry and Eric Granger.
Mobile Application Development Keshav Bahadoor. Part 1 Cross Platform Web Applications.
Good afternoon everyone
A Discriminative Key Pose Sequence Model for Recognizing Human Interactions Arash Vahdat, Bo Gao, Mani Ranjbar, and Greg Mori ICCV2011.
1 / 21 Network Characteristics of Video Streaming Traffic Ashwin Rao †, Yeon-sup Lim *, Chadi Barakat †, Arnaud Legout †, Don Towsley *, and Walid Dabbous.
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
SocialSync: Synchronization for a Smartphone Camera Network Richard Latimer Rice University September 7, 2014.
Patch to the Future: Unsupervised Visual Prediction
Face Detection, Pose Estimation, and Landmark Localization in the Wild
IT’S NOT POLITE TO POINT: DESCRIBING PEOPLE WITH UNCERTAIN ATTRIBUTES CVPR 2013 Poster.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.
Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.
Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.
4EyesFace-Realtime face detection, tracking, alignment and recognition Changbo Hu, Rogerio Feris and Matthew Turk.
Automatic Facial Landmark Tracking in Video Sequences using Kalman Filter Assisted Active Shape Models Utsav Prabhu, Keshav Seshadri, Marios Savvides 報告人.
Multi-criteria infrastructure for location-based applications Shortly known as: Localization Platform Ronen Abraham Ido Cohen Yuval Efrati Tomer Sole'
Linux GUI X-Windows System. X-Windows System A stable and extremely powerful GUI system that provides a complete Client/Server framework. The X-org foundation.
APPLAUS: A Privacy-Preserving Location Proof Updating System for Location-based Services Zhichao Zhu and Guohong Cao Department of Computer Science and.
Object Recognition by Discriminative Methods Sinisa Todorovic 1st Sino-USA Summer School in VLPR July, 2009.
Oral Defense by Sunny Tang 15 Aug 2003
WebQuilt and Mobile Devices: A Web Usability Testing and Analysis Tool for the Mobile Internet Tara Matthews Seattle University April 5, 2001 Faculty Mentor:
MACHINE VISION GROUP Multimodal sensing-based camera applications Miguel Bordallo 1, Jari Hannuksela 1, Olli Silvén 1 and Markku Vehviläinen 2 1 University.
PI : TV A PI MEDIA CENTER Michael Hoskins Network Security & Electronics Program, Dept. of Applied Engineering & Technology, Eastern Kentucky University.
Hamed Pirsiavash, Deva Ramanan, Charless Fowlkes
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Real-time Action Recognition by Spatiotemporal Semantic and Structural Forest Tsz-Ho Yu, Tae-Kyun Kim and Roberto Cipolla Machine Intelligence Laboratory,
Bag of Video-Words Video Representation
Cloud Services for Improved User Experience in Sharing Mobile Videos Authors: Dejan Kovachev, Yiwei Cao and Ralf Klamma Advanced Community Information.
Building Face Dataset Shijin Kong. Building Face Dataset Ramanan et al, ICCV 2007, Leveraging Archival Video for Building Face DatasetsLeveraging Archival.
A General Framework for Tracking Multiple People from a Moving Camera
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Class-Specific Hough Forests for Object Detection Zhen Yuan Hsu Advisor:S.J.Wang Gall, J., Lempitsky, V.: Class-specic hough forests for object detection.
Stable Multi-Target Tracking in Real-Time Surveillance Video
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR
DETECTING AND TRACKING TRACTOR-TRAILERS USING VIEW-BASED TEMPLATES Masters Thesis Defense by Vinay Gidla Apr 19,2010.
ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised and weakly-supervised discovery of events in video (and audio) Fernando De la Torre.
An Effective & Interactive Approach to Particle Tracking for DNA Melting Curve Analysis 李穎忠 DEPARTMENT OF COMPUTER SCIENCE & INFORMATION ENGINEERING NATIONAL.
Recognition Using Visual Phrases
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Skeleton Based Action Recognition with Convolutional Neural Network
Markerless Augmented Reality Platform Design and Verification of Tracking Technologies Author:J.M. Zhong Date: Speaker:Sian-Lin Hong.
Visual Odometry David Nister, CVPR 2004
Unsupervised Salience Learning for Person Re-identification
Learning video saliency from human gaze using candidate selection CVPR2013 Poster.
A REAL-TIME DEFORMABLE DETECTOR 謝汝欣 OUTLINE  Introduction  Related Work  Proposed Method  Experiments 2.
Multi-view Synchronization of Human Actions and Dynamic Scenes Emilie Dexter, Patrick Pérez, Ivan Laptev INRIA Rennes - Bretagne Atlantique
On Detection of Multiple Object Instances using Hough Transforms Olga Barinova Moscow State University Victor Lempitsky University of Oxford Pushmeet Kohli.
Hierarchical Motion Evolution for Action Recognition Authors: Hongsong Wang, Wei Wang, Liang Wang Center for Research on Intelligent Perception and Computing,
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
A Study of Finance App on the Android Platform and
A Study of Google Map API on Android Platform and an “EasyEat”
Guillaume-Alexandre Bilodeau
Logo: if available Project Name:
Implementing Localization
Compositional Human Pose Regression
Proposed architecture of a Fully Integrated
Augmented Reality SDK Introduction
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Logo: if available Project Name:
Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.
Image processing and computer vision
Toonvas Spen의 기적 허기훈 김선호 안지훈.
By Savannah Hiatt INT 492, Spring 2017 Introduction
Category-Sensitive Question Routing in Community Question Answering
School of Information Systems Singapore Management University
Weekly Learning Alex Omar Ruiz Irene.
Presentation transcript:

CVPR 2012 POSTER Mobile Object Detection through Client-Server based Vote Transfer

Outline Introduction Frame detection Mobile application blue-print Experiment Conclusion

Introduction Android OS

Introduction Short video sequence

Introduction Main Contribution:  Novel hough forest based multi-frame object detection framework  Vote transfer  Client-server framework

Frame detection Single-Frame detection  Hough forest [10] [10] J. Gall and V. Lempitsky. Class-specific hough forests for object detection. In CVPR, 2009.

Frame detection P={L,c,d}

Frame detection Multi-Frame detection  Motivation  Different express with single frame detection

Frame detection Multi-Frame detection  Vote transfer

Frame detection Multi-Frame detection  Vote transfer

Frame detection

Mobile application blue-print Client-server

Experiment Datasets  A new multi-view dataset that we collected  the Car Show Dataset introduced by Ozuysal et al [19]  [19] Pose estimation for category specific multiview object localization. In CVPR, 2009

Experiment Vote transfer  Giving each a weight  Reference frame’s weight=1  Other frames’s weight= 2 -i/10, i={10,20,30,40,50}

Experiment Single vs Multi-frame Performance

Experiment Single vs Multi-frame Performance

Experiment Tracking analysis

Experiment Image resolution

Experiment Mobile platform: Client-Server analysis Client:  Motorola Atrix 4g dual-core phone Android 2.2  Image size:640*480 Server:  2.4GHZ triple-core desktop For more information to Motorola Atrix ATRIX-4G/72112,en_US,pd.html?cgid=mobile-phones

Experiment Mobile platform: Client-Server analysis  Single frame  Multi frame

Conclusion A new approach to multi-frame object detection using Hough Forest Realistic implementation Client-server approach on mobile platform About future work: Pose estimation, how view-point changes can foster pose estimation

Thanks for your listening.