Human Intention Prediction Using Two-Stream Spatio-Temporal Features

Slides:



Advertisements
Similar presentations
TRECVID 2011 Surveillance Event Detection Speaker: Lu Jiang Longfei Zhang, Lu Jiang, Lei Bao, Shohei Takahashi, Yuanpeng Li, Alexander.
Advertisements

Decision Tree Approach in Data Mining
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Alyssa Wolejko Advisor: Nick Webb. How do we distinguish differences?  Terminology: A pitching motion is a single pitch thrown by a pitcher  Each pitch.
UML (Sequence Diagrams, Collaboration and State Chart Diagrams) Presentation By - SANDEEP REDDY CHEEDEPUDI (Student No: ) - VISHNU CHANDRADAS (Student.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
Decision Tree under MapReduce Week 14 Part II. Decision Tree.
On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.
Efficient Scalable Video Compression by Scalable Motion Coding Review and Implementation of DWT Based Approach Syed Jawwad Bukhari
WEEK 6: DEEP TRACKING STUDENTS: SI CHEN & MEERA HAHN MENTOR: AFSHIN DEGHAN.
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Tracker-based Peer Selection using ALTO Map Information draft-yang-tracker-peer-selection-00 Y. Richard Yang Richard Alimi, Ye Wang, David Zhang, Kai Lee.
Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Micro expression Detection using Strain Patterns - Sridhar Godavarthy Based on V.Manohar, D.B. Goldgof, S.Sarkar, Y. Zhang, "Facial Strain Pattern as a.
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.
Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.
Human pose recognition from depth image MS Research Cambridge.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory, California Institute of Technology,
Skeleton Based Action Recognition with Convolutional Neural Network
Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue
Machine Learning in Compiler Optimization By Namita Dave.
MIS 451 Building Business Intelligence Systems Classification (1)
BY International School of Engineering {We Are Applied Engineering} Disclaimer: Some of the Images and content have been taken from multiple online sources.
Unsupervised Learning of Video Representations using LSTMs
Decision Trees an introduction.
San Diego May 22, 2013 Giovanni Saponaro Giampiero Salvi
Presented by: Chung-Hsien Yu
Deep Recurrent Q-Learning of Behavioral Intervention
Gedas Adomavicius Jesse Bockstedt
A Personal Tour of Machine Learning and Its Applications
Semi-supervised Machine Learning Gergana Lazarova
DNN-Based Urban Flow Prediction
Ajita Rattani and Reza Derakhshani,
Context-Aware Modeling and Recognition of Activities in Video
Welcome to GIS in Water Resources 2017
An Inteligent System to Diabetes Prediction
Hu Li Moments for Low Resolution Thermal Face Recognition
Lei Sha, Jing Liu, Chin-Yew Lin, Sujian Li, Baobao Chang, Zhifang Sui
Introduction to Geographic Information Science
Classification by Decision Tree Induction
Two-Stream Convolutional Networks for Action Recognition in Videos
Welcome to GIS in Water Resources 2018
Multiple Instance Learning: applications to computer vision
How to use this template?
2 variants: Global fusion & Local perturbation
Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang
Xiaodan Liang Sun Yat-Sen University
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Pattern recognition in gait activities using a floor sensor system
Welcome to GIS in Water Resources 2013
Object Classification through Deconvolutional Neural Networks
Extraction of Multi-scale Outlier Hierarchy From Spatio-temporal Data Stream Jianming Lv.
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
CNN-based Action Recognition Using Adaptive Multiscale Depth Motion Maps And Stable Joint Distance Maps Junyou He, Hailun Xia, Chunyan Feng, Yunfei Chu.
Chengyang Zhang Computer Science Department University of North Texas
Open Topics.
Mitsuo Kawato ATR Computational Neuroscience Labs
Project on Implementation of Wave Front Planner Algorithm
Color-Attributes-Related Image Retrieval
Presented By: Harshul Gupta
Multi-UAV Detection and Tracking
Presenter: Sihong LIN Adviser: Bei ZHOU 9 July, 2019
Week 3 Volodymyr Bobyr.
… 1 2 n A B V W C X 1 2 … n A … V … W … C … A X feature 1 feature 2
Week 7 Presentation Ngoc Ta Aidean Sharghi
Presentation transcript:

Human Intention Prediction Using Two-Stream Spatio-Temporal Features Shengchao Li, Lin Zhang, and Xiumin Diao School of Engineering Technology, Purdue University Phone: +1 (765) 494-2212; Email: diaox@purdue.edu ACM/IEEE International Conference on Human-Robot Interactions March 11-14, 2018, Daegu, Korea

Experiment Overview The experiment is about a human participant pitching a ball toward a robot (represented by a target area in the figure). The target area is divided into 9 grids which represent the 9 targets that the ball can hit. We expect “the robot” to be able to predict the intention of the participant, namely, which of the nine targets the participant is pitching the ball to.

Training Data RGB images sequence Optical flow sequence

Approach Spatial network: Optimize early fusion methods of RGB images and optical flow for spatial network. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Approach Temporal network: Capture motion information in temporal network. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Approach Two-stream network with average fusion: Final average fusion to improve prediction performance. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Results In this example, the leaf nodes represent a class. However, this does not have to be the case. Some

Thank you!