Human Intention Prediction Using Two-Stream Spatio-Temporal Features

Slides:

Advertisements

Similar presentations

TRECVID 2011 Surveillance Event Detection Speaker: Lu Jiang Longfei Zhang, Lu Jiang, Lei Bao, Shohei Takahashi, Yuanpeng Li, Alexander.

Advertisements

Decision Tree Approach in Data Mining

1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.

Alyssa Wolejko Advisor: Nick Webb. How do we distinguish differences?  Terminology: A pitching motion is a single pitch thrown by a pitcher  Each pitch.

UML (Sequence Diagrams, Collaboration and State Chart Diagrams) Presentation By - SANDEEP REDDY CHEEDEPUDI (Student No: ) - VISHNU CHANDRADAS (Student.

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.

Decision Tree under MapReduce Week 14 Part II. Decision Tree.

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.

Efficient Scalable Video Compression by Scalable Motion Coding Review and Implementation of DWT Based Approach Syed Jawwad Bukhari

WEEK 6: DEEP TRACKING STUDENTS: SI CHEN & MEERA HAHN MENTOR: AFSHIN DEGHAN.

Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.

Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning

Tracker-based Peer Selection using ALTO Map Information draft-yang-tracker-peer-selection-00 Y. Richard Yang Richard Alimi, Ye Wang, David Zhang, Kai Lee.

Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.

GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.

Micro expression Detection using Strain Patterns - Sridhar Godavarthy Based on V.Manohar, D.B. Goldgof, S.Sarkar, Y. Zhang, "Facial Strain Pattern as a.

Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim

Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.

Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.

Human pose recognition from depth image MS Research Cambridge.

Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,

First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory, California Institute of Technology,

Skeleton Based Action Recognition with Convolutional Neural Network

Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue

Machine Learning in Compiler Optimization By Namita Dave.

MIS 451 Building Business Intelligence Systems Classification (1)

BY International School of Engineering {We Are Applied Engineering} Disclaimer: Some of the Images and content have been taken from multiple online sources.

Unsupervised Learning of Video Representations using LSTMs

Decision Trees an introduction.

San Diego May 22, 2013 Giovanni Saponaro Giampiero Salvi

Presented by: Chung-Hsien Yu

Deep Recurrent Q-Learning of Behavioral Intervention

Gedas Adomavicius Jesse Bockstedt

A Personal Tour of Machine Learning and Its Applications

Semi-supervised Machine Learning Gergana Lazarova

DNN-Based Urban Flow Prediction

Ajita Rattani and Reza Derakhshani,

Context-Aware Modeling and Recognition of Activities in Video

Welcome to GIS in Water Resources 2017

An Inteligent System to Diabetes Prediction

Hu Li Moments for Low Resolution Thermal Face Recognition

Lei Sha, Jing Liu, Chin-Yew Lin, Sujian Li, Baobao Chang, Zhifang Sui

Introduction to Geographic Information Science

Classification by Decision Tree Induction

Two-Stream Convolutional Networks for Action Recognition in Videos

Welcome to GIS in Water Resources 2018

Multiple Instance Learning: applications to computer vision

How to use this template?

2 variants: Global fusion & Local perturbation

Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang

Xiaodan Liang Sun Yat-Sen University

MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.

Pattern recognition in gait activities using a floor sensor system

Welcome to GIS in Water Resources 2013

Object Classification through Deconvolutional Neural Networks

Extraction of Multi-scale Outlier Hierarchy From Spatio-temporal Data Stream Jianming Lv.

An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.

CNN-based Action Recognition Using Adaptive Multiscale Depth Motion Maps And Stable Joint Distance Maps Junyou He, Hailun Xia, Chunyan Feng, Yunfei Chu.

Chengyang Zhang Computer Science Department University of North Texas

Mitsuo Kawato ATR Computational Neuroscience Labs

Project on Implementation of Wave Front Planner Algorithm

Color-Attributes-Related Image Retrieval

Presented By: Harshul Gupta

Multi-UAV Detection and Tracking

Presenter: Sihong LIN Adviser: Bei ZHOU 9 July, 2019

Week 3 Volodymyr Bobyr.

… 1 2 n A B V W C X 1 2 … n A … V … W … C … A X feature 1 feature 2

Week 7 Presentation Ngoc Ta Aidean Sharghi

Presentation transcript:

Human Intention Prediction Using Two-Stream Spatio-Temporal Features Shengchao Li, Lin Zhang, and Xiumin Diao School of Engineering Technology, Purdue University Phone: +1 (765) 494-2212; Email: diaox@purdue.edu ACM/IEEE International Conference on Human-Robot Interactions March 11-14, 2018, Daegu, Korea

Experiment Overview The experiment is about a human participant pitching a ball toward a robot (represented by a target area in the figure). The target area is divided into 9 grids which represent the 9 targets that the ball can hit. We expect “the robot” to be able to predict the intention of the participant, namely, which of the nine targets the participant is pitching the ball to.

Training Data RGB images sequence Optical flow sequence

Approach Spatial network: Optimize early fusion methods of RGB images and optical flow for spatial network. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Approach Temporal network: Capture motion information in temporal network. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Approach Two-stream network with average fusion: Final average fusion to improve prediction performance. Terminate to a leaf node when: No more instances exist, No additional attributes exist, All instances belong to the same class Classifications apply to all instances that reach the leaf node

Results In this example, the leaf nodes represent a class. However, this does not have to be the case. Some

Thank you!