Enabling User Interactions with Video Contents Khalad Hasan, Yang Wang, Wing Kwong and Pourang Irani.

Slides:



Advertisements
Similar presentations
CSE 424 Final Presentation Team Members: Edward Andert Shang Wang Michael Vetrano Thomas Barry Roger Dolan Eric Barber Sponsor: Aviral Shrivastava.
Advertisements

EdgeSplit: Facilitating the Selection of Off-Screen Objects Zahid Hossain Khalad Hasan Hai-Ning Liang Pourang Irani.
A Natural Interactive Game By Zak Wilson. Background This project was my second year group project at University and I have chosen it to present as it.
San Diego SPUG -
Kien A. Hua Division of Computer Science University of Central Florida.
Whole-Home Gesture Recognition Using Wireless Signals —— MobiCom’13 Author: Qifan Pu et al. University of Washington Presenter: Yanyuan Qin & Zhitong Fei.
Hand Gesture for Taking Self Portrait Shaowei Chu and Jiro Tanaka University of Tsukuba Japan 12th July 15 minutes talk.
How to Position the Cursor? An Exploration of Absolute and Relative Cursor Positioning for Back-of-Device Input Khalad Hasan Xing-Dong Yang Hai-Ning Liang.
By : Adham Suwan Mohammed Zaza Ahmed Mafarjeh. Achieving Security through Kinect using Skeleton Analysis (ASKSA)
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Smartwatches + Head-Worn Displays: the ‘New’ Smartphone Marcos Serrano 1, Khalad Hasan 2, Barrett Ens 2, Xing-Dong Yang 3, Pourang Irani 2 1 IRIT- University.
Video Object Tracking and Replacement for Post TV Production LYU0303 Final Year Project Spring 2004.
ADVISE: Advanced Digital Video Information Segmentation Engine
TAUCHI – Tampere Unit for Computer-Human Interaction Manual Text Entry: Experiments, Models, and Systems Poika Isokoski Tampere Unit for Computer-Human.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
CS335 Principles of Multimedia Systems Multimedia and Human Computer Interfaces Hao Jiang Computer Science Department Boston College Nov. 20, 2007.
VOICe 1.5 Enabling Technology - Final Project Gabe Su.
Stanford hci group / cs376 research topics in human-computer interaction I/O Toolkits Scott Klemmer 29 November 2005.
REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES Yuanlu Xu, Hongfei Zhou, Qing Wang*, Liang Lin Sun Yat-sen University,
Application Programming Interface For Tracking Face & Eye Motion Team Members Tharaka Roshan Pathberiya Nimesh Saveendra Chamara Susantha Gayan Gunarathne.
CMPUT 617 (Topics in Computing Science): Advanced Image Analysis Nilanjan Ray Fall 2012 Computing Science University of Alberta.
Jan SedmidubskySeptember 23, 2014Motion Retrieval for Security Applications Jan Sedmidubsky Jakub Valcik Pavel Zezula Motion Retrieval for Security Applications.
A Brief Overview of Computer Vision Jinxiang Chai.
GIP: Computer Graphics & Image Processing 1 1 Medical Image Processing & 3D Modeling.
EWatchdog: An Electronic Watchdog for Unobtrusive Emotion Detection based on Usage Analysis Rayhan Shikder Department.
Topic regards: ◆ Browsing of Search Results ◆ Video Retrieval using Spatio-Temporal ◆ Object Tracking ◆ Face tracking Yuan-Hao Lai.
Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis
PortableVision-based HCI A Hand Mouse System on Portable Devices 連矩鋒 (Burt C.F. Lien) Computer Science and Information Engineering Department National.
Two Handed and Gaze Input Stanford and Princeton Lecture Nov 29, 1999 Shumin Zhai.
KYLE PATTERSON Automatic Age Estimation and Interactive Museum Exhibits Advisors: Prof. Cass and Prof. Lawson.
GENERAL PRESENTATION SUBMITTED BY:- Neeraj Dhiman.
INTRODUCTION Generally, after stroke, patient usually has cerebral cortex functional barrier, for example, the impairment in the following capabilities,
A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Human Computer Interaction © 2014 Project Lead The Way, Inc.Computer Science and Software Engineering.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
卓越發展延續計畫分項三 User-Centric Interactive Media ~ 主 持 人 : 傅立成 共同主持人 : 李琳山,歐陽明,洪一平, 陳祝嵩 水美溫泉會館研討會
 Detecting system  Training system Human Emotions Estimation by Adaboost based on Jinhui Chen, Tetsuya Takiguchi, Yasuo Ariki ( Kobe University ) User's.
Model of the Human  Name Stan  Emotion Happy  Command Watch me  Face Location (x,y,z) = (122, 34, 205)  Hand Locations (x,y,z) = (85, -10, 175) (x,y,z)
 Motivated by desire for natural human-robot interaction  Encapsulates what the robot knows about the human  Identity  Location  Intentions Human.
User Attention Tracking in Large Display Face Tracking and Pose Estimation Yuxiao Hu Media Computing Group Microsoft Research, Asia.
Skeleton Based Action Recognition with Convolutional Neural Network
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
CONTENT FOCUS FOCUS INTRODUCTION INTRODUCTION COMPONENTS COMPONENTS TYPES OF GESTURES TYPES OF GESTURES ADVANTAGES ADVANTAGES CHALLENGES CHALLENGES REFERENCE.
Final Year Project. Project Title Kalman Tracking For Image Processing Applications.
IEEE International Conference on Multimedia and Expo.
A Recognition Method of Restricted Hand Shapes in Still Image and Moving Image Hand Shapes in Still Image and Moving Image as a Man-Machine Interface Speaker.
Quiz Week 8 Topical. Topical Quiz (Section 2) What is the difference between Computer Vision and Computer Graphics What is the difference between Computer.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Connor Carey. Aims  Record road scene from Android  Detect speed sign  Determine speed limit  Compare to current speed(GPS)  Alert driver if speeding.
Byron Hood | version 0.3 Computer Systems Lab Project Sign Language Recognition.
CIRP Annals - Manufacturing Technology 60 (2011) 1–4 Augmented assembly technologies based on 3D bare-hand interaction S.K. Ong (2)*, Z.B. Wang Mechanical.
MULTIMODAL AND NATURAL COMPUTER INTERACTION Domas Jonaitis.
Articulated Human Pose Estimation using Gaussian Kernel Correlation
Saliency-guided Video Classification via Adaptively weighted learning
NBKeyboard: An Arm-based Word-gesture keyboard
Video-based human motion recognition using 3D mocap data
Khalad Hasan Xing-Dong Yang Hai-Ning Liang Pourang Irani
Specialist hardware devices for physically disabled users
Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad
Progress Report Meng-Ting Zhong 2015/9/10.
Invisible Computing Angie, Collin, Nick, Will.
I/O Toolkits Scott Klemmer · 16 November 2006.
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Natural User Interaction with Perceptual Computing
Evaluating Language Learning Technology
Face Detection Gender Recognition 1 1 (19) 1 (1)
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

Enabling User Interactions with Video Contents Khalad Hasan, Yang Wang, Wing Kwong and Pourang Irani

Motion Control 2

3 End Call Voice Control

4 Face Recognition

5 Gaps Interaction with video contents Suitable querying and interface component Selection technique Motivation

6 Gaps Interaction with video contents Suitable querying and interface component Selection technique Motivation

7 Computer Vision Comparison among state-of-art algorithms Apply best algorithm to extract objects HCI Interaction with video contents Techniques for selection Contribution

Object Detection & Tracking 8

9 Tracking-Learning-Detection (TLD): Kalal et al. Datasets:

10 Struck: Hare et al.

11 StruckTLDStruckTLD Time(sec) Frame/Sec Coke Girl Tiger Tiger Average Speed Comparison

12 TLDStruck Coke Girl Tiger Tiger Average Precision

Interactions 13

14 Input Device

15 Kinect Input

Static Target Selection Target

Moving Target Selection

18 Selection Techniques Left-hand with Basic Left-hand with Ghost Left-hand with Crossing Depth with Basic Depth with Ghost Depth with Crossing

19 Selection Techniques Left-hand with Basic

Action Target Ghost (Khalad et al. CHI 2011)

21 Selection Techniques Left-hand with Ghost

22 Selection Techniques Left-hand with Crossing

23 Selection Techniques Selection Left-hand Depth

24 Results Technique Basic Ghost Crossing Task Completion Time (ms) 6,000 ― 4,000 ― 2,000 ― 0 ― Left-hand Depth

25 Results Technique Basic Ghost Crossing Average Number of Attempts 2.0 ― 1.5 ― 1.0 ― 0.5 ― 0 ― Left-hand Depth

26

27 Take-Home TLD is faster & accurate Both hands for Kinect based interactions Selection is best achieved with static proxies

28 Future work Computer Vision Multiple tracked objects Online detection & tracking HCI Selection Techniques New form of interactions with Kinect

Thank you