CS335 Principles of Multimedia Systems Multimedia and Human Computer Interfaces Hao Jiang Computer Science Department Boston College Nov. 20, 2007.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Detecting Faces in Images: A Survey
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
SECOND MIDTERM REVIEW CS 580 Human Computer Interaction.
 INTRODUCTION  STEPS OF GESTURE RECOGNITION  TRACKING TECHNOLOGIES  SPEECH WITH GESTURE  APPLICATIONS.
Detecting Pedestrians by Learning Shapelet Features
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Human Computer Interaction
John Hu Nov. 9, 2004 Multimodal Interfaces Oviatt, S. Multimodal interfaces Mankoff, J., Hudson, S.E., & Abowd, G.D. Interaction techniques for ambiguity.
Class 6 LBSC 690 Information Technology Human Computer Interaction and Usability.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Stanford hci group / cs376 research topics in human-computer interaction Multimodal Interfaces Scott Klemmer 15 November 2005.
Human Posture Recognition with Convex Programming Hao Jiang, Ze-Nian Li and Mark S. Drew School of Computing Science Simon Fraser University Burnaby, BC,
0 HCI Today Talk about HCI Success Stories Talk about HCI Success Stories Talk about Norman’s Paper Talk about Norman’s Paper Start talking about The human.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Multimedia - Motivation & Definition.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Face Detection using the Viola-Jones Method
   Input Devices Main Memory Backing Storage PROCESSOR
Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis
PortableVision-based HCI A Hand Mouse System on Portable Devices 連矩鋒 (Burt C.F. Lien) Computer Science and Information Engineering Department National.
11.10 Human Computer Interface www. ICT-Teacher.com.
GENERAL PRESENTATION SUBMITTED BY:- Neeraj Dhiman.
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
Portable Vision-Based HCI A Real-Time Hand Mouse System on Portable Devices 連矩鋒 (Burt C.F. Lien) Department of Computer Science and Information Engineering.
Reconstructing 3D mesh from video image sequences supervisor : Mgr. Martin Samuelčik by Martin Bujňák specifications Master thesis
Stanford hci group / cs376 u Scott Klemmer · 16 November 2006 Speech & Multimod al.
MULTIMEDIA DEFINITION OF MULTIMEDIA
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Face detection Slides adapted Grauman & Liebe’s tutorial
Human Computer Interaction © 2014 Project Lead The Way, Inc.Computer Science and Software Engineering.
卓越發展延續計畫分項三 User-Centric Interactive Media ~ 主 持 人 : 傅立成 共同主持人 : 李琳山,歐陽明,洪一平, 陳祝嵩 水美溫泉會館研討會
1 CP586 © Peter Lo 2003 Multimedia Communication Human Computer Interaction.
Robust Real Time Face Detection
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
CONTENT FOCUS FOCUS INTRODUCTION INTRODUCTION COMPONENTS COMPONENTS TYPES OF GESTURES TYPES OF GESTURES ADVANTAGES ADVANTAGES CHALLENGES CHALLENGES REFERENCE.
Introduction to HCI Lecture #1.
Immersive Virtual Characters for Educating Medical Communication Skills J. Hernendez, A. Stevens, D. S. Lind Department of Surgery (College of Medicine)
Project Course in Pen- Based Computing Fall 2007 Tablet Flash Cards YoungJoo Jeong Manoj Kintali Dave Hacker.
Stanford hci group / cs376 u Jeffrey Heer · 19 May 2009 Speech & Multimodal Interfaces.
Pen Based User Interface Issues CSE 490RA January 25, 2005.
Basic Element of Electronics Data Processing Hardware Hardware Software Software Networking Networking Person involved in Computer Fields Person involved.
Multimodal and Natural computer interaction Evelina Stanevičienė.
MULTIMODAL AND NATURAL COMPUTER INTERACTION Domas Jonaitis.
AUTHOR PRADEEP KUMAR B.tech 1 st year CSE branch Gnyana saraswati college of eng. & technology Dharmaram(b)
Lecture Input Devices Keyboard. Mouse Microphone Digital Camera Scanner.
Lesson 4 Alternative Methods Of Input.
Alternative Methods Of Input
Standard Methods of Input.
Human Computer Interaction (HCI)
Methods of Computer Input and Output
11.10 Human Computer Interface
AHED Automatic Human Emotion Detection
Human Computer Interaction (HCI)
Lesson 4 Alternative Methods Of Input.
Multimodal Interfaces
The Implementation of a Glove-Based User Interface
Multimodal Human-Computer Interaction New Interaction Techniques 22. 1
Map of Human Computer Interaction
AHED Automatic Human Emotion Detection
Lecture 29: Face Detection Revisited
Chapter 9 System Control
Presentation transcript:

CS335 Principles of Multimedia Systems Multimedia and Human Computer Interfaces Hao Jiang Computer Science Department Boston College Nov. 20, 2007

CS335 Principles of Multimedia Systems Outline  Human Computer Interface.  Multimedia interfaces and applications.  Vision based interfaces –Systems using vision based interfaces. –Some basic methods in vision interfaces Background subtraction. Color blob detection and tracking. Face detection (Viola and Jones face detector).

CS335 Principles of Multimedia Systems Human Computer Interface  Different meanings of HCI –The study of people, computer technology and the ways they interactive with each other. –The design, evaluation and implementation of computer systems for human use. –The research to study how to make computer systems more useable.

CS335 Principles of Multimedia Systems Aspects of HCI  Design –Intuition, experiences and design principles.  Human factors –Experiments testing the perception, memory and cognition models of human being.  Devices –Physical devices such as mouse, keyboards and other fancy HCI hardware.  Software and systems –Such as windows GUI, shell commands in Unix etc.

CS335 Principles of Multimedia Systems HCI Studies  Computers –What are they good at and bad at? –Input, output, processing, networking, etc.  People –What are people good at? –What is the limitation of human abilities? –Input, output, memory, perception and cognition model. –No average answer. Must accommodate diversities.  Usability –More than the size of widgets, where are the buttons and syntax of commands.  Context –Environment, physical constraints, cognition constraints, task, etc.

CS335 Principles of Multimedia Systems The Quality of HCI  There are fives factors that affect the design and implementation of HCI –Time to learn. –Speed of performance. –Rate of errors. –Retention over time. –Subjective satisfaction.

CS335 Principles of Multimedia Systems Multimedia Interfaces  Multimedia interface combines different sorts of media modalities to achieve more natural and more efficient human computer communication. –Speech, audio, touch, video input (patterns, posture, gesture) etc.  “Put that there” is an example multimedia HCI that combines speech, gestures and graphics. Put that there, MIT 1981

CS335 Principles of Multimedia Systems Early and Late Integration Mode 1 Mode 2 Classification Features Output Mode 1 Mode 2 Features Classification Output Classification Early integration Late integration

CS335 Principles of Multimedia Systems Advantages of Multimedia Interface  More alternative methods.  The flexibility of combining different schemes.  More efficient for specific tasks.  Greater precision of information.  Help handicapped people to use computers. Pen gesture interface Virtual Reality

CS335 Principles of Multimedia Systems Vision Based Interfaces  Vision based interfaces uses computer vision techniques to achieve more effective HCI.  Vision based interfaces are based on techniques such as: Hand tracking, Hand gesture, Arm gesture Hand Pointing Face detection Head tracking Gaze Expression Lip reading Posture, body gesture and action recognition

CS335 Principles of Multimedia Systems Example Systems Motion Capturing

CS335 Principles of Multimedia Systems Example Systems  The MIT Kidsroom Posture recognition Action detection

CS335 Principles of Multimedia Systems Methods in Vision Interfaces  Background subtraction.  Estimating the background B(n) = B(n)*alpha + I(n)*(1-alpha) Foreground Map = |Image – Background|>threshold alpha is a positive floating point number close to 1.

CS335 Principles of Multimedia Systems Color Blob Detection We would like to compare the template color histogram with each location in the image. We first convert each image pixel by its bin number. s(x,y) = s(x,y-1) + i(x,y) h(x,y) = h(x-1,y)+ s(x,y) (x,y) h(x,y) is the color histogram ab cd

CS335 Principles of Multimedia Systems Blob Tracking (Filtering Scheme) Search window t1 t2 t3 t4 : Predicted location.: detected object. : Final estimation. p x y x = p * alpha + y * (1-alpha)

CS335 Principles of Multimedia Systems Blob Tracking (DP Approach) t2 t3 t4 AB BC BD t1 A B C D E F CF CE DF DE t cost(F) + alpha ||2C-B-F|| const

CS335 Principles of Multimedia Systems Mean Shift Tracking  Instead of exhaustive search in a window, if object moves slowly, we can use a more efficient location updating scheme – Mean shift.  In the following equation, x(i) are points in the search window and y is the location of object, w(i) are weights.

CS335 Principles of Multimedia Systems Mean Shift Tracking Result

CS335 Principles of Multimedia Systems Face Detection  Viola and Jones face detector These features can be computed using integral images similarly to the color histograms discussed before.

CS335 Principles of Multimedia Systems Face Detection  It uses boosting scheme to automatically choose the optimal weight for the classifier based on each kind of feature.  To further improve the efficiency, a cascaded classifier is used.

CS335 Principles of Multimedia Systems Summary  We only touch the very surface of multimedia interfaces.  Currently, the dominant HCI is still GUI with keyboard and mouse.  Multimedia interfaces are beginning to attract more and more interests.  We need more robust and efficient media recognition schemes to support effective multimedia interfaces.