Gesture Recognition in a Class Room Environment Michael Wallick CS766.

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
Active Appearance Models
Automatic Color Gamut Calibration Cristobal Alvarez-Russell Michael Novitzky Phillip Marks.
July 27, 2002 Image Processing for K.R. Precision1 Image Processing Training Lecture 1 by Suthep Madarasmi, Ph.D. Assistant Professor Department of Computer.
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
September 10, 2013Computer Vision Lecture 3: Binary Image Processing 1Thresholding Here, the right image is created from the left image by thresholding,
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
CS 376b Introduction to Computer Vision 04 / 04 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 02 / 27 / 2008 Instructor: Michael Eckmann.
ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.
CSE 160 – Lecture 10 Programs 1 and 2. Program 1 Write a “launcher” program to specify exactly where programs are to be spawned, gather output, clean.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Highlights Lecture on the image part (10) Automatic Perception 16
Virtual Dart – An Augmented Reality Game on Mobile Device Supervised by Prof. Michael R. Lyu LYU0604Lai Chung Sum ( )Siu Ho Tung ( )
Automatic Camera Calibration for Image Sequences of a Football Match Flávio Szenberg (PUC-Rio) Paulo Cezar P. Carvalho (IMPA) Marcelo Gattass (PUC-Rio)
Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.
Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.
A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,
What to do when you are done. PRINTING WITH PHOTOSHOP.
Computer Vision Systems for the Blind and Visually Disabled. STATS 19 SEM Talk 3. Alan Yuille. UCLA. Dept. Statistics and Psychology.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Sort-Last Parallel Rendering for Viewing Extremely Large Data Sets on Tile Displays Paper by Kenneth Moreland, Brian Wylie, and Constantine Pavlakos Presented.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
CS 376b Introduction to Computer Vision 02 / 26 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 04 / 02 / 2008 Instructor: Michael Eckmann.
Presentation by: K.G.P.Srikanth. CONTENTS  Introduction  Components  Working  Applications.
GROUP 1 Advantages of electronic system over its corresponding traditional system.
NATIONAL TECHNICAL UNIVERSITY OF ATHENS Image, Video And Multimedia Systems Laboratory Background
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.
Lecture 03 Area Based Image Processing Lecture 03 Area Based Image Processing Mata kuliah: T Computer Vision Tahun: 2010.
Video Segmentation Prepared By M. Alburbar Supervised By: Mr. Nael Abu Ras University of Palestine Interactive Multimedia Application Development.
#? rahul swaminathan (T-Labs) & professor patrick baudisch hci2 hasso-plattner institute determining depth.
CS 376b Introduction to Computer Vision 03 / 21 / 2008 Instructor: Michael Eckmann.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Synthesizing Natural Textures Michael Ashikhmin University of Utah.
CS 376b Introduction to Computer Vision 02 / 11 / 2008 Instructor: Michael Eckmann.
© ACTS-MoMuSys All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France.
Automatic Video Editing Stanislav Sumec. Motivation  Multiple source video data – several cameras in the meeting room, several meeting rooms in teleconference,
Making a great Project 2 OCR 1994/2360. Implementation This is about how you make your system. It should have enough detail for a competent user to be.
Application of Facial Recognition in Biometric Security Kyle Ferris.
Image Processing A Study in Pixel Averaging Building a Resolution Pyramid With Parallel Computing Denise Runnels and Farnaz Zand.
Quiz Week 8 Topical. Topical Quiz (Section 2) What is the difference between Computer Vision and Computer Graphics What is the difference between Computer.
Machine Vision ENT 273 Hema C.R. Binary Image Processing Lecture 3.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Che-An Wu Background substitution. Background Substitution AlphaMa p Trimap Depth Map Extract the foreground object and put into another background Objective.
Detection, Tracking and Recognition in Video Sequences Supervised By: Dr. Ofer Hadar Mr. Uri Perets Project By: Sonia KanOra Gendler Ben-Gurion University.
Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.
Equivalence partitioning
Applying Deep Neural Network to Enhance EMPI Searching
Answering ‘Where am I?’ by Nonlinear Least Squares
Multiple Regression.
הפקולטה להנדסת חשמל - המעבדה לבקרה ורובוטיקה גילוי תנועה ועקיבה אחר מספר מטרות מתמרנות הטכניון - מכון טכנולוגי לישראל TECHNION.
Mean transform , a tutorial
Optical Flow For Vision-Aided Navigation
Application of Facial Recognition in Biometric Security
Image filtering Images by Pawan Sinha.
David Harwin Adviser: Petros Faloutsos
Binocular Stereo Vision
Image filtering Images by Pawan Sinha.
Digital Image Processing Week IV
Lecture 8: Image alignment
The Image The pixels in the image The mask The resulting image 255 X
Application of Facial Recognition in Biometric Security
Presentation transcript:

Gesture Recognition in a Class Room Environment Michael Wallick CS766

Virtual Videography Place cameras in an environment Automatically edit video off-line Output should look like a professional editor

Our Implementation Looking at the classroom domain Recorded one semester of CS559 (Computer Graphics)

Computer Vision in Virtual Videography Understand what is happening on the chalkboard Writing on the board Understand what the professor is doing Location Actions

Chalkboard… Partition the board into regions Regions are semantically related groups of writing Regions can be approximated using computer vision Let’s treat this as a black box … it just happens

Gesture Recognition Understand gestures or actions by a performer Generally used as an input to a computer Understand what the professor is doing Pointing Writing Reaching

Writing can be confused with Pointing and Reaching

Template Matching for G. R. Generate templates of known gestures Match an unknown frame with a template matching algorithm Sum of Squared Difference Cross Correlation Image Difference …

Implement of Gesture Recognition The user selects several template images Pointing Reaching

Format the templates Separate the lecturer Crop the image Resize the images 256x256

Build the Recognition Mask Load each template into the mask For each “on” pixel, increment the mask at that location

Recognizing Gestures Separate the lecturer from foreground Crop and resize For every “on” pixel, increment the “Score” by that value in the mask Compute Confidence as (float) (Score/Mask_Total) Compute Confidence for all gestures

A Gesture Matches if Confidence is: Under 50% but much larger than other gestures Over 50% and not too close to other gestures

Example: Ground State

Example: Pointing

Example: Reaching

Mistakes Overall the results are good Sometimes individual frames are not correct

Solution For each frame, look at surrounding frames Label frame with gesture of the majority

Where to go from here… Use the regions to Validate the gestures Determine what is being pointed at Incorporate the writing information with the gestures Write paper and webpage!

Conclusions We want to use gesture recognition for Virtual Videography Gestures can be used to drive camera model Find gestures by template matching For each frame, take the “average” around a region of frames to correct errors

Thank You! Questions/ Comments?