Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.

Slides:



Advertisements
Similar presentations
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Advertisements

Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
Extracting Minimalistic Corridor Geometry from Low-Resolution Images Yinxiao Li, Vidya, N. Murali, and Stanley T. Birchfield Department of Electrical and.
Facial feature localization Presented by: Harvest Jang Spring 2002.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
September 10, 2013Computer Vision Lecture 3: Binary Image Processing 1Thresholding Here, the right image is created from the left image by thresholding,
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
1 Hierarchical Image-Motion Segmentation using Swendsen-Wang Cuts Adrian Barbu Siemens Corporate Research Princeton, NJ Acknowledgements: S.C. Zhu, Y.N.
Høgskolen i Gjøvik Saleh Alaliyat Video - based Fall Detection in Elderly's Houses.
Medical Imaging Mohammad Dawood Department of Computer Science University of Münster Germany.
Lecture 6 Image Segmentation
Medical Imaging Mohammad Dawood Department of Computer Science University of Münster Germany.
J. Mike McHugh,Janusz Konrad, Venkatesh Saligrama and Pierre-Marc Jodoin Signal Processing Letters, IEEE Professor: Jar-Ferr Yang Presenter: Ming-Hua Tang.
Golnaz Abdollahian, Cuneyt M. Taskiran, Zygmunt Pizlo, and Edward J. Delp C AMERA M OTION -B ASED A NALYSIS OF U SER G ENERATED V IDEO IEEE TRANSACTIONS.
Research on high-definition video vehicles location and tracking Xiong Changzhen, LiLin IEEE, Distributed Computing and Applications to Business Engineering.
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Segmentation Divide the image into segments. Each segment:
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
MULTIPLE MOVING OBJECTS TRACKING FOR VIDEO SURVEILLANCE SYSTEMS.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Implementing Codesign in Xilinx Virtex II Pro Betim Çiço, Hergys Rexha Department of Informatics Engineering Faculty of Information Technologies Polytechnic.
Medical Imaging Dr. Mohammad Dawood Department of Computer Science University of Münster Germany.
Digital Image Processing CCS331 Relationships of Pixel 1.
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
Digital Image Processing Lecture 18: Segmentation: Thresholding & Region-Based Prof. Charlene Tsai.
Chapter 10 Image Segmentation.
Chapter 10, Part II Edge Linking and Boundary Detection The methods discussed in the previous section yield pixels lying only on edges. This section.
G52IVG, School of Computer Science, University of Nottingham 1 Edge Detection and Image Segmentation.
National Taiwan A Road Sign Recognition System Based on a Dynamic Visual Model C. Y. Fang Department of Information and.
Monitoring Human Behavior in an Office Environment Douglas Ayers and Mubarak Shah * Research Conducted at the University of Central Florida *Now at Harris.
Image Segmentation and Edge Detection Digital Image Processing Instructor: Dr. Cheng-Chien LiuCheng-Chien Liu Department of Earth Sciences National Cheng.
A Face processing system Based on Committee Machine: The Approach and Experimental Results Presented by: Harvest Jang 29 Jan 2003.
Bo QIN, Zongshun MA, Zhenghua FANG, Shengke WANG Computer-Aided Design and Computer Graphics, th IEEE International Conference on, p Presenter.
Vehicle Segmentation and Tracking From a Low-Angle Off-Axis Camera Neeraj K. Kanhere Committee members Dr. Stanley Birchfield Dr. Robert Schalkoff Dr.
CS654: Digital Image Analysis
Stable Multi-Target Tracking in Real-Time Surveillance Video
The Implementation of Markerless Image-based 3D Features Tracking System Lu Zhang Feb. 15, 2005.
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
Image Segmentation by Histogram Thresholding Venugopal Rajagopal CIS 581 Instructor: Longin Jan Latecki.
Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.
Video Tracking G. Medioni, Q. Yu Edwin Lei Maria Pavlovskaia.
Machine Vision ENT 273 Regions and Segmentation in Images Hema C.R. Lecture 4.
Wonjun Kim and Changick Kim, Member, IEEE
SUREILLANCE IN THE DEPARTMENT THROUGH IMAGE PROCESSING F.Y.P. PRESENTATION BY AHMAD IJAZ & UFUK INCE SUPERVISOR: ASSOC. PROF. ERHAN INCE.
Learning and Removing Cast Shadows through a Multidistribution Approach Nicolas Martel-Brisson, Andre Zaccarin IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Detection, Tracking and Recognition in Video Sequences Supervised By: Dr. Ofer Hadar Mr. Uri Perets Project By: Sonia KanOra Gendler Ben-Gurion University.
National Taiwan Normal A System to Detect Complex Motion of Nearby Vehicles on Freeways C. Y. Fang Department of Information.
Machine Vision ENT 273 Lecture 4 Hema C.R.
Automatic Video Shot Detection from MPEG Bit Stream
IMAGE SEGMENTATION USING THRESHOLDING
Image Segmentation.
DIGITAL SIGNAL PROCESSING
Histogram—Representation of Color Feature in Image Processing Yang, Li
Presented by: Cindy Yan EE6358 Computer Vision
DICOM 11/21/2018.
Presented by :- Vishal Vijayshankar Mishra
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
Danger Prediction by Case-Based Approach on Expressways
Computer and Robot Vision I
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Presentation transcript:

Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information Engineering National Taiwan Normal University 1

Outline Introduction System Flow Chart ◦ Preprocessing Stage ◦ Improved Region Growing Method ◦ Object Segmentation ◦ Gesture Analysis Experimental Results Conclusions and Future Work 2

Introduction Smart Classroom ◦ Smart Classroom usually refers to a room with an instructor station, including a computer and audiovisual equipment. ◦ These equipments allow the instructor to teach using a wide variety of media. Classroom 2.0 ◦ A new concept of a smart classroom ◦ It tries to create a future classroom environment with modern technologies (ex., video processing and speech recognition) embedded during the process of teaching and learning. 3

Introduction Four systems are developed in Classroom 2.0: ◦ The Intelligent Roll-Call System  To recognize students who are present in the classroom ◦ The Intelligent Teaching Response System  To control the technologies used in the classroom, including the control of the software systems ◦ The Intelligent Content Retrieval System  To give feedback on the quality of an answer to a given question ◦ The Intelligent Classroom Exception Recognition System  To recognize and track the behaviors of students in a classroom and collect data for educational research 4

Introduction The student gesture recognition system ◦ A subsystem of the Intelligent Classroom Exception Recognition system ◦ Goal: To recognize the student gestures in a classroom ◦ The student gestures:  raising of hands to ask/answer questions  dozing off during the lecture  taking naps  standing up  sitting down 5

Introduction Assumptions: ◦ Preserve some flexibility of the focus and the position of the camera ◦ Handle various lightings ◦ Do not construct the background image 6 Reflected light Fluorescent lights Windows

System Flow Chart 7 Video sequence input Main line localization Motion detection Foreground pixel extraction Foreground pixel identification Improved region growing method Object segmentation Gesture analysis

Main Line Localization 8 Edge detection Output main line localization Main line identification Video sequence input Horizontal edge extraction Horizontal edge projection Main line candidate extraction

Main Line Localization 9 Edge detection Output main line localization Main line identification Video sequence input Horizontal edge extraction Horizontal edge projection Main line candidate extraction

Motion Detection 10 I t-1 (p), I t (p): the intensity values of a pixel p at time t-1 and t, respectively. M(p): the motion magnitude of pixel p.

Foreground Pixel Extraction 11 40% 5%. Accumulate the pixel numbers from the input frame to form a histogram of the Hue and Cr values. Assume background occupies larger regions than foreground. The system normalizes and sorts these pixel numbers on the histogram. The top 40% pixels are classified as the background pixels, and the bottom 5% pixels are classified as the candidates of the foreground pixels. Background pixels Foreground pixels

Foreground Pixel Identification 12 α, β, and γ: constants. F t (p): the foreground pixel probability of pixel p at time t M(p): the normalized motion magnitude of pixel p C(p): the normalized location value of pixel p in the sorted Hue-Cr histogram F t-1 (p): the foreground pixel probability of pixel p at time t-1 Foreground pixels Background pixels

System Flow Chart 13 Sequential image input Main line localization Motion detection Foreground pixel extraction Foreground pixel identification Improved region growing method Object segmentation Gesture analysis

Improved Region Growing Method Main function ◦ select the foreground pixels between the maximum and minimum main lines ◦ set the selected pixels as the seeds to grow the foreground regions (call RegionGrowing function) Function Main() { the set of foreground pixels, and let r = 0, If (the y _axis location of x is between the maximum and minimum main lines){ If ( x does not belong to any labelled region){ Label the region number of x as r ; RegionGrowing( x ); r = r +1; } } } 14

Improved Region Growing The RegionGrowing function ◦ Select a pixel from the foreground regions  All neighbouring pixels whose properties are similar to the selected pixel are classified into the same region.  If the neighbouring pixels of the selected pixel belong to the background regions, then the selected pixel is set as the boundary pixel of the foreground regions to stop this iteration. Until all pixels in the foreground regions have been processed 15

Object Segmentation Weight initialization : the region areas of node n and n’, respectively : the distance between the means of node n and n’ ◦ The weight of the link will be large if the areas of node n and n’ are large and the distance between these two nodes are short. ◦ The weight indicates the possibility of two nodes belonging to the same object. 16

Object Segmentation 17 Weight adaption ◦ Increase weight ◦ if the link connects two nodes which have the same neighbours ◦ the possibility that these two nodes belong to the same object is high ◦ Decrease weight ◦ if the link connects two nodes which contain no common neighbours ◦ the possibility that these two nodes belong to the same object is low

Object Segmentation Select objects which contain motion pixels 18

Gesture Analysis The system divides the student gestures into six classes. ◦ These six classes can be regarded as states and be constructed to form a finite state machine.  Assume that the initial state of the finite automaton is the normal gesture.  Define 14 rules (corresponding to transitions a to l ) to translate the states.  Consider the following attributes:  The area, the center position, and the motion of the object  The areas, the center positions, and the colors of the regions belonging to the object 19 Lie prone

Gesture Analysis 20 Raise two hands Raise the left hand Normal Raise the right hand Stand up Lie prone a b c d e f g h i j k l m n Rule a : If the center positions of some left regions of object i move upwards, and the area and center position of object i increase, then the student i may be raising his/her left hand.

Experimental Results 21 Input images Region growing Object segmentation An example: a student raises her right hand

Experimental Results 22 Raises her left hand Raises two hands

Experimental Results 23 Lies prone Stands up

Experimental Results Total numbers of gestures Accuracy rate Correct recognition numbers False positive Raise the left hand 3171%228 Raise the right hand 3373%2410 Raise two hands 3272%235 Lie prone 3278%252 Stand up 3168%2118 Total 15972%

Experimental Results 25 Normal Raise the right hand Raise the left hand Stand up Raise two hands Lie prone

Conclusions and Future Work Conclusions We propose a students’ gesture analysis system in the lecture theatre. - Main line location extraction - Foreground pixel identification - Region growing - Object segmentation - Gesture analysis Future work - Combine prior knowledge - Improve the rules of the finite automaton - Use multiple PTZ cameras 26

Thank you for your attention! 27