M4 – Video Processing, Brno University of Technology1 M4 – Video Processing Igor Potůček, Michal Španěl, Ibrahim Abu Kteish, Olivier Lai Kan Thon, Pavel.

Slides:



Advertisements
Similar presentations
ARTIFICIAL PASSENGER.
Advertisements

Jan SedmidubskyOctober 28, 2011MUFIN: Large-scale Similarity Search Leader: prof. Pavel Zezula Members: Dr. Michal Batko Dr. Vlastislav.
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Descriptive schemes for facial expression introduction.
Model-based Image Interpretation with Application to Facial Expression Recognition Matthias Wimmer
Hand Gesture for Taking Self Portrait Shaowei Chu and Jiro Tanaka University of Tsukuba Japan 12th July 15 minutes talk.
The Steerable Projector and Camera Unit in an Instrumented Environment Lübomira Spassova Saarland University, Saarbrücken, Germany.
Weiwei Zhang, Jian Sun, and Xiaoou Tang, Fellow, IEEE.
Face Alignment with Part-Based Modeling
Robust 3D Head Pose Classification using Wavelets by Mukesh C. Motwani Dr. Frederick C. Harris, Jr., Thesis Advisor December 5 th, 2002 A thesis submitted.
3D Face Modeling Michaël De Smet.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
GMM-Based Multimodal Biometric Verification Yannis Stylianou Yannis Pantazis Felipe Calderero Pedro Larroy François Severin Sascha Schimke Rolando Bonal.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Potential Projects RGBD gesture recognition with the Microsoft Kinect Person recognition by parts.
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
LYU0603 A Generic Real-Time Facial Expression Modelling System Supervisor: Prof. Michael R. Lyu Group Member: Cheung Ka Shun ( ) Wong Chi Kin ( )
Recent Developments in Human Motion Analysis
Generic Object Recognition -- by Yatharth Saraf A Project on.
A Study of Approaches for Object Recognition
MUltimo3-D: a Testbed for Multimodel 3-D PC Presenter: Yi Shi & Saul Rodriguez March 14, 2008.
Face Detection: a Survey Speaker: Mine-Quan Jing National Chiao Tung University.
Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.
Language and Speaker Identification using Gaussian Mixture Model Prepare by Jacky Chau The Chinese University of Hong Kong 18th September, 2002.
EECE 279: Real-Time Systems Design Vanderbilt University Ames Brown & Jason Cherry MATCH! Real-Time Facial Recognition.
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
WP -6: Human Tracking and Modelling Year–I Objectives: Simple upper-body models and articulated tracks from test videos. Year-I Achievements: Tracking.
Capturing the Motion of Ski Jumpers using Multiple Stationary Cameras Atle Nes Faculty of Informatics and e-Learning Trondheim University.
A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,
TEAM-1 JACKIE ABBAZIO SASHA PEREZ DENISE SILVA ROBERT TESORIERO Face Recognition Systems.
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science Technische Universität München Adaptive.
Database Construction for Speech to Lip-readable Animation Conversion Gyorgy Takacs, Attila Tihanyi, Tamas Bardi, Gergo Feldhoffer, Balint Srancsik Peter.
Facial Feature Detection
West Virginia University
The Camera Mouse: Visual Tracking of Body Features to Provide Computer Access for People With Severe Disabilities.
Irfan Essa, Alex Pentland Facial Expression Recognition using a Dynamic Model and Motion Energy (a review by Paul Fitzpatrick for 6.892)
Multimodal Interaction Dr. Mike Spann
Face Model Fitting with Generic, Group-specific, and Person- specific Objective Functions Chair for Image Understanding and Knowledge-based Systems Institute.
September M4 meeting Delft1 VUT Brno meeting data collection for M4 Petr Jenderka, Stanislav Sumec, Igor Potůček, Petr Motlíček, Jan Černocký,
Trends in Computer Vision Automatic Video Surveillance.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
OCR Cambridge Nationals
Multimodal Information Analysis for Emotion Recognition
1 Ying-li Tian, Member, IEEE, Takeo Kanade, Fellow, IEEE, and Jeffrey F. Cohn, Member, IEEE Presenter: I-Chung Hung Advisor: Dr. Yen-Ting Chen Date:
A Seminar Report On Face Recognition Technology A Seminar Report On Face Recognition Technology 123seminarsonly.com.
A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis J. Wu, J. M. Pedersen, D. Putthividhya, D. Norgaard,
卓越發展延續計畫分項三 User-Centric Interactive Media ~ 主 持 人 : 傅立成 共同主持人 : 李琳山,歐陽明,洪一平, 陳祝嵩 水美溫泉會館研討會
Computer Vision Michael Isard and Dimitris Metaxas.
Natural Tasking of Robots Based on Human Interaction Cues Brian Scassellati, Bryan Adams, Aaron Edsinger, Matthew Marjanovic MIT Artificial Intelligence.
CAMEO: Year 1 Progress and Year 2 Goals Manuela Veloso, Takeo Kanade, Fernando de la Torre, Paul Rybski, Brett Browning, Raju Patil, Carlos Vallespi, Betsy.
Biometrics.
Face Detection Using Skin Color and Gabor Wavelet Representation Information and Communication Theory Group Faculty of Information Technology and System.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Automatic Video Editing Stanislav Sumec. Motivation  Multiple source video data – several cameras in the meeting room, several meeting rooms in teleconference,
A Prototype System for 3D Dynamic Face Data Collection by Synchronized Cameras Yuxiao Hu Hao Tang.
WBI/WCI - SKM 14 July Analysis and Knowledge Extraction from Video & Audio Rick Parent Jim Davis Raghu Machiraju Deleon Wang Department of Computer.
Learning Goals, Scales, and Learning Activities Clarity and Purpose.
Face Recognition Technology By Catherine jenni christy.M.sc.
Jeopardy Shots Composition EditingDeveloping Director Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q $200 Q $300 Q $400 Q $500 Final Jeopardy.
Face Detection 蔡宇軒.
CONTENTS:  Introduction.  Face recognition task.  Image preprocessing.  Template Extraction and Normalization.  Template Correlation with image database.
CIRP Annals - Manufacturing Technology 60 (2011) 1–4 Augmented assembly technologies based on 3D bare-hand interaction S.K. Ong (2)*, Z.B. Wang Mechanical.
Faculty of Information Technology, Brno University of Technology, CZ
San Diego May 22, 2013 Giovanni Saponaro Giampiero Salvi
High Performance Architecture for Object Detection in Streamed Video
How we film and what we call it
Automated Detection of Human Emotion
Computer Vision Readings
Presentation transcript:

M4 – Video Processing, Brno University of Technology1 M4 – Video Processing Igor Potůček, Michal Španěl, Ibrahim Abu Kteish, Olivier Lai Kan Thon, Pavel Zemčík Faculty of Information Technolgoy, Brno University of Technology, Czech Republic M4 Meeting, September 2003, Delft, The Netherlands

M4 – Video Processing, Brno University of Technology2 Outline: Video processing block diagram Mouth parametrization Face detection and feature extraction Demo Other video work in Brno Conclusions

Gesture interpretation Face (rough) positioning, recognition M4 – Video Processing, Brno University of Technology3 Video Processing Block Diagram Finished In progress Planned Raw (hyperbolic mirror image) „Corrected“ geometry Pre-identified body parts Image acquisition Colour-based methods Identification of face, eyes, mouth (hands) Gabor wavelet networks Detected mouth in large video (reference) Geometry/graphics Mouth parametrization Mouth area (motion and shape) in low res. Statistical methods Features for audio Pattern matching Annotation/control functions Annotation functions

M4 – Video Processing, Brno University of Technology4 Mouth Parametrization Mouth parametrization may be useful for speech recognition anhancement Low-resolution videos do not allow „proper“ lips tracking (in the case of meting rooms the heads are always small) Statistical mouth area parametrization needed; however, we also need a reference method with „visibly correct“ mouth parametrization to be able to compare the results 2 methods - „full size“ lips tracking and mouth parametrization and „statistical“ approach for small-size face images while the „full size“ version serves just as a reference (currently no intention for further research in it)

M4 – Video Processing, Brno University of Technology5 Mouth Parametrization Three main parameters: width w, height h, and curvature c A hope exists that these parameters are enough as they can be quite well extracted from the low resolution video If the parametrization model is not sufficient, it can be further extended (probably through subcontracting)

M4 – Video Processing, Brno University of Technology6 Face Detection and Feature Extraction The process of face detection if based on pre-processing of the colours in the video It is necessary to positively identify the faces in the video Gabor wavelet networks seem to be suitable instrument for that purpose (fixed-size networks with 56 or 112 wavelets, see paper of M. Španěl) Training for general face detection and positioning (spatial orientation, 5 pan, 5 tilt positions, 15° step) Can be re-applied for face features, such as eyes or mouth to fine-tune the results and specifically to adjust the scale

M4 – Video Processing, Brno University of Technology7 Face Detection and Feature Extraction The Gabor wavelet networks can olso be used for (limited capability) face recognition Well suitable for occlusion problems solution – does not loose track of people Can be used to distinguish „registered“ people and „newcomers“ Requires totally different training – not „zero price“ but still cheap

M4 – Video Processing, Brno University of Technology8 Demo Face tracking with feature detection and identification

M4 – Video Processing, Brno University of Technology9 Demo Face tracking with feature detection and identification

M4 – Video Processing, Brno University of Technology10 Demo Face tracking with feature detection and identification

M4 – Video Processing, Brno University of Technology11 Demo Face tracking with feature detection and identification

M4 – Video Processing, Brno University of Technology12 Demo Face tracking with feature detection and identification

M4 – Video Processing, Brno University of Technology13 Demo Mouth parametrization

M4 – Video Processing, Brno University of Technology14 Other Video Work in Brno Automatic video editing – PROLOG-based, prolog clauses generated from scenarios (general intentions and rules) and processed audiovisual sequences – generates „cutting“ instructions that are used by an off-line video editing software Video annotation tool – shown before, recently extended to multi-channel capabilitiy and extended XML usage The more detailed description of the tools/approaches can be seen at:

M4 – Video Processing, Brno University of Technology15 Conclusions Audio/video multimodal processing not yet very successfull Video features detection fairly successfull Face detection very reliable In the future work (possibly in AMI), we would like to perform some fo the tasks in real-time and in embedded video camera systems Thanks for the attention