WBI/WCI - SKM 14 July 2003 1 Analysis and Knowledge Extraction from Video & Audio Rick Parent Jim Davis Raghu Machiraju Deleon Wang Department of Computer.

Slides:



Advertisements
Similar presentations
Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.
Advertisements

Kien A. Hua Division of Computer Science University of Central Florida.
Department of Electrical and Computer Engineering He Zhou Hui Zheng William Mai Xiang Guo Advisor: Professor Patrick Kelly ASLLENGE Midway Design review.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents.
Digital Interactive Entertainment Dr. Yangsheng Wang Professor of Institute of Automation Chinese Academy of Sciences
Department of Electrical and Computer Engineering He Zhou Hui Zheng William Mai Xiang Guo Advisor: Professor Patrick Kelly ASLLENGE.
Real-Time Audio-Visual Automatic Speech Recognition Demonstrator TSI-TUC, Greece (A. Potamianos, E. Sanchez-Soto, M. Perakakis) NTUA, Greece (P. Maragos,
Recent Developments in Human Motion Analysis
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Appearance Based Behavior Recognition by Event Driven Selective Attention Toshikazu Wada Takashi Matsuyama.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
Non-invasive Techniques for Human Fatigue Monitoring Qiang Ji Dept. of Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute
Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.
Remote Surveillance System Presented by: Robarin Holdings Limited Telephone: Facsimile:
MACHINE VISION GROUP Multimodal sensing-based camera applications Miguel Bordallo 1, Jari Hannuksela 1, Olli Silvén 1 and Markku Vehviläinen 2 1 University.
Dan Schonfeld Co-Director, Multimedia Communications Laboratory Professor, Departments of ECE, CS & Bioengineering University of Illinois at Chicago.
SG-VoIP Page 1 / 14 PLANET Pan / Tilt Internet Camera Internet Surveillance Solution.
Personalized Medicine Research at the University of Rochester Henry Kautz Department of Computer Science.
DIVA - University of Fribourg - Switzerland Seminar presentation, jan Lawrence Michel, MSc Student Portable Meeting Recorder.
What’s Making That Sound ?
Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis
Contactforum: Digitale bibliotheken voor muziek. 3/6/2005 Real music libraries in the virtual future: for an integrated view of music and music information.
M4 – Video Processing, Brno University of Technology1 M4 – Video Processing Igor Potůček, Michal Španěl, Ibrahim Abu Kteish, Olivier Lai Kan Thon, Pavel.
Trends in Computer Vision Automatic Video Surveillance.
Motion Object Segmentation, Recognition and Tracking Huiqiong Chen; Yun Zhang; Derek Rivait Faculty of Computer Science Dalhousie University.
CP SC 881 Spoken Language Systems. 2 of 23 Auditory User Interfaces Welcome to SLS Syllabus Introduction.
Privacy Protection for Life-log Video Jayashri Chaudhari, Sen-ching S. Cheung, M. Vijay Venkatesh Department of Electrical and Computer Engineering Center.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
An Architecture for Mining Resources Complementary to Audio-Visual Streams J. Nemrava, P. Buitelaar, N. Simou, D. Sadlier, V. Svátek, T. Declerck, A. Cobet,
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Object Based Processing for Privacy Protected Surveillance Karl Martin Kostas N. Plataniotis University of Toronto Dept. of Electrical and Computer Engineering.
Automated Geometric Centroiding System Matthew Shanker, Eric Harris, David McArthur; Faculty Mentor: Dr. James Palmer; Client: Jim Clark Department of.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Multimodal Information Analysis for Emotion Recognition
Vrobotics I. DeSouza, I. Jookhun, R. Mete, J. Timbreza, Z. Hossain Group 3 “Helping people reach further”
Subtask 1.8 WWW Networked Knowledge Bases August 19, 2003 AcademicsAir force Arvind BansalScott Pollock Cheng Chang Lu (away)Hyatt Rick ParentMark (SAIC)
Computer Vision Technologies for Remote Collaboration Using Physical Whiteboards, Projectors and Cameras Zhengyou Zhang Microsoft Research mailto:
Supporting rapid design and evaluation of pervasive application: challenges and solutions Lei Tang 1,2, Zhiwen Yu 1, Xingshe Zhou 1, Hanbo Wang 1, Christian.
卓越發展延續計畫分項三 User-Centric Interactive Media ~ 主 持 人 : 傅立成 共同主持人 : 李琳山,歐陽明,洪一平, 陳祝嵩 水美溫泉會館研討會
ENTERFACE ’08 Project 2 “Multimodal High Level Data Integration” Final Report August 29th, 2008.
A Preview of NEESgrid 3.0 Capabilities NEES Consortium Annual Meeting San Diego, CA May 2004.
Model of the Human  Name Stan  Emotion Happy  Command Watch me  Face Location (x,y,z) = (122, 34, 205)  Hand Locations (x,y,z) = (85, -10, 175) (x,y,z)
Application Recognition Sam Larsen Determina. Process Control One method to improve computer security is through process control  Whitelist: user specifies.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Transforming video & photo collections into valuable resources John Waugaman President - Tygart Technology, Inc.
UDL 2.0 Beth Poss, MA CCC-SLP Christopher R. Bugaj, MA CCC-SLP
TEMPLATE DESIGN © E-Eye : A Multi Media Based Unauthorized Object Identification and Tracking System Tolgahan Cakaloglu.
UDL 2.0 Beth Poss, MA CCC-SLP Christopher R. Bugaj, MA CCC-SLP
Spring 2007 COMP TUI 1 Computer Vision for Tangible User Interfaces.
System Support for High Performance Data Mining Ruoming Jin Leo Glimcher Xuan Zhang Gagan Agrawal Department of Computer and Information Sciences Ohio.
 digital methodologies for global media research Randy Kluver Dept of Communication Texas A&M University.
Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents mid-term presentation.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.
System Support for High Performance Scientific Data Mining Gagan Agrawal Ruoming Jin Raghu Machiraju S. Parthasarathy Department of Computer and Information.
Under Guidance of Mr. A. S. Jalal Associate Professor Dept. of Computer Engineering and Applications GLA University, Mathura Presented by Dev Drume Agrawal.
Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Introduction to Pattern Recognition
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Developing systems with advanced perception, cognition, and interaction capabilities for learning a robotic assembly in one day Dr. Dimitrios Tzovaras.
Automatic Speaker Identification Using Sentinel Word Discrimination
Name of Event Name of Event
AHED Automatic Human Emotion Detection
Human-object interaction
Jetson-Enabled Autonomous Vehicle
Presentation transcript:

WBI/WCI - SKM 14 July Analysis and Knowledge Extraction from Video & Audio Rick Parent Jim Davis Raghu Machiraju Deleon Wang Department of Computer and Information Science Ohio State University

WBI/WCI - SKM 14 July Overview Human operators & large data sets Extract important events Focus on human behavior Use multimodal approach Security (real-time processing) Annotating recorded video Processing archival material Streaming data from video & audio Problem Solution Motivation Applications

WBI/WCI - SKM 14 July Objectives Detect and track people to extract audio-visual events Present graphical summaries to human operator via secure web-based interface 3 level system Person/action detection Sequential long-term tracking Multi-modal identification Incrementally constructs event model to focus attention and resources to track and recognize people across sequences Build prototype system

WBI/WCI - SKM 14 July Person Detection and Activity Recognition (Jim Davis) Thermal-based image analysis and person detection Framework for recognizing basic human activities

WBI/WCI - SKM 14 July Sequential-frame tracking (Raghu Machiraju, Rick Parent) Monitor across sequences Characterize motions Capture appearance Tack human figure poses

WBI/WCI - SKM 14 July Robust Speaker Recognition (Deleon Wang) Usable speech extraction from multiple speaker audio =+ By tracking pitch and extracting voiced segments

WBI/WCI - SKM 14 July Deliverables Demonstration subsystems Person detection Long-term tracking Speech recognition 6 mos: review of basic work 12 months: demo of capabilities, summary report

WBI/WCI - SKM 14 July Expenditures 6 Student-quarters of support over 12 months 2 Qtrs: Person detection (Davis) 3 Qtrs: Tracking (Machiraju & Parent) 1 Qtr: Speech (Wang)