Smart Meeting Systems Josh Reilly. Why are Smart Meeting Systems worth studying?

Slides:



Advertisements
Similar presentations
ARTIFICIAL PASSENGER.
Advertisements

Multimedia Components (Develop & Delivery System)
Using Audacity Audacity is a free, easy-to- use audio editor and recorder for a variety of operating systems.
Hand Gesture for Taking Self Portrait Shaowei Chu and Jiro Tanaka University of Tsukuba Japan 12th July 15 minutes talk.
Team Pakistan: Ahmad Humayun Ozair Muazzam Tayyab Javed Yahya Cheema.
By : Adham Suwan Mohammed Zaza Ahmed Mafarjeh. Achieving Security through Kinect using Skeleton Analysis (ASKSA)
Using Multiple Synchronized Views Heymo Kou.  What is the two main technologies applied for efficient video browsing? (one for audio, one for visual.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
ADVISE: Advanced Digital Video Information Segmentation Engine
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
1 Discussion Class 10 Informedia. 2 Discussion Classes Format: Question Ask a member of the class to answer. Provide opportunity for others to comment.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Real-Time Systems and the Aware Home Anind K. Dey Ubiquitous Computing Future Computing Environments.
1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.
FACE RECOGNITION BY: TEAM 1 BILL BAKER NADINE BROWN RICK HENNINGS SHOBHANA MISRA SAURABH PETHE.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
“ Walk to here ” : A Voice Driven Animation System SCA 2006 Zhijin Wang and Michiel van de Panne.
A Brief Survey on Face Recognition Systems Amir Omidvarnia March 2007.
Your Interactive Guide to the Digital World Discovering Computers 2012.
Introduction ‘Have you ever played video games before? Look at the joystick movement. When you move the joystick to the left, the plane on the TV screen.
Introduce about sensor using in Robot NAO Department: FTI-FHO-FPT Presenter: Vu Hoang Dung.
MULTIMEDIA TECHNOLOGIES FOR ENHANCED TELE-COLLABORATION Zhengyou Zhang Principal Researcher Communications and Collaboration Systems.
Guide to MCSE , Second Edition, Enhanced 1 Objectives Understand and use the Control Panel applets Describe the versatility of the Microsoft Management.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
DIVA - University of Fribourg - Switzerland Seminar presentation, jan Lawrence Michel, MSc Student Portable Meeting Recorder.
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
September 29, 2002Ubicomp 021 NIST Meeting Data Collection Jean Scholtz National Institute of Standards and Technology Gaithersburg, MD USA.
Multimedia Databases (MMDB)
Chapter 5 Input By: Matthew D McCoog What Is Input? Any data or instructions entered into the memory of a computer.
Privacy Protection for Life-log Video Jayashri Chaudhari, Sen-ching S. Cheung, M. Vijay Venkatesh Department of Electrical and Computer Engineering Center.
Ubiquitous Computing Computers everywhere. Where are we going? What happens when the input is your car pulls into the garage, and the output is the heat.
Mi-Gyeong Gwak, Christian Vargas, Jonathan Vinson
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
NM – LREC 2008 /1 N. Moreau 1, D. Mostefa 1, R. Stiefelhagen 2, S. Burger 3, K. Choukri 1 1 ELDA, 2 UKA-ISL, 3 CMU s:
CPT 123 Internet Skills Class Notes Audio/Video Communication Session.
COMPUTER PARTS AND COMPONENTS INPUT DEVICES
Input By Hollee Smalley. What is Input? Input is any data or instructions entered into the memory of a computer.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Computer Vision Technologies for Remote Collaboration Using Physical Whiteboards, Projectors and Cameras Zhengyou Zhang Microsoft Research mailto:
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
Video Production 101 James Falkofske. Sections PlanningScripting Shooting Setups Camera Movements LightingAudio.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
 Designed to monitor the movement of people in given area.  Used video cameras to transmit a signal to a specific place on a limited set of monitors.
Audio Location Accurate Low-Cost Location Sensing James Scott Intel Research Cambridge Boris Dragovic Intern in 2004 at Intel Research Cambridge Studying.
CAMEO: Year 1 Progress and Year 2 Goals Manuela Veloso, Takeo Kanade, Fernando de la Torre, Paul Rybski, Brett Browning, Raju Patil, Carlos Vallespi, Betsy.
Beyond the PC Kiosks & Handhelds Albert Huang Larry Rudolph Oxygen Research Group MIT CSAIL.
Automatic Storytelling in Comics
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
GUI Meets VUI: Some Possible Guidelines James A. Larson VP, Larson Technical Services 4/21/20151© 2015 Larson Technical Services.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
An MPEG-7 Based Semantic Album for Home Entertainment Presented by Chen-hsiu Huang 2003/08/12 Presented by Chen-hsiu Huang 2003/08/12.
Product: Microsoft Kinect Team I Alex Styborski Brandon Sayre Brandon Rouhier Section 2B.
WBI/WCI - SKM 14 July Analysis and Knowledge Extraction from Video & Audio Rick Parent Jim Davis Raghu Machiraju Deleon Wang Department of Computer.
Elluminate Live! Participant's Guide Ensure your computer meets the minimum system requirements recommended for running an Elluminate Live! session on.
WHAT IN THE WORLD IS IT?! Computer Processing Computer Basics by Bill Cowan Page 9.
MPEG 7 &MPEG 21.
CCTV - Security Cameras
Presented By Bhargav (08BQ1A0435).  Images play an important role in todays information because A single image represents a thousand words.  Google's.
Digital Video Library - Jacky Ma.
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Simon Tucker and Steve Whittaker University of Sheffield
                      Digital Audio 1.
ConnectPro User Guide for Students
Title of poster... M. Author1, D. Author2, M. Author3
Title of ePoster... M. Author1, D. Author2, M. Author3
Discussion Class 9 Informedia.
Presentation transcript:

Smart Meeting Systems Josh Reilly

Why are Smart Meeting Systems worth studying?

Objectives of a Smart Meeting System Improves the productivity of a team by automating the: –Capture of the meeting –Processing of the meeting for valuable information –Displaying of that information accurately and effectively to the end user through a client application

Organization of Smart Meeting System Processes A smart meeting system can be decomposed into three sets of processes  Meeting Capture  Meeting Recognition  Semantic Processing

Organization of Smart Meeting System Processes

Meeting Capture Gathering raw inputs from the meeting  Video Capture  Audio Capture  Other Context

Video Capture Video feeds from:  Cameras for the attendees Could use a single static camera Could use a single camera with pan, tilt, zoom (PTZ) capabilities Recommend camera view of every contributor's face  Visual Aids Separate camera Digital feed from device

Microsoft Distributed Meetings Project Camera Placement

Microsoft Distributed Meetings Project Video Capture RingCam Array of 90 º Cameras 360 º Panoramic view

Audio Capture Use an array of microphones  Placed on the table  Placed on the ceiling  Worn on the person Levels need to be controlled so that they are similar levels for each contributor

Microsoft Distributed Meetings Project Audio Capture RingCam  Has an array of microphones on its base.

Other Context Capture RFID to track attendees  Attendees swipe their RFID cards when they enter the meeting to add their ID to the list of people attending this meeting Motion Detectors  to track the locations of attendees within the room

Organization of Smart Meeting System Processes

Meeting Recognition The processing of the raw capture before it is organized into something useful Steps:  Person Identification  Attention Detection  Activity Recognition  Hot Spot Recognition  Summarization

Person Identification is associating sections of video, audio, and the visual aids that were captured from the meeting with the attendee(s) that they belong to Face Recognition Face Tracking Speech Recognition SSL Beamforming

Person Identification Face Recognition Facial Recognition Identify the person speaking from a list of attendees Eigenface Approach Challenges Poor Quality Images Poor Room Lighting Continuously changing facial expressions Occlusion

Face Recognition The Eigenface Approach All faces are assumed to be made up of different percentages of different eigenfaces A set of eigenfaces is a set of very generalized pictures of faces that were generated so that each has a basic ingredient that can be used to make a face Eigenfaces from AT&T Laboratories Cambridge

Person Identification Speech Recognition Speech Recognition Match the voice of the person speaking to someone on the list of attendees Using Voice recognition in conjunction with face recognition allows for an accurate identification of the speaker Sound Source Localization (SSL) Used to determine which camera is pointed at the speaker Could be used to point PTZ camera Beamforming

Person Identification Writer Recognition Writer Recognition When someone writes on the whiteboard, they may not be in clear view of the cameras Writing recognition algorithms can be used to identify who wrote what during a meeting

Attention Detection Attempt to determine who is looking at whom during a meeting. Provides information used for activity recognition and hot spot recognition Done using: Hidden Markov Models (HMM) Sound Source Localization (SSL) Known layout of room

Activity Recognition Determine what is happening during the meeting Step 1: Determine what each individual is doing at each point during the meeting Person Identification, Attention Detection, SSL, Gesture Recognition Step 2: Take that information to determine what activity the entire group is engaging in at each point during the meeting

Hot Spot Recognition Find the important parts of the meeting Using sound queues Ex: Changes in pitch Using activity recognition When people are nodding When their focus changes

Summarization Takes all of the information that the smart meeting system has learned about the meeting and creates a quick overview of the events that took place during that meeting. This information will be used in the semantic processing stage

Organization of Smart Meeting System Processes

Semantic Processing Takes the information from the meeting recognition step and makes it usable by the end user. Meeting Annotation Meeting Indexing Meeting Browsing

Meeting Annotation Describe the raw data from the meeting from each viewpoint Attempt to label all meeting segments Implicitly Automatically Explicitly By Hand

Meeting Annotation Implicit Automated Annotation Assumes that the meeting recognition processes performed with relatively high efficiency Tags every person in the video Narrates what was happening during the meeting Has not been achieved

Meeting Annotation Explicit Annotation By Hand When the recognition processes fail to gather sufficient correct information about the raw data Users will have to go through the meeting and tag the people attending as well as indicate what events are happening all through the meeting

Meeting Indexing Indexing is done at all levels of data from a raw audio feed to the annotations The best form of indexing to use is the event-based indexing An index is created every time an event occurs This is the best way for users to find a specific spot in the meeting when performing a query

Meeting Browsing The interface that the end user uses to retrieve information from the meetings Functions: Can browse/search a list of all meetings for a specific meeting Can browse/search the contents of the chosen meeting Aided by tools like bookmarks, a meeting outline, and queries (content, people, camera angles, visual aids, etc...)

Meeting Browsing Microsoft Distributed Meetings

Remote Attendee Use the smart meeting system as the attendee's eyes and ears Microsoft's PING project Uses a monitor and speaker to display the remote attendee's voice and audio during the meeting However, the remote attendee is often ignored

Carnegie Mellon University’s Meeting System Architecture Lacks Activity Recognition Hot Spot Recognition Annotations

University of California, San Diego AVIARY System Architecture 2 PCs 4 Static Cameras 4 PTZ Cameras No SSL

Ricoh Portable Meeting Recorder

Ricoh Portable Meeting Recorder Doughnut Camera

Ricoh Portable Meeting Recorder Meeting Browser

Technology Limitations Speech recognition and facial recognition algorithms are not yet as efficient as they should be in order for a smart meeting system to perform accurately

Workspace Limitations Cameras and microphones can block view, distract, or intimidate attendees during the meeting Security and Privacy needs to be addressed

References [1] Zhiwen Yu and Yuichi Nakamura Smart meeting systems: A survey of state-of-the-art and open issues. ACM Comput. Surv. 42, 2, Article 8 (March 2010), 20 pages. DOI= / [2]Ross Cutler, Yong Rui, Anoop Gupta, Jj Cadiz, Ivan Tashev, Li-wei He, Alex Colburn, Zhengyou Zhang, Zicheng Liu, Steve Silverberg. (2002). Distributed Meetings. A Meeting Capture and Broadcasting System. 10 pages. [3]Harold Fox The eFacilitator: A Meeting Capture Application and Infrastructure. 89 pages. [4]Yong Rui, Eric Rudolph, Li-wei He, Rico Malvar, Michael Cohen, Ivan Tashev Ping: A Group-To- Individual Distributed meeting System. 4 pages. [5]Dar-Shyang Lee, Berna Erol, Jamey Graham, Jonathan Hull, Norihiko Murata Portable Meeting Recorder. 10 pages.