Twenty-First Century Automatic Speech Recognition: Meeting Rooms and Beyond ASR 2000 September 20, 2000 John Garofolo

Slides:



Advertisements
Similar presentations
INDIANAUNIVERSITYINDIANAUNIVERSITY GENI Global Environment for Network Innovation James Williams Director – International Networking Director – Operational.
Advertisements

Developing a Mobile-Optimized Web Instrument for the Consumer Expenditure Diary Survey Nhien To Brandon Kopp Jean Fox Erica Yu Federal CASIC Workshops.
Nymity (and other made up words) Dan Cutting July 2003.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
Irek Defée Signal Processing for Multimodal Web Irek Defée Department of Signal Processing Tampere University of Technology W3C Web Technology Day.
Integrating Educational Technology into the Curriculum
Ying Wang EDN 303 Fall Objectives Define curriculum-specific learning Explain the difference between computer, information, and integration literacy.
Maximizing Data and Data Services Monday, October 14, 2013 Location: Denver CO© 2013 Child Care Aware ® of America.
Web Conferencing at GSE Harvard IT Summit June 23, 2011.
Input to the Computer * Input * Keyboard * Pointing Devices
Meeting Recorder Adam Janin
Class 6 LBSC 690 Information Technology Human Computer Interaction and Usability.
Developing a Mobile Strategy Alex Richards & Rachel Wetherall 1 st November 2011.
Handhelds and Collaborative Command and Control Brad Myers Human Computer Interaction Institute Carnegie Mellon University February 23, 2001.
ASR Evaluation Julia Hirschberg CS Outline Intrinsic Methods –Transcription Accuracy Word Error Rate Automatic methods, toolkits Limitations –Concept.
Desktop, Mobile & Web Based GIS/ Collaborative GIS Lecture 4.
Dr. Peter Parnes Associate Professor Luleå University of Technology October 18, 2005 teknik medie.
Your Interactive Guide to the Digital World Discovering Computers 2012.
Usable Security – CS 6204 – Fall, 2009 – Dennis Kafura – Virginia Tech Medical Applications Tejinder Judge Usable Security – CS 6204 – Fall, 2009 – Dennis.
CHAPTER 2 Input & Output Prepared by: Mrs.sara salih 1.
 * Transparencies! 35mm slide projector, a VCR?  * Conferences calls from phones & desk style  units (star shaped speaker phones)  * 2 way Video.
Mithra Busler May 11 th, 2010 NPTNJ 1 Interactive Whiteboards in the classroom.
(CONTROLLER-FREE GAMING
® Automatic Scoring of Children's Read-Aloud Text Passages and Word Lists Klaus Zechner, John Sabatini and Lei Chen Educational Testing Service.
PERVASIVE COMPUTING AT NIST Martin Herman, Chief INFORMATION ACCESS & USER INTERFACES DIVISION INFORMATION TECHNOLOGY LABORATORY NATIONAL INSTITUTE OF.
Michael Margel Dec CSC 2524 SURFBRD. What is SURFBRD? SURFace-Based Remote Desktop Pronounced “Surfboard” A desktop environment that allows users.
September 29, 2002Ubicomp 021 NIST Meeting Data Collection Jean Scholtz National Institute of Standards and Technology Gaithersburg, MD USA.
Integrating Educational Technology into the Curriculum
New Meeting IDIAP Daniel Gatica-Perez, Iain McCowan, Samy Bengio Corpus Administration – Joanne Schulz Technical Assistance – Thierry Collado,
Center for Human Computer Communication Department of Computer Science, OG I 1 Designing Robust Multimodal Systems for Diverse Users and Mobile Environments.
Privacy Protection for Life-log Video Jayashri Chaudhari, Sen-ching S. Cheung, M. Vijay Venkatesh Department of Electrical and Computer Engineering Center.
ST01 - Introduction 1 Introduction Lecturer: Smilen Dimitrov Sensors Technology – MED4.
Collaborate Moderator Training Louis Algaze
ATLAS Demystified: A Practical Introduction Christophe Laprun, Jonathan Fiscus, John Garofolo, Sylvain Pajot National Institute of Standards and Technology.
A Flexible and Extensible Architecture for Linguistic Annotation Steven Bird *, David Day †, John Garofolo ‡, John Henderson †, Christophe Laprun ‡ and.
Deans/VPs Meeting January 2009 UB’s Strategic Plan for IT Elias G. Eldayrie CIO.
COMPUTER PARTS AND COMPONENTS INPUT DEVICES
Input By Hollee Smalley. What is Input? Input is any data or instructions entered into the memory of a computer.
Usability in Pervasive Computing Environment Advance Usability October 18, 2004 Anuj A. Nanavati.
Collaborative Annotation of the AMI Meeting Corpus Jean Carletta University of Edinburgh.
Specialized Input and Output. Inputting Sound ● The microphone is the most basic device for inputting sounds into a computer ● Microphones capture sounds.
What is a computer? Computer is a device for processing information.
Traffic Models Discussion September 2003 IEEE C /86.
Designing Speech Interfaces for Kiosks Max Van Kleek Buddhika Kottahachchi Tyler Horton Paul Cavallaro.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
A centre of expertise in digital information managementwww.ukoln.ac.uk Beyond - Strategies For Collaborative Working In The 21st Century: Welcome.
1 Dialogue, Speech and Images: The Companions Project Data Set Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing, and Oli Mival
Online Software 8-July-98 Commissioning Working Group DØ Workshop S. Fuess Objective: Define for you, the customers of the Online system, the products.
L C SL C S The Intelligent Room’s MeetingManager: A Look Forward Alice Oh Stephen Peters Oxygen Workshop, January, 2002.
Choosing interaction devices: hardware components
Welcome to CPS 210 Graduate Level Operating Systems –readings, discussions, and programming projects Systems Quals course –midterm and final exams Gateway.
Pervasive Computing offers Adaptable Interfaces Signals, Standards, Metadata, and ICADI June 26, 2003.
MCS  FUTURESLABARGONNE  CHICAGO Rick Stevens, Terry Disz, Lisa Childers, Bob Olson Argonne National Laboratory
Main Computer Components
Video Room Set Up 4 Major Types of Video Conferencing Solutions.
Societal-Scale Computing: The eXtremes Scalable, Available Internet Services Information Appliances Client Server Clusters Massive Cluster Gigabit Ethernet.
Dan Bohus Researcher Microsoft Research in collaboration with: Eric Horvitz, ASI Zicheng Liu, CCS Cha Zhang, CCS George Chrysanthakopoulos, Robotics Tim.
9/30/2001Craig Ganoe Methods Supporting Usability Evaluation of the Collaborative Meeting Place Craig Ganoe Project Description LiNC (Learning.
Microsoft Research Faculty Summit Dan Bohus, Eric Horvitz Microsoft Research.
Information Networks. Internet It is a global system of interconnected computer networks that link several billion devices worldwide. It is an international.
© 2011 DigitalDay | MOBILE WEB INFORMATION ARCHITECTURE Best Practices Workshop 1.
SMART SPACES TESTBED Marty Herman, Chief
Technology Support Strategy for UNI Learning Spaces
SURFBRD Michael Margel Dec CSC 2524.
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
Collaboration with Google Drive
Measure E Technology Update
Advanced NLP: Speech Research and Technologies
User Training.
Presentation transcript:

Twenty-First Century Automatic Speech Recognition: Meeting Rooms and Beyond ASR 2000 September 20, 2000 John Garofolo

Challenges Target for the new millenium in ASR Technology: –Meeting Room Transcription and Annotation Task multiple sensors –stationary, mobile, and arrays of mics in conjunction with video input devices noise and microphone robustness speaker-independent recognition speaker identification  automatic production of usable transcriptions with speakers identified and with properly formatted, capitalized, and punctuated text.  Perfect research task to move forward the state-of-the-art Development infrastructure will require –new metrics, evaluation tools –new I/O specifications –research corpora, new methods of collecting, compiling, and annotating data

NIST Proposed Initiative Collaborate with ASR research community to create evaluation infrastructure Develop corpus design and transcription and ASR system output specifications Revise and update NIST SCLITE ASR scoring software to extend beyond classical word error rate measurements Collaborate with NIST Smart Space Lab to collect, transcribe, and annotate a pilot meeting room transcription corpus Sponsor Evaluations and Workshops

Meeting Room Pilot Corpus Meeting type: –Possible focus group discussions requiring information lookup and real consensus building Participants: –At least 4 per meeting plus moderator –Native speakers? Multi-microphones: –Head-mounted ‘control’ –Microphone array –Lapel mikes worn by, or desk-top mikes for each participant –Table/wall-mounted stationary mikes Video: –Wide-angle view positioned so that it can be correlated with mike array for source location. Possibly other views to capture faces head-on. Annotation: –Transcription (words with capitalization/punctuation) –Speaker ID –Background noise conditions –Some initial exploration of annotating dialogue, people movement, gestures, lip movement, interaction with devices

NIST Smart Space Test Bed Laboratory 59-mic array, assorted conventional mics Cameras/video capture Large screen display Pervasive devices –Palm tops –Tablets –Wireless LAN Data collection servers Gigabit Ethernet High-bandwidth data flow system  Well-suited for creating pilot meeting corpus Camera Element Microphone Array Beams Equipment Room Large Screen Display Camera Elements

Approach for NIST will collaborate closely with a few research sites who will be the early users of the data to create the project specifications. –Via list and Web site NIST will create a pilot meeting room data collection –Data storage will be a significant issue NIST will create evaluation software for the new domain –Update SCLITE + detection-based scoring software If feasible, NIST will coordinate an experimental evaluation –Late summer/early Fall 2001 NIST will host a workshop (~October 2001) –to discuss research issues –to introduce the pilot corpus to the wider research community –to discuss evaluation metrics and the dry-run evaluation –to plan for future efforts (kickoff for larger DARPA program?)

21 st Century Automatic Speech Recognition: Meeting Rooms and Beyond John Garofolo NIST Speech Group: NIST Smart Space Lab: ASR 2000 September 20, 2000