Meeting Recorder Adam Janin

Slides:



Advertisements
Similar presentations
Chapter 1:Introduction to the world of computers
Advertisements

Topic 5 Instructional audio OWT 410. Instructional audio Digital audio Definition of podcast Type of podcast Steps for creating audio podcasts Tools for.
4.03 IT PowerPoint Objective 4.03—Understand Information Technology activities and careers.
Bc EV-20 PM-20EV Sound Repeater.
Interview Room Recorder IR2. Interview Room Recorder IR2 is designed to make the interview process easier to record, facilitate, annotate, and export.
The physical parts of Computer
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
COMPUTER CONCEPTS Computer Information Systems. COURSE COMPETENCIES Explain the functions of computer system components. Describe the information processing.
Discovering Computers: Chapter 1
CSCE101 – Op Codes (continued), 4.4, 4.5, 5.2 Tuesday, October 31, 2006 and Thursday, November 2, 2006.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
A Prototype Personal Dictation System Adam Janin
Lecturing with Digital Ink Richard Anderson University of Washington.
GCSE English Language Unit 3 Speaking & Listening
Support for Palm Pilot Collaboration Including Handwriting Recognition.
Why is ASR Hard? Natural speech is continuous
 Captures screen activity and synchronized audio.  Allows you to share info with people not able to attend session.  Customers can view recording at.
Use voice tag in mobile By : Sara Ayad Aldehany To : Mariam Almie.
Readily Available Technology to Support Writing Tools for Struggling Writers.
McGraw-Hill Technology Education © 2006 by the McGraw-Hill Companies, Inc. All rights reserved. 77 CHAPTER INPUT AND OUTPUT Page 150.
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
CHAPTER 2 Input & Output Prepared by: Mrs.sara salih 1.
4.03 IT PowerPoint Objective 4.03—Understand Information Technology activities and careers.
Speech Recognition SR Commands Alternative Input Handhelds.
Chapter 5 Input. What Is Input? What are the input devices? Input device is any hardware component used to enter data or instructions Data or instructions.
Twenty-First Century Automatic Speech Recognition: Meeting Rooms and Beyond ASR 2000 September 20, 2000 John Garofolo
Using an Ipad to teach English. Why use an Ipad Using an Ipad to teach is a good Idea as teachers can access more information in order to help get the.
Chapter 11: Interaction Styles. Interaction Styles Introduction: Interaction styles are primarily different ways in which a user and computer system can.
BravoBrava Mississippi State University Can Advances in Speech Recognition make Spoken Language as Convenient and as Accessible as Online Text? Joseph.
Real-Time Speech Recognition Subtitling in Education Respeaking 2009 Dr Mike Wald University of Southampton.
Interfacing with the Machine Jay Desloge SENS Corporation Sumit Basu Microsoft Research.
Chapter 5 Input By: Matthew D McCoog What Is Input? Any data or instructions entered into the memory of a computer.
Johannes Spitzbart Phonogrammarchiv, Austrian Academy of Sciences Österreichische Tage der Digitalen Geisteswissenschaften save the data - workshop on.
A brief overview of Speech Recognition and Spoken Language Processing Advanced NLP Guest Lecture August 31 Andrew Rosenberg.
1 An Assessment of a Speech-Based Programming Environment Andrew Begel Microsoft Research (formerly UC Berkeley)
Audio on the Web Teaching OntheNet 2002 Minneapolis, MN June 23-25, 2002.
Special Education Software and Programs Demetrios Houmas
Fall 2002CS/PSY Dialog Design 3 How to use a PDA.
Chapter 5: Input CSC 151 Beth Myers Kristy Heller Julia Zachok.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
ICT – 8th grade ASPAEN – Gimnasio Los Cerezos Angela I. Arango Echeverry Hardware: Input devices.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and Audio Brandon Muramatsu Andrew McKinney
Dirk Van CompernolleAtranos Workshop, Leuven 12 April 2002 Automatic Transcription of Natural Speech - A Broader Perspective – Dirk Van Compernolle ESAT.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Unlocking Audio/Video Content with Speech Recognition Behrooz Chitsaz Director, IP Strategy Microsoft Research Frank Seide Lead.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
What is Input?  Input  Processing  Output  Storage Everything we enter into the computer to do is Input.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Five Components of a Computer Input Device – keyboard, scanner, PDA/stylus, digital camera, mouse, MP3 player, fax machine, microphone Storage Device –
Computer Information Systems
Building Community around Tools for Automated Video Transcription for Rich Media Notebooks: The SpokenMedia Project Brandon Muramatsu MIT,
Input and Output Devices
OneNote.
The IT Environment Section 5 ICA11v1.0
Welcome to your first Live Session on Adobe Connect
Automatic Speech Recognition
Group Decision Support Systems
Artificial Intelligence for Speech Recognition
3.0 Map of Subject Areas.
Understand Information Technology activities and careers.
CS 2610 Project Presentation Presented By- Zuha Agha and Tazin Afrin
4.03 IT PowerPoint Objective 4.03—Understand Information Technology activities and careers.
Welcome to your first Online Session
4.03 IT PowerPoint Objective 4.03—Understand Information Technology activities and careers.
PLTW Terms PLTW Vocabulary Set #10.
PLTW Terms PLTW Vocabulary Set #10.
Presentation transcript:

Meeting Recorder Adam Janin

2 A Portable Meeting Recorder n Record meetings in natural settings n Support multiple speakers n Allowing correction and annotation n Support indexing and searching n Self-contained (using IRAM)

3 Vector IRAM n High absolute performance u Floating point and integer vector units u High bandwidth to memory n High performance per watt u Low instruction dispatch rate for vectors u On-chip memory n Well-suited for Meeting Recorder u ASR is (mostly) easy to vectorize u Supports floating point u Compiler

5 Why Meeting Recorder? n Why not written notes? u Unreadable handwriting u Detracts from listening u Incomplete record u Hard to search n Why not a tape recorder? u Hard to annotate u Playback can be disruptive u Too much data u Hard to search

6 Natural Settings n Record impromptu meetings n Robust to noise, reverberation, etc. n Large vocabulary n Spontaneous speech n Portable n Self-contained n Hands-free

7 Multiple Speakers n Robust to accents, speaking styles n Interrupts n Mark speaker change n Adapt to common participants

8 Correction and Annotation n Correcting – informing the system that it has made an error u If the system has a good idea of alternatives, it may be faster to correct than to edit u Recognizer can adapt to user and vocabulary n Annotation – Supplementing and modifying the output u “That’s not what I meant to say” u Pen vs. speech input u General ink and text annotations

9 Indexing and searching n Index by speaker and text content n Store and replay audio n Allow text and spoken queries n Perfect speech recognition is not required! u N-best lists u Function vs. content words u Audio record

10 Collaboration n Multiple Meeting Recorders in a meeting connected via a wireless network n Offline u Share notes (à la NotePals) u Improve transcript quality n Online u “Chat” u Microphone array

11 Data Collection Phase n Meeting room at ICSI n 5 wireless head-mounted microphones n 3 PZM omni-directional microphones n HandSpring PDA (probably wired network) n Primarily speech and NLP meetings n Collect and hand-transcribe about 40 hours of meetings

12 Issues n Will people object to being recorded? n Will their speaking style change? n Are the limitations of the data collection phase important? n Is cross-domain training feasible? n Lots of UI issues… n Lots of IR issues... n Lots of ASR issues…

13 Prototype of a Personal Dictation System n Records a single user dictating text n Allows correction and limited annotation Correct transcripts Insert annotations Search/playback Audio frontend Speech recognizer Correction server n Hosted system Palm Pilot Sun Workstation

Prototype Screen Shots Richard Posner, who heads the United States Court of Appeals...