12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706.

Slides:



Advertisements
Similar presentations
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Advertisements

Lecture 1: IntroductionIntro to IT COSC1078 Introduction to Information Technology Lecture 1 Introduction James Harland
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
INTONATION Chapters 15 & 16.
1 Spoken Dialogue Systems Dialogue and Conversational Agents (Part IV) Chapter 19: Draft of May 18, 2005 Speech and Language Processing: An Introduction.
Prof. James A. Landay University of Washington Spring 2012 Introduction & Course Overview CSE 441 – Advanced HCI March 27, 2012.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Course Overview Lecture 1 Spoken Language Processing Prof. Andrew Rosenberg.
Lecture 1: IntroductionIntro to IT COSC1078 Introduction to Information Technology Lecture 2 Overview James Harland
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
Automatic Content Extraction for Voic Using Ninja Goal: Make voic more accessible Enable faster browsing of many voic s Access from different.
Spoken Language Processing Lab Who we are: Julia Hirschberg, Stefan Benus, Fadi Biadsy, Frank Enos, Agus Gravano, Jackson Liscombe, Sameer Maskey, Andrew.
Lecture 1: IntroductionIntro to IT COSC1078 Introduction to Information Technology Lecture 1 Introduction James Harland
OBJECT ORIENTED PROGRAMMING I LECTURE 1 GEORGE KOUTSOGIANNAKIS
6/28/20151 Spoken Dialogue Systems: Human and Machine Julia Hirschberg CS 4706.
Games For People Who Are Blind By: Ben Ehrich Scott Holland Megan Wallace.
Automatic Speech Recognition
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
ISSUES IN SPEECH RECOGNITION Shraddha Sharma
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
 Mrs. DeBoard’s Contact Information  Phone:   Website: deboardvirtualbio.wikispaces.com  Office Hours:
NM7613: Music Signal Analysis and Retrieval 音樂訊號分析與檢索 Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Introduction to Natural Language Processing Heshaam Faili University of Tehran.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
E.O.G. Jeopardy! Poetry Elements EOG terms Story Elements Resources Author’s Purpose Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q $200 Q $300 Q $400 Q.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
Welcome to IIT and cs115!.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
INTRODUCTION TO ATHLETIC MEDICINE TRIMESTER PROJECT.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Chapter 3 Culture and Language. Chapter Outline  Humanity and Language  Five Properties of Language  How Language Works  Language and Culture  Social.
Syllabus Highlights CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
1 Computation Approaches to Emotional Speech Julia Hirschberg
12/8/20151 Introduction to the Course and to Speech Synthesis Julia Hirschberg.
Intro to CIT 594
Using Google's Web Speech API with Moodle for language learning tasks
Basic structure of sphinx 4
How to Learn English Efficiently By : Ali Servat.
Language in Cognitive Science. Research Areas for Language Computational models of speech production and perception Signal processing for speech analysis,
CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,
Experiences with Undergraduate Research (Natural Language Processing for Educational Applications) Professor Diane Litman University of Pittsburgh.
CS112: Course Overview George Mason University. Today’s topics Go over the syllabus Go over resources – Marmoset – Blackboard – Piazza – Textbook Highlight.
INDEFINITE AND DEFINITE ARTICLES Learn THE facts..
CIS101 Introduction to Computing Week 01. Agenda What is CIS101? Class Introductions Using your Pace Introduction to Blackboard and online learning.
Natural Language and Speech (parts of Chapters 8 & 9)
Using Oral Recordings to Promote Focused Speaking and Reflective Listening Patricia N. Early Georgia State University.
Prof. James A. Landay University of Washington Winter 2009 Introduction & Course Overview CSE 441 – Advanced HCI January 6, 2009.
Assignment 1 – Voice Activated Systems Meryem Gurel PowerPack : Physical Computing, Wireless Networks and Internet of Things 10/7/2013 German W Aparicio.
Presented By: O. Govinda Rao 3 rd MCA AITAM CH. Hari Prasad 3 rd MCA AITAM.
Speech Recognition Xiaofeng Lai. What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and.
SPEECH TECHNOLOGY An Overview Gopala Krishna. A
G. Anushiya Rachel Project Officer
Course Projects Speech Recognition Spring 1386
Why Study Spoken Language?
Spoken Language Processing
Issues in Spoken Dialogue Systems
Why Study Spoken Language?
Advanced NLP: Speech Research and Technologies
Spoken Language Processing:Summing Up
Speech Processing August 4, /2/2018.
Advanced NLP: Speech Research and Technologies
Emotional Speech Julia Hirschberg CS /8/2018.
PROJ2: Building an ASR System
Accelerated Introduction to Computer Science
Spoken Language Processing
Presentation transcript:

12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706

12/5/20152 Speech Processing –How do you produce sounds that other people interpret as language? –How does your hearer decode what you are trying to convey? –In a conversation, how do you know when it’s your turn to talk? –Once you decide what you want to say, how do you decide how to say it?: How do you decide where to pause in the sentence? How do you decide what words to emphasize? How do you decide what intonational contour to use? How do you convey your own feelings and emotions?

12/5/20153 Applications for Speech Technologies Speech synthesis (TTS): AT&T, IBM (Jeopardy 2/14- 16), SitePalAT&TIBM JeopardySitePal Speech recognition (ASR): Nuance, Sphinx, HTK Speech to Speech Translation: TRANSTAC (DARPA) Speech Search: Google Voice SearchVoice Search Homeland Security: Deception Detection, Dialect and Language ID, and Speaker ID, trust Spoken Dialogue Systems: –Over-the-phone services: Android or SiriAndroid Siri –Tutoring systems: KTH’s VilleVille –Amtrak Julie (or here)Juliehere

12/5/20154 What will we do in this course? Learn about fundamental aspects of speech signals and how to analyze them –Acoustic/prosodic information: pitch, energy –Phonemes and phones: sounds of a language –Intonation: pitch, intensity, timing Study two basic speech technologies, TTS and ASR and their application in Spoken Dialogue Systems (SDS) –Build your own SDS using the Festival and SphinxToolkits in a domain of your choice

Sample Domains from Previous Years Pizza orders Movie recommendations Spoken interface to games –Colossal Cave Adventure –Froggie Voice-controlled audio book reader Sports event ticket search Voice Yelp Spoken cookie recipes 12/5/20155

Student Clinic Appointment Systems Querying Dow Jones Information via Yahoo Finance NFL player stats iPhone task manager Music browsing 12/5/20156

7 Course information Course syllabus and readingsCourse syllabus –Jurafsky & Martin, second editionJurafsky & Martin –Articles in syllabus Speech tools and speech lab Courseworks: discussion, course files, gradebook TAs: Erica Cooper and Rivka Levitan

12/5/20158 Projects Build and demo a Spoken Dialogue System Teams of 2 or 3 –Organize your own or –Advertise for team members on Courseworks discussion category “Find a team” or –Send mail to prof or TAs -- early 4 Deadlines: –Project Description –TTS component (using Festival tools) –ASR component (using HTK tools) –Beta-test (does your system work?) –SDS demos (during final exam period)

12/5/20159 Honor policy on syllabuspolicy Late policy on syllabus (5 ‘free’ late days per project for the semester) My office hours: M 4:15-6:15, CEPSR 705 Rivka Levitan : TBD, CEPSR 7LW1 (Speech Lab) – Erica Cooper: TBD, CEPSR 7LW1 (Speech Lab) –

12/5/ Questions?