Spoken Language Processing

Slides:

Advertisements

Similar presentations

Lecture 1: IntroductionIntro to IT COSC1078 Introduction to Information Technology Lecture 1 Introduction James Harland

Advertisements

Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.

CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.

CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:

Course Overview Lecture 1 Spoken Language Processing Prof. Andrew Rosenberg.

Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,

CS 4705 Lecture 1 CS4705 Introduction to Natural Language Processing.

CS 4705 Natural Language Processing What is Natural Language Processing? The study of human languages and how they can be represented computationally.

Games For People Who Are Blind By: Ben Ehrich Scott Holland Megan Wallace.

Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.

 Mrs. DeBoard’s Contact Information  Phone:   Website: deboardvirtualbio.wikispaces.com  Office Hours:

Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.

CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.

CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.

Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.

+ Introduction to Class IST210 Class Lecture. + Course Objectives Understand the importance of data, databases, and database management Design and implement.

Lecture 1: IntroductionIntro to IT COSC1078 Introduction to Information Technology Lecture 1 Introduction James Harland

12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706.

Using Google's Web Speech API with Moodle for language learning tasks

CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,

INDEFINITE AND DEFINITE ARTICLES Learn THE facts..

MUSIC – Unit 1 What is Music?. Music is organized sound.

Course Information CSE 2031 Fall Instructor U.T. Nguyen Office: CSE Home page:

Baseball Challenge! Today’s Game is pitched by Ms. Howe.

CS 274: Internet Programming

SPEECH TECHNOLOGY An Overview Gopala Krishna. A

G. Anushiya Rachel Project Officer

Welcome to 7th Grade Pre-Registration

Digital Image Processing Fall Course Syllabus

Class of 2019 Course Selection

To access, go to: <YOUR SUBDOMAIN HERE>

WELCOME PARENTS! Please sign in.

CSc 1302 Principles of Computer Science II

CSc 020: Programming Concepts and Methodology II

Brand Voice Guidelines

Course Projects Speech Recognition Spring 1386

Computer Science 102 Data Structures CSCI-UA

Why Study Spoken Language?

INTRODUCTION Google has taken on Amazon's Alexa, Apple's Siri, and Microsoft's Cortana with its own voice assistant called Google Assistant. Google first.

Issues in Spoken Dialogue Systems

Spoken Dialogue Systems

Spoken Dialogue Systems: Human and Machine

Course Overview Juan Carlos Niebles and Ranjay Krishna

An Introduction to Online Timetabling

Speech Conductor Team Six (see below)

Mr. Smith’s Schedule 1st Period Computer Science I

Why Study Spoken Language?

Advanced NLP: Speech Research and Technologies

Advanced Placement® Macroeconomics

CSC2310 Principles of Computer Programming

Spoken Language Processing:Summing Up

Speech Processing August 4, /2/2018.

Advanced NLP: Speech Research and Technologies

Spoken Dialogue Systems

Emotional Speech Julia Hirschberg CS /8/2018.

PROJ2: Building an ASR System

Welcome to ENG1DI with Westra!

Accelerated Introduction to Computer Science

Spoken Dialogue Systems: System Overview

Introduction to.

MESA RIDGE GRIZZLIES.

August 29th, 2017 Mr. Isho Chemistry

Machine Learning in FinTech

Intro to CIT 594

Tools for Speech Analysis

Welcome to the First-Year Experience!

New Student Orientation

Computer Engineering Department Islamic University of Gaza

Spoken Language Processing

Low Level Cues to Emotion

Presentation transcript:

Spoken Language Processing Julia Hirschberg CS 4706 9/19/2018

Speech Processing How do you produce sounds that other people interpret as language? How does your hearer decode what you are trying to convey? In a conversation, how do you know when it’s your turn to talk? Once you decide what you want to say, how do you decide how to say it?: How do you decide where to pause in the sentence? How do you decide what words to emphasize? How do you decide what intonational contour to use? How do you convey your own feelings and emotions? 9/19/2018

Applications for Speech Technologies Speech synthesis (TTS): AT&T, IBM (Jeopardy 2/14-16), SitePal Speech recognition (ASR): Nuance, Sphinx, HTK Speech to Speech Translation: TRANSTAC (DARPA) Speech Search: Google Voice Search Homeland Security: Deception Detection, Dialect and Language ID, and Speaker ID, trust Spoken Dialogue Systems: Over-the-phone services: Android or Siri Tutoring systems: KTH’s Ville Amtrak Julie (or here) Watson on Jeopardy feb 14-16 TTS: Charles St. West is the home of Mr. St. Clair. The city hall parking lot was chock full of cars. Jorge Posada catches for the Yankees. 9/19/2018

What will we do in this course? Learn about fundamental aspects of speech signals and how to analyze them Acoustic/prosodic information: pitch, energy Phonemes and phones: sounds of a language Intonation: pitch, intensity, timing Study two basic speech technologies, TTS and ASR and their application in Spoken Dialogue Systems (SDS) Build your own SDS using the Festival and SphinxToolkits in a domain of your choice 9/19/2018

Sample Domains from Previous Years Pizza orders Movie recommendations Spoken interface to games Colossal Cave Adventure Froggie Voice-controlled audio book reader Sports event ticket search Voice Yelp Spoken cookie recipes 9/19/2018

Student Clinic Appointment Systems Querying Dow Jones Information via Yahoo Finance NFL player stats iPhone task manager Music browsing 9/19/2018

Course syllabus and readings Jurafsky & Martin, second edition Course information Course syllabus and readings Jurafsky & Martin, second edition Articles in syllabus Speech tools and speech lab Courseworks: discussion, course files, gradebook TAs: Erica Cooper and Rivka Levitan 9/19/2018

Projects Build and demo a Spoken Dialogue System Teams of 2 or 3 Organize your own or Advertise for team members on Courseworks discussion category “Find a team” or Send mail to prof or TAs -- early 4 Deadlines: Project Description TTS component (using Festival tools) ASR component (using HTK tools) Beta-test (does your system work?) SDS demos (during final exam period) 9/19/2018

Honor policy on syllabus Late policy on syllabus (5 ‘free’ late days per project for the semester) My office hours: M 4:15-6:15, CEPSR 705 Rivka Levitan : TBD, CEPSR 7LW1 (Speech Lab) – rlevitan@cs.columbia.edu Erica Cooper: TBD, CEPSR 7LW1 (Speech Lab) – ecooper@cs.columbia.edu 9/19/2018

Questions? 9/19/2018