NSF Grant Number: IIS-0095940 PI: Joseph Picone Institution: Mississippi State University Title: Integrating Prosody, Speech Recognition, Parsing In Spoken-Language.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Information Extraction from Spoken Language Dr Pierre Dumouchel Scientific Vice-President, CRIM Full Professor, ÉTS.
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
MINING FEATURE-OPINION PAIRS AND THEIR RELIABILITY SCORES FROM WEB OPINION SOURCES Presented by Sole A. Kamal, M. Abulaish, and T. Anwar International.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Methods in Computational Linguistics II Queens College Lecture 1: Introduction.
Department of Mathematics and Computer Science
Presenters: Başak Çakar Şadiye Kaptanoğlu.  Typical output of an IR system – static predefined summary ◦ Title ◦ First few sentences  Not a clear view.
Big Ideas in Cmput366. Search Blind Search Iterative deepening Heuristic Search A* Local and Stochastic Search Randomized algorithm Constraint satisfaction.
1/23 Applications of NLP. 2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems.
Software and Multimedia
2011 Proposal Writing Workshop Part II: Features of Effective Proposals.
ELN – Natural Language Processing Giuseppe Attardi
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Text summarization MEAD NewsInEssence Cross-document structure Sentence compression Lexrank Political science Discourse dynamics Centrality identification.
May 2006CLINT-CS Verbmobil1 CLINT-CS Dialogue II Verbmobil.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Summary  Extractive speech summarization aims to automatically select an indicative set of sentences from a spoken document to concisely represent the.
INSTITUTE FOR SIGNAL AND INFORMATION PROCESSING Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi State.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
National Technical University of Ukraine “Kiev Polytechnic Institute” Heat and energy design faculty Department of automation design of energy processes.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, 龙星计划课程 : 信息检索 Course Summary ChengXiang Zhai ( 翟成祥 ) Department of.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
ELISQ Systems Demonstration Sagnik Ray Choudhury Doha -- May 2015.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.
STEWARD: A Spatio-Textual Document Search Engine for HUDUSER.ORG Prof. Hanan Samet Department of Computer Science, University of Maryland, College Park,
© W. Wahlster, DFKI IST ´98 Workshop „The Language of Business - the Business of Language“ Vienna, 2 December 1998 German Research Center for Artificial.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Linguistic knowledge for Speech recognition
Investigating Pitch Accent Recognition in Non-native Speech
Copyright © American Speech-Language-Hearing Association
Text-To-Speech System for English
Poetry Questions NCFE Review.
Daniel Bevis William King Villanova University Spring 2006 CS9010
Course Projects Speech Recognition Spring 1386
3.0 Map of Subject Areas.
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Software and Multimedia
Rapidly Retargetable Translingual Detection
Software and Multimedia
Text Analytics Giuseppe Attardi Università di Pisa
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
EXPERIMENTS WITH UNIT SELECTION SPEECH DATABASES FOR INDIAN LANGUAGES
Turn-taking and Disfluencies
Studying Spoken Language Text 17, 18 and 19
Advanced NLP: Speech Research and Technologies
Automatic Speaker Identification Using Sentinel Word Discrimination
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Advanced NLP: Speech Research and Technologies
Key features of effective school-based models
Voice Activation for Wealth Management
Disillusion, Defiance, and Discontent
Project Title Investigator’s Name / Lead Institution
Grant Number: IIS Institution of PI: Brigham Young University PI’s: David W. Embley, Stephen W. Liddle, Deryle W. Lonsdale Title:
Spontaneous Voice Driven Interaction with Avatars: Discriminating Alerting and Referential Contexts of Sentinel Words The Problem The Solution e-WUW:
Lexical selection: activates successive lexical items.
Presentation transcript:

NSF Grant Number: IIS-0095940 PI: Joseph Picone Institution: Mississippi State University Title: Integrating Prosody, Speech Recognition, Parsing In Spoken-Language Research Objectives: Significant Results: Approach Graphic: Broader Impact: Better understand how integration of prosodic information, speech recognition and parsing can impact the problem of information extraction from spoken documents. Apart from disfluency cues, sentence-internal prosody speech does not provide strong cues for parsing (compared to punctuation) Methods based on machine learning and statistical modeling can be used to identify and correct speech repairs Four key themes in the proposed research: (1) utilizing parsing in information retrieval, (2) integrating prosodic information in parsing spoken language, (3) incorporating uncertainty in parsing to handle speech recognition errors, (4) improvements to speech recognition of spontaneous speech. All components share a probabilistic formulation. Initial steps towards information extraction from spoken documents or from any text (such as encyclopedias), and can serve as the basis for a sorely needed sophisticated web browser technology and data mining applications.