Collecting, Storing, Coding, and Analyzing Spoken Tutorial Dialogue Corpora Diane Litman LRDC & Pitt CS.

Slides:



Advertisements
Similar presentations
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
Advertisements

Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Overview of IS Controls, Auditing, and Security Fall 2005.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Using Multiple Synchronized Views Heymo Kou.  What is the two main technologies applied for efficient video browsing? (one for audio, one for visual.
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems Kate Forbes-Riley, Diane Litman, Scott Silliman, Amruta Purandare.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
AUTOMATIC ORGANIZING AND FORMATTING FOR LECTURE NOTES SHIQING (LICIA) HE ADIVISOR: PROF.KRISTINA STRIEGNITZ SPRING 2014 STRUCTURING THE UNSTRUCTURED NOTE:
UNDERSTANDING JAVA APIS FOR MOBILE DEVICES v0.01.
Forecasting Presence and Availability Joe Tullio CS8803.
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
CSC1016 Coursework Clarification Derek Mortimer March 2010.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Lecturing with Digital Ink Richard Anderson University of Washington.
~ Multimodal Communication ~ HOW TO: From raw data to data annotation.
 Mark & Sons Future Technology Co. (hereafter, MSFT) is a $40 billion public company that provides high-technology products and services.  Currently,
 A data processing system is a combination of machines and people that for a set of inputs produces a defined set of outputs. The inputs and outputs.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
DIVA - University of Fribourg - Switzerland Seminar presentation, jan Lawrence Michel, MSc Student Portable Meeting Recorder.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
The NITE XML Toolkit Jean Carletta University of Edinburgh HCRC Language Technology Group.
NXT meets the ICSI Corpus Jean Carletta and Jonathan Kilgour University of Edinburgh HCRC Language Technology Group.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
Lights, Camera, Caption! Presented by Kaela Parks.
Interactive Dialogue Systems Professor Diane Litman Computer Science Department & Learning Research and Development Center University of Pittsburgh Pittsburgh,
TEMPORAL DATA AND REAL- TIME ALGORITHMS CHAPTER 4 – GROUP 3.
Contactforum: Digitale bibliotheken voor muziek. 3/6/2005 Real music libraries in the virtual future: for an integrated view of music and music information.
Creating Web Applications Using ASP.NET Chapter Microsoft Visual Basic.NET: Reloaded 1.
CHAPTER FOUR COMPUTER SOFTWARE.
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Introduction.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
circle Adding Spoken Dialogue to a Text-Based Tutorial Dialogue System Diane J. Litman Learning Research and Development Center & Computer Science Department.
Comparing Synthesized versus Pre-Recorded Tutor Speech in an Intelligent Tutoring Spoken Dialogue System Kate Forbes-Riley and Diane Litman and Scott Silliman.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Collaborative Annotation of the AMI Meeting Corpus Jean Carletta University of Edinburgh.
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
VLDB Demo WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web Hai He, Weiyi Meng, Clement Yu, Zonghuan.
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
The Games Corpus Design, implementation and annotation Agustín Gravano Spoken Language Processing Group Columbia University.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Building & Evaluating Spoken Dialogue Systems Discourse & Dialogue CS 359 November 27, 2001.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
Metacognition and Learning in Spoken Dialogue Computer Tutoring Kate Forbes-Riley and Diane Litman Learning Research and Development Center University.
Improving (Meta)cognitive Tutoring by Detecting and Responding to Uncertainty Diane Litman & Kate Forbes-Riley University of Pittsburgh Pittsburgh, PA.
Using Natural Language Processing to Analyze Tutorial Dialogue Corpora Across Domains and Modalities Diane Litman, University of Pittsburgh, Pittsburgh,
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Bridging the Generation Gap Through Stories Aro Muttilainen Oliphant Sammander Sen.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Prosodic Cues to Disengagement and Uncertainty in Physics Tutorial Dialogues Diane Litman, Heather Friedberg, Kate Forbes-Riley University of Pittsburgh.
Chapter 1 WHAT IS A COMPUTER Faculty of ICT & Business Management Tel : BCOMP0101 Introduction to Information Technology.
INTRODUCTION TO INFORMATION SYSTEMS LECTURE 9: DATABASE FEATURES, FUNCTIONS AND ARCHITECTURES PART (2) أ/ غدير عاشور 1.
A Generic Toolkit for Electronic Editions of Medieval Manuscripts
Physical Data Model – step-by-step instructions and template
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
Multimedia Information Retrieval
PROJECTS SUMMARY PRESNETED BY HARISH KUMAR JANUARY 10,2018.
Presentation transcript:

Collecting, Storing, Coding, and Analyzing Spoken Tutorial Dialogue Corpora Diane Litman LRDC & Pitt CS

ITSPOKE Tutorial Dialogue Corpora Students engage in spoken dialogue with tutors, in the qualitative physics domain –human tutors –(fully automated) computer tutors –‘wizard’ computer tutors

Data Collection Speech-enhanced computer interfaces –Head-mounted microphones –Currently no video –Humans can be at different locations Human and Wizard Tutoring –Dialogue speech files Computer Tutoring –Utterance speech files Coordinated system logs

Data Storage Wav, raw audio, ogg formats Sampling –16k samples per second –16 bits per sample Stereo (dialogue level) and mono (utterance level)

Coding and Analysis Initially WaveSurfer –Open Source tool for sound visualization and manipulation Speech/sound analysis Sound annotation and transcription Praat is similar Recently moved to NXT (NITE XML Toolkit) –Also Open Source –

NXT Mature open-source libraries to support heavily annotated corpora whether they be multimodal; textual; monologue; dialogue A powerful integrated query language Built in tools for common tasks + Java API for custom tools Media sync built in Command line tools for data analysis

NXT meets the ICSI Corpus Jean Carletta and Jonathan Kilgour University of Edinburgh HCRC Language Technology Group

ICSI Meeting Corpus 75 natural meetings from research groups –close-talking and far-field microphones orthographic transcription "speech quality" tags (e.g., emphasis) dialogue acts hot spots

The NITE XML Toolkit library support for data handling and search using a data model that can express both timing and complex structure multiple file stand-off XML data storage some standard GUIs, data utilities library support for writing tailored GUIs

extract from Bdb001.A.words.xml time - line extract from Bdb001.A.speech-quality.xml Stand-off XML

Tasks pre-NXT: up-translation and tokenization hand annotation (topic segmentation, dialogue acts, extractive summaries,...) automatic annotation/indexing by query match

Queries in NXT ($a w):(TEXT($a) ~ /th.*/):: ($s speechquality):($s ^ $a) && Find instances of words starting with “th” For each find instances of speech quality tags of type "emphasis" that dominate the word Discard words that are not dominated by at least one such tag Use queries to understand data, verify quality, index.

NXT as Meeting Browser Browser = display + signal indexing + search NXT data displays: –synchronize with signal –highlight search results

Issues Already can't load all the ICSI data at once on some machines NXT supports display of one meeting at a time but browsing may be over several meetings Really complicated queries are often too slow for browser response times Key: Pre-indexing of query results, tailored data builds

NXT meets the BEETLE Corpus Johanna Moore’s Group University of Edinburgh

Coding Tutorial Dialogue Partitioned the dialogue into a set of non-overlapping segments with the following category names: –Content Dialogue that contains information relevant to the topics in the lessons. –Management Dialogue that does not contain information relevant to the lesson topics, but deals with the flow of the lesson. –Metacognition Dialogue that contains the student or tutor’s feeling about his or her understanding of the lesson material or each other. –Social Dialogue that serves as motivation, encouragement, humor, or establishing rapport.

Coding Student Utts for Sig Events Consider common theories of effective learning events Constructivism / generative learning –Osborne & Wittrock, 1983 Impasses –Van Lehn, et. al., 2003 Accountable talk –Wolf, Crosson & Resnick, 2006 Deep processing / cognitive effort –Thomas & Rohwer, 1993 Motivated, self-directed learner –Thomas & Rohwer, 1993 Student produces a lot of new information Student utts are incorrect or correct w/ low confidence Student utts are both accurate & deep Student utts are deep (regardless of accuracy)‏ High frequency of internally motivated student utts NOVELTY 1 ACCURACY 2 & CONFIDENCE 3 ACCURACY 2 & DEPTH 4 DEPTH 4 INITIATIVE 5

Student Utterance Coding Five major dimensions –Accuracy Correct, some missing, some errors, incorrect –Signs of “deep” processing or cognitive effort Present versus absent –Explain/justify/support statement with evidence/reasoning –Summarize or paraphrase –Express relationships or make connections between constructs –Questions or challenges statements from lesson or tutor Wolf, Crosson & Resnick (2006)‏ –Signs of low confidence Present versus absent (Bhatt, Evens & Argamon, 2004)‏ –Origin Externally versus internally motivated –Novelty Old versus new information

A B C battery X Question: If bulb B is damaged, what do you think will happen to bulbs A and C? Non-Accountable Talk: utt69: student: A and C will not light up Accuracy = Correct; Cognitive Processing = Absent Cognitive Effort and Potential Impasse: utt122a: student: bulb a will light but b and c won't since b is damaged and breaks the closed path circuit Accuracy = Incorrect; Cognitive Processing = Present Potential Impasse: utt97: student: both would be either dim or not light I would think Accuracy = Partially Correct; Cognitive Processing = Absent; Signs of Low Confidence = Yes utt83a: student: both bulbs A and C will go out because this scenario would act the same as if there was an open circuit Accuracy = Correct; Cognitive Processing = Present Accountable Talk: