Carnegie Mellon Mostow 12/7/2015, p. 1 The Sounds of Silence: Towards Automated Evaluation of Student Learning in a Reading Tutor that Listens Jack Mostow.

Slides:



Advertisements
Similar presentations
READING P-Peer A-Assisted L-Learning S-Strategies
Advertisements

Tuning Jenny Burr August Discussion Topics What is tuning? What is the process of tuning?
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.
Natalie Fong English Centre, The University of Hong Kong Good Practices in a Second Language Classroom: An Alternating Use of ICT in Independent Learning.
IndusInd Bank – My Account My Number
CURR 511 WIKI By: Amanda Paganin, Elizabeth Depue, Brendan Crowley, Amber Allenbrandt.
© 2010 Artur Dubrawski 1 T-Cube Web Interface in RTBP: A Review of R&D Challenges Artur Dubrawski, Ph.D, M.Eng. Director, Auton Lab Senior Systems Scientist,
Running Records.
Team Members: Cliff Clark, Gary Hollingshead, and Elizabeth Reese 1 TEAM FAULT FINDERS SENIOR DESIGN REVIEW Sponsors: Idaho Power Others Involved: Schweitzer.
Do you suffer from judgement creep? A group moderation session will soon put you right!
Mining Data from Randomized Within-Subject Experiments in an Automated Reading Tutor Joseph E. Beck and Jack Mostow Project LISTEN (
Detecting Prosody Improvement in Oral Rereading Minh Duong and Jack Mostow Project LISTEN Carnegie Mellon University The research.
Variables Variable = something that can change in different conditions in a study VARIABLES HAVE TO VARY!!
Observation Watch, listen, and learn…. Agenda  Observation exercise Come back at 3:40.  Questions?  Observation.
CS CS 5150 Software Engineering Lecture 12 Usability 2.
Carnegie Mellon Project LISTEN 17/22/2004 Some Useful Design Tactics for Mining ITS Data Jack Mostow Project LISTEN (
Carnegie Mellon Project LISTEN16/29/2004 Which Help Helps? Effects of Various Types of Help on Word Learning in an Automated Reading Tutor that Listens.
Final Project CS HCI Kim T Le. Screen Readers for Blind.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Intro to Evaluation See how (un)usable your software really is…
Variables Variable = something that can change in different conditions in a study VARIABLES HAVE TO VARY!!
Lessons from generating, scoring, and analyzing questions in a Reading Tutor for children Jack Mostow Project LISTEN (
ICS 463, Intro to Human Computer Interaction Design: 8. Evaluation and Data Dan Suthers.
 New communication technologies are being developed and these changes affect not only literacy instruction but also our definition of literacy itself.
1 EmuPlayer Music Recommendation System Based on User Emotion Using Vital-sensor KMSF- sunny 親: namachan さん.
Setup Guide for Win 7 Speech Recognition 6/30/2014 Debbie Hebert, PT, ATP Central AT Services.
RUNNING RECORDS GUIDED READING &. © STEPS Professional Development3 THE MULTIDIMENSIONAL MODEL OF READING connecting comparing Reading Strategies synthesising.
1 SPACE Reading Tutor: Design and Functionality Leen Cleuren Jacques Duchateau Pol Ghesquière Hugo Van hamme.
® Automatic Scoring of Children's Read-Aloud Text Passages and Word Lists Klaus Zechner, John Sabatini and Lei Chen Educational Testing Service.
My Reading e-Portfolio
Assistive Technology for Students with Auditory Processing Disabilities.
Job Aids video storyboard Matt Boardman, Chiao-Ling Lu.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
How Spread Works. Spread Spread stands for Speech and Phoneme Recognition as Educational Aid for the Deaf and Hearing Impaired Children It is a game used.
Experiments & Statistics. Experiment Design Playtesting Experiments don’t have to be “big”--many game design experiments take only 30 minutes to design.
Mobile HCI Presented by Bradley Barnes. Mobile vs. Stationary Desktop – Stationary Users can devote all of their attention to the application. Very graphical,
Demo. Overview Overall the project has two main goals: 1) Develop a method to use sensor data to determine behavior probability. 2) Use the behavior probability.
Student Use of CALL Software and its Effect on Learners Alan Bessette Poole Gakuin University GloCALL
Reading With Your Kids A parent’s guide to helping your child with reading at home.
Improving the Help Selection Policy in a Reading Tutor that Listens Cecily Heiner, Joseph E. Beck, Jack Mostow Project LISTEN
First Steps Daily Lesson Plan 1. Re-Reading (Fluency) 2. Word-Study 3. Writing 4. New Read Transition Lesson Plan 1. Instructional Reading 2. Word-Study.
Search. Search issues How do we say what we want? –I want a story about pigs –I want a picture of a rooster –How many televisions were sold in Vietnam.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples.
Interaction Design Dr. Jim Rowan Foley Introduction What’s in the Book that we’ll cover.
English 95, Read Right 2 credits per session Welcome to English 95.
September 8,  Analyze Running Record for Don’t Let the Pigeon Drive the Bus  Review calculating reading level of texts  Practice using Miscue.
Carnegie Mellon How does the amount of context in which words are practiced affect fluency growth? Experimental results Jack Mostow, Jessica Nelson, Martin.
The Interactive Strategies Approach to Early Literacy Intervention (ISA) Michelle Eackles RDG 692 Best Practices in Early Literacy Instruction Diane M.
Cobb County Classroom Walk-Through Overview.
Evaluation / Usability. ImplementDesignAnalysisEvaluateDevelop ADDIE.
DEVELOPING READING FLUENCY A Project LIFT Training Module CORE - Center at Oregon for Research in Education Module 4 – Part 2.
Principles of Demonstrative Instructional Video Peyton R. Glore Assistant Professor School of Information Technology Macon State College October 17, 2007.
1 Title One Summer Reading Tips 2006 “Reading is to the mind what exercise is to the body. As by the one, health is preserved strengthened, and invigorated;
Voice Computing and Reaching the 3B People at the Bottom of the Pyramid Raj Reddy Carnegie Mellon University Pittsburgh, PA Sep 20, 2016 Heidelberg.
Human Computer Interaction Lecture 21 User Support
Improving Reading Fluency
The development of speech comprehension
Education Occupation Project 5th period
Wei Chen, Jack Mostow Gregory Aist Project LISTEN
(adapted from Keri Huddleston, 2016)
Design & Evaluation Methods
Micro-analysis of Fluency Gains in a Reading Tutor that Listens:
Detecting Prosody Improvement in Oral Rereading
SY DE 542 User Testing March 7, 2005 R. Chow
Nigel G. Ward, Anais G. Rivera, Karen Ward, David G. Novick
Independent versus Computer-Guided Oral Reading:
An Embedded Experiment to Evaluate the Effectiveness of Vocabulary Previews in an Automated Reading Tutor Jack Mostow, Joe Beck, Juliet Bey, Andrew Cuneo,
Jack Mostow* and Joseph Beck Project LISTEN (
Educational Data Mining Success Stories
Experimenter-defined measures in a Reading Tutor that Listens
Presentation transcript:

Carnegie Mellon Mostow 12/7/2015, p. 1 The Sounds of Silence: Towards Automated Evaluation of Student Learning in a Reading Tutor that Listens Jack Mostow and Gregory Aist Project LISTEN, Carnegie Mellon University

Carnegie Mellon Mostow 12/7/2015, p. 2 Pilot study in urban elementary school Goals: Analyze extended use of Reading Tutor Analyze extended use of Reading Tutor Identify opportunities for improvement Identify opportunities for improvementProtocol: Principal chose 8 lowest third-grade readers Principal chose 8 lowest third-grade readers Aide took each kid daily to use Reading Tutor in small room Aide took each kid daily to use Reading Tutor in small room Kid chose text to read ( Weekly Reader, poems, …) Kid chose text to read ( Weekly Reader, poems, …)Milestones: Oct. 96: deployed Pentium, trained users, refined design Oct. 96: deployed Pentium, trained users, refined design Nov. 96: school pre-tested individually Nov. 96: school pre-tested individually June 97: school post-tested individually June 97: school post-tested individually

Carnegie Mellon Mostow 12/7/2015, p. 3 User may: click Back click Help click Go click word read Tutor may: go on read word recue word read phrase User-Tutor interaction (11/7/96 version used in pilot study)

Carnegie Mellon Mostow 12/7/2015, p. 4 Data recorded by Reading Tutor Sessions from Nov. 96 to May 97 (excluding outliers) 29 to 57 sessions per kid, averaging 14 minutes 29 to 57 sessions per kid, averaging 14 minutes Not used during vacations, downtime, absences Not used during vacations, downtime, absences 6 gigabytes of data.WAV files of kids’ spoken utterances.WAV files of kids’ spoken utterances.SEG files of time-aligned speech recognizer output.SEG files of time-aligned speech recognizer output.LOG files of Reading Tutor events.LOG files of Reading Tutor events

Carnegie Mellon Mostow 12/7/2015, p. 5 What to evaluate? Usability (can kids use it?) 1993 Wizard of Oz experiments 1993 Wizard of Oz experiments Lab and in-school user tests of successive versions Lab and in-school user tests of successive versions Assistiveness (do kids perform better with than without?) 1994 Reading Coach boosted comprehension by ~20% 1994 Reading Coach boosted comprehension by ~20% But: evaluation obtrusive, costly, sparse, subjective, noisy But: evaluation obtrusive, costly, sparse, subjective, noisy Learning (do kids improve over time?) Within tutor: this talk Within tutor: this talk On unassisted reading: pre-/post-test by school On unassisted reading: pre-/post-test by school More than with alternatives: future studies More than with alternatives: future studies

Carnegie Mellon Mostow 12/7/2015, p. 6 How should the Reading Tutor evaluate learning? Evaluation should be Ecologically valid -- based on normal system use Ecologically valid -- based on normal system use Authentic -- student chooses material Authentic -- student chooses material Unobtrusive -- invisible to student Unobtrusive -- invisible to student Automatic -- objective, cheap Automatic -- objective, cheap Fast -- computable in real-time on PC Fast -- computable in real-time on PC Robust -- to student, recognizer, and tutor behavior Robust -- to student, recognizer, and tutor behavior Data-rich -- based on many observations Data-rich -- based on many observations Sensitive -- detect subtle effects Sensitive -- detect subtle effects So estimate improvement in assisted performance

Carnegie Mellon Mostow 12/7/2015, p. 7 How to estimate performance? Accuracy = % of text words matched by recognizer output Coarse-grained Coarse-grained Sensitive to missed words Sensitive to missed words Doesn’t penalize requests for help Doesn’t penalize requests for help Inter-word latency = time interval between aligned text words Finer-grained Finer-grained Sensitive to hesitations, insertions Sensitive to hesitations, insertions Robust to many speech recognizer errors Robust to many speech recognizer errors

Carnegie Mellon Mostow 12/7/2015, p. 8 Estimation of accuracy and latency (Nov. 96 example from video) Text: If the computer thinks you need help, it talks to you. Student said: if the computer...takes your name...help it...take...s to you Recognizer heard: IF THE COMPUTER THINKS YOU IF THE HELP IT TO TO YOU Tutor estimated 81% accuracy; inter-word latencies: If the computer thinks you need…help, it talks...to you. ? cs

Carnegie Mellon Mostow 12/7/2015, p. 9 Improvement in accuracy and latency (same kid reads “help” in May 97) Text: When some kids jump rope, they help other people too. Student said: when some kids jump rope they help other people too Recognizer heard: WHEN SOME KIDS JUMP ROPE THEY HELP OTHER PEOPLE TOO Tutor estimated 100% accuracy; inter-word latencies: When some kids jump rope, they help other people too. ? cs

Carnegie Mellon Mostow 12/7/2015, p. 10 Which performance improvements count? Echoing the sentence doesn’t count. So look only at the first try. So look only at the first try. Picking stories with easier words doesn’t count. So look at changes on the same word. So look at changes on the same word. Memorizing the story doesn’t count. So look only at encounters of words in new contexts. So look only at encounters of words in new contexts. Remembering recent words doesn’t count. So look only at the first time a word is seen that day. So look only at the first time a word is seen that day.

Carnegie Mellon Mostow 12/7/2015, p. 11 firstlast Accuracy increased 16% on same word from first to last day seen in new context

Carnegie Mellon Mostow 12/7/2015, p. 12 firstlast Latency decreased 35% on same word from first to last day read in new context

Carnegie Mellon Mostow 12/7/2015, p. 13 Is accuracy and latency estimation... Ecologically valid? Reading Tutor used in school Authentic? kids choose stories Unobtrusive? evaluate assisted reading invisibly Automatic? align recognizer output against text Fast? real-time on Pentium Robust? to much student, recognizer, and tutor behavior Data-rich? utterances, aligned words Sensitive? detects significant but subtle effects (< 0.1 sec)

Carnegie Mellon Mostow 12/7/2015, p. 14 Conclusion Does the Reading Tutor help? Yes, with assisted reading Yes, with assisted reading Transfers to unassisted reading! Transfers to unassisted reading! Research questions: Who benefits how much, when, and why? Who benefits how much, when, and why? How should we improve the Tutor? How should we improve the Tutor? For more information: