Carnegie Mellon Project LISTEN 17/22/2004 Some Useful Design Tactics for Mining ITS Data Jack Mostow Project LISTEN (www.cs.cmu.edu/~listen)www.cs.cmu.edu/~listen.

Slides:



Advertisements
Similar presentations
Take a piece of pizza from the counter.
Advertisements

Language Experience Stories
Using the Four Block Literacy Framework for Students with Disabilities
Stages of Literacy Development
Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.
Strategies that Lead to Transference
Copyright (c) 2003 Allyn & Bacon Chapter 12 Facilitating Reading This multimedia product and its contents are protected under copyright law. The following.
Teaching English Reading in a Bilingual Classroom.
LITERACY IN PRIMARY/JUNIOR DIANE NEWMAN PROFESSIONAL DEVELOPMENT OECTA.
Learning decomposition WARNING. Goals Understand what learning decomposition is –And basic intuition See how it was applied to a variety of problems Think.
The New English Curriculum
Language and Cognition Colombo, June 2011 Day 8 Aphasia: disorders of comprehension.
Developing Active Readers Everyday D.A.R.E
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Sigh, smell touch, see, tuh, uh smell They tell us about five senses which we use, in our daily life Touching Seeing Use our nose to smell Uh, apples?
Teaching Phonemic and Phonological Awareness in the early grades Leecy Wise.
Mining Data from Randomized Within-Subject Experiments in an Automated Reading Tutor Joseph E. Beck and Jack Mostow Project LISTEN (
Detecting Prosody Improvement in Oral Rereading Minh Duong and Jack Mostow Project LISTEN Carnegie Mellon University The research.
Carnegie Mellon Project LISTEN16/29/2004 Which Help Helps? Effects of Various Types of Help on Word Learning in an Automated Reading Tutor that Listens.
1 Are all questions created equal?: Factors that influence cloze question difficulty. Brooke Soden Hensler Carnegie Mellon University (starting graduate.
Lessons from generating, scoring, and analyzing questions in a Reading Tutor for children Jack Mostow Project LISTEN (
Carnegie Mellon Project LISTEN17/22/2004 If I Have a Hammer: Computational Linguistics in a Reading Tutor that Listens Jack Mostow Project LISTEN (
Teaching Phonemic Awareness
Gardening Simulation Created by: Sherry Burrill & Laura Hurlbirt OLIT 533 Computer Simulations Dr. Dennis Lester The University of New Mexico Inspired.
Kindergarten Unit On Maps and Globes
By Anna Strole. Research RAND: Reading Study Group Report on reading comprehension Shows that there are 3 domains to comprehension: Instruction Teacher.
1 Reading Comprehension: Strategies for Success Presented by Katie Bain English Language Fellow.
Katie Bain elfellowkbain.wordpress.com.
RUNNING RECORDS GUIDED READING &. © STEPS Professional Development3 THE MULTIDIMENSIONAL MODEL OF READING connecting comparing Reading Strategies synthesising.
® Automatic Scoring of Children's Read-Aloud Text Passages and Word Lists Klaus Zechner, John Sabatini and Lei Chen Educational Testing Service.
Everett Public Schools DOK to Unit Planning LMS Department August 28, 2014.
THE PREDICTIVE ASSESSMENT OF READING (PAR) February 11, 2013 Carrie Malloy & Julie Smith.
Reading and Technology Revised 8/1/2005 Holy Family University Educ 509, Summer II 2005 Adrienne Mosiondz Website:
Framework for Diagnostic Teaching. Framework The framework for diagnostic teaching places a premium on tailoring programs that specifically fit all readers.
Sebrina Carroll, MS SPCAA HS/EHS. What is Family Literacy  Learning to communicate and understand the world through the context of the family.  Family.
C&I 209: MA/RR and Retells- Putting the Pieces Together
Glogster EETT Training Mathew Swerdloff November 30, 2010.
Georgia Pathway to Language and Literacy What is Georgia Pathway? Georgia Pathway is a Community of Practice (CoP) of professionals, advocates, and parents.
Classroom Strategies Classroom Strategies. Our classroom strategies are the most effective ways to build fluency, vocabulary, comprehension, and writing.
Planning Literacy Instruction EDC424 Dr. Julie Coiro.
READING Phonemic Awareness Word Recognition Comprehension Hope M. Koppers.
Characteristics of Sheltered Instruction  Warm, affective environment  High levels of student interaction, including small- group and cooperative learning.
The New English Curriculum September The new programme of study for English is knowledge-based; this means its focus is on knowing facts. It is.
Using Inspiration In Your Classroom S. Taylor-Foley Consultant for Research, Planning and Technology, South Shore District School Board.
Improving the Help Selection Policy in a Reading Tutor that Listens Cecily Heiner, Joseph E. Beck, Jack Mostow Project LISTEN
Using the Early Development Instrument to Support School Readiness NURTURING NEW ROOTS Supporting the Newcomer Family 6 th Ontario Professional Development.
Day 1. Literacy development Why are we here? Historical trends in beginning reading. Language and reading development.
September 8,  Analyze Running Record for Don’t Let the Pigeon Drive the Bus  Review calculating reading level of texts  Practice using Miscue.
Carnegie Mellon Mostow 12/7/2015, p. 1 The Sounds of Silence: Towards Automated Evaluation of Student Learning in a Reading Tutor that Listens Jack Mostow.
Year 1 Reading Workshop. End of Year Expectations Word ReadingComprehension As above and: Letters and Sounds Phases 4 to 5.  Respond speedily with the.
Carnegie Mellon How does the amount of context in which words are practiced affect fluency growth? Experimental results Jack Mostow, Jessica Nelson, Martin.
Qualitative Reading Inventory
FEBRUARY 17, 2014 TCH 264: Emergent Literacy. National Reading Panel NRP was formed in 1997 to research and assess effective literacy instructional practices.
Establishing a reading intervention program Strategies to use with struggling and/or reluctant readers Does reading intervention work for all ages? Reporting.
Goals and Objectives for the Unit Plan. Quick Assessment n Goals & objectives should contain: –Clear Conditions –Observable Behavior –Measurable Criteria.
OCTOBER 16, 2014 Milton School. Decoding Inferential Comprehension Critical Comprehension Love of Reading Literal Comprehension Word Study, Vocabulary,
What is Integrated Group Reading? (2)
What is Integrated Group Reading? (2)
Ready to Read Using Dialogic Reading
Micro-analysis of Fluency Gains in a Reading Tutor that Listens:
Detecting Prosody Improvement in Oral Rereading
Lessons from Project LISTEN: What have we learned from a Reading Tutor that listens? Jack Mostow, Director Project LISTEN ( Carnegie.
CEN3722 Human Computer Interaction Learning
Independent versus Computer-Guided Oral Reading:
An Embedded Experiment to Evaluate the Effectiveness of Vocabulary Previews in an Automated Reading Tutor Jack Mostow, Joe Beck, Juliet Bey, Andrew Cuneo,
Jack Mostow* and Joseph Beck Project LISTEN (
Warm-Up: Take a ¼ sheet of paper.
Educational Data Mining Success Stories
Experimenter-defined measures in a Reading Tutor that Listens
IERI educational data mining panel
Presentation transcript:

Carnegie Mellon Project LISTEN 17/22/2004 Some Useful Design Tactics for Mining ITS Data Jack Mostow Project LISTEN ( Carnegie Mellon University Funding: National Science Foundation ITS 04 Workshop on Analyzing Student-Tutor Interaction Logs to Improve Educational Outcomes, Maceio, Brazil

Carnegie Mellon Project LISTEN 27/22/2004 Outline 1. Project LISTEN’s Reading Tutor 2. Modify tutor to get mineable data 3. Map data stream to analyzable data set 4. Mine data set to discover insights

Carnegie Mellon Project LISTEN 37/22/2004 Project LISTEN’s Reading Tutor (video)

Carnegie Mellon Project LISTEN 47/22/2004 Project LISTEN’s Reading Tutor (video) John Rubin (2002). The Sounds of Speech (Show 3). On Reading Rockets (Public Television series commissioned by U.S. Department of Education). Washington, DC: WETA. Available at

Carnegie Mellon Project LISTEN 57/22/2004 Thanks to fellow LISTENers Tutoring:  Dr. Joseph Beck, mining tutorial data  Prof. Albert Corbett, cognitive tutors  Prof. Rollanda O’Connor, reading  Prof. Kathy Ayres, stories for children  Joe Valeri, activities and interventions  Becky Kennedy, linguist Listening:  Dr. Mosur Ravishankar, recognizer  Dr. Evandro Gouvea, acoustic training  John Helman, transcriber Programmers:  Andrew Cuneo, application  Karen Wong, Teacher Tool Field staff:  Dr. Roy Taylor  Kristin Bagwell  Julie Sleasman Grad students:  Hao Cen, HCI  Cecily Heiner, MCALL  Peter Kant, Education  Shanna Tellerman, ETC Plus:  Advisory board  Research partners  DePaul  UBC  U. Toronto  Schools

Carnegie Mellon Project LISTEN 67/22/2004 Project LISTEN’s Reading Tutor: A rich source of experimental data database:  9 schools  > 200 computers  > 50,000 sessions  > 1.5M tutor responses  > 10M words recognized  Embedded experiments  Randomized trials

Carnegie Mellon Project LISTEN 77/22/2004 Modify tutor to get mineable data Log operations at grain size and level of interest  Click at time t: motor control  Click “Goldilocks”: item selection Reify operations to log them analyzably  Handwriting or speech  typed input  Freehand drawing  graphical palette (Geometry Tutor)  Free-form responses  menu selection (Self 88)  Natural language  sentence starters (Goodman 03) Time student and tutor actions  Time allocation reflects motivation (ITS 02)  Hasty responses indicate guessing (TICL 04)  Latency reflects automaticity (TICL 04)

Carnegie Mellon Project LISTEN 87/22/2004 Modify tutor: add relevant data Randomize tutorial decisions  What skill to test, what help to give Probe skills  Assess cognitive development (Arroyo 00)  Test vocabulary words (IJAIE 01)  Insert automated comprehension questions (TICL 04) Import student data  Gender, age, IQ (Shute 96)  Prior knowledge (Corbett 00)  Pretest scores (TICL 04) Hand-label when appropriate  Transcribe (some) spoken input (FLET 04)

Carnegie Mellon Project LISTEN 97/22/2004 Modify tutor: an example Randomize: explain some new words but not others. Probe: test each new word the next day. Did kids do better on explained vs. unexplained words?  Overall: NO; 38%  36%, N = 3,171 trials (IJAIE 01).  Rare, 1-sense words tested 1-2 days later: YES! 44% >> 26%, N = 189.

Carnegie Mellon Project LISTEN 107/22/2004 Map data stream to data set: structure data into a single type  Data stream: heterogeneous events over time  Data set: elements with the same features Segment into shorter episodes  Tutorial action(s) + student response (Beck 00) Slice into narrower strands  Successive encounters of a specific word (AMLDP 98)  Successive instances of a specific skill (learning curves) Measure aggregated events  Allocation of time among activities (ITS 02) Formulate data as experimental trials  Context where the trial occurred  Decision made in this trial  Outcome based on subsequent events

Carnegie Mellon Project LISTEN 117/22/2004 Data stream: Map data stream to data set: Formulate data as experimental trials Outcome: read fluently? Decision (randomized) Student clicks ‘read.’ ‘I love to read stories.’ ‘People sit down and …’ ‘… read a book.’ Student is reading a story Student needs help on a word Tutor chooses what help to give Student continues reading Student sees word in a later sentence Time passes… Context:

Carnegie Mellon Project LISTEN 127/22/2004 Map data stream to data set: trials Context:Decision: Outcome:

Carnegie Mellon Project LISTEN 137/22/2004 Mine data set to make discoveries Count outcome frequency  Success rate of each help type (ICALL 04) Fit a parametric model  Knowledge tracing (Corbett 95) Train a model  Statistics, e.g. regression (TICL 04)  Machine learning, e.g. decision trees (AIED 01)

Carnegie Mellon Project LISTEN 147/22/2004 Count outcome frequency: which help types worked best? Same day:Later day: Grade 1 words:Say In ContextSay In Context, Onset Rime Grade 2 words:Say In ContextSay In Context, Rhymes With Rhymes With Grade 3 words:Say In ContextRhymes WithRhymes With, One Grapheme One Grapheme  Best: Rhymes With 69.2% ± 0.4%Rhymes With  Worst: Recue 55.6% ± 0.4%Recue Compare within level to control for word difficulty. Supplying the word helped best in the short term… But rhyming hints had longer lasting benefits.

Carnegie Mellon Project LISTEN 157/22/2004 Summary: modify, map, mine. 1. Modify tutor to make data mineable.  Log, reify, time, hand-label, import, probe, randomize. 2. Map data streams to data sets.  Segment, slice, measure. 3. Mine data set to make discoveries.  Count, fit, train. See videos, papers, etc. at Thank you! Questions?

Carnegie Mellon Project LISTEN 167/22/2004 Modify tutor to get mineable data word features

Carnegie Mellon Project LISTEN 177/22/2004 Structure of Reading Tutor database Story Encounter List storiesPick stories Sentence Encounter Read sentence Show one sentence at a time Word Encounter Read each word Listens and helps StudentReading Tutor Session Login List readers

Carnegie Mellon Project LISTEN 187/22/2004 Map data stream to data set: formulate data as experimental trials ContextDecisionOutcome Student is stuck Prompt or cough? Next event in dialog FF 2000 Before a new word Explain it or not? Test word next day IJAIE 01 Click on wordWhat help to give? Word read OK next time? SSSR 04  Context where the trial occurred  Decision made in this trial  Outcome based on subsequent events

Carnegie Mellon Project LISTEN 197/22/2004 Learning curves for students’ help requests Try to predict subset  Grade 1-2 level  1-6 prior encounters Selected data  53 students  175,961 words  29,278 help requests Train predictive model  Count help requests 5x  Predict other kids’ data  71% accuracy

Carnegie Mellon Project LISTEN 207/22/2004 Count outcome frequency (average success rate 66.1%) Whole word:  24,841 Say In ContextSay In Context  56,791 Say WordSay Word Decomposition:  6,280 SyllabifySyllabify  14,223 Onset RimeOnset Rime  19,677 Sound OutSound Out  22,933 One GraphemeOne Grapheme Analogy:  13,165 Rhymes WithRhymes With  13,671 Starts LikeStarts Like Semantic:  14,685 RecueRecue  2,285 Show PictureShow Picture  488 Sound EffectSound Effect Which types stood out?  Best: Rhymes With 69.2% ± 0.4%Rhymes With  Worst: Recue 55.6% ± 0.4%Recue Example: ‘People sit down and read a book.’