Learning decomposition WARNING. Goals Understand what learning decomposition is –And basic intuition See how it was applied to a variety of problems Think.

Slides:



Advertisements
Similar presentations
Structural Equation Modeling
Advertisements

Brief introduction on Logistic Regression
Logistic Regression Psy 524 Ainsworth.
Topic 12 – Further Topics in ANOVA
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
The Research Consumer Evaluates Measurement Reliability and Validity
1 Reliability in Scales Reliability is a question of consistency do we get the same numbers on repeated measurements? Low reliability: reaction time High.
Teaching Reluctant Students to Read A CTAP project by Cindy Ranney, Elementary Special Education Teacher, Portola, California.
EVAAS EVALUATION EFFECTIVENESS …AND WHAT DOES IT SAY??? Brawley Middle School November 27, 2012.
INFERENTIAL STATISTICS. Descriptive statistics is used simply to describe what's going on in the data. Inferential statistics helps us reach conclusions.
Oral Presentation Rubrics Standards-based Assessment of and for Learning.
Business Statistics for Managerial Decision
Realism in Assessment of Effort Estimation Uncertainty: It Matters How You Ask By Magne Jorgensen IEEE Transactions on Software Engineering Vol. 30, No.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Top 10 Myths and Concerns About the CRT Brought to you today by Nevada 8th Graders.
Who are the participants? Creating a Quality Sample 47:269: Research Methods I Dr. Leonard March 22, 2010.
Mining Data from Randomized Within-Subject Experiments in an Automated Reading Tutor Joseph E. Beck and Jack Mostow Project LISTEN (
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Evaluating Hypotheses
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
1 Overview of Major Statistical Tools UAPP 702 Research Methods for Urban & Public Policy Based on notes by Steven W. Peuquet, Ph.D.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Extensive Reading Research in Action
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Overview of Major Statistical Tools UAPP 702 Research Methods for Urban & Public Policy Based on notes by Steven W. Peuquet, Ph.D. 1.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
Making a great Project 2 OCR 1994/2360. Analysis This is the key to getting it right. Too many candidates skip through this section. It’s worth 20% of.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 5: Methods for Assessing Associations.
Chapter 5 Building Assessment into Instruction Misti Foster
Improving the Help Selection Policy in a Reading Tutor that Listens Cecily Heiner, Joseph E. Beck, Jack Mostow Project LISTEN
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Chapter 1 Statistical Thinking What is statistics? Why do we study statistics.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.
Working In Groups: An Overview Dr. Rob Danin English Language Specialist
Stat 13, Tue 5/29/ Drawing the reg. line. 2. Making predictions. 3. Interpreting b and r. 4. RMS residual. 5. r Residual plots. Final exam.
Robust Estimators.
Assessment Information from multiple sources that describes a student’s level of achievement Used to make educational decisions about students Gives feedback.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Variables It is very important in research to see variables, define them, and control or measure them.
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. English Language Learners Assessing.
1 Modeling change Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy
Machine Learning in Practice Lecture 6 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 Measuring Poverty: Inequality Measures Charting Inequality Share of Expenditure of Poor Dispersion Ratios Lorenz Curve Gini Coefficient Theil Index Comparisons.
Chapter 26 Inferences for Regression. An Example: Body Fat and Waist Size Our chapter example revolves around the relationship between % body fat and.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Copyright © 2010 Pearson Education, Inc. Slide
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Statistics 19 Confidence Intervals for Proportions.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
¿What's The Best Way To Teach Children To Read? According To The National Reading Panel.
EXPERIMENTAL RESEARCH
Question So, I’ve done my factor analysis.
Journal Suppose you were asked to select the best person to be your teacher from among a group of applicants. How would you go about making the selections?
Jack Mostow* and Joseph Beck Project LISTEN (
Educational Data Mining Success Stories
Model generalization Brief summary of methods
Confidence Intervals for Proportions
Confidence Intervals for Proportions
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Learning decomposition WARNING

Goals Understand what learning decomposition is –And basic intuition See how it was applied to a variety of problems Think about how to apply it to your data

Introduction to Reading Tutor More free-form than most of the cognitive tutors Random interventions Kids or tutor can initiate help Turn taking Never quite sure what student is trying to do

Project LISTEN’s Reading Tutor

What is a practice opportunity? (and are they all equally valuable?) Before story, tutor teaches ‘elephant’ Student sees word ‘elephant’ in sentence Student clicks for help on it Student reads it ‘Elephant’ occurs twice in the next sentence How many practice opportunities? Did instruction have any benefit? Did seeing word immediately afterwards help?

Procedure Determine (hopefully motivated) learning decompositions Find data that reflect learning Solve as a non-linear regression model –Fit model to each student Interpret model coefficient

Question: Does learner control result in more learning? In Reading Tutor, students pick story half the time Tutor picks other half Tutor selects stories much faster than student Suspect motivational benefit from learner control (willing to tolerate system over a school year) Is there a cognitive benefit? Compare learning of words that occur in student- vs. tutor-selected stories.

Find data that reflect learning Students perform many actions Only want those that indicate “real” learning to count Assumptions –First opportunity each day is purest marker Albert, Ken, and Joe have all observed difficulties with closely space trials –Don’t count stories student has already read Need outcome measure –Fuse accuracy, speed, and help performance

Approach Day Asked for help? Reading time (Sec) Prior opportunities Outcome Student selected Tutor selected 1Yes No No No No No0.332

Procedure Determine (hopefully motivated) learning decompositions Find data that reflect learning Solve as a non-linear regression model –Fit model to each student Interpret model coefficient (B)

Learning curves Better Worse Performance = Ae -bt Input: number of prior trials (t) Output: expected performance

What if all trials aren’t equal? Normal model = Ae -bt Think about student vs. tutor chosen story –t1 = trials where student chose story –t2 = trials where tutor chose story Learning decomp model = Ae -b(t1+B*t2) –B determines relative efficacy of trials of type t1 and t2

Use regression to find relative weight of tutor-selected prior opportunities Day Asked for help? Reading time (Sec) Prior opportunities Outcome Student selected Tutor selected 1Yes No No No No No0.322

Fit model to each student’s data StudentB Chris Smith0.3 Pat Johnson1.2 Sam Jackson0.5 Jessie Stevens0.9 Reagan Ronald0.7

Interpret B parameter B is scaling parameter –B>1  students benefit from tutor control –B  1  no benefit either way –B<1  student control is better B  0.8 for tutor-chosen stories (median) –Students learn more from student chosen stories (not my H0) What could be other causes of result?

Which students benefit? Top-down approach: –Think of plausible subgroups –See how/if B varies among them E.g. 1 st grader had 0.98, 2 nd graders 0.89, and 3 rd graders 0.49 –Suggests older kids benefit more (getting pickier?) Many possibilities, want to avoid fishing expedition

Which students benefit? Bottom- up approach StudentBBenefits from tutor control? Chris Smith0.3No Pat Johnson1.2Yes Sam Jackson0.5No Jessie Stevens0.9? Reagan Ronald0.7No Use regression results as training labels for classifier Predictors: –Gender –Grade –Test score (grade normed) –Disability status Boys benefit from learner control

Other learning decompositions: practice effects Open debate if more learning from rereading stories or reading new stories Generally believed spaced practice better for long term retention (but not short) Results –Reading new material better than rereading old stories (B = 0.5) –Later practice opportunities on same day are ineffective (B = 0.2)

Other learning decompositions: impact of instruction Reading Tutor has a bunch of random bits of instruction Do they do anything? –Solution: model instruction as an encounter and give it a weight Impact of instruction (in progress) –Spelling intervention worth 0.75 exposures –Word ID intervention worth 0.36 exposure –Neither is particularly effective –(but, first analytic approach to find any effect)

Using learning decomposition to model transfer (Xiaonan Zhang) How do students represent words? –Naïve model: words are independent –What about “cat” vs “cats”? Alternate models: –Word roots (cats, cat  CAT) –Rimes (bat, cat  AT) T1 = # prior times have read word T2 = # prior times have read root T3 = # prior times have read rime Substantial transfer at level of word root –55% as good as seeing the word itself

Hopefully Understand approach –Think of two types of learning that may have unequal impact –Divide up trials –Perform curve fitting See that it applies to variety of problems But…

Concerns We say things like “rereading is not as effective as reading different stories” –Is it safe to make causal inference from observational data? Wide- vs. Re-reading: troublesome –What if lower proficiency is true cause? Massed vs. Distributed practice: ok (?) Student vs. Tutor control: ok Interventions: ok What about student initiated help?

Interesting view (Jack Mostow) Each student has a B parameter E.g. Chris Smith has B=0.3 for rereading –  Chris Smith learns 30% as much from rereading as wide reading –Impossible for traits of Chris Smith to be a confound (proficiency, disability, etc.) –But, states could still be a problem E.g. Chris only rereads after sleeping poorly

Compare LFA and Learning Decomposition Similar: –Use learning curves and performance data –Insight: a model that better predicts student performance is a better model of student’s mental processes (modulo complexity) Different: –Bottom-up vs. top-down –Each manipulates different aspect of representation

Bottom-up vs. Top-down Learning decomposition –Start with theory-driven idea –Estimate effect (if any) –No search LFA –Start with variety of factors –Perform search –Might not correspond to higher level construct Not necessarily a bad thing

Consider transfer at level of word roots Learning decomp: –Student exposure to words of same root is 55% as good as seeing the word i.e. cats, cats, cat  cat, cat i.e. accepts, accepts, accept  accept, accept LFA –Cats and cat are same skill (perfect transfer) i.e. cats, cats, cat > cat, cat –Accepts and accept are different skills i.e. accepts, accepts, accept < accept, accept

Consider transfer at level of word roots Learning decomp: –Student exposure to words of same root is 55% as good as seeing the word i.e. cats, cats, cat = cat, cat i.e. accepts, accepts, accept = accept, accept LFA –Cats and cat are same skill (perfect transfer) i.e. cats, cats, cat > cat, cat –Achieve and achieving are different skills i.e. accepts, accepts, accept < accept, accept

Consider transfer at level of word roots Learning decomp: –Student exposure to words of same root is 55% as good as seeing the word i.e. cats, cats, cat = cat, cat i.e. accepts, accepts, accept = accept, accept LFA –Cats and cat are same skill (perfect transfer) i.e. cats, cats, cat > cat, cat –Achieve and achieving are different skills i.e. accepts, accepts, accept < accept, accept

Student learning history SkillPrior practice opportunities Skill Skill20 Skill30 Skill13 Skill21

Learning factors analysis SkillPrior practice opportunities Skill Skill20 Skill30 Skill13 Skill21 Did student utilize skill1 here? Is it better to think of it as skill1’?

Learning decomposition SkillPrior practice opportunities Skill Skill20 Skill30 Skill13 Skill21 Did the student really have 3 prior practice opportunities? = 3, but is there a better way of counting?

Wrapup Why model individual points Scope of learning decomposition How learning decomp differs from LFA