Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.

Slides:



Advertisements
Similar presentations
Mastery Learning is a style of assessment in which the student must demonstrate mastery of the assignment by correctly answering a certain number of problems.
Advertisements

What Behaviors Indicate a Student is Meeting Course Goals and Objectives? “Indicators”
Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.
Smarter Balanced Assessment: What do parents need to know?
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Knowledge Inference: Advanced BKT Week 4 Video 5.
Objective: Demonstrate mastery of cycle content. Today students will take an assessment on the whole cycle's content. They will first reflect briefly on.
Modeling Student Knowledge Using Bayesian Networks to Predict Student Performance By Zach Pardos, Neil Heffernan, Brigham Anderson and Cristina Heffernan.
Professional Performance Process OPD 863 – Using the Rubric Our Journey Continues! 21 st Century Supervision and Evaluation.
MATH ASSESSMENT TEST OCMA May, HISTORY OF MAT Test originally developed in late 60’s.
Effective Skill Assessment Using Expectation Maximization in a Multi Network Temporal Bayesian Network By Zach Pardos, Advisors: Neil Heffernan, Carolina.
The ASSISTment Project Trying to Reduce Bottom-out hinting: Will telling students how many hints they have left help? By Yu Guo, Joseph E. Beck& Neil T.
Addressing the Testing Challenge with a Web-Based E - Assessment System that Tutors as it Assesses Nidhi Goel Course: CS 590 Instructor: Prof. Abbott.
Learning Characteristics in Math Chapter 7. Cognitive Deficits in Math Neurological deficits: conflicting results across studies Memory deficits: perceptual.
Conclusion Our prediction model did a good job at predict 8 th grade math proficiency. It can be used to estimate 10 th grade score fairly well, too. But.
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Educational data mining overview & Introduction to Exploratory Data Analysis with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
Who is Really Responsible for On-line Students’ Technical Support? James R. Lackey, Ph.D. Oklahoma State University Stillwater, Oklahoma.
Educational Data Mining and DataShop John Stamper Carnegie Mellon University 1 9/12/2012 PSLC Corporate Partner Meeting 2012.
AN EVALUATION OF THE EIGHTH GRADE ALGEBRA PROGRAM IN GRAND BLANC COMMUNITY SCHOOLS 8 th Grade Algebra 1A.
Welcome to the TAYLOR ELEMENTARY SCHOOL Introduction to MCAS.
Common Core State Standards. What are Standards? They ensure that students are ready to be successful in school and in the workforce. They set the tone.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Elements of Multiple Regression Analysis: Two Independent Variables Yong Sept
Learning and Teaching Conference 2014 Remembering the Student in Student Satisfaction Dr David Bowles.
The Stock Market Eleanor Skinner Ganesha High School Pomona, California CTAP 295 Teaching with Technology.
Theodore C. Clark Oroville High School, Oroville, CA.
Case Study – San Pedro Week 1, Video 6. Case Study of Classification  San Pedro, M.O.Z., Baker, R.S.J.d., Bowers, A.J., Heffernan, N.T. (2013) Predicting.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 2, 2012.
2005 CALIFORNIA CASH FOR COLLEGE A Project of the California Student Aid Commission Workshop Evaluation Results April 30, 2005.
Copyright © 2008 Pearson Education MathXL ® for School Teacher Training Series MathXL ® for School Overview.
Case Study Presentation
May 13, 2011 Getting to Know the Common Core State Standards (CCSS)
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
1 lending ssessment with nstruction rogram /
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Terje Manger Beate Buanes Roth In co-operation with Arve Asbjørnsen, Ole Johan Eikeland, and Lise Jones Bergen Cognition and Learning Group University.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University.
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
Welcome to Astronomy 113 “ It would seem that you have no useful skill or talent whatsoever, he said.
Blended Value Accounting & Social Enterprise Success John Anner, PhD SIERC Annual Conference Auckland, New Zealand 12 February, 2016.
Construction Engineering 221 Probability and Statistics.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
BUS 308 Entire Course (Ash Course) FOR MORE CLASSES VISIT BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308 Week 1.
Inferential Statistics Psych 231: Research Methods in Psychology.
CIS 115 ALL EXERCISES DEVRY UNIVERSITY (DEVRY)  CIS 115 All Exercises Devry University CIS 115 ALL ILABS DEVRY UNIVERSITY (DEVRY)  CIS 115 All iLabs.
Math 1316 Prof. Thomas Jay. Homework: Homework problems are designed to reinforce the lecture/reading. If you miss a class, you should attempt to do.
LEARNING GOALS AND SCALES. LEARNING GOALS FOR TODAY Teachers will understand the characteristics of learning goals. Teachers will understand the difference.
Data-Driven Education
Math for Mastery Information
BUS 308 mentor innovative education/bus308mentor.com
Correlation & Regression
Math 1316 Prof. Thomas Jay.
Question 1 What are students graded based on?
Using Learning Analytics in Personalized Learning
Concept of Test Validity
Week 1 Gates Introduction to Information Technology cosc 010 Week 1 Gates
Big Data, Education, and Society
Addressing the Assessing Challenge with the ASSISTment System
Neil T. Heffernan, Joseph E. Beck & Kenneth R. Koedinger

Multiple Regression Chapter 14.
Mike Timms and Cathleen Kennedy University of California, Berkeley
What was your favorite/least favorite part of summer?
OSSLT Get Ready!.
Model selection Stepwise regression.
What’s the plan? First, we are going to look at the correlation between two variables: studying for calculus and the final percentage grade a student gets.
Presentation transcript:

Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance score? Method: Logistic Regression using LFA preliminary analysis to determine assistance score Outcome: Deeply aggregated data shows that both are good predictors (p<.001); Complete case analysis incomplete due to hurdles [data scrubbing, software, insufficient memory]

Searching for Patterns Sean Early University of Southern California PSLC Summer School 2007 Data Mining Track June 22, 2007

Background Assistments data set from school year (Heffernan) N=179 MCAS 39 step table Data covers one year of 8 th grade math online tutoring system results

Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance score?

Method Logistic Regression using LFA preliminary analysis to determine assistance score LFA represents an extension of Rasch modeling such that the probability of a correct response is equal to the underlying ability of the student, the difficulty of the knowledge component, and the number of opportunities that the student has had to respond correctly to that knowledge component

Model The Learning Factors Algorithm can be represented as: ln(p/1-p)= α i + β j + Γ j t-1

Results Preliminary results using standard logistic regression showed no significant difference in the predictive power of errors and requests for assistance These variables were significantly correlated with each other, and with post- test such that the fewer errors or hints, the better the final performance

Unanswered Questions The simple logistic regression model leaves much to be desired. It fails to account for the alpha term (where the student’s initial skill level was) or for the rate of growth across time. Both of these questions can be addressed through the LFA model, as the alpha term represents an intercept term while the gamma value represents opportunities to learn before mastery.

Data Mining Hurdles I Didn’t See Coming: or, What I Wish I Knew Then Data scrubbing (deleting repeated cases due to multiple knowledge components in one problem) needs to happen before creating aggregate data sets Excel is more powerful than I thought (Pivot Tables may indeed be the best thing MS has given us) 65,000 cases of information applied to a logistic regression algorithm makes my computer seize up with a case of the “not todays” Codebooks are our friends

Final Thoughts For those of us who believe in the mastery v. performance orientation framework, please take my lack of a more concrete product to share as evidence of my mastery orientation. I feel as though I learned a great deal this week. I also feel that with about 10 more hours of work and a more powerful machine at my side, I just may be able to answer my initial question in a way that captures some of the more interesting subtleties in the data set.