Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 4, 2013.

Slides:



Advertisements
Similar presentations
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.
Advertisements

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 30, 2012.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.
Educational Data Mining Overview Ryan S.J.d. Baker PSLC Summer School 2010.
Next Semester CSCI 5622 – Machine learning (Matt Wilder)  great text by Hastie, Tibshirani, & Friedman great text ECEN 5018 – Game Theory ECEN 5322 –
2-May-15 GUI Design. 2 HMI design There are entire college courses taught on HMI (Human-Machine Interface) design This is just a very brief presentation.
Knowledge Inference: Advanced BKT Week 4 Video 5.
CSE 5522: Survey of Artificial Intelligence II: Advanced Techniques Instructor: Alan Ritter TA: Fan Yang.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 27, 2012.
Ryan S.J.d. Baker Adam B. Goldstein Neil T. Heffernan Detecting the Moment of Learning.
Week 8 Video 4 Hidden Markov Models.
Effective Skill Assessment Using Expectation Maximization in a Multi Network Temporal Bayesian Network By Zach Pardos, Advisors: Neil Heffernan, Carolina.
Explicit Direct Instruction Critical Elements. Teaching Grade Level Content  The higher the grade the greater the disparity  Test Scores go up when.
CS 280 Data Structures Professor John Peterson. Goals Understand “Programming in the small” Java programming Know what’s under the hood in complex libraries.
Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.
CSC 160 Computer Programming for Non-Majors Introduction Prof. Adam M. Wittenstein
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Discussion examples Andrea Zhok.
CSE 515 Statistical Methods in Computer Science Instructor: Pedro Domingos.
Feature Engineering Studio February 23, Let’s start by discussing the HW.
Determining the Significance of Item Order In Randomized Problem Sets Zachary A. Pardos, Neil T. Heffernan Worcester Polytechnic Institute Department of.
Meta-Cognition, Motivation, and Affect PSY504 Spring term, 2011 January 19, 2010.
BIT 115: Introduction To Programming1 Sit in front of a computer Log in –Username: 230class –password: –domain: student Bring up the course web.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Meta-Cognition, Motivation, and Affect PSY504 Spring term, 2011 January 13, 2010.
Student Forum March5, pm - Collaborate Students will share their thoughts on topics including: --experiences with online courses --ways instructors.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 13, 2012.
CSc 2310 Principles of Programming (Java) Dr. Xiaolin Hu.
Invitation to Computer Science, Java Version, Second Edition.
1 Knowledge Management, Representation and Reasoning Specialism Advice Knowledge Management, Representation and Reasoning Specialism Advice MSc in AI Jessica.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 2, 2012.
Acadia Institute for Teaching and Technology1 Creating a Balanced Course.
English Opportunity By Ron Rogge. Problem - Solution 1.The initial impetus for this idea came from my observation that students lost English ability over.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 4, 2013.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 26, 2012.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 January 28, 2013.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Grade Book Database Presentation Jeanne Winstead CINS 137.
Meeting Management/Planning. Today Go over basics of meeting management Introduce key elements of creating a plan.
Feature Engineering Studio March 1, Let’s start by discussing the HW.
What is ALEKS? Who uses ALEKS? How is ALEKS being used? Teacher Feedback Recommendations.
"I pledge allegiance to the flag of the United States of America, and to the Republic for which it stands: one Nation under God, indivisible, with Liberty.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Feature Engineering Studio September 9, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 6, 2013.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 25, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Systems Analyst (Module V) Ashima Wadhwa. The Systems Analyst - A Key Resource Many organizations consider information systems and computer applications.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Dr. Ed Lovitt – Director Distance Learning Johnson County Community College Requiring an Orientation for New Online Students.
Welcome to MM212! Unit 1 Seminar. MM212 Unit 1 Seminar Agenda Welcome and Syllabus Review Classifying Numbers Operations with Real Numbers Arithmetic.
ON LINE TOPIC Assessment.  Educational assessment is the process of documenting, usually in measurable terms, knowledge, skills, attitudes and beliefs.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
PeerWise Student Instructions
as presented on that date, with special formatting removed
CSc 1302 Principles of Computer Science II
Special Topics in Educational Data Mining
HUDK5199: Special Topics in Educational Data Mining
Core Methods in Educational Data Mining
Detecting the Learning Value of Items In a Randomized Problem Set
Big Data, Education, and Society
Neil T. Heffernan, Joseph E. Beck & Kenneth R. Koedinger
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Presentation transcript:

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 4, 2013

Announcement There has been a change in the course schedule – Explanation for why provided in class The biggest change – Assignment 2 is now due on February 13 instead of February 18 – If this creates specific difficulties for anyone, please me in the next 2 days to request an extension – I’ll grant an extension for any good reason

Announcement #2 If you are not subscribed to the Learning Analytics Seminar Series at TC – Last semester we had Taylor Martin, Neil Heffernan, and Ilya Goldin – This semester we will have Alex Bowers and Samuel Greiff And you want to be on the list Please me, and I’ll make sure you get added to the list

Announcement #3 I’ve had a request for more support in learning how to use Excel If there’s enough interest, I could arrange a special session on this… – Who would be interested? – I’ll pass around a sign-up sheet

Announcement #3 I’ve had a request for more support in learning how to use Excel If there’s enough interest, I could arrange a special session on this… – Who would be interested? – I’ll pass around a sign-up sheet

Announcement #4 Don’t worry about learning how to compile and run Java It’s not necessary all that much during the semester…

Today’s Class Performance Factors Analysis

What is the key goal of PFA?

Measuring how much latent skill a student has, while they are learning – Expressed in terms of probability of correctness, the next time the skill is encountered – No direct expression of the amount of latent skill, except this probability of correctness How likely is Bob to be able to perform correctly, the next time he sees skill 7?

What is the typical use of PFA? Assess a student’s knowledge of topic X Based on a sequence of items that are dichotomously scored – E.g. the student can get a score of 0 or 1 on each item Where the student can learn on each item, due to help, feedback, scaffolding, etc.

How does PFA differ from BKT?

BKT assesses latent knowledge as well as probability of correctness; PFA only assesses probability of correctness BKT only handles one skill per item (extensions can handle this – we’ll talk about this next week)

Key assumptions Each item may involve multiple latent skills or knowledge components – Different from BKT Each skill has success learning rate  and failure learning rate  There is also a difficulty parameter, but its semantics can vary – more on this later From these parameters, and the number of successes and failures the student has had on each relevant skill so far, we can compute the probability P(m) that the learner will get the item correct

Note The assumption that items may involve multiple skills is not seen in BKT Why might this be a good assumption? Why might this be a bad assumption?

Note The assumption that learning rates are different between cases involving success and failure is not seen in BKT Why might this be a good assumption? Why might this be a bad assumption?

Note PFA has no direct expression of the amount of latent skill, except a probability of correctness Why might this be beneficial? Why might this be non-optimal?

Simple PFA Not actually a variant that Pavlik uses, but will serve as a base to understand the more complex variants that he uses

Simple PFA

Let’s enter this into Excel Using pfa-modelfit-set1-v2.xlsx

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -2 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = 2 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = 0 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -50 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0 gamma(2,3) = 0 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0 gamma(2,3) = 0 rho(1)= -0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0.1 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA What parts of the parameter space are definitely degenerate? What parts of the parameter space are plausible? Are there any gray areas?

 Parameters Pavlik uses three different  Parameters – Item – Item-Type – Skill

 Parameters Pavlik uses three different  Parameters – Item – Item-Type – Skill How does the number of parameters change? – Also, data points/parameters ratio Is there a concern about over-fitting? – We’ll talk about cross-validation later in the semester

Homework Solutions Before we discuss fitting PFA models, let’s go through a few homework solutions I’ve picked a few particularly interesting solutions There’s not time to go through everyone’s solution, but you will get picked on in a future class

High-level comments Don’t save formulas in.csv – “Save as” to.xls or.xlsx

Cameron White Can you walk us through your solution on PFA?

Madeline Weiss Can you walk us through your solution on PFA?

Let’s fit a Simple PFA model to this data pfa-modelfit-set-v2.xlsx Students Let’s start with the simple baseline model We’ll use SSR (sum of squared residuals) as our goodness criterion Note that there are better ways to do this than in Excel, such as Expectation Maximization

Let’s fit the other 3 PFA variants to this data How does goodness of fit change?

Notes Jury is still out on whether PFA is better than other approaches

PFA.vs. BKT PFA beats BKT – Pavlik, Cen, & Koedinger (2009)  A’ = 0.01, 0.01, 0.02, 0.02 – Gong, Beck, & Heffernan (2010)  A’ = 0.01 – Pardos, Baker, Gowda, & Heffernan (in press)  A’ = 0.02 BKT beats PFA – Baker, Pardos, Gowda, Nooraei, & Heffernan (2011)  A’ = 0.03 – Pardos, Gowda, Baker, & Heffernan (2011)  Predicting post-test)  r = 0.24,  RMSE = 0.01

Final Thoughts PFA is a competitor for measuring student skill, which predicts the probability of correctness rather than latent knowledge Not yet used within Intelligent Tutoring Systems – Should it be?

Questions asked in assignments How is model degeneracy discovered? Through seeing different results of running different models or by seeing flaws in the model equation itself or both? When did the shift occur in EDM from making software responsive to student behavior to influence ITS to predictive of student behavior that occurs outside of the closed system in which the EDM data is being collected? I can understand the affordance of the 2-state model in knowledge tracing models. However, based on my experience as a teacher and a learner, I think there are more subtleties inherent in seeing knowledge as being just either learned or unlearned. How do we express and capture these subtleties in future EDM models?

Questions asked in assignments What literature in cognitive science and psychology would be helpful to read in understanding what can be measured by EDM algorithms and models? How does one doing EDM figure out when a model is too sensitive and if there is overfitting? Are there benchmarks to measure data analysis results against to gauge this?

PFA Questions? Comments?

Next Class Wednesday, February 6 Diagnostic Metrics Fogarty, J., Baker, R., Hudson, S. (2005) Case Studies in the use of ROC Curve Analysis for Sensor-Based Estimates in Human Computer Interaction. Proceedings of Graphics Interface (GI 2005), Russell, S., Norvig, P. (2010) Artificial Intelligence: A Modern Approach. Ch. 20: Learning Probabilistic Models.

The End