Modeling Student Knowledge Using Bayesian Networks to Predict Student Performance By Zach Pardos, Neil Heffernan, Brigham Anderson and Cristina Heffernan.

Slides:



Advertisements
Similar presentations
Mastery Learning is a style of assessment in which the student must demonstrate mastery of the assignment by correctly answering a certain number of problems.
Advertisements

Test Development.
The Research Consumer Evaluates Measurement Reliability and Validity
What is a Good Test Validity: Does test measure what it is supposed to measure? Reliability: Are the results consistent? Objectivity: Can two or more.
Advanced Topics in Standard Setting. Methodology Implementation Validity of standard setting.
MODULE 3 1st 2nd 3rd. The Backward Design Learning Objectives What is the purpose of doing an assessment? How to determine what kind of evidences to.
Effective Skill Assessment Using Expectation Maximization in a Multi Network Temporal Bayesian Network By Zach Pardos, Advisors: Neil Heffernan, Carolina.
Using an Intelligent Tutoring System to increase parent engagement in student learning By Zach Broderick, Christine O’Connor, Courtney Mulcahy, Cristina.
Computer Science Department Jeff Johns Autonomous Learning Laboratory A Dynamic Mixture Model to Detect Student Motivation and Proficiency Beverly Woolf.
The ASSISTment Project Trying to Reduce Bottom-out hinting: Will telling students how many hints they have left help? By Yu Guo, Joseph E. Beck& Neil T.
Towards Designing a User-Adaptive E-Learning System By Leena Razzaq, Neil Heffernan & Robert Lindeman This work-in-progress presents the groundwork for.
Addressing the Testing Challenge with a Web-Based E - Assessment System that Tutors as it Assesses Nidhi Goel Course: CS 590 Instructor: Prof. Abbott.
Conclusion Our prediction model did a good job at predict 8 th grade math proficiency. It can be used to estimate 10 th grade score fairly well, too. But.
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.
Using Mixed-Effects Modeling to Compare Different Grain-Sized Skill Models Mingyu Feng, Worcester Polytechnic Institute Neil T. Heffernan, Worcester Polytechnic.
Principles of High Quality Assessment
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Standardized Test Scores Common Representations for Parents and Students.
Welcome to the TAYLOR ELEMENTARY SCHOOL Introduction to MCAS.
Determining the Significance of Item Order In Randomized Problem Sets Zachary A. Pardos, Neil T. Heffernan Worcester Polytechnic Institute Department of.
Developing Evaluation Instruments
Georgia Milestones End of Grade (EOG) Assessment Grades 3, 4, and 5 Whitney Shook, Assistant Principal/Testing Coordinator Lori Dunagan, Curriculum Support.
Crafting Effective Multiple Choice Questions A Professional Development Tutorial for Faculty and T.A.’s.
Assessing Students’ Performance Longitudinally: Item Difficulty Parameter vs. Skill Learning Tracking Mingyu Feng, Worcester Polytechnic Institute Neil.
Case Study – San Pedro Week 1, Video 6. Case Study of Classification  San Pedro, M.O.Z., Baker, R.S.J.d., Bowers, A.J., Heffernan, N.T. (2013) Predicting.
Reliability Presented By: Mary Markowski, Stu Ziaks, Jules Morozova.
The Formative Assessment Cycle Solve a selection of problems of a given skill Analysis Students are instantly told if their answers on ASSISTment are correct.
2013 Calendar Year Customer Survey Results
Welcome Cristina Heffernan: Adjunct instructor at WPI, ASSISTment Training Coordinator.
Chapter 1 Measurement, Statistics, and Research. What is Measurement? Measurement is the process of comparing a value to a standard Measurement is the.
THE SAT. What is the purpose of the SAT? Because the United States does not have the same education standards for all states, the SAT provides college.
Reliability vs. Validity.  Reliability  the consistency of your measurement, or the degree to which an instrument measures the same way each time it.
Based on Common Core.  South Carolina will discontinue PASS testing in the year This will be a bridge year for common core and state standards.
Woodman PAT scores Language Arts Acceptable standardExcellence EnrolledWritersProvince(writers)CBE writersEnrolledWritersProvince(writers)CBE.
Formative Assessment Cycle: a Strategy for Communication “Formative assessment is a planned process in which assessment-elicited evidence of students’
Estimating Component Availability by Dempster-Shafer Belief Networks Estimating Component Availability by Dempster-Shafer Belief Networks Lan Guo Lane.
A ssessment & E valuation. Assessment Answers questions related to individuals, “What did the student learn?” Uses tests and other activities to determine.
The Recognition of Prior learning (RPL) Chiangrai Industrial and Community Education College, Thailand OFFICE OF THE VOCATIONAL EDUCATION COMMISSION MINISTRY.
Auto Diagnosing: An Intelligent Assessment System Based on Bayesian Networks IEEE 2007 Frontiers In Education Conference- Global Engineering : Knowledge.
Basic Science Terms  Observation: using the five senses to gather information, which can be proven (facts)  Inference: an opinion based on facts from.
Validity Validity is an overall evaluation that supports the intended interpretations, use, in consequences of the obtained scores. (McMillan 17)
Vincent Briseno EDTC  In a technology-enhanced environment, the MTT demonstrates knowledge of:  Instructional design  Development  Assessment.
An ACT Overview. The ACT and the SAT are both meant to test your knowledge of the fundamentals of a high school education in the United States. Differences.
Newark Central School District Universal Pre-K Update of Enrollment and Progress.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 25, 2012.
Michigan Assessment Consortium Common Assessment Development Series Module 16 – Validity.
Formative Assessment with A free public service of.
Learn about research involving a free web based tutoring system and how it is used to support the formative assessment process in real classrooms. Worcester.
Standards-Based Tests A measure of student achievement in which a student’s score is compared to a standard of performance.
MATHS LESSON PLAN Equations and identity Teacher : Lucia Visciano ICS ‘Giampietro Romano’ – Torre del Greco.
Assessment in Education ~ What teachers need to know.
Core Methods in Educational Data Mining
MCAS-Alt Teacher Consultants
Assessment of Learning 1
How to interact with the system?
Basic Science Terms Observation: using the five senses to gather information, which can be proven (facts) Inference: an opinion based on facts from observations.
Reliability and Validity
Chapter 14 Assembling, Administering, and Appraising classroom tests and assessments.
Week 3 Class Discussion.
Using Bayesian Networks to Predict Test Scores
Math Milestones Information Constructed Response
Detecting the Learning Value of Items In a Randomized Problem Set
CS539: Project 3 Zach Pardos.
Addressing the Assessing Challenge with the ASSISTment System
SAT Math Overview.
Neil T. Heffernan, Joseph E. Beck & Kenneth R. Koedinger
How to interact with the system?
Factors in Tiering Degree of assistance and support
Presentation transcript:

Modeling Student Knowledge Using Bayesian Networks to Predict Student Performance By Zach Pardos, Neil Heffernan, Brigham Anderson and Cristina Heffernan To evaluate the predictive performance of various fine-grained student skill models in the ASSISTment tutoring system using Bayesian networks. Goal ASSISTment is a web-based assessment system for 8 th -10 th grade math that tutors students on items they get wrong. There are 1,443 items in the system. The system is freely available at Question responses from 600 students using the system during the school year were used. Each student completed around 260 items each. The Skill Models The skill models were created for use in the online tutoring system called ASSISTment, founded at WPI. They consist of skill names and associations (or tagging) of those skill names with math questions on the system. Models with 1, 5, 39 and 106 skills were evaluated to represent varying degrees of concept generality. The skill models’ ability to predict performance of students on the system as well as on a standardized state test was evaluated. The five skill models used: WPI-106: 106 skill names were drafted and tagged to items in the tutoring system and to the questions on the state test by our subject matter expert, Cristina. WPI-5 and WPI-39: 5 and 39 skill names drafted by the Massachusetts Department of Education. WPI-1: Represents unidimensional assessment. Background on ASSISTment Predicting student responses within the ASSISTment tutoring system The ASSISTment fine-grained skill models excel at assessment of student skills (see Ming Feng’s poster for a Mixed-Effects approach comparison) Accurate prediction means teachers can know when their students have attained certain competencies. 1. Skill probabilities are inferred from a student’s responses to questions on the system Bayesian Belief Network Student Test Score Prediction Process This work has been accepted for publication at the 2007 User Modeling Conference in Corfu, Greece. SponsorsCollaborators Result: The finer-grained the model, the better prediction accuracy. The finest-grained WPI-106 performed the best with an average of only 5.5% error in prediction of student answers within the system. Result: The finest-grained model, the WPI-106, came in 2 nd to the WPI-39 which may have performed better than the 106 because 50% of its skills are sampled on the MCAS Test vs. only 25% of the WPI-106’s. Predicting student state test scores Conclusions A Bayesian Network is a probabilistic machine learning method. It is well suited for making predictions about unobserved variables by incorporating prior probabilities with new evidence. Bayesian Networks Arrows represent associations of skills with question items. They also represent conditional dependence in the Bayesian Belief Network. Probability of Guess is set to 10% (tutor questions are fill in the blank) Probability of getting the item wrong even if the student knows it is set to 5% 2. Inferred skill probabilities from above are used to predict the probability the student will answer each test question correctly Probabilities are summed to generate total test score. Probability of Guess is set to 25% (MCAS questions are multiple choice) Probability of getting the item wrong even if the student knows it is set to 5%