Latent regression models. Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Mark D. Reckase Michigan State University The Evaluation of Teachers and Schools Using the Educator Response Function (ERF)
1 Scaling of the Cognitive Data and Use of Student Performance Estimates Guide to the PISA Data Analysis ManualPISA Data Analysis Manual.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Bayesian Estimation in MARK
Sampling: Final and Initial Sample Size Determination
Flipping A Biased Coin Suppose you have a coin with an unknown bias, θ ≡ P(head). You flip the coin multiple times and observe the outcome. From observations,
Chapter 7 Introduction to Sampling Distributions
3.3 Toward Statistical Inference. What is statistical inference? Statistical inference is using a fact about a sample to estimate the truth about the.
Estimation A major purpose of statistics is to estimate some characteristics of a population. Take a sample from the population under study and Compute.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Today Today: Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Part 2b Parameter Estimation CSE717, FALL 2008 CUBS, Univ at Buffalo.
Point and Confidence Interval Estimation of a Population Proportion, p
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
SAMPLING DISTRIBUTIONS. SAMPLING VARIABILITY
1 Introduction to Biostatistics (PUBHLTH 540) Sampling.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Today Today: Chapter 8, start Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Random Variables and Probability Distributions
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Chapter 6: Sampling Distributions
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Objectives (BPS chapter 11) Sampling distributions  Parameter versus statistic  The law of large numbers  What is a sampling distribution?  The sampling.
Sampling distributions BPS chapter 11 © 2006 W. H. Freeman and Company.
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Random Sampling, Point Estimation and Maximum Likelihood.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Chapter 7 Point Estimation
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
7.4 – Sampling Distribution Statistic: a numerical descriptive measure of a sample Parameter: a numerical descriptive measure of a population.
Determination of Sample Size: A Review of Statistical Theory
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Confidence Interval & Unbiased Estimator Review and Foreword.
Estimation. The Model Probability The Model for N Items — 1 The vector probability takes this form if we assume independence.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Sampling Theory and Some Important Sampling Distributions.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Chapter 6: Sampling Distributions
Claus H. Carstensen, Institute for Science Education IPN Kiel, Germany
Probability Theory and Parameter Estimation I
Goals of Statistics.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Ch3: Model Building through Regression
Item Analysis: Classical and Beyond
AP Statistics: Chapter 7
More about Posterior Distributions
Sampling Distribution
Sampling Distribution
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Population Proportion
Item Analysis: Classical and Beyond
Item Analysis: Classical and Beyond
How Confident Are You?.
Presentation transcript:

Latent regression models

Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested in the average of what the items assess Stochastic subject argument Random sampling of subjects

Two different models Random sampling Stochastic subject

A Random Effects (Sampling) Model -- 1 Part 1: A Model for the population equivalently

A Random Effects Model -- 2 Part 2: A Model for the item response mechanism Example: SLM (but can be any)

A Random Effects Model -- 3 Part 3: Putting them together The unknowns are: is not an unknown, it is variable of integration What is analysed is item response, what is estimated is item parameters and population parameters

Why do this? Solves some theoretical estimation problems – For non-Rasch models Provides better estimates of population characteristics

Problems with point estimates

Problem with discreteness For a 6-item test, there are only 7 possible ability estimates to assign to people, those getting a score of 0,1,2,3,4,5,6. (raw score is sufficient statistic for ability) Suppose we want to know where the 25th percentile point is. That is, 25% of the population are below this point. We need extrapolation.

The Resulting JML Ability Distribution Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

Distribution for a six item test Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

Traditional approach is a two-step analyses First estimate abilities Then compute population estimates such as mean, variance, percentiles using leads to biased results due to measurement error. In the case of the population variance, we can correct the bias (disattenuate) by multiplying by the reliability. But in other cases, it is less obvious how to correct for the bias caused by measurement error.

Distribution of Estimates is Discrete One ability for each raw score Ability estimates have a discrete distribution We imagine (and the model’s premise) is a continuous distribution The distribution of ability estimates is distorted by measurement error

Solutions Direct estimation of population parameters (directly via item responses, and not through the estimated abilities) Complicated analyses that take into account the error – Not always possible

MML: How it works — 1 Item Response Model for item i: Population Model (discrete)

MML: How it works — 2

The Implications — 1 If, then contains parameters – Note that no ability parameters are involved, only population parameters. Use maximum likelihood estimation method to estimate the item difficulty parameters and population parameters. Thus, we directly estimate population parameters through the item responses

Bayes Theorem

The Idea of Posterior Distribution If a student’s item response pattern is x then the posterior distribution is given by

The Idea of Posterior Distribution Instead of obtaining a point estimate for ability, there is now a (posterior) probability distribution incorporates measurement error for the uncertainty in the estimate.

The Resulting JML Ability Distribution Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

Resulting MML Posterior Distributions Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

MML EAP Estimates – an aside Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

MML EAP Estimates – an aside Biased at the individual level Discrete scale, bias & measurement error leads to bias in population parameter estimates Requires assumptions about the distribution of proficiency in the population

Distribution for a six item test Score 0 Score 1 Score 2 Score 3 Score 4 Score 5 Score 6 Proficiency on Logit Scale

Estimating proportions below a point based up posterior distributions

More General Form of the Model Item response model Population model

The underlying population distribution is a mixture of two normal distributions, with different means ( and ). Population Not Normal E.g., sample consists of grades 5 and 8.

where x=0 if a student is in group 1 and x=1 if a student is in group 2. In this case, we estimate, and. Note that is the difference between the means of the two distributions. That is, group 1 has mean (as x=0), and group 2 has mean (as x=1). Latent Regression - 1

Latent Regression - 2 We call “x” a “regressor”, or a “conditioning variable”, or a “background variable”. We can generalise to include many conditioning variables.