THE WEIGHTING GAME Ciprian M. Crainiceanu Thomas A. Louis Department of Biostatistics

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Properties of Least Squares Regression Coefficients
Point Estimation Notes of STAT 6205 by Dr. Fan.
The Simple Regression Model
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 7. Statistical Estimation and Sampling Distributions
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
CHAPTER 7.1 Confidence Intervals for the mean. ESTIMATION  Estimation is one aspect of inferential statistics. It is the process of estimating the value.
Model Assessment and Selection
Model Assessment, Selection and Averaging
The General Linear Model. The Simple Linear Model Linear Regression.
Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Regression with One Explanator
3.3 Toward Statistical Inference. What is statistical inference? Statistical inference is using a fact about a sample to estimate the truth about the.
The Mean Square Error (MSE):. Now, Examples: 1) 2)
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Simple Linear Regression
Today Today: Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Part 2b Parameter Estimation CSE717, FALL 2008 CUBS, Univ at Buffalo.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Copyright © 2006 Pearson Addison-Wesley. All rights reserved. Lecture 5: Regression with One Explanator (Chapter 3.1–3.5, 3.7 Chapter 4.1–4.4)
Statistical Background
STAT262: Lecture 5 (Ratio estimation)
Chapter 7 Estimation: Single Population
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
Lecture 3 Properties of Summary Statistics: Sampling Distribution.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be.
Aim: How do we find confidence interval? HW#9: complete question on last slide on loose leaf (DO NOT ME THE HW IT WILL NOT BE ACCEPTED)
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
Chapter 7 Sampling and Point Estimation Sample This Chapter 7A.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Chapter 8 : Estimation.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Term 4, 2006BIO656--Multilevel Models 1 PROJECTS ARE DUE By midnight, Friday, May 19 th Electronic submission only to Please.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Chapter1: Introduction Chapter2: Overview of Supervised Learning
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Machine Learning 5. Parametric Methods.
Class 5 Estimating  Confidence Intervals. Estimation of  Imagine that we do not know what  is, so we would like to estimate it. In order to get a point.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
LECTURE 13: LINEAR MODEL SELECTION PT. 3 March 9, 2016 SDS 293 Machine Learning.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Confidence Intervals and Sample Size
STATISTICAL INFERENCE
Some General Concepts of Point Estimation
Sampling Distributions
Bias and Variance of the Estimator
Ratio and regression estimation STAT262, Fall 2017
Random Sampling Population Random sample: Statistics Point estimate
CONCEPTS OF ESTIMATION
Simple Linear Regression
Sampling Distributions
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Applied Statistics and Probability for Engineers
Presentation transcript:

THE WEIGHTING GAME Ciprian M. Crainiceanu Thomas A. Louis Department of Biostatistics

2 Oh formulas, where art thou?

3 Why does the point of view make all the difference?

4 Getting rid of the superfluous information

5 How the presentation could have started, but didn’t: Proof that statisticians can speak alien languages Let ,P) be a probability space, where  is a measurable space and P:  ↦ [0,1] is a probability measure function from the  - algebra  It is perfectly natural to ask oneself what a  -algebra or  -field is. Definition. A  -field is a collection of subsets  of the sample space  with Of course, once we mastered the  -algebra or  -field concept it is only reasonable to wonder what a probability measure is Definition. A probability measure P has the following properties Where do all these fit in the big picture? Every sample space is a particular case of probability space and weighting is intrinsically related to sampling

6 Why simple questions can have complex answers? Question: Question: What is the average length of in-hospital stay for patients? Complexity: Complexity: The original question is imprecise. New question: New question: What is the average length of stay for: –Several hospitals of interest? –Maryland hospitals? –Blue State hospitals? …

7 “Data” Collection & Goal Survey, conducted in 5 hospitals Hospitals are selected n hospital patients are sampled at random Length of stay (LOS) is recorded Goal: Estimate the population mean

8 Procedure Compute hospital specific means “Average” them –For simplicity assume that the population variance is known and the same for all hospitals How should we compute the average? Need a (good, best?) way to combine information

9 DATA Hospital # sampled n hosp Hospital size % of Total size: 100  hosp Mean LOS Sampling variance  2 /  2 /  2 /  2 /  2 /15 Total

10 Weighted averages Weighting strategy Weights x100 MeanVariance Ratio Equal Inverse variance Population Examples of various weighted averages: Variance using inverse variance weights is smallest

11 What is weighting? (via Constantine)            

12 What is weighting? The EssenceThe Essence: a general (fancier?) way of computing averages There are multiple weighting schemes Minimize variance by using inverse variance weights Minimize bias for the population mean by using population weights (“survey weights”) Policy weights “My weights,”...

13 Weights and their properties Let (  1,  2,  3,  4,  5 ) be the TRUE hospital-specific LOS Then estimates If  1 =  2 =  3 =  4 =  5 =    i  i ANY set of weights that add to 1 estimate   . So, it’s best to minimize the variance But, if the TRUE hospital-specific E(LOS) are not equal –Each set of weights estimates a different target –Minimizing variance might not be “best” –An unbiased estimate of    sets  w i =  i General idea Trade-off variance inflation & bias reduction

14 Mean Squared Error General idea Trade-off variance inflation & bias reduction (Estimate - True) 2 MSE = Expected(Estimate - True) 2 Variance + Bias 2 = Variance + Bias 2 Bias is unknown unless we know the  i (the true hospital-specific mean LOS) But, we can study MSE ( , w,  ) Consider a true value of the variance of the between hospital means Study BIAS, Variance, MSE for various assumed values of this variance

15 Mean Squared Error Consider a true value of the variance of the between hospital means T =  (  i -  * ) 2 Study BIAS, Variance, MSE for optimal weights based on assumed values (A) of this variance When A = T, MSE is minimized Convert T and A to fraction of total variance

16 The bias-variance trade-off X is assumed variance fraction Y is performance computed under the true fraction

17 Summary Much of statistics depends on weighted averages Choice of weights should depend on assumptions and goals trustIf you trust your (regression) model, –Then, minimize the variance, using “optimal” weights –This generalizes the equal  s case worryIf you worry about model validity (bias for    –Buy full insurance, by using population weights –You pay in variance (efficiency) –Consider purchasing only what you need Using compromise weights

18 Statistics is/are everywhere!

19 EURO: our short wish list