A theory of transmission bias

Slides:

Advertisements

Similar presentations

Exponential Functions Logarithmic Functions

Advertisements

Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.

AP Statistics: Section 10.1 A Confidence interval Basics.

Estimating from Samples © Christine Crisp “Teach A Level Maths” Statistics 2.

Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.

BSc/HND IETM Week 9/10 - Some Probability Distributions.

Estimation from Samples Find a likely range of values for a population parameter (e.g. average, %) Find a likely range of values for a population parameter.

Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.

The Basics of Regression continued

1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.

Ch. 3.1 – Measurements and Their Uncertainty

The Real Zeros of a Polynomial Function

Slide 1 of 48 Measurements and Their Uncertainty

Sampling Distribution of the Mean Problem - 1

Use of spreadsheet Software!

Chapter 20: Testing Hypotheses about Proportions

Density Curves Normal Distribution Area under the curve.

Copyright © 2010 Pearson Education, Inc. Slide

Fundamental Building Blocks of Social Structure Honoring Peter Killworth’s contribution to social network theory Southampton, Sept. 28, 2006.

1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.

STAT 111 Introductory Statistics Lecture 9: Inference and Estimation June 2, 2004.

Slide 1 of 48 Measurements and Their Uncertainty

The Normal Distribution © Christine Crisp “Teach A Level Maths” Statistics 1.

Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.

Sample-Based Epidemiology Concepts Infant Mortality in the USA (1991) Infant Mortality in the USA (1991) UnmarriedMarriedTotal Deaths16,71218,78435,496.

CS1Q Computer Systems Lecture 6 Simon Gay. Lecture 6CS1Q Computer Systems - Simon Gay2 Algebraic Notation Writing AND, OR, NOT etc. is long-winded and.

Rule of sample proportions IF:1.There is a population proportion of interest 2.We have a random sample from the population 3.The sample is large enough.

The Network Scale-Up Method: Background and Theory H. Russell Bernard and Christopher McCarty University of Florida February, 2009 © 2009 H. Russell Bernard.

Bell Work Write the answers on the left hand side of your IAN

40S Applied Math Mr. Knight – Killarney School Slide 1 Unit: Statistics Lesson: ST-6 Confidence Intervals Confidence Intervals Learning Outcome B-4 ST-L6.

Sampling Distribution Models Chapter 18. Toss a penny 20 times and record the number of heads. Calculate the proportion of heads & mark it on the dot.

Section 10.1 Confidence Intervals

Losing Weight (a) If we were to repeat the sampling procedure many times, on average, the sample proportion would be within 3 percentage points of the.

© Copyright Pearson Prentice Hall Slide 1 of Measurements and Their Uncertainty On January 4, 2004, the Mars Exploration Rover Spirit landed on.

Scientific Measurement Measurements and their Uncertainty Dr. Yager Chapter 3.1.

Slide 1 of 48 Measurements and Their Uncertainty

CS1Q Computer Systems Lecture 2 Simon Gay. Lecture 2CS1Q Computer Systems - Simon Gay2 Binary Numbers We’ll look at some details of the representation.

CS1Q Computer Systems Lecture 6 Simon Gay. Lecture 6CS1Q Computer Systems - Simon Gay2 Algebraic Notation Writing AND, OR, NOT etc. is long-winded and.

False Positives Sensitive Surveys Lesson Starter A bag contains 5 red marbles and 4 blue marbles. Two marbles are drawn without replacement. What.

Slide 1 of 48 Measurements and Their Uncertainty

Slide 1 of 48 Measurements and Their Uncertainty

The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.

Estimating standard error using bootstrap

Active Learning Lecture Slides For use with Classroom Response Systems

SUR-2250 Error Theory.

This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those.

“Teach A Level Maths” Statistics 1

Some useful results re Normal r.v

Sampling Distributions

Statistics for the Social Sciences

The Diversity of Samples from the Same Population

Sampling Distributions

Regression and Residual Plots

1 Department of Engineering, 2 Department of Mathematics,

1 Department of Engineering, 2 Department of Mathematics,

Receiver Operating Curves

1 Department of Engineering, 2 Department of Mathematics,

Daniela Stan Raicu School of CTI, DePaul University

Sampling Distributions

Objective: To be able to evaluate an investigation

Power Section 9.7.

Chapter 7: Sampling Distributions

Inferential Statistics

BUSINESS MATHEMATICS & STATISTICS.

Sampling Distribution of a Sample Mean

Lecture Slides Elementary Statistics Twelfth Edition

Density Curves Normal Distribution Area under the curve

Density Curves Normal Distribution Area under the curve

Presentation transcript:

A theory of transmission bias Assume that people report correctly what they know. The comparison of the data from clergy and others shows that whatever the errors are, they are consistent. Shortly before his untimely death. Peter Killworth proposed the following: (1) Instead of assuming that people are inaccurate, assume that they report correctly what they know. After all, whatever the errors are in the model’s predictions, we see from slide 16 – which we repeat in the next slide – that those errors are consistent (see next slide). 1

Whatever the errors are in the model’s predictions, we see from slide 16 that those errors are consistent. 2

Most Americans know a Christopher It’s likely that you know at least one Christopher That is, the probability of knowing NO Christophers is close to zero. Twins are likely to be underreported. But what’s the truth? How can we draw the curve on that jagged diagram so that the true values are represented? From the graph in slide 43, we see that Americans are very likely to know at least one person named Christopher. We also see that twins are probably underreported. The population of twins is very large (about 1 in 125 births), but about 30% of Americans reported in our surveys that they did not know anyone who has a twin. The problem is, we don’t know what the truth is. We’d like to be able to re-draw the graph in slide 43 so that the true values were represented, not just what people report. 3

Suppose people report accurately In other words, given the structure of that diagram, we decided to trust our informants and assume that they are reporting correctly what they know. It’s just that what they know is incorrect. That jaggedy curve doesn’t tell us where the curve would be if people responded honestly to correct information instead of honestly to incorrect information. To do this, instead of assuming inaccurate informants, suppose we assume that people are accurate in their reporting. It’s just that what they know is incorrect. 4

This means adjusting the x-axis rather than the y-axis Suppose that widows don’t tell half the people they know about their being a widow. The 0.13 on the x-axis would remain the same but the number that people would be responding to would be .013/2. To make the x-axis the effective size of that population, we would slide it to the left while the y-axis would remain the same. Widows are 0.13 of the population in the U.S. … Suppose that widows only tell half the people they know that they are widows. Then, some people who report that they don’t know any widows would be incorrect, but would still be reporting correctly what they know. To adjust for this, we would slide the x-axis in the graph to the left while keeping the y-axis the same. 5

Of course, we have no idea what the transmission error might be – that’s what we tried in vain to get with weightings. We only know that if the numbers remain the same on the y-axis and we make up the effective sizes on the x-axis, the jaggedy line would go. How big an adjustment should we make to the x-axis? We don’t know – that’s what we could not find out with any of the weightings. 6

Killworth did this analytically by satisfying certain mathematical properties. We know the probability of knowing none and also of knowing just one person in a subpopulation. These have to be related mathematically, which leads to a well-defined set of values for the effective subpopulation. We can then compute the predicted distribution of c. This next diagram shows that we may be on the right track.