As a data user, it is imperative that you understand how the data has been generated and processed…

Slides:



Advertisements
Similar presentations
Inference about a Population Proportion
Advertisements

Probability models- the Normal especially.
Sampling techniques as applied to environmental and earth sciences
Math 20: Foundations FM20.6 Demonstrate an understanding of normal distribution, including standard deviation and z-scores. FM20.7 Demonstrate understanding.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Statistical Concepts (continued) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of.
7-2 Estimating a Population Proportion
Understanding sample survey data
BA 427 – Assurance and Attestation Services
Full time and part time employment Coventry population in employment by gender Source: Annual Population Survey, Office for National Statistics
Determining the Size of
GS/PPAL Section N Research Methods and Information Systems A QUANTITATIVE RESEARCH PROJECT - (1)DATA COLLECTION (2)DATA DESCRIPTION (3)DATA ANALYSIS.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Inferential Statistics
AM Recitation 2/10/11.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Determining Sample Size
RESEARCH A systematic quest for undiscovered truth A way of thinking
IB Business and Management
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
POSC 202A: Lecture 9 Lecture: statistical significance.
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Topic 5 Statistical inference: point and interval estimate
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
PARAMETRIC STATISTICAL INFERENCE
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Source: Annual Population Survey, Office for National Statistics. Full time and part time employment Coventry population.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Making Inferences. Sample Size, Sampling Error, and 95% Confidence Intervals Samples: usually necessary (some exceptions) and don’t need to be huge to.
Managerial Economics Demand Estimation & Forecasting.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Determining the Size of a Sample 1 Copyright © 2014 Pearson Education, Inc.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
Section 10.1 Confidence Intervals
AP Statistics Section 10.1 C Determining Necessary Sample Size.
Foundations of Sociological Inquiry Statistical Analysis.
Employment, unemployment and economic activity Coventry working age population by ethnicity Source: Annual Population Survey, Office for National Statistics.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Review of Basic Statistical Concepts. Normal distribution Population Mean: μ and Standard Deviation: σ.
Population size * Population size doesn’t matter as long as…
Chapter 19 Confidence intervals for proportions
Inference: Probabilities and Distributions Feb , 2012.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
Chapter 13 Understanding research results: statistical inference.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Review Statistical inference and test of significance.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
GOVT 201: Statistics for Political Science
And distribution of sample means
This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those.
ESTIMATION.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Confidence Intervals Accelerated Math 3.
BUSINESS MATHEMATICS & STATISTICS.
Lecture Slides Elementary Statistics Twelfth Edition
Estimating Population Parameters Based on a Sample
Presentation transcript:

As a data user, it is imperative that you understand how the data has been generated and processed…

This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those uses).

Ipsos MORI’s technical note  Available on the survey website  Contents: – Sampling methodology – Data collection – Data processing – Data weighting – Statistical reliability…

An intro to sampling  Not feasible to question all of the people in the population we are interested in i.e. all residents in a local authority area (this would be a ‘census’) Population… Sampling is making an inference about a… …from a……Sample

We use sampling in our everyday lives! Do you need to eat the whole pot to see if it is correctly seasoned? Or Will just tasting it be enough? TASTING IT - PROVIDED IT’S WELL STIRRED!

NHTS uses probability sampling…  That is every sampling unit (an address) has a known and non- zero probability of selection –we know the size of the population – the total no. of addresses – we are drawing from –we know the chance of selection that applies to every individual unit – an address – within the population (“1 in n”).  This does not apply to quota sample surveys which are the most commonly used approach in market research.  Random probability sampling is theoretically purer –it is less prone to non-response bias –all statistical reliability tests assume random sampling

Statistical reliability  A sample survey produces figures which are estimates of the ‘truth’: –that is, we are drawing inferences about the entire population based on a sample (the ‘true’ figure would be based on a survey of the entire population i.e. a census)  Statistical reliability is a statement about those estimates and the confidence we have about them in relation to the ‘truth’  Reliability is sometimes referred to as “confidence intervals”, “margins of error” or “sampling tolerances”  It is determined by: –the percentage –the sample size on which the percentage is based –the level of confidence we want to apply – it is usual to test at the 95% confidence level –the effect of any weights applied

Confidence intervals and sample size  Larger samples provide more accurate data  But to achieve double the reliability we need to quadruple the sample size  For example, the following figures are for a 50% finding: Sample size Confidence interval + 14%10%7%5%

Some examples  For an NHTS survey based on 400 responses and in relation to a finding that 50% are fairly satisfied with a service: –“We are 95% confident that our sample percentage is reliable to plus or minus 5%.”  Put another way: –“Out of every 100 surveys we conduct where we see these figures, in 95 of them we would be right and the true figure lies within that range, in 5 of them we would be wrong and the true figure would not lie in that range.” –“The chances are 95 in 100 that this result would not vary by more than 5 percentage points from the ‘true’ result, that which would be found had the entire population responded i.e. the ‘true’ result would be between 45% and 55%.”  Another example, based on 100 responses: –“We are 95% confident that our sample percentage is reliable to plus or minus 10%. The ‘true’ result would be between 40% and 60%.”

Statistical tests should not be used for sample sizes less than 50 – in fact, any estimates derived from small samples should be considered, at best, indicative.

A full list of confidence intervals for a range of sample sizes and percentages is provided in Ipsos MORI’s Technical note.

Statistical significance between %s  When we extend this to look at two figures (survey estimates), we are usually interested in the likelihood of the difference between the two figures being ‘real’: –In other words, how confident are we that the difference reflects what we would have found if we had surveyed the entire population?  The issue may apply either to two figures from one sample or to a comparison of figures in two different samples for example: –comparing the views of men and women –comparing the views of one authority’s residents with another's –comparing attitudes among one authorities’ residents and how they have changed between 2009 and 2010  The answer is obtained using a test of statistical significance

Statistical significance: some examples  For two samples based on 1,000 responses and with a difference of 5 percentages points (50% vs 55%) –The confidence interval is +4. –“We are 95% confident that this is a statistically significant finding.”  With the same %s based on 500 vs 500 responses –The confidence interval is +6. –“We are 95% confident that this is not a statistically significant finding”.  With the same %s based on 100 vs 100 responses –The confidence interval is +14. –“We are 95% confident that this is not a statistically significant finding”.

This means that there will be different confidence intervals involved in comparisons of survey %s for local authority A vs B, and comparisons of C vs D (assuming that A-D received different numbers of responses).

There can still be merit in reporting on, and using findings, which are not statistically significant but caution should be exercised the smaller the sample size. We cannot use these findings with the same degree of confidence.

A full list of confidence intervals for different sample size comparisons and percentages is provided in Ipsos MORI’s Technical note.