MPA 630: Data Science for Public Management November 1, 2018

Slides:



Advertisements
Similar presentations
We Can Read About Mixing Colors
Advertisements

Overview of Inferential Statistics
In this chapter we introduce the idea of hypothesis testing in general, and then we look at the specifics for a hypothesis test for a single population.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
© red ©
Sampling Distributions What is a sampling distribution?
Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.
Target population-> Study Population-> Sample 1WWW.HIVHUB.IR Target Population: All homeless in country X Study Population: All homeless in capital shelters.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
AP Statistics Chapter 9 Notes.
Chi-Square Test.
Section Estimating a Proportion with Confidence Objectives: 1.To find a confidence interval graphically 2.Understand a confidence interval as consisting.
The Scientific Method:
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
The table below gives the pretest and posttest scores on the MLA listening test in Spanish for 20 high school Spanish teachers who attended an intensive.
By.  Are the proportions of colors of each M&M stated by the M&M company true proportions?
Color Distribution A block BrownYellowOrangeRedGreenBlue GreenYellowOrangeRedPurple M&M Skittles.
Sampling Distributions & Sample Means Movie Clip.
Confidence Intervals INTRO. Confidence Intervals Brief review of sampling. Brief review of the Central Limit Theorem. How do CIs work? Why do we use CIs?
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Estimation: Confidence Intervals SECTION 3.2 Confidence Intervals (3.2)
Statistics 26 Comparing Counts. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Chapter 9 Estimation using a single sample. What is statistics? -is the science which deals with 1.Collection of data 2.Presentation of data 3.Analysis.
Chapter 9 Lesson 9.1 Estimation Using a Simple Sample 9.1: Point Estimation.
Bias and Variability Lecture 27 Section 8.3 Wed, Nov 3, 2004.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Sampling Analysis. Statisticians collect information about specific groups through surveys. The entire group of objects or people that you want information.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Check your understanding: p. 684
CHAPTER 11 Inference for Distributions of Categorical Data
11.1 Chi-Square Tests for Goodness of Fit
Sampling Distributions and Estimators
Business Statistics, 4th by Ken Black
CHAPTER 8 Estimating with Confidence
Chapter 16: Sample Size “See what kind of love the Father has given to us, that we should be called children of God; and so we are. The reason why the.
This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those.
Business Statistics, 4th by Ken Black
Target for Today Know what can go wrong with a survey and simulation
CHAPTER 8 Estimating with Confidence
Chi-Square X2.
CHAPTER 12 Sample Surveys.
M & M color Analysis By Ellen Zimmer.
Ch. 8 Estimating with Confidence
Chi-Square Test.
CHAPTER 8 Estimating with Confidence
Chi-Square Test.
Sampling Distribution
Sampling Distribution
CHAPTER 8 Estimating with Confidence
Section 7.7 Introduction to Inference
CHAPTER 11 Inference for Distributions of Categorical Data
Can I color yellow?. Can I color yellow?
Section 7.1 Sampling Distributions
Chi-Square Test.
CHAPTER 8 Estimating with Confidence
Sample Size.
What Color is it?.
Chapter 7 Sampling Distributions
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 8 Estimating with Confidence
CHAPTER 11 Inference for Distributions of Categorical Data
Sampling: How to Select a Few to Represent the Many
Chapter 26 Part 2 Comparing Counts.
Sampling Distributions
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 8 Estimating with Confidence
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Data Collection and Sampling Techniques
Inference for Distributions of Categorical Data
Presentation transcript:

MPA 630: Data Science for Public Management November 1, 2018 SAMPLING MPA 630: Data Science for Public Management November 1, 2018 Fill out your reading report on Learning Suite

Sampling with computers PLAN FOR TODAY Exam 2 Sampling vocabulary Sampling in real life Sampling with computers

EXAM 2

SAMPLING VOCABULARY

DEFINING THE POPULATION A collection of things in the world Population parameter Something we want to know about the population

COUNTING THE POPULATION Census Count every single thing in the whole population Sampling Select parts of the population and count those

Sample statistic or point estimate MEASURE THE SAMPLE Sample statistic or point estimate The population parameter, but for the sample Uses the hat sign; p-hat

IS THE SAMPLE GOOD? Representativeness Bias and randomness Does the sample look like the population? Bias and randomness Does every part of the population have the same chance of being sampled? Generalizability Is p-hat a good guess of p?

WHY EVEN DO THIS? Censuses are expensive and often impossible If a sample is taken at random… …it will be unbiased and representative… …and the sample estimates can generalize to the whole population (within a confidence interval)

WHAT IF *YOU* AREN’T COUNTED? Sampling gets us accurate estimates of population parameter—even if samples seem small! Statistical power

SAMPLING IN REAL LIFE

M&M SAMPLING Define the population Count the population What thing are we counting? Define the population What parameter are we measuring? Count the population Census or sample? Measure the sample What is our p-hat? Is the sample representative? Is the sample good? Is the sample biased? Is p-hat a good guess?

https://commons.wikimedia.org/wiki/File:Plain-M%26Ms-Pile.jpg

THE TRUE P Plant City Blue Brown Green Orange Red Yellow CLV Cleveland, OH 20.7% 12.4% 19.8% 20.5% 13.1% 13.5% HKP Hackettstown, NJ 25.0% 12.5% “Our color blends were selected by conducting consumer preference tests, which indicate the assortment of colors that pleased the greatest number of people and created the most attractive overall effect.” https://blogs.sas.com/content/iml/2017/02/20/proportion-of-colors-mandms.html https://qz.com/918008/the-color-distribution-of-mms-as-determined-by-a-phd-in-statistics/ “Each large production batch is blended to those ratios and mixed thoroughly. However, since the individual packages are filled by weight on high-speed equipment, and not by count, it is possible to have an unusual color distribution”

Bigger sample size = better sampling IMPROVING P What can we do to get a better estimate of the whole population of M&Ms? More samples? Bigger samples? Bigger sample size = better sampling

SAMPLING WITH COMPUTERS