Sampling Plans Copyright (c) 2008 by The McGraw-Hill Companies. This spreadsheet is intended solely for educational purposes by licensed users of LearningStats.

Slides:



Advertisements
Similar presentations
Statistics for Managers Using Microsoft® Excel 5th Edition
Advertisements

Sampling.
Sampling Methods Copyright (c) 2003 by The McGraw-Hill Companies. This material is solely for educational use by licensed users of LearningStats. It may.
Why sample? Diversity in populations Practicality and cost.
Chapter 7 Selecting Samples
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
Ch 8 ==> Statistics Is Fun!
Sampling Chapter 1. EQT 373 -L2 Why Sample? Selecting a sample is less time-consuming than selecting every item in the population (census). Selecting.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Basic Business Statistics
Statistics Terminology. What is statistics? The science of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Chapter 1 Introduction and Data Collection
ThiQar college of Medicine Family & Community medicine dept
Session Six Jeff Driskell, MSW, PhD
Chapter 14 Sampling PowerPoint presentation developed by:
Sampling.
Marketing Research Aaker, Kumar, Leone and Day Eleventh Edition
Dr. Unnikrishnan P.C. Professor, EEE
Sampling Why use sampling? Terms and definitions
Social Research Methods
Sampling.
Dr J Frost S3: Chapter 2 – Sampling Dr J Frost
Part III – Gathering Data
Sampling Population: The overall group to which the research findings are intended to apply Sampling frame: A list that contains every “element” or.
SAMPLING Purposes Representativeness “Sampling error”
Section 5.1 Designing Samples
Chapter 10 Samples.
Graduate School of Business Leadership
Sampling And Sampling Methods.
SAMPLING (Zikmund, Chapter 12.
Meeting-6 SAMPLING DESIGN
Statistics Section 1.2 Identify different methods for selecting a sample Simulate a random process Review: quantitative and qualitative variables, population.
Sampling: Design and Procedures
CHAPTER 12 Sample Surveys.
Social Research Methods
Class Discussion on Sampling
Sampling: Design and Procedures
Defining and Collecting Data
Keller: Stats for Mgmt & Econ, 7th Ed
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
The “Big Picture” of Statistics—What is the Purpose?
Sampling Lecture 10.
1.2 Sampling LEARNING GOAL
Chapter 1 The Where, Why, and How of Data Collection
Statistics Workshop Tutorial 1
Chapter 1: Statistics.
Chapter 1 The Where, Why, and How of Data Collection
Statistics · the study of information Data · information
SAMPLING (Zikmund, Chapter 12).
Business Statistics: A First Course (3rd Edition)
Sampling Chapter 6.
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Chapter 7 Sampling and Sampling Distributions
Defining and Collecting Data
Social Research Methods
Week 9 Sampling a population
The Where, Why, and How of Data Collection
Defining and Collecting Data
Data Collection and Experimental Design
Defining and Collecting Data
Sampling Methods and the Central Limit Theorem
Chapter 1 The Where, Why, and How of Data Collection
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Presentation transcript:

Sampling Plans Copyright (c) 2008 by The McGraw-Hill Companies. This spreadsheet is intended solely for educational purposes by licensed users of LearningStats. It may not be copied or resold for profit.

Why Sample? Lower cost Improved accuracy Destructive testing, such as That's right – a sample can be both cheaper and more accurate than an attempted census! The 2000 U.S. Census cost more and was less accurate because Congress rejected a plan to use scientific sampling in hard-to-count locations.* Lower cost Improved accuracy Destructive testing, such as Light bulb life (hours until failure) 5 mph crash car bumper tests (dollars of damage) Perishable data (e.g., fresh fish inspection) Timely data is required (e.g., political polls) Large, dispersed population (e.g., seat belt usage) * Source: http://www.census2000.org/facts/experts.html

Why Census? Computer database exists When the population is small or when the data are already in a computer database, why take a sample? But in auditing a computer record, we may still have to check physical records, so sampling still may be appropriate. Why Census? Computer database exists Population is small and accessible Legal requirements U.S. Census (national headcount) Bank cash (end-of-day cash drawer balance) Money is no constraint (rare) Time is no constraint (rare)

Objectives Descriptive (to get a general overview) Percent of HMO patients who arrive late Your mean commute time to work Inferential (to estimate a key parameter) Mean insurance claim size for angioplasty Six-month stent failure rate

Inference Population m = $58,222 s = $6,771 N = 125,000 Sample Question What is the average salary of a computer security specialist? We sample because the population is large and hard to reach. Population m = $58,222 s = $6,771 N = 125,000 Sample = $57,545 s = $6,958 n = 100

Sampling Plans - I Simple random sample Systematic sample List of population items must be available Random numbers are used to choose items Each item has same chance of being selected Systematic sample Continuous process non-enumerable population (no list) Choose every Kth item (e.g., every 10th voter at poll exit) Use a random starting point (e.g., the 8th voter) Unbiased unless data are in non-random order

Sampling Plans - II Stratified sample Cluster sample Each stratum is a defined population sub group (e.g., male/female) May be many strata (e.g., gender, race, occupation) Weight sample estimates by strata size (strata % must be known) Corrects for possible under-representation of groups Cluster sample Like stratification except based on geography (e.g., school districts) Two-stage is common (random cluster, random items in cluster) Reduces travel cost for in-person interviews

Alas – many key business decisions are made this way! Sampling Plans - III Judgment sample Experts in the field select the sample (e.g., which firms) Utilizes domain knowledge of experts (e.g., software engineers) May avoid wasting time on atypical or unimportant respondents But introduces subjectivity Convenience sample Asking co-workers opinions "because they're handy" Using a data set that happens to exist already Subjective Unknowable biases Alas – many key business decisions are made this way!

Simple Random Sample Data are the GPAs of college freshmen in a classroom. Choose 10 at random by picking random rows and columns. Sampled GPAs are highlighted. How well do the sample statistics and histograms match the population? Note If you chose the sample, you might try to avoid adjacent items, or try to cover every row or column. That would not be random.

Systematic Sample Data are the GPAs of college freshmen in a classroom. Choose 10 at random by picking every 12th student starting in row 5, column 1 and going down and across. Sampled GPAs are highlighted. How well do the sample statistics and histograms match the population? Note Although this sample happens to give poor estimates, the systematic method is not to blame unless the population items were arranged in a certain way (e.g., every 12th student is in the honors program). In this case, it's just sampling error.

Cluster Sample Should we sample without replacement? Data are the ages of concert goers. Each cluster is a zip code. There are 11 clusters. Choose 3 clusters at random, then select 3 persons from each selected cluster. Sampled ages are highlighted. How well do the sample statistics and histograms match the population?

Bottom Line Do a cost/benefit before sampling Define the purpose before you plunge Expert advice can help with The sampling plan The sample sizes required Analysis of the sample