Sampling methods AP Statistics Chapter 12.

Slides:



Advertisements
Similar presentations
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
Advertisements

Why sample? Diversity in populations Practicality and cost.
Sampling Sampling Big Questions – Main Ideas 1)Should you include everyone or just a sample? 2)Probability versus Non-probability 3)How large? Response.
1-3 Data Collection and Sampling Techniques Surveys are the most common method of collecting data. Three methods of surveying are: 1) Telephone surveys.
Part III – Gathering Data
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Definitions Population: the entire group to which we wish to project our findings Sample: the subgroup that is actually measured Unit of analysis: that.
Copyright © 2008 Pearson Education, Inc.. Slide 1-2 Chapter 1 The Nature of Statistics Section 1.3 Other Sampling Designs.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
Chapter 1 Getting Started What is Statistics?. Individuals vs. Variables Individuals People or objects included in the study Variables Characteristic.
Honors Stats 3 Day 5. Do Now 1) Round 2 Match game! Turn all the cards over and match the examples to the bias 2) Check your HW: Questions??
Random Sampling All statistical sampling designs have in common the idea that chance not human choice is used to select the sample. [notes for days 2 and.
SAMPLING Purposes Representativeness “Sampling error”
is used to select the sample.
AC 1.2 present the survey methodology and sampling frame used
Chapter 12 Sample Surveys.
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Sample Surveys.
Section 4.2 Random Sampling.
Chapter 12 Sample Surveys
Sampling Why use sampling? Terms and definitions
Part III – Gathering Data
Sampling Population: The overall group to which the research findings are intended to apply Sampling frame: A list that contains every “element” or.
SAMPLING Purposes Representativeness “Sampling error”
Section 5.1 Designing Samples
Josie Burridge & Alyssa Pennacchi
Chapter 10 Samples.
Population: the entire group of individuals that we want information about   Census: a complete count of the population Sample: A part of the population.
Statistics – Chapter 1 Data Collection
Experimental Design, Data collection, and sampling Techniques
Probability and Statistics
Graduate School of Business Leadership
Sample Surveys Chapter 11.
Probability & Statistics Chapter 10
SAMPLING (Zikmund, Chapter 12.
Week 6 Lecture 1 Chapter 10. Sample Survey.
CHAPTER 12 Sample Surveys.
Chapter 12 Sample Surveys
Questionnaire Sampling and Terms.
Jada Hardy & Malakai Miller
statistics Specific number
Chapter 12 part 1 Sample surveys.
Understandable Statistics
Section 5.1 Designing Samples
Who do I ask, what do I ask them, what does that tell me?
guarantees that each group is represented in the sample
Introduction to Statistics
statistics Specific number
Chapter 12 Sample Surveys.
Chapter 1 The Where, Why, and How of Data Collection
WARM – UP Use LINE 5 of the random digit table. 30. The World Series.
Use your Chapter 1 notes to complete the following warm-up.
Designing Samples Section 5.1.
Chapter 1 The Where, Why, and How of Data Collection
SAMPLING (Zikmund, Chapter 12).
Chapter 5: Producing Data
MATH 2311 Section 6.1.
Chapter 5: Producing Data
Sample Surveys Idea 1: Examine a part of the whole.
Chapter 5 Producing Data.
Sampling Chapter 6.
Chapter 4: Designing Studies
Sampling Techniques Statistics.
Chapter 8 SAMPLING and SAMPLING METHODS
The Where, Why, and How of Data Collection
CS639: Data Management for Data Science
EQ: What is a “random sample”?
Surveys How to create one.
Chapter 1 The Where, Why, and How of Data Collection
Presentation transcript:

Sampling methods AP Statistics Chapter 12

is used to select the sample. Random Sampling All statistical sampling designs have in common the idea that chance not human choice is used to select the sample. So real quick… some basic ideas for this class… for a random sample, we have to take the human element out of the selection process, and let CHANCE do the choosing. When humans choose, invariably, bias comes into the picture and screws things up.

Randomize – let chance do the choosing! Randomization can protect you against -factors that you know are in the data -factors you are not even aware of Randomizing makes sure that on the average the sample “looks” like the population. Let’s say you’re a chef prepping a BIGGGGG pot of soup for a bunch of hungry people. At the last minute, if you add in a bunch of salt, what are you going to do before you taste it? (duh, stir the pot) But WHY? (to help ensure that your sample is REPRESENTATIVE of the pot, in terms of the way it tastes). Representative… that’ll be a HUGE word in statistics this year.

Simple Random Sampling Summary of Sampling Methods Simple Random Sampling every individual has an equal chance of being selected every set of n individuals has an equal chance of being selected *most basic & fundamental type of sampling! So these are in your notes packet, but let’s talk about these briefly. That 2nd bullet – “every set of n individuals has an equal probability…”, you should put STARS by that and note that THIS IS THE DEFINING CHARACTERISTIC of a SRS. (by the way, simple random is the ONLY sampling type that can be abbreviated by “SRS”)

Summary of Sampling Methods Stratified Sampling divide population into strata (homogenous layers, subpopulations) take SRS from each strata Cluster sampling divide population into clusters (each cluster should be representative of population) Randomly select one (or more) cluster(s) Take a CENSUS of the selected cluster(s) These two get mixed up all the time… both involve splitting up the population into groups. But stratified involves taking a random subset from EACH (and EVERY) group… while cluster involves taking a random selection of group(s), and then taking EVERYONE in the selected groups (not every group will be selected in a cluster!!!)

Summary of Sampling Methods Systematic Sampling Randomly select a starting point, then take (for example) every 10th (or 20th, or 5th, etc.) subject... Multistage Sampling Randomness is involved at more than one stage Be careful not to confuse with CLUSTER sampling For systematic, make sure you select a RANDOM starting point (for instance, using numbers in a hat, then pick one and start with that number person…). The textbook does NOT make a distiction of needing the random starting point, but this is required for AP.

OR Describe how to select a SRS of 5 students from a group of 27: Assign each student a unique number from 1 – 27 Use a RNG (on a calculator/computer) to generate 5 UNIQUE numbers from that range (repeated numbers will be ignored). The 5 students who have their numbers drawn will… OR Assign each student a unique number from 1 – 27 Write the numbers 1 – 27 on slips of paper, and put them in a hat. Stir the slips to mix them. Without looking, draw 5 slips of paper from the hat WITHOUT REPLACEMENT. The 5 students who have their numbers drawn will… When explaining the random selection process, these are acceptable ways of describing the process. Make sure you state that we ignore repeats (or state “unique” numbers), and also state what to do with the numbers. Slips of paper in a hat have been acceptable in the past, but AP seems to be pushing more towards modern methods (computer or calculator with a RNG).

Types of data – Numerical vs Categorical Numerical: Does it make sense to take an average? Catergorical: Cannot take an average, but we CAN take a proportion (or percentage) of… Name Job Type Age Gender Race Salary Zip Code Jose Cedillo Technical 27 Male Hispanic 52,300 90630 Amanda Childers Clerical 42 Female White 27,500 90521 Tonia Chen Management 51 Asian 83,600 90629

Identify the following Population (of interest): A research group wishes to know the mean GPA of all 2600(ish) students at Podunk High School. To estimate this, they take a random sample of 189 students that are enrolled in Pre-AP/AP math classes, and pull those records. The mean GPA of the students in the sample is 3.38. According to the school registrar, the GPA of all 2600(ish) students at Podunk High School is 3.09. Identify the following Population (of interest): Parameter of interest: Sampling frame: Sample: (WHO are we interested in?) ALL students at PHS (WHAT are we interested in?) Mean GPA of ALL students at PHS (who had a CHANCE of being selected?) All students enrolled in Pre-AP/AP Math (who was actually selected?) The 189 students. GPA is numerical data: 3.09 – this number is the PARAMETER (refers to the population) 3.38 – this number is the STATISTIC (refers to the sample)

Identify the following Population (of interest): A neighborhood interest group wants to know what proportion of households in Austin watch the TV show “Dancing with the Comets.” They select a random sample of 59 houses from Northwest Austin, and find that 35.6% of those families watch the program regularly. Local ratings indicate that about 22% of all households watch “Dancing with the Comets” on a regular basis. Identify the following Population (of interest): Parameter of interest: Sampling frame: Sample: Households in Austin (probably ALL of Austin) What proportion of households in Austin watch “DWTC” Households in Northwest Austin The 59 houses that were selected. This is categorical data (think: The answer is Yes/No. 22% or 0.22 – this number is the PARAMETER (refers to the population) 35.6% or 0.356 – this number is the STATISTIC (refers to the sample)

STOP!!! So… stop here and just do Jellyblubbers for the rest of day 2. Because for day 3, the goal is to get through all of Powerpoint 1.3.