COLLECTING STATISTICAL DATA

Slides:



Advertisements
Similar presentations
Economics 105: Statistics Review #1 due next Tuesday in class Go over GH 8 No GH’s due until next Thur! GH 9 and 10 due next Thur. Do go to lab this week.
Advertisements

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?
Bush's lead gets smaller in poll By Susan Page, USA TODAY WASHINGTON — President Bush leads Sen. John Kerry by 8 percentage points among likely voters,
The eternal tension in statistics.... Between what you really really want (the population) but can never get to...
Calculating sampling error Actually, we’ll calculate what is known (technically) as the confidence interval. This is the “+ or =” number that you hear.
Copyright ©2009 Cengage Learning 1.1 Day 3 What is Statistics?
Statistical Inference: Which Statistical Test To Use? Pınar Ay, MD, MPH Marmara University School of Medicine Department of Public Health
C1, L2, S1 Design Method of Data Collection Surveys and Polls Experimentation Observational Studies.
What is statistics? Statistics is the science of dealing with data.
–Population: The collection of objects or individuals. N-value: The number of individuals in the population. N-value: The number of individuals.
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Homework Discussion Read pages Page 467: 17 – 20, 25 – 27, 61, 62, 63, 67 See if you can find an example in your life of a survey that might.
Homework Read pages Page 467: 1 – 16, 29 – 34, 37, 38, 59.
Statistical Inference: Making conclusions about the population from sample data.
Final 2004 Presidential Election Gallup Poll Rhajiv Ratnatunga STATS 1000 Seminar 21.
Pitfalls of Surveys. The Literary Digest Poll 1936 US Presidential Election Alf Landon (R) vs. Franklin D. Roosevelt (D)
Making Inferences. Sample Size, Sampling Error, and 95% Confidence Intervals Samples: usually necessary (some exceptions) and don’t need to be huge to.
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
SECTION 4.1. INFERENCE The purpose of a sample is to give us information about a larger population. The process of drawing conclusions about a population.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
Bias in Survey Sampling. Bias Due to Unrepresentative Samples A good sample is representative. This means that each sample point represents the attributes.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
LOOKING AT SOME BASICS Can you tell the difference?
Qualitative vs. Quantitative & Displaying Data. “There are three kinds of lies: lies, damned lies, and statistics.” –Benjamin Disraeli ( ) & popularized.
Chapter 2 Section 2 Surveys, Samples, and Populations Obj: Explain the survey method and the importance of proper sampling techniques.
Chapter 12 Sample Surveys.
Political Science 30: Political Inquiry
Chapter 1: Statistics, Data and Statistical Thinking
Sources of Error In Sampling
Chapter 8: Estimating with Confidence
Sampling Why use sampling? Terms and definitions
Estimating with Confidence: Means and Proportions
Chapter 1: Statistics, Data, and Statistical Thinking
1. Estimation ESTIMATION.
Introduction to Statistics
The Diversity of Samples from the Same Population
Statistics – The science of collectiong, organizing, analyzing, and interpreting data in order to make decisions. Data – Consists of information coming.
Confidence Intervals: Sampling Distribution
Bias On-Level Statistics.
Objectives Estimate population means and proportions and develop margin of error from simulations involving random sampling. Analyze surveys, experiments,
CHAPTER 12 Sample Surveys.
Inference for Sampling
Introductory Statistical Language
Sampling Distribution of a Sample Proportion
Chapter 4 Sampling Design.
Sampling Population – any well-defined set of units of analysis; the group to which our theories apply Sample – any subset of units collected in some manner.
How do I study different sampling methods for collecting data?
Chapter 7 Special Topics
Introduction to Statistics
Confidence Intervals: The Basics
EXAMPLE: The weight of a can of Coca Cola is supposed to have mean = 12 oz with std. dev.= 0.82 oz. The product is declared underweight if it weighs.
1.1 What Is/Are Statistics?
Statistics Workshop Tutorial 1
Chapter 1 Statistics: The Art and Science of Learning from Data
Chapter 10: Estimating with Confidence
Section 5.1 Designing Samples
Chapter 5: Producing Data
1.) Come up with 10 examples of how statistics are used in the real life. Be specific and unique. 2.) Video.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Confidence Intervals & Polls
Chapter 8: Estimating with Confidence
What do Samples Tell Us Variability and Bias.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 1: Statistics, Data and Statistical Thinking
Samples and Populations
Presentation transcript:

COLLECTING STATISTICAL DATA When information is packaged in numerical form, it is called data. (page 447) Statistics is the science of dealing with data. This includes gathering data, organizing data, interpreting data, and understanding data. (page 447)

The group of individuals or objects to which a statistical statement refers is called the population. (page 448) The N-value is an accurate head count of every member of a population. (page 448) Do not confuse the N-value with the population itself.

Determining the N-Value of a population When populations are small and accessible, one can actually get an exact N-value by simply counting “heads”. Census: Collecting data from every member of the population (page 450)

The N-value N=430 is only an estimate. Collecting data from a selected subgroup of a population and then using those data to draw a conclusion and make statistical inferences about the entire population is called conducting a survey. The subgroup of the population from which the data is collected is called a sample. (page 452)

THE CAPTURE-RECAPTURE METHOD: ESTIMATING THE N-VALUE OF A POPULATION BY SAMPLING (page 460) STEP 1. The Capture: Capture (choose) a sample of size , tag (mark, identify) the animals (objects, people), and release them back into the general population. STEP 2. The Recapture: After a certain period of time, capture a new sample of size , and take an exact head count of the tagged individuals (i.e., those that were also in the first sample). Let’s call this number k. The ratio is approximately equal to the ratio . From this we get: .

Example. You want to estimate how many fish there are in a small pond Example. You want to estimate how many fish there are in a small pond. Suppose you capture 500 fish, tag them, and throw them back in the pond. After a couple of days, you go back to the pond and capture 120 fish, of which 30 are tagged. Give an estimate of the N-value of the fish population in the pond.

Page 459 Statisticians use the term statistic to describe any kind of numerical information drawn from a sample. A statistic is always an estimate for some unknown measure, called a parameter, of the population.

Sampling error (page 459) is the difference between a parameter and a statistic used to estimate that parameter.

Page 459 In surveys, chance error is the result of sampling variability : the fact that two different samples are likely to give two different statistics, even when the samples are chosen using the same sampling method. Sample bias is the result of having a poorly chosen sample.

The critical issues are: a. Finding a sample that is representative of the population, and b. Determining how big the sample should be. If n is the sample size and N is the population size then n/N is called the sampling rate. This is usually expressed as a percentage. A sampling rate of x% indicates that the sample is x% of the population. (page 529) Choosing a good sample of a reasonable size is more important that the sampling rate.

Consider the M&M example Assume there are 1,500 M&M’s in the container Describe the population of the survey Describe the sample for this survey Give the sample statistic for the number of M&M’s in this survey Give the parameter of the survey Give the sampling proportion for the survey Give the sampling error Give the sampling error, expressed as a percent Is the sampling error found a result of sampling variability or sampling bias? Explain.

A public opinion poll (page 453) is a special kind of a survey in which the members of the sample provide information by answering specific questions from an “interviewer”.

Example In order to estimate how effective Mr. Evans is in his Core 120 class, he gives 5 students in his section a survey on which they rate his/her efficiency on a scale of 1-5. he chooses students from each section in the following way: one that is getting an “A”, one that is getting a “B”, one that is getting a “C”, etc… The scores reported were 4, 4, 5, 4, 4. At the end of the semester, he gives the survey to all students in the class and finds that the average rating is 4.45

Describe the population of the survey Describe the sample for this survey Give the sample statistic (if it is given) for the average rating of Mr. Evans Give the parameter (if it is given) for the average rating of Mr. Evans Give the sampling proportion (rate) for the survey Give the sampling error Give the sampling error, expressed as a percent Is the sampling error found a result of sampling variability or sampling bias? Explain. What could be done with this sample to eliminate (or minimize) the sampling error.

A public opinion poll (page 453) is a special kind of a survey in which the members of the sample provide information by answering specific questions from an “interviewer”. The critical issues are: a. Finding a sample that is representative of the population, and b. Determining how big the sample should be.

Bush's lead gets smaller in poll By Susan Page, USA TODAY WASHINGTON — President Bush leads Sen. John Kerry by 8 percentage points among likely voters, the latest USA TODAY/ CNN/Gallup Poll shows. That is a smaller advantage than the president held in mid-September but shows him maintaining a durable edge in a race that was essentially tied for months. Results based on likely voters are based on the sub sample of 758 survey respondents deemed most likely to vote in the November 2004 General Election. The margin of sampling error is ±4 percentage points.

1936 Literary digest poll 1936 - Great Depression Presidential election between Democrat Franklin D. Roosevelt and Republican Alfred Landon. Literary digest runs a poll before election Telephone lists, professional organizations, magazine subscriptions Created a list of 10,000 names From 2.4 million respondents: Landon 57% Roosevelt 43% Actual results Landon 38% Roosevelt 62%

Homework Read pages 452 - 461 Page 467: 1 – 16, 29 – 34, 37, 38, 59