UNIT 1 • INFERENCES AND CONCLUSIONS FROM DATA

Slides:



Advertisements
Similar presentations
QBM117 Business Statistics Introduction to Statistics.
Advertisements

Chapter 7: Data for Decisions Lesson Plan
Chapter 10: Sampling and Sampling Distributions
© 2003 Prentice-Hall, Inc.Chap 1-1 Business Statistics: A First Course (3 rd Edition) Chapter 1 Introduction and Data Collection.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
© 2002 Prentice-Hall, Inc.Chap 1-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 1 Introduction and Data Collection.
Beginning the Research Design
Part III: Inference Topic 6 Sampling and Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Basic Business Statistics (8th Edition)
SAMPLING METHODS. Reasons for Sampling Samples can be studied more quickly than populations. A study of a sample is less expensive than studying an entire.
Sampling Concepts Population: Population refers to any group of people or objects that form the subject of study in a particular survey and are similar.
Sampling Designs and Sampling Procedures
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
Sample Design.
Introduction to Statistical Inferences
Chapter 5 Sampling Distributions
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
CHAPTER 12 – SAMPLING DESIGNS AND SAMPLING PROCEDURES Zikmund & Babin Essentials of Marketing Research – 5 th Edition © 2013 Cengage Learning. All Rights.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Review: Two Main Uses of Statistics 1)Descriptive : To describe or summarize a collection of data points The data set in hand = all the data points of.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
1.1 Chapter One What is Statistics?. 1.2 What is Statistics? “Statistics is a way to get information from data.”
Introduction In medicine, business, sports, science, and other fields, important decisions are based on statistical information drawn from samples. A sample.
Part III Gathering Data.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Chapter 12 Sample Surveys
Chapter 7: Sampling and Sampling Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Two THE DESIGN OF RESEARCH.
1 Chapter Two: Sampling Methods §know the reasons of sampling §use the table of random numbers §perform Simple Random, Systematic, Stratified, Cluster,
Gile Sampling1 Sampling. Fundamental principles. Daniel Gile
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Stat 1510: Sampling Distributions
Introduction Previous lessons demonstrated the use of the standard normal distribution. While distributions with a mean of 0 and a standard deviation of.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
Data Collection & Sampling Dr. Guerette. Gathering Data Three ways a researcher collects data: Three ways a researcher collects data: By asking questions.
Introduction Suppose that some students from the junior class will be chosen to receive new laptop computers for free as part of a pilot program. You hear.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
1 of 29Visit UMT online at Prentice Hall 2003 Chapter 1, STAT125Basic Business Statistics STATISTICS FOR MANAGERS University of Management.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Introduction Previous lessons demonstrated the use of the standard normal distribution. While distributions with a mean of 0 and a standard deviation of.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Selecting a Sample. outline Difference between sampling in quantitative & qualitative research.
Types of method Quantitative: – Questionnaires – Experimental designs Qualitative: – Interviews – Focus groups – Observation Triangulation.
Sampling & Simulation Chapter – Common Sampling Techniques  For researchers to make valid inferences about population characteristics, samples.
Chapter 31 What Do Samples Tell Us?. Chapter 32 Thought Question 1 During a medical exam, the doctor measures your cholesterol two times. Do you think.
Statistics 19 Confidence Intervals for Proportions.
Sampling Design and Procedure
Formulation of the Research Methods A. Selecting the Appropriate Design B. Selecting the Subjects C. Selecting Measurement Methods & Techniques D. Selecting.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Statistical analysis.
Statistical analysis.
Experimental Design, Data collection, and sampling Techniques
Graduate School of Business Leadership
Chapter 5 Sampling Distributions
CHAPTER 11: Sampling Distributions
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Chapter 5 Sampling Distributions
Presentation transcript:

UNIT 1 • INFERENCES AND CONCLUSIONS FROM DATA Lesson 3: Populations Versus Random Samples and Random Sampling Instruction Common Core Georgia Performance Standards MCC9–12.S.IC.1★ MCC9–12.S.IC.2★ Essential Questions How is a sample different from a population? Why are samples used in research? How and why are samples used in research? What are the advantages and disadvantages of using a simple random sample compared to using other methods of sampling? 1.3.1: Differences Between Populations and Samples

Vocabulary Bias, Biased Sample, Chance Variation, Cluster Sample, Combination, Convenience Sample, Factorial, Inference, Parameter, Population, Random Number Generator, Random Sample Reliability, Representative Sample, Sample, Sampling Bias, Sampling Error, Simple Random Sample, Statistics, Stratified sample, Systematic sample, Validity 1.3.1: Differences Between Populations and Samples

Introduction In medicine, business, sports, science, and other fields, important decisions are based on statistical information drawn from samples. A sample is a subset of the population. The wise selection of samples often determines the success of those who use the information. It is possible that one sample is more reliable to predict an election or justify a new medical procedure, while other samples are simply not reliable. Some conclusions based on statistical samples are little more than guesses, and some are reckless conclusions in life-or-death matters; in many cases, it all comes down to whether the sample selected is genuinely random. 1.3.1: Differences Between Populations and Samples

Key Concepts The word statistics has two different but related meanings. On a basic level, a statistic is a measure of a sample that is used to estimate a corresponding measure of a population (all of the people, objects, or phenomena of interest in an investigation). A statistic is a number used to summarize, describe, or represent something about a sample drawn from a larger population; the statistic allows us to make predictions about that population. A measure of the population that we are interested in is a parameter, a numerical value that represents the data in a set. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued We use different notation for sample statistics and population parameters. For example, the symbol for the mean of a population is μ, the Greek letter mu, whereas the symbol for the mean of a sample population is , pronounced “x bar.” The symbol for the standard deviation of a population is σ, the lowercase version of the Greek letter sigma; the symbol for the standard deviation of a sample population is s. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued Though the formulas for the mean of a population and the mean of a sample population are essentially the same, the formula for the standard deviation of a sample population is slightly different from the formula for the standard deviation of a population. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued For a population, the formula is , with σ representing the standard deviation of the population and μ representing the mean of the population. For a sample, the formula is , with s representing the standard deviation of the sample and representing the mean of the sample. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued When using a graphing calculator to find standard deviations of data sets, it is important to recognize whether the data set is a population or a sample so that the proper measure of standard deviation is selected. On a higher level, the field of statistics concerns the science and mathematics of describing and making inferences about a population from a sample. An inference is a conclusion reached upon the basis of evidence and reasoning. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued How well a statistic computed from a sample describes a population depends greatly upon the quality of the sampling method(s) used. First, the sample must be representative of the population. A representative sample is a sample in which the characteristics of the people, objects, or items in the sample are similar to the characteristics of the population. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued Samples that represent a population well can provide valuable information about that population. In research, it may be impractical to gather information about an entire population because of time, money, availability, privacy, and many other issues. In these cases, representative sampling may provide researchers with an efficient way to gather information and make decisions. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued In addition to the need for sampling to be representative, it must also produce reliable measures. Reliability refers to the degree to which a study or experiment performed many times would have similar results. When small samples are used, there is often great variability and little consistency among the statistics that are found. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued By increasing sample size, the variability in many sample statistics (such as means, standard deviations, and proportions) can be reduced, resulting in improved reliability and greater consistency of results. Statistical reasoning often involves making decisions based on limited information. In particular, when a population of interest is too large or expensive to study, a carefully chosen sample is used. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued One of the most important things that a researcher should understand about a population is the amount of chance variation, or sampling error, that is present in the measures of interest in that population. Chance variation is a measure showing how precisely a sample reflects the population, with smaller sampling errors resulting from large samples and/or when the data clusters closely around the mean. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued If a population is small enough, then parameters (such as measures of average, variation, or proportions) can be measured directly. There is no need for sampling in these cases. For example, if a teacher wants to know the mean grade for a recent test, he can calculate the mean of the entire class. If a population is large, or if it is impractical to measure all members of a population, then estimates are made from samples. The accuracy and reliability of the estimates depends on the quality of the sampling procedures used. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued In general, estimates of a population based on data from large samples are more reliable than estimates from small samples. In estimating the mean of a population, a sample size greater than 30 is recommended. In some cases, the sample size is much larger. In estimating proportions, a larger sample is desirable. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued Validity is the degree to which the results obtained from a sample measure what they are intended to measure. The validity of inferences made about a population depends greatly on the amount of bias, or lack of neutrality, in sampling procedures. A biased sample is a sample in which some members of the population have a better chance of inclusion in the sample than others. 1.3.1: Differences Between Populations and Samples

Key Concepts, continued An estimate made using a sample that is biased is likely to be inaccurate even if a large sample is used. For example, if a publisher wants to determine the percent of readers who prefer printed books to e-books, interviewing 100 people shopping at a bookstore may yield biased results, since those people are more likely to be deliberately seeking out printed books instead of e-books for a variety of reasons (they prefer printed books, they don’t own e-readers, they lack Internet access, etc.). 1.3.1: Differences Between Populations and Samples

Common Errors/Misconceptions not recognizing that results from an experiment or an observational study with a small sample size are unreliable not recognizing that samples that are biased can lead to misleading results even if numerical calculations are accurate not understanding that some of the variation in samples can be attributed to chance variation/sampling error 1.3.1: Differences Between Populations and Samples

Guided Practice Example 2 High levels of blood glucose are a strong predictor for developing diabetes. Blood glucose is typically tested after fasting overnight, and the test result is called a fasting glucose level. A doctor wants to determine the percentage of his patients who have high glucose levels. He reviewed the glucose test results for 25 patients to determine how many of them had a fasting glucose level greater than 100 mg/dL (milligrams per deciliter). He recorded each patient’s fasting glucose level in a table, shown on the next slide. 1.3.1: Differences Between Populations and Samples

Patient glucose levels in mg/dL Guided Practice: Example 2, continued Identify the population, parameter, sample, and statistic of interest in this situation, and then calculate the percent of patients in the sample with a fasting glucose level above 100 mg/dL. Patient glucose levels in mg/dL 99.9 105.4 131.8 79.7 66.6 116.7 111.5 98.1 86.4 76.4 105.8 107.0 95.7 87.6 99.1 75.4 106.2 89.2 72.4 58.9 86.8 66.0 53.6 88.1 1.3.1: Differences Between Populations and Samples

The population is all patients of this doctor. Guided Practice: Example 2, continued Identify the population in this situation. The population is all patients of this doctor. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued Identify the parameter in this situation. The parameter is the percent of patients with a fasting glucose level greater than 100 mg/dL. 1.3.1: Differences Between Populations and Samples

The sample is the 25 patients whose blood tests the doctor reviewed. Guided Practice: Example 2, continued Identify the sample in this situation. The sample is the 25 patients whose blood tests the doctor reviewed. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued Identify the statistic of interest in this situation. The statistic of interest is the percent of patients in the sample with a fasting glucose level greater than 100 mg/dL. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued Calculate the statistic of interest. To calculate the percent of patients in the sample with a fasting glucose level greater than 100 mg/dL, use the fraction , where x represents the number of patients with a fasting glucose level greater than 100 mg/dL and n represents the number of patients in the sample. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued From the table, it can be seen that 7 of the values are greater than 100, so x = 7. The total number of patients in the sample is 25, so n = 25. Fraction Substitute 7 for x and 25 for n, and then solve. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued Of the patients in the sample, 0.28 or 28% had a fasting glucose level greater than 100 mg/dL. Note: It is important to recognize that this may be an inaccurate estimate because the patients in the sample may not be representative of the entire population of the doctor’s patients. ✔ 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 2, continued http://www.walch.com/ei/00627 1.3.1: Differences Between Populations and Samples

Guided Practice Example 3 Data collected by the National Climatic Data Center from 1971 to 2000 was used to determine the average total yearly precipitation for each state. The table on the next slide shows the mean yearly precipitation for a random sample of 10 states and each state’s ranking in relation to the rest of the states, where a ranking that’s closer to 1 indicates a higher mean yearly precipitation. Use the sample data to estimate the total rainfall in all 50 states for the 30-year period from 1971 to 2000. Identify the population, parameter, sample, and statistic of interest in this situation. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Ranking State Mean yearly precipitation (in inches) 5 Florida 54.5 8 Arkansas 50.6 12 Kentucky 48.9 28 Ohio 39.1 35 Kansas 28.9 38 Nebraska 23.6 39 Alaska 22.5 41 South Dakota 20.1 43 North Dakota 17.8 46 New Mexico 14.6 1.3.1: Differences Between Populations and Samples

The population is all 50 states. Guided Practice: Example 3, continued Identify the population in this situation. The population is all 50 states. 1.3.1: Differences Between Populations and Samples

The parameter is the total rainfall from 1971 to 2000. Guided Practice: Example 3, continued Identify the parameter in this situation. The parameter is the total rainfall from 1971 to 2000. 1.3.1: Differences Between Populations and Samples

The sample is the 10 randomly selected states. Guided Practice: Example 3, continued Identify the sample in this situation. The sample is the 10 randomly selected states. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Identify the statistic of interest in this situation. The statistic of interest is the mean yearly precipitation for the sample. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Calculate the statistic of interest. To calculate the mean yearly precipitation for the sample, first find the total mean yearly precipitation in the sample states. To do this, find the sum of the mean yearly precipitation of each state. mean yearly precipitation = 22.5 + 50.6 + 54.5 + 28.9 + 48.9 + 23.6 + 14.6 + 17.8 + 39.1 + 20.1 = 320.6 The total mean yearly precipitation for the 10 sample states is 320.6 inches. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Next, use this value to estimate the total precipitation in all 50 states for 1 year. Create a proportion, as shown on the next slide; then, solve it for the unknown value. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Substitute known values. 10x = 50(320.6) Cross-multiply to solve for x. 10x = 16,030 Simplify. x = 1603 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Based on the data from 1971 to 2000, the estimated total precipitation in all 50 states for 1 year during this time frame is 1,603 inches per year. Use this value to estimate the total precipitation in all 50 states for this period of 30 years. 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued Multiply the precipitation in all 50 states (1,603) for 1 year by 30. 1603(30) = 48,090 The estimated total rainfall in all 50 states for the 30-year period from 1971 to 2000 is 48,090 inches. ✔ 1.3.1: Differences Between Populations and Samples

Guided Practice: Example 3, continued http://www.walch.com/ei/00628 1.3.1: Differences Between Populations and Samples