Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Experiments Versus.

Similar presentations


Presentation on theme: "Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Experiments Versus."— Presentation transcript:

1 Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Experiments Versus Obvervational Studies Inference: From Sample to Population Confidence Intervals 1 Mathematical Literacy in Today’s World, 8th ed. For All Practical Purposes © 2009, W.H. Freeman and Company

2 Chapter 7: Data for Decisions Sampling 2 Statistics  The science of collecting, organizing, and interpreting data.  How is the data produced? Sampling and experiments. Sampling  Gather information about a large group of individuals. Time, cost, and inconvenience forbid contacting every individual. Instead, gather information about only part of the group in order to draw conclusions about the whole. Population – The entire group of individuals about which we want information. Sample – Part of the population from which we actually collect information used to draw conclusions about the whole.

3 Chapter 7: Data for Decisions Bad Sampling Methods Bad Sampling Methods  If personal choice is involved in selecting the sample, the following could happen: Results could become biased. The sample may not be a true representation of the population. 3 Bias – The design of a statistical study that systematically favors a certain outcome. 1.Convenience Samples Interviewer chooses the sample from individuals close at hand (easiest to reach). Example: Mall surveys 2.Voluntary Response Sample People who choose themselves by responding to a general appeal. People with strong opinions are most likely to respond; can cause bias. Examples: Online polls, call-ins, write-ins.

4 Chapter 7: Data for Decisions Simple Random Samples Simple Random Sample (SRS)  An SRS of size n consists of n individuals from the population chosen in such a way that every set of n individuals has an equal chance to be the sample actually selected.  Choosing a sample by chance avoids bias by giving all individuals an equal chance to be chosen (a good sampling method). Examples of SRS  Draw names from a hat: Place all the names of the people in the population into a hat and draw out a handful (the sample). Slow and inconvenient  Use the table of random digits: A more efficient way of randomly selecting the sample without bias. For smaller samples, tables of random digits are used. For larger samples, computers do the random digit sampling. 4

5 Chapter 7: Data for Decisions Simple Random Sample Two Steps in Choosing a Simple Random Sample 1.Give each member of the population a numerical label of the same length. Example: 100 items can be labeled with two digits 01, 02, …, 99, 00 2.To choose the random sample, select a line in the digit table. 5 A table of random digits – A list of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these two properties: 1.Each entry in the table is equally likely to be any of the 10 digits from 0 through 9. 2.The entries are independent of one another. That is, knowledge of one part of the table gives no information about the other part. For a sample size of n, start reading off numbers of length of the labels until n individuals are selected from the population. When selecting the n individuals for the sample from the random digits table: 1. Do not use any group of digits not used as a label. 2. Do not use any repeats.

6 Chapter 7: Data for Decisions Simple Random Sample Using the Random Digit Table 6 Section Taken From a Random Digits Table 101 19223 95034 05756 28713 96409 12531 42544 82853 102 73676 47150 99400 01927 27754 42648 82425 36290 103 45467 71709 77558 00095 32863 29485 82226 90056 104 52711 38889 93074 60227 40011 85848 48767 52573 Example:  A group of 70 people were labeled 01, 02, 03, …, 69, 70.  In the random digits table, line 104 was selected and three lucky winners were selected.  Reading off two digit labels from line 104… 52 was selected first, 71 was skipped over (because it is not in the range of labels), 13 was chosen, then 88, 89 and 93 were skipped over (out of range), and 07 was chosen. Winners: 52,13,and 07

7 Example Use Table 7.1 to form a random sample of 5 students at Acme College from the entire population of 2350 AC students. Start on line 102.

8 Solution Assign each AC student a four digit numerical label, 0001–2350, making sure that no label is assigned twice. Read strings of 4 consecutive digits, ignoring repetitions and unassigned strings, until 5 assigned labels are obtained. The students with those labels are the sample.

9 Chapter 7: Data for Decisions Cautions About Sample Surveys Sample surveys of large populations require the following:  A good sampling design (can be done with SRS)  An accurate and complete list of the population  Participation of all individuals selected for the sample  A question posed that is neutral and clear Bias can occur due to the following:  Problems with obtaining accurate and complete population list Undercoverage – Occurs when some groups in the population are left out of the process of choosing the sample. Example: Homeless, prison inmates, students in dormitories, etc.  Problems with getting 100% participation of sampled people Nonresponse – Occurs when an individual chosen for the sample cannot be contacted or refuses to participate.  Problems with posing a misleading or confusing question 9

10 Chapter 7: Data for Decisions Experiments Observation versus Experiments  Observational Study – Example: sample survey Observes individuals and measures variable of interest but does not attempt to influence the response. Purpose is to describe some group or situation.  Experiment Deliberately imposes some treatment on individuals in order to observe their responses. Purpose is to study whether the treatment causes a change in the response. Examining Cause and Effect Between Variables  Experiments are the preferred method for examining the effect of one variable on another.  By imposing specific treatment of interest and controlling other influences, we can pin down cause and effect. 10

11 Chapter 7: Data for Decisions Experiments Uncontrolled Experiment  When it is not possible to control outside factors that can influence the outcome.  Confounding – The variables, whether part of a study or not, are said to be confounded when their effects on the outcome cannot be distinguished from each other. 11 Apply a treatment Influences by outside effects Observe or measure the response Randomized Comparative Experiment (helps confounding)  The outside effects and confounding variables act on all groups.  An experiment to compare two or more treatments in which people, animals, or things are assigned to treatments by chance. Randomized – The subjects are assigned to treatments by chance. Comparative – Compares two or more treatments.

12 Chapter 7: Data for Decisions Experiments versus Observational Studies Statistical Significance  An observed effect is statistically significant if it is so large that it is unlikely to occur just by chance in the absence of a real effect in the population from which the data were drawn. Example: The connection between smoking and lung cancer is statistically significant. Control Group  A group of experimental subjects who are given a standard treatment or no treatment at all (such as a placebo). Placebo Effect  The effect of a dummy treatment (such as an inert pill in a medical experiment) on the response of the subjects.  The tendency to respond favorably to any treatment. Double-Blind Experiments  An experiment in which neither the experimental subjects nor the persons who interact with them know which treatment each subject received. This helps to eliminate possible influences or biases between the subjects and workers — everyone kept “blind.” 12

13 Example A teacher has the following students in her class. Adam, Faiz, Kevin, Patty, Victoria, Billy, Gwen, Leo, Quinn, Wally, Cassy, Heidi, Mary, Rachel, Xavier, Daniel, Iliana, Nadia, Sarah, Yaffa, Edwin, Jacob, Ottis, Thomas, Zeki The teacher wants to determine if giving students extra reading assignments has any effect on a math test that includes word problems. The 25 students in the class are given a math pre-test. 13 students will be given the extra reading assignment and 12 will not (control group). Each group will be given a math post-test and the results will be compared. Outline the design of the experiment and begin on line 104 from table 7.1 to determine the 12 students to be the control group.

14 Solution: 01 Adam 06 Faiz 11 Kevin 16 Patty 21 Victoria 02 Billy 07 Gwen 12 Leo 17 Quinn 22 Wally 03 Cassy 08 Heidi 13 Mary 18 Rachel 23 Xavier 04 Daniel 09 Iliana 14 Nadia 19 Sarah 24 Yaffa 05 Edwin 10 Jacob 15 Ottis 20 Thomas 25 Zeki Step 1: Give each student a label of the same numerical length.

15 Step 2: Use the table starting on line 104, looking at groups of digits of length 2. The random labels in order are 01, 02, 07, 08, 13, 14, 15, 17, 18, 19, 21, and 25. Thus, the control group is Adam, Billy, Gwen, Heidi, Mary, Nadia, Ottis, Quinn, Rachel, Sarah, Victoria, and Zeki.

16 The placebo effect is a special kind of confounding in which a patient responds favorably to any treatment, even a placebo (fake treatment). To avoid the placebo effect and any possible bias on the part of the experimenters, use a double-blind experiment, so neither subjects nor investigators know which treatment an individual is receiving. (When one increases the number of factors in an experiment, the number of groups also increase. Forming a table is often helpful to show how many groups and the type of groups that are needed.)

17 Example: Describe the subject groups for a combined test of a high- potassium supplement and a multi-vitamin supplement on smokers’ blood pressure. Solution: Form four equal groups (if possible) using a random procedure. Give the groups treatments according to the following table: The experiment should be double-blind, so no participant knows who belongs to which group.

18 Chapter 7: Data for Decisions Inference: From Sample to Population Statistical Inference  When the sample was chosen at random from a population, we can infer conclusions about the wider population from these data.  Statistical inference works only if the data come from random samples or a randomized comparative experiment.  Parameter is a number that describes the population. A parameter is a fixed number (in practice we do not know its value).  A statistic is a number that describes a sample. The value of a statistic is known when we have taken a sample, but it can change from sample to sample. Example: A random sample of 2500 people was chosen from the population and asked a question: “Do you like getting new clothes but find shopping for clothes frustrating and time consuming?” 1650 people agreed. Sample statistic, p = 1650/2500 = 0.66 = 66% Infer that 66% of the population agrees. 18 ^

19 Chapter 7: Data for Decisions Inference: From Sample to Population Sampling Distribution  The distribution of values taken by the statistic in all possible samples of the same size from the same population.  For a fixed number of trials, a distribution with larger sample sizes will have less variation and the values will lie closer to the mean. 19 1000 SRSs of size 2500 from the same population (less variable than samples of size 100) 1000 SRSs of size 100 from the same population

20 Chapter 7: Data for Decisions Inference: From Sample to Population Sample Proportion  Choose a SRS of size n from a large population that contains population proportion p of successes: count of successes in the sample Sample proportion of successes, p = n  Then… Shape: For large sample sizes, the sampling distribution of is approximately normal. Center: The mean of the sampling distribution of is p. Spread: The standard deviation of the sampling distribution is: p (1 – p ) Standard deviation is  n For the shopping example… 0.60 ( 1 - 0.60) With a mean p = 0.6 and n = 2500, stand. dev. is  2500 = 0.0098 20 ^ ^

21 Example 1 Calculate the standard deviation of given that p = 0.125 and n = 2153.

22 Chapter 7: Data for Decisions Confidence Intervals The 68-95-99.7 Rule  68% of the observations fall within ± 1 standard deviation of the mean.  95% of the observations fall within ± 2 standard deviations of the mean.  99.7% of the observations fall within ± 3 standard deviations of the mean. 95% Confidence Interval An interval obtained from the sample data by a method that in 95% of all samples will produce an interval containing the true population parameter A 95% confidence interval for p is p (1 – p)  n where p is the proportion of successes in the sample and the margin of error is 2  p (1 – p ) /n  This recipe is only approximately correct, but it is quite accurate when the sample size n is large. 22 ^ ^^ ^ ^ ^ p ± 2

23 Example In a fall semester student poll, 450 potential spring semester students are asked if they have decided which classes they would take. Suppose that 212 say “yes.” a) Estimate the proportion of students who have decided about their spring classes. b) Find a 95% confidence interval for this estimate.

24 Solution: A) Sample proportion of students: B) The 95% confidence interval for this estimate can be calculated as follows.

25 Example 2 In a political poll, 10,134 potential voters are asked if they have decided yet which candidate they will vote for in the next election. Suppose that 6215 say “yes.” a) Estimate the proportion of decided voters. b) Find a 95% confidence interval for this estimate.

26 Solution: a) The sample proportion of voters is therefore as follows. b) The 95% confidence interval for this estimate can be calculated as follows.


Download ppt "Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Experiments Versus."

Similar presentations


Ads by Google