Download presentation
Presentation is loading. Please wait.
Published byEleanore Booth Modified over 9 years ago
1
© aSup-2007 Inference about Means and Mean Different 1 PART III Inference about Means and Mean Different
2
© aSup-2007 Inference about Means and Mean Different 2 Chapter 8 INTRODUCTION TO HYPOTHESIS TESTING
3
© aSup-2007 Inference about Means and Mean Different 3 The Logic of Hypothesis Testing It usually is impossible or impractical for a researcher to observe every individual in a population Therefore, researchers usually collect data from a sample and then use the sample data to answer question about the population Hypothesis testing is statistical method that uses sample data to evaluate a hypothesis about the population
4
© aSup-2007 Inference about Means and Mean Different 4 The Hypothesis Testing Procedure 1. State a hypothesis about population, usually the hypothesis concerns the value of a population parameter 2. Before we select a sample, we use hypothesis to predict the characteristics that the sample have. The sample should be similar to the population 3. We obtain a sample from the population (sampling) 4. We compare the obtain sample data with the prediction that was made from the hypothesis
5
© aSup-2007 Inference about Means and Mean Different 5 PROCESS OF HYPOTHESIS TESTING It assumed that the parameter μ is known for the population before treatment The purpose of the experiment is to determine whether or not the treatment has an effect on the population mean Known population before treatment μ = 30 TREATMENT Unknown population after treatment μ = ?
6
© aSup-2007 Inference about Means and Mean Different 6 EXAMPLE It is known from national health statistics that the mean weight for 2-year-old children is μ = 26 pounds and σ = 4 pounds The researcher’s plan is to obtain a sample of n = 16 newborn infants and give their parents detailed instruction for giving their children increased handling and stimulation NOTICE that the population after treatment is unknown
7
© aSup-2007 Inference about Means and Mean Different 7 STEP-1: State the Hypothesis H 0 : μ = 26(even with extra handling, the mean at 2 years is still 26 pounds) H 1 : μ ≠ 26(with extra handling, the mean at 2 years will be different from 26 pounds) Example we use α =.05 two tail
8
© aSup-2007 Inference about Means and Mean Different 8 STEP-2: Set the Criteria for a Decision Sample means that are likely to be obtained if H 0 is true; that is, sample means that are close to the null hypothesis Sample means that are very unlikely to be obtained if H 0 is false; that is, sample means that are very different from the null hypothesis The alpha level or the significant level is a probability value that is used to define the very unlikely sample outcomes if the null hypothesis is true
9
© aSup-2007 Inference about Means and Mean Different 9 The location of the critical region boundaries for three different los -1.961.96 -2.58 2.58 -3.303.30 α =.05 α =.01 α =.001
10
© aSup-2007 Inference about Means and Mean Different 10 STEP-3: Collect Data and Compute Sample Statistics After obtain the sample data, summarize the appropriate statistic σ M = σ √n√n z = M - μ σMσM NOTICE that the top f the z-scores formula measures how much difference there is between the data and the hypothesis The bottom of the formula measures standard distances that ought to exist between the sample mean and the population mean
11
© aSup-2007 Inference about Means and Mean Different 11 STEP-4: Make a Decision Whenever the sample data fall in the critical region then reject the null hypothesis It’s indicate there is a big discrepancy between the sample and the null hypothesis (the sample is in the extreme tail of the distribution)
12
© aSup-2007 Inference about Means and Mean Different 12 HYPOTHESIS TEST WITH z A standardized test that are normally distributed with μ = 65 and σ = 15. The researcher suspect that special training in reading skills will produce a change in scores for individuals in the population. A sample of n = 25 individual is selected, the average for this sample is M = 70. Is there evidence that the training has an effect on test score? LEARNING CHECK
13
© aSup-2007 Inference about Means and Mean Different 13 FACTORS THAT INFLUENCE A HYPOTHESIS TEST The size of difference between the sample mean and the original population mean The variability of the scores, which is measured by either the standard deviation or the variance The number of score in the sample σ M = σ √n√n z = M - μ σMσM
14
© aSup-2007 Inference about Means and Mean Different 14 DIRECTIONAL (ONE-TAILED) HYPOTHESIS TESTS Usually a researcher begin an experiment with a specific prediction about the direction of the treatment effect For example, a special training program is expected to increase student performance In this situation, it possible to state the statistical hypothesis in a manner that incorporates the directional prediction into the statement of H 0 and H 1
15
© aSup-2007 Inference about Means and Mean Different 15 A psychologist has developed a standardized test for measuring the vocabulary skills of 4- year-old children. The score on the test form a normal distribution with μ = 60 and σ = 10. A researcher would like to use this test to investigate the hypothesis that children who grow up as an only child develop vocabulary skills at a different rate than children in large family. A sample of n = 25 only children is obtained, and the mean test score for this sample is M = 63. LEARNING CHECK
16
© aSup-2007 Inference about Means and Mean Different 16 Chapter 9 INTRODUCTION TO t STATISTIC
17
© aSup-2007 Inference about Means and Mean Different 17 THE t STATISTIC: AN ALTERNATIVE TO z In the previous chapter, we presented the statistical procedure that permit researcher to use sample mean to test hypothesis about an unknown population Remember that the expected value of the distribution of sample means is μ, the population mean
18
© aSup-2007 Inference about Means and Mean Different 18 The statistical procedure were based on a few basic concepts: 1.A sample mean (M) is expected more or less to approximate its population mean ( μ ). This permits us to use sample mean to test a hypothesis about the population mean. 2.The standard error provide a measure of how well a sample mean approximates the population mean. Specially, the standard error determines how much difference between M and μ is reasonable to expect just by chance.
19
© aSup-2007 Inference about Means and Mean Different 19 The statistical procedure were based on a few basic concepts: 3.To quantify our inferences about the population, we compare the obtained sample mean (M) with the hypothesized population mean ( μ ) by computing a z- score test statistic
20
© aSup-2007 Inference about Means and Mean Different 20 THE t STATISTIC: AN ALTERNATIVE TO z The goal of the hypothesis test is to determine whether or not the obtained result is significantly greater than would be expected by chance.
21
© aSup-2007 Inference about Means and Mean Different 21 THE PROBLEM WITH z-SCORE A z-score requires that we know the value of the population standard deviation (or variance), which is needed to compute the standard error In most situation, however, the standard deviation for the population is not known In this case, we cannot compute the standard error and z-score for hypothesis test. We use t statistic for hypothesis testing when the population standard deviation is unknown
22
© aSup-2007 Inference about Means and Mean Different 22 Introducing t Statistic σ M = σ √n√n Now we will estimates the standard error by simply substituting the sample variance or standard deviation in place of the unknown population value S M = s √n√n Notice that the symbol for estimated standard error of M is S M instead of σ M, indicating that the estimated value is computed from sample data rather than from the actual population parameter
23
© aSup-2007 Inference about Means and Mean Different 23 z-score and t statistic σ M = σ √n√n z = M - μ σMσM S M = s √n√n t = M - μ SMSM
24
© aSup-2007 Inference about Means and Mean Different 24 The t Distribution Every sample from a population can be used to compute a z-score or a statistic If you select all possible samples of a particular size (n), then the entire set of resulting z-scores will form a z-score distribution In the same way, the set of all possible t statistic will form a t distribution
25
© aSup-2007 Inference about Means and Mean Different 25 The Shape of the t Distribution The exact shape of a t distribution changes with degree of freedom There is a different sampling distribution of t (a distribution of all possible sample t values) for each possible number of degrees of freedom As df gets very large, then t distribution gets closer in shape to a normal z-score distribution
26
© aSup-2007 Inference about Means and Mean Different 26 HYPOTHESIS TESTS WITH t STATISTIC The goal is to use a sample from the treated population (a treated sample) as the determining whether or not the treatment has any effect Known population before treatment Unknown population after treatment μ = 30μ = ? TREATMENT
27
© aSup-2007 Inference about Means and Mean Different 27 HYPOTHESIS TESTS WITH t STATISTIC As always, the null hypothesis states that the treatment has no effect; specifically H 0 states that the population mean is unchanged The sample data provides a specific value for the sample mean; the variance and estimated standard error are computed t = sample mean (from data) Estimated standard error (computed from the sample data) population mean (hypothesized from H 0 ) -
28
© aSup-2007 Inference about Means and Mean Different 28 A psychologist has prepared an “Optimism Test” that is administered yearly to graduating college seniors. The test measures how each graduating class feels about it future. The higher the score, the more optimistic the class. Last year’s class had a mean score of μ = 19. a sample of n = 9 seniors from this years class was selected and tested. The scores for these seniors are as follow: 192423271920272118 On the basis of this sample, can the psychologist conclude that this year’s class has a different level of optimism than last year’s class? LEARNING CHECK
29
© aSup-2007 Inference about Means and Mean Different 29 STEP-1: State the Hypothesis, and select an alpha level H 0 : μ = 19(there is no change) H 1 : μ ≠ 19(this year’s mean is different) Example we use α =.05 two tail
30
© aSup-2007 Inference about Means and Mean Different 30 STEP-2: Locate the critical region Remember that for hypothesis test with t statistic, we must consult the t distribution table to find the critical t value. With a sample of n = 9 students, the t statistic will have degrees of freedom equal to df = n – 1 = 9 – 1 = 8 For a two tailed test with α =.05 and df = 8, the critical values are t = ± 2.306. The obtained t value must be more extreme than either of these critical values to reject H 0
31
© aSup-2007 Inference about Means and Mean Different 31 STEP-3: Obtain the sample data, and compute the test statistic Find the sample mean Find the sample variances Find the estimated standard error S M Find the t statistic S M = s √n√n t = M - μ SMSM
32
© aSup-2007 Inference about Means and Mean Different 32 STEP-4: Make a decision about H 0, and state conclusion The obtained t statistic (t = -4.39) is in the critical region. Thus our sample data are unusual enough to reject the null hypothesis at the.05 level of significance. We can conclude that there is a significant difference in level of optimism between this year’s and last year’s graduating classes t(8) = -4.39, p<.05, two tailed
33
© aSup-2007 Inference about Means and Mean Different 33 The critical region in the t distribution for α =.05 and df = 8 Reject H 0 Fail to reject H 0 -2.3062.306
34
© aSup-2007 Inference about Means and Mean Different 34 DIRECTIONAL HYPOTHESES AND ONE-TAILED TEST The non directional (two-tailed) test is more commonly used than the directional (one- tailed) alternative On other hand, a directional test may be used in some research situations, such as exploratory investigation or pilot studies or when there is a priori justification (for example, a theory previous findings)
35
© aSup-2007 Inference about Means and Mean Different 35 A fund raiser for a charitable organization has set a goal of averaging at least $ 25 per donation. To see if the goal is being met, a random sample of recent donation is selected. The data for this sample are as follows: 20503025152040501020 LEARNING CHECK
36
© aSup-2007 Inference about Means and Mean Different 36 The critical region in the t distribution for α =.05 and df = 9 Reject H 0 Fail to reject H 0 1.883
37
© aSup-2007 Inference about Means and Mean Different 37 Chapter 10 THE t TEST FOR TWO INDEPENDENT SAMPLES
38
© aSup-2007 Inference about Means and Mean Different 38 OVERVIEW Single sample techniques are used occasionally in real research, most research studies require the comparison of two (or more) sets of data There are two general research strategies that can be used to obtain of the two sets of data to be compared: ○ The two sets of data come from the two completely separate samples (independent-measures or between-subjects design) ○ The two sets of data could both come from the same sample (repeated-measures or within subject design)
39
© aSup-2007 Inference about Means and Mean Different 39 Do the achievement scores for students taught by method A differ from the scores for students taught by method B? In statistical terms, are the two population means the same or different? Unknown µ =? Sample A Unknown µ =? Sample B Taught by Method A Taught by Method B
40
© aSup-2007 Inference about Means and Mean Different 40 THE HYPOTHESES FOR AN INDEPENDENT-MEASURES TEST The goal of an independent-measures research study is to evaluate the mean difference between two population (or between two treatment conditions) H 0 : µ 1 - µ 2 = 0 (No difference between the population means) H 1 : µ 1 - µ 2 ≠ 0 (There is a mean difference)
41
© aSup-2007 Inference about Means and Mean Different 41 THE FORMULA FOR AN INDEPENDENT- MEASURES HYPOTHESIS TEST In this formula, the value of M 1 – M 2 is obtained from the sample data and the value for µ 1 - µ 2 comes from the null hypothesis The null hypothesis sets the population mean different equal to zero, so the independent-measures t formula can be simplifier further t = sample mean difference estimated standard error population mean difference - = M 1 – M 2 S (M1 – M2)
42
© aSup-2007 Inference about Means and Mean Different 42 THE STANDARD ERROR To develop the formula for S (M1 – M2) we will consider the following points: Each of the two sample means represent its own population mean, but in each case there is some error S M = s2s2 n √ S M1-M2 = s12s12 n1n1 √ s22s22 n2n2 +
43
© aSup-2007 Inference about Means and Mean Different 43 POOLED VARIANCE The standard error is limited to situation in which the two samples are exactly the same size (that is n 1 – n 2 ) In situations in which the two sample size are different, the formula is biased and, therefore, inappropriate The bias come from the fact that the formula treats the two sample variance
44
© aSup-2007 Inference about Means and Mean Different 44 POOLED VARIANCE for the independent-measure t statistic, there are two SS values and two df values) S P 2 = SS n S M1-M2 = s12s12 n1n1 √ s22s22 n2n2 +
45
© aSup-2007 Inference about Means and Mean Different 45 HYPOTHESIS TEST WITH THE INDEPENDENT-MEASURES t STATISTIC In a study of jury behavior, two samples of participants were provided details about a trial in which the defendant was obviously guilty. Although Group-2 received the same details as Group-1, the second group was also told that some evidence had been withheld from the jury by the judge. Later participants were asked to recommend a jail sentence. The length of term suggested by each participant is presented. Is there a significant difference between the two groups in their responses?
46
© aSup-2007 Inference about Means and Mean Different 46 THE LENGTH OF TERM SUGGESTED BY EACH PARTICIPANT Group-1 scores:4 4 3 2 5 1 1 4 Group-2 scores:3 7 8 5 4 7 6 8 There are two separate samples in this study. Therefore the analysis will use the independent-measure t test
47
© aSup-2007 Inference about Means and Mean Different 47 STEP-1: State the Hypothesis, and select an alpha level H 0 : μ 1 - μ 2 = 0 (for the population, knowing evidence has been withheld has no effect on the suggested sentence) H 1 : μ 1 - μ 2 ≠ 0 (for the population, knowledge of withheld evidence has an effect on the jury’s response) We will set α =.05 two tail
48
© aSup-2007 Inference about Means and Mean Different 48 STEP-2: Identify the critical region For the independent-measure t statistic, degrees of freedom are determined by df = n 1 + n 2 – 2 = 8 + 8 – 2 = 14 The t distribution table is consulted, for a two tailed test with α =.05 and df = 14, the critical values are t = ± 2.145. The obtained t value must be more extreme than either of these critical values to reject H 0
49
© aSup-2007 Inference about Means and Mean Different 49 STEP-3: Compute the test statistic Find the sample mean for each group M 1 = 3 and M 2 = 6 Find the SS for each group SS 1 = 16 and SS 2 = 24 Find the pooled variance, and S P 2 = 2.86 Find estimated standard error S (M1-M2) = 0.85
50
© aSup-2007 Inference about Means and Mean Different 50 STEP-3: Compute the t statistic t = M 1 – M 2 S (M1 – M2) = -3 0.85 = -3.53
51
© aSup-2007 Inference about Means and Mean Different 51 STEP-4: Make a decision about H 0, and state conclusion The obtained t statistic (t = -3.53) is in the critical region on the left tail (critical t = ± 2.145). Therefore, the null hypothesis is rejected. The participants that were informed about the withheld evidence gave significantly longer sentences, t(14) = -3.53, p<.05, two tails
52
© aSup-2007 Inference about Means and Mean Different 52 The critical region in the t distribution for α =.05 and df = 14 Reject H 0 Fail to reject H 0 -2.1452.145
53
© aSup-2007 Inference about Means and Mean Different 53 LEARNING CHECK The following data are from two separate independent-measures experiments. Without doing any calculation, which experiment is more likely to demonstrate a significant difference between treatment A and B? Explain your answer. EXPERIMENT AEXPERIMENT B Treatment ATreatment BTreatment ATreatment B n = 10 M = 42M = 52M = 61M = 71 SS = 180SS = 120SS = 986SS = 1042
54
© aSup-2007 Inference about Means and Mean Different 54 A psychologist studying human memory, would like to examine the process of forgetting. One group of participants is required to memorize a list of words in the evening just before going to bed. Their recall is tested 10 hours latter in the morning. Participants in the second group memorized the same list of words in he morning, and then their memories tested 10 hours later after being awake all day. LEARNING CHECK
55
© aSup-2007 Inference about Means and Mean Different 55 LEARNING CHECK The psychologist hypothesizes that there will be less forgetting during less forgetting during sleep than a busy day. The recall scores for two samples of college students are follows: Asleep ScoresAwake Scores 151314 15131412 1615161514131112 1615171413 1214
56
© aSup-2007 Inference about Means and Mean Different 56 Sketch a frequency distribution for the ‘asleep’ group. On the same graph (in different color), sketch the distribution for the ‘awake’ group. Just by looking at these two distributions, would you predict a significant differences between two treatment conditions? Use the independent-measures t statistic to determines whether there is a significant difference between the treatments. Conduct the test with α =.05 LEARNING CHECK
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.