SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean.

Slides:



Advertisements
Similar presentations
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Advertisements

Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
SW388R6 Data Analysis and Computers I Slide 1 Paired-Samples T-Test of Population Mean Differences Key Points about Statistical Test Sample Homework Problem.
One-sample T-Test of a Population Mean
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Strategy for Complete Regression Analysis
Assumption of normality
AP Statistics – Chapter 9 Test Review
Outliers Split-sample Validation
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 9_part I ( and 9.7) Tests of Significance.
Detecting univariate outliers Detecting multivariate outliers
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Chi-square Test of Independence
Multiple Regression – Assumptions and Outliers
Multiple Regression – Basic Relationships
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Hypothesis Testing Using The One-Sample t-Test
Regression Analysis We have previously studied the Pearson’s r correlation coefficient and the r2 coefficient of determination as measures of association.
Assumption of Homoscedasticity
Testing Assumptions of Linear Regression
Logistic Regression – Complete Problems
SW388R7 Data Analysis & Computers II Slide 1 Assumption of normality Transformations Assumption of normality script Practice problems.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
Mann-Whitney and Wilcoxon Tests.
SW388R6 Data Analysis and Computers I Slide 1 Chi-square Test of Goodness-of-Fit Key Points for the Statistical Test Sample Homework Problem Solving the.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Sampling Distribution of the Mean Problem - 1
SW318 Social Work Statistics Slide 1 Estimation Practice Problem – 1 This question asks about the best estimate of the mean for the population. Recall.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Simple linear regression is an appropriate model of the relationship between two quantitative variables provided.
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems.
8/23/2015Slide 1 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a one-sample test of a.
AM Recitation 2/10/11.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Estimation and Confidence Intervals
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
Chi-Square Test of Independence Practice Problem – 1
Stepwise Multiple Regression
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
110/10/2015Slide 1 The homework problems on comparing central tendency and variability extend our focus on central tendency and variability to a comparison.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems Homework Problems.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
SW388R6 Data Analysis and Computers I Slide 1 Independent Samples T-Test of Population Means Key Points about Statistical Test Sample Homework Problem.
SW388R7 Data Analysis & Computers II Slide 1 Hierarchical Multiple Regression Differences between hierarchical and standard multiple regression Sample.
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
11/4/2015Slide 1 SOLVING THE PROBLEM Simple linear regression is an appropriate model of the relationship between two quantitative variables provided the.
11/16/2015Slide 1 We will use a two-sample test of proportions to test whether or not there are group differences in the proportions of cases that have.
11/19/2015Slide 1 We can test the relationship between a quantitative dependent variable and two categorical independent variables with a two-factor analysis.
Chi-square Test of Independence
SW388R7 Data Analysis & Computers II Slide 1 Hierarchical Multiple Regression Differences between hierarchical and standard multiple regression Sample.
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
SW388R6 Data Analysis and Computers I Slide 1 One-way Analysis of Variance and Post Hoc Tests Key Points about Statistical Test Sample Homework Problem.
SW318 Social Work Statistics Slide 1 Percentile Practice Problem (1) This question asks you to use percentile for the variable [marital]. Recall that the.
SW388R6 Data Analysis and Computers I Slide 1 Percentiles and Standard Scores Sample Percentile Homework Problem Solving the Percentile Problem with SPSS.
SW388R7 Data Analysis & Computers II Slide 1 Detecting Outliers Detecting univariate outliers Detecting multivariate outliers.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
1/5/2016Slide 1 We will use a one-sample test of proportions to test whether or not our sample proportion supports the population proportion from which.
© Copyright McGraw-Hill 2004
SW388R6 Data Analysis and Computers I Slide 1 Comparing Central Tendency and Variability across Groups Impact of Missing Data on Group Comparisons Sample.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of linearity Strategy for solving problems Producing outputs for evaluating linearity Assumption.
Assumption of normality
Multiple Regression – Split Sample Validation
Presentation transcript:

SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean

SW388R6 Data Analysis and Computers I Slide 2 Problem 1 Based on the dataset GSS2000.SAV, is the following statement true, false, or an incorrect application of a statistic? Use 0.05 as the level of significance. For the population represented by this sample of survey respondents, the mean for the variable "HIGHEST YEAR OF SCHOOL COMPLETED" is greater than the mean of found in a previous research study. 1. True 2. True with caution 3. False 4. Incorrect application of a statistic

SW388R6 Data Analysis and Computers I Slide 3 Request the statistics to evaluate normality The one-sample t-test of a population mean assumes that the test variable is normally distributed. To evaluate this assumption, we need to compute the skewness and kurtosis of the distribution.

SW388R6 Data Analysis and Computers I Slide 4 Select the variable to evaluate for normality First, move the variable "educ" to the list of variables.

SW388R6 Data Analysis and Computers I Slide 5 Request the skewness and kurtosis First, mark the checkboxes for kurtosis and skewness in the Distribution panel. Clear the checkboxes for the other statistics to cut down on the amount of output. Second. click on the continue button to close the Options dialog box.

SW388R6 Data Analysis and Computers I Slide 6 Complete the request to evaluate normality Click on the OK button to complete the request.

SW388R6 Data Analysis and Computers I Slide 7 Statistical output to evaluate normality However, since the sample size of 269 is at least 30, the central limit theorem states that the sampling distribution of statistics will follow a normal distribution, and the use of the statistical test with this variable is appropriate. The skewness of HIGHEST YEAR OF SCHOOL COMPLETED for the sample (-0.137) is within the range for normality (-1.0 to +1.0). The kurtosis of HIGHEST YEAR OF SCHOOL COMPLETED for the sample (1.246) is outside the range for normality (-1.0 to +1.0). This condition violates the assumption of normality.

SW388R6 Data Analysis and Computers I Slide 8 Request the one-sample t-test To compute a one-sample t- test of a population mean in SPSS, select the Compare Means | One-Sample T-Test command from the Analyze menu.

SW388R6 Data Analysis and Computers I Slide 9 Select the variable for the one-sample t-test Second, click on the arrow button to move the variable "educ" to the list of Test Variables. First, highlight the variable "educ" to use in the t-test.

SW388R6 Data Analysis and Computers I Slide 10 Enter the value for the population mean First, enter the value for the population mean specified in the problem (12.82) in the Test Value text box. Second, with the variable selected and the test value entered, click on the OK button to complete the request.

SW388R6 Data Analysis and Computers I Slide 11 The SPSS Output for the One-Sample T-test If the two-tailed hypothesis test is the one implied by the problem, we can computer the “Sig. (2-tailed)” probability to the level of significance stated in the problem. The SPSS output is for a two-tailed test for the research hypothesis that the mean of the population represented by the sample (13.12) is not is not equal to the population mean which we specified (12.82).

SW388R6 Data Analysis and Computers I Slide 12 Computing a one-tailed probability In this problem, the research hypothesis states that the population mean is greater than We need to derive the one-tailed probability from the “Sig. (2-tailed)” output. We do this by dividing the two-tailed probability by 2. For this output, the one-tailed probability is: / 2 = 0.048

SW388R6 Data Analysis and Computers I Slide 13 The direction of the computed one-tailed test There are two possible one-tailed tests: one for a greater than relationship and another for a less than relationship. Which of these hypotheses is associated to the probability of which we just calculated. The one-tailed probability of is associated with the comparison of the sample mean (13.12) to the specified population mean (12.82). Since the sample mean (13.12) is larger than the population mean (12.82), the probability of is the probability for a greater than relationship, which is the relationship stated in the problem.

SW388R6 Data Analysis and Computers I Slide 14 Interpret the output for the one-sample t-test The probability of the test statistic for this problem is We interpret this as the probability that we could draw a sample with a mean of or greater from a population that has a mean of Since this probability is less than or equal to the level of significance of 0.05, we reject the null hypothesis and conclude that the analysis supports the research hypothesis. Based on the one-sample t-test, the population mean for the variable "HIGHEST YEAR OF SCHOOL COMPLETED" is greater than The answer to the question is true.

SW388R6 Data Analysis and Computers I Slide 15 The probability for the other one-tailed test The one-tailed probability of is associated with the comparison of the sample mean (13.12) to the specified population mean (12.82). Since the sample mean (13.12) is larger than the population mean (12.82), the probability of is the probability for a greater than relationship, which is the relationship stated in the problem. Suppose the problem had stated the opposite relationship: “For the population represented by this sample of survey respondents, the mean for the variable "HIGHEST YEAR OF SCHOOL COMPLETED" is less than the mean of found in a previous research study.” The probability for the less than relationship is the probability below the right tail: 1.0 – =

SW388R6 Data Analysis and Computers I Slide 16 A caution about the probabilities for one- tailed tests While most of the time we would expect to do a one-tailed test that corresponds to the relationship between the sample mean and the population mean, sometimes we do test a hypothesis in the opposite direction. To detect these occasions, we need to compare the direction of a one-tailed test implied in the problem to the probability actually computed by SPSS.

SW388R6 Data Analysis and Computers I Slide 17 Problem 2 Based on the dataset GSS2000.SAV, is the following statement true, false, or an incorrect application of a statistic? Use 0.01 as the level of significance. For the population represented by this sample of survey respondents, the mean for the variable "TOTAL FAMILY INCOME" is less than the mean of found in a previous research study. 1. True 2. True with caution 3. False 4. Incorrect application of a statistic

SW388R6 Data Analysis and Computers I Slide 18 Solution 2 The one-sample t-test of a population mean requires that the variable be interval. The variable "TOTAL FAMILY INCOME" is ordinal, which does not meet this requirement. The answer to the question is incorrect application of a statistic.

SW388R6 Data Analysis and Computers I Slide 19 Problem 3 Based on the dataset GSS2000.SAV, is the following statement true, false, or an incorrect application of a statistic? Use 0.05 as the level of significance. For the population represented by this sample of survey respondents, the mean for the variable "AGE OF RESPONDENT" is different from the mean of found in a previous research study. 1. True 2. True with caution 3. False 4. Incorrect application of a statistic

SW388R6 Data Analysis and Computers I Slide 20 Request the statistics to evaluate normality The one-sample t-test of a population mean assumes that the test variable is normally distributed. To evaluate this assumption, we need to compute the skewness and kurtosis of the distribution.

SW388R6 Data Analysis and Computers I Slide 21 Select the variable to evaluate for normality First, move the variable "educ" to the list of variables.

SW388R6 Data Analysis and Computers I Slide 22 Request the skewness and kurtosis First, mark the checkboxes for kurtosis and skewness in the Distribution panel. Clear the checkboxes for the other statistics to cut down on the amount of output. Second. click on the continue button to close the Options dialog box.

SW388R6 Data Analysis and Computers I Slide 23 Complete the request to evaluate normality Click on the OK button to complete the request.

SW388R6 Data Analysis and Computers I Slide 24 Statistical output to evaluate normality The assumption of normality required by the one-sample t-test of a population mean is satisfied. The skewness of HIGHEST YEAR OF SCHOOL COMPLETED for the sample (0.595) is within the range for normality (-1.0 to +1.0). The kurtosis of HIGHEST YEAR OF SCHOOL COMPLETED for the sample (-0.351) is outside the range for normality (-1.0 to +1.0). This condition violates the assumption of normality.

SW388R6 Data Analysis and Computers I Slide 25 Request the one-sample t-test To compute a one-sample t- test of a population mean in SPSS, select the Compare Means | One-Sample T-Test command from the Analyze menu.

SW388R6 Data Analysis and Computers I Slide 26 Enter the specifications for the t-test Second, enter the value for the population mean specified in the problem (43.80) in the Test Value text box. Third, with the variable selected and the test value entered, click on the OK button to complete the request. First, highlight the variable “age" to use in the t-test.

SW388R6 Data Analysis and Computers I Slide 27 The probability of the test statistic The research hypothesis implied by the the problem is a difference between sample and population mean – a non-directional, two-tailed test. In this situation, we use the two-tailed probability output by SPSS,

SW388R6 Data Analysis and Computers I Slide 28 Decision for the one-sample t-test The probability of the test statistic for this problem is We interpret this as the probability that we could draw a sample with a mean as large as from a population with a mean of Since this probability is less than or equal to the level of significance of 0.05, we reject the null hypothesis and conclude that the analysis supports the research hypothesis. Based on the one-sample t-test, the population mean for the variable "HIGHEST YEAR OF SCHOOL COMPLETED" is different from The answer to the question is true.

SW388R6 Data Analysis and Computers I Slide 29 Steps in solving one-sample t-test problems The following is a guide to the decision process for answering One-sample T-test homework problems: Is the level of measurement requirement satisfied? Is the assumption of normality satisfied? Is the probability of the test statistic less than the level of significance? Incorrect application of a statistic Add caution if the question turns out to be true Yes False True Yes No

SW388R6 Data Analysis and Computers I Slide 30 Problem 4 Based on the dataset GSS2000.SAV, is the following statement true, false, or an incorrect application of a statistic? Use 0.05 as the level of significance. We can be 95% confident that the interval from 3.14 to 3.90 contains the population mean for the variable "NUMBER OF BROTHERS AND SISTERS". 1. True 2. True with caution 3. False 4. Incorrect application of a statistic

SW388R6 Data Analysis and Computers I Slide 31 Request the statistics to evaluate normality The one-sample t-test of a population mean assumes that the test variable is normally distributed. To evaluate this assumption, we need to compute the skewness and kurtosis of the distribution.

SW388R6 Data Analysis and Computers I Slide 32 Select the variable to evaluate for normality First, move the variable “sibs" to the list of variables.

SW388R6 Data Analysis and Computers I Slide 33 Request the skewness and kurtosis First, mark the checkboxes for kurtosis and skewness in the Distribution panel. Clear the checkboxes for the other statistics to cut down on the amount of output. Second. click on the continue button to close the Options dialog box.

SW388R6 Data Analysis and Computers I Slide 34 Complete the request to evaluate normality Click on the OK button to complete the request.

SW388R6 Data Analysis and Computers I Slide 35 Statistical output to evaluate normality The skewness of NUMBER OF BROTHERS AND SISTERS for the sample (2.391) is outside the range for normality (-1.0 to +1.0). The kurtosis of NUMBER OF BROTHERS AND SISTERS for the sample (8.700) is outside the range for normality (-1.0 to +1.0) However, since the sample size of 269 is at least 30, the central limit theorem states that the sampling distribution of statistics will follow a normal distribution, and the use of the statistical test with this variable is appropriate.

SW388R6 Data Analysis and Computers I Slide 36 Request the confidence interval To compute a confidence interval in SPSS, select the Descriptive Statistics | Explore… command from the Analyze menu. Note: the confidence interval on the SPSS One-Sample T-Test output is the confidence interval for the difference between the sample mean and the population mean, not the confidence interval for the population mean, which we need for this problem.

SW388R6 Data Analysis and Computers I Slide 37 Select the variable for the analysis Second, mark the Statistics option button to tell SPSS that we only want the Statistics output. Third, click on the Statistics... Button to specify the statistics we want. First, move the variable “sibs” to the Dependent List.

SW388R6 Data Analysis and Computers I Slide 38 Specify the confidence interval Second, type the size of the confidence interval that we want to compute. The size of the confidence interval is the level of significance specified in the problem subtracted from – 0.05 = 0.95 or 95% Third, click on the Continue Button to complete the specification. The confidence interval is part of the Descriptives output. First, mark the checkbox for Descriptives.

SW388R6 Data Analysis and Computers I Slide 39 Complete the request for a confidence interval Click on the OK button to complete the request for a confidence interval for the variable “sibs.”

SW388R6 Data Analysis and Computers I Slide 40 The confidence interval for a mean The 95% confidence interval is from 3.14 to The answer to the question is true.

SW388R6 Data Analysis and Computers I Slide 41 Alternative ways to phrase a confidence interval  It is very likely that the interval from 3.14 to 3.90 contains the population mean for the variable "NUMBER OF BROTHERS AND SISTERS".  We strongly believe that the interval from 3.14 to 3.90 contains the population mean for the variable "NUMBER OF BROTHERS AND SISTERS".  The probability is 0.95 that the interval from 3.14 to 3.90 contains the population mean for the variable "NUMBER OF BROTHERS AND SISTERS".  We can be 95% confident that the interval from 3.14 to 3.90 contains the population mean for the variable "NUMBER OF BROTHERS AND SISTERS". These statements are different ways of stating a confidence interval.

SW388R6 Data Analysis and Computers I Slide 42 Alternative ways to phrase a confidence interval The probability is 0.95 that the population mean for the variable "NUMBER OF BROTHERS AND SISTERS" lies within the interval from 3.14 to This is an incorrect statement of a confidence interval. The probability value applies to the interval and not to the population mean. The population mean is a fixed value, even though we don't know what that value is. If a homework problem were phrased this way, it would be false.

SW388R6 Data Analysis and Computers I Slide 43 Steps in solving confidence interval problems - 1 The following is a guide to the decision process for answering One-sample T-test homework problems: Is the level of measurement requirement satisfied? Is the assumption of normality satisfied? Incorrect application of a statistic Add caution if the question turns out to be true Yes No

SW388R6 Data Analysis and Computers I Slide 44 Steps in solving confidence interval problems - 2 The following is a guide to the decision process for answering One-sample T-test homework problems: Is the statement of the confidence interval phrased correctly, and are the confidence interval values correct? False True Yes No