Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Comparing One Sample to its Population
Ethan Cooper (Lead Tutor)
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Review: What influences confidence intervals?
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Cal State Northridge  320 Andrew Ainsworth PhD Hypothesis Tests: One Sample Mean.
Topics: Inferential Statistics
Chapter 7 Sampling and Sampling Distributions
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
4. Multiple Regression Analysis: Estimation -Most econometric regressions are motivated by a question -ie: Do Canadian Heritage commercials have a positive.
PSY 307 – Statistics for the Behavioral Sciences
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chapter 8 Estimation: Single Population
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Inference about a Mean Part II
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 11: Inference for Distributions
Lecture 7 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Getting Started with Hypothesis Testing The Single Sample.
June 23, 2008Stat Lecture 13 - One Mean1 Inference for a Population Mean Confidence Intervals and Tests with unknown variance and Two- sample Tests.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Review As sample size increases, the distribution of sample means A.Becomes wider and more normally shaped B.Becomes narrower and more normally shaped.
AM Recitation 2/10/11.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
1 G Lect 5b G Lecture 5b A research question involving means The significance test approach »The problem of s 2 »Student’s t distribution.
Review for Exam 2 (Ch.6,7,8,12) Ch. 6 Sampling Distribution
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Sampling Distribution of the Mean Central Limit Theorem Given population with and the sampling distribution will have: A mean A variance Standard Error.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ μ.
Single Sample Inferences
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
Estimates and Sample Sizes Lecture – 7.4
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Statistics for the Behavioral Sciences Second Edition Chapter 11: The Independent-Samples t Test iClicker Questions Copyright © 2012 by Worth Publishers.
H1H1 H1H1 HoHo Z = 0 Two Tailed test. Z score where 2.5% of the distribution lies in the tail: Z = Critical value for a two tailed test.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Week 6 October 6-10 Four Mini-Lectures QMM 510 Fall 2014.
Estimating a Population Mean
Chapter 8 Parameter Estimates and Hypothesis Testing.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Mystery 1Mystery 2Mystery 3.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Class 5 Estimating  Confidence Intervals. Estimation of  Imagine that we do not know what  is, so we would like to estimate it. In order to get a point.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 9 Introduction to the t Statistic
Outline Sampling Measurement Descriptive Statistics:
Lecture Nine - Twelve Tests of Significance.
Assumptions For testing a claim about the mean of a single population
Sampling Distributions and Estimation
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Review: What influences confidence intervals?
Monday, October 19 Hypothesis testing using the normal Z-distribution.
Estimates and Sample Sizes Lecture – 7.4
Introduction to the t Test
Presentation transcript:

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample

Hypothesis Testing with Means To this point, we have discussed the use of hypothesis testing in general. To this point, we have discussed the use of hypothesis testing in general. We have also examined the use of correlation and regression techniques to determine if there is a linear relation between two variables. We have also examined the use of correlation and regression techniques to determine if there is a linear relation between two variables. Over the next few weeks, we will focus on hypothesis tests involving differences among means. Over the next few weeks, we will focus on hypothesis tests involving differences among means.

Hypothesis Testing with Means Comparisons among means represents a common analysis when conducting experiments. Comparisons among means represents a common analysis when conducting experiments. Individuals are assigned to groups Individuals are assigned to groups Nominal scales Nominal scales Control and treatment groups Control and treatment groups Goal is to determine if the differences between group means are likely due to sampling error or not. Goal is to determine if the differences between group means are likely due to sampling error or not.

Hypothesis Testing with Means We will begin with the case of comparing results of a single sample to that of known parameters. We will begin with the case of comparing results of a single sample to that of known parameters. For example, comparing results on intelligence, language development, etc., to known population norms For example, comparing results on intelligence, language development, etc., to known population norms The null hypothesis will take the form where we postulate a specific value: The null hypothesis will take the form where we postulate a specific value: After this, we will move to the more common tack of comparing two groups to one another. After this, we will move to the more common tack of comparing two groups to one another.

An Example We know that in the population, the mean IQ = 100 with a sd = 15. We know that in the population, the mean IQ = 100 with a sd = 15. Let’s say that a new supplement is being marketed as a pill capable of increasing intelligence. Let’s say that a new supplement is being marketed as a pill capable of increasing intelligence. To examine this claim, we could have a sample of 100 individuals take the pill for a month and then give them an IQ test. To examine this claim, we could have a sample of 100 individuals take the pill for a month and then give them an IQ test. Let’s say the score from this sample is 105. Let’s say the score from this sample is 105. Did the pill work? Did the pill work?

An Example To answer this question, we will have to examine the sampling distribution of the mean to determine the standard error. To answer this question, we will have to examine the sampling distribution of the mean to determine the standard error. Even if the null is true, sample means would be expected to vary around the population mean of 100. Even if the null is true, sample means would be expected to vary around the population mean of 100. The Central Limit Theorem allows us to obtain information about the nature of the sampling distribution for any variable. The Central Limit Theorem allows us to obtain information about the nature of the sampling distribution for any variable.

Central Limit Theorem Given a population mean and variance, the sampling distribution of the mean will have a mean and variance defined as below. The distribution will also approach the normal distribution as N increases. Given a population mean and variance, the sampling distribution of the mean will have a mean and variance defined as below. The distribution will also approach the normal distribution as N increases. The CLT thus allows us to The CLT thus allows us to calculate the mean and sd of the sampling distribution, but also dictates that it will be normal with larger N.

Central Limit Theorem If the variable is normally distributed in the population, the sampling distribution will be normal regardless of N. If the variable is normally distributed in the population, the sampling distribution will be normal regardless of N. Even if the population distribution is very non- normal (e.g., rectangular, skewed), the sampling distribution will approach normality as N increases. Even if the population distribution is very non- normal (e.g., rectangular, skewed), the sampling distribution will approach normality as N increases. Why? Why?

N=3 N=10

Back to the Example Sampling distribution for our example: Sampling distribution for our example: Given that we know the population variance, we can calculate a z-score for our group. Given that we know the population variance, we can calculate a z-score for our group.

Testing a Sample Mean When  is Unknown: One-Sample t Test More frequently, it may be the case that we know the norms for a population in terms of means, but do not know the variance. More frequently, it may be the case that we know the norms for a population in terms of means, but do not know the variance. We must estimate the population variance from the sample variance. We must estimate the population variance from the sample variance. In these cases, we cannot use the z-scores to calculate probabilities, as the sampling distribution of sample sd’s is positively skewed. In these cases, we cannot use the z-scores to calculate probabilities, as the sampling distribution of sample sd’s is positively skewed. On average, this implies that we will often underestimate the population variance, which will result in lower estimates of the standard error. On average, this implies that we will often underestimate the population variance, which will result in lower estimates of the standard error. What problem will this cause with hypothesis testing? What problem will this cause with hypothesis testing? Increases in Type I errors Increases in Type I errors

The sample sd is an unbiased estimator of the population sd, but any single sample sd is likely to underestimate the population sd. The sample sd is an unbiased estimator of the population sd, but any single sample sd is likely to underestimate the population sd. Standard error calculations using the sample sd will usually produce probability values that are too low (i.e., z scores that are too high). Standard error calculations using the sample sd will usually produce probability values that are too low (i.e., z scores that are too high).

Student’s t Distribution To remedy this fact, we use the t distribution to calculate probabilities instead of the normal distribution. To remedy this fact, we use the t distribution to calculate probabilities instead of the normal distribution. The t distribution is a family of sampling distributions whose shape varies as a function of sample size. The t distribution is a family of sampling distributions whose shape varies as a function of sample size. As sample size gets larger, the t distribution approaches the normal. As sample size gets larger, the t distribution approaches the normal. Smaller samples produce “fatter” tails. Smaller samples produce “fatter” tails.

Student’s t Distribution t distributions represent deviations as t’s rather than z’s. t distributions represent deviations as t’s rather than z’s. t’s are simply “z-scores” calculated using a standard error based on the sample variance. t’s are simply “z-scores” calculated using a standard error based on the sample variance.

Student’s t Distribution Degrees of Freedom refer to the number of information items free to vary. Degrees of Freedom refer to the number of information items free to vary. When calculating the variance in a sample, we use the sample mean. Thus, N-1 items are allowed to vary. When calculating the variance in a sample, we use the sample mean. Thus, N-1 items are allowed to vary. Once we know the sample mean and N-1 items, the final item is contrained. Once we know the sample mean and N-1 items, the final item is contrained. Thus, the shape of t distributions vary with degrees of freedom. Thus, the shape of t distributions vary with degrees of freedom. Applet Applet

Back to the Example Let’s go back to the IQ example, except we’ll now assume that we don’t know the population sd, but the sample sd=20. Let’s go back to the IQ example, except we’ll now assume that we don’t know the population sd, but the sample sd=20. Does a score of 105 in our sample suggest increased intelligence? Does a score of 105 in our sample suggest increased intelligence? To answer this question, we find the critical t value for 99 df’s. To answer this question, we find the critical t value for 99 df’s. t is significant, but with less probability than using the normal distribution. t is significant, but with less probability than using the normal distribution. What would happen if we used a smaller sample (N=11) What would happen if we used a smaller sample (N=11) Assuming that the standard error was 11 as well. Assuming that the standard error was 11 as well.

Factors that Affect the Magnitude of t Size of the obtained difference between means Size of the obtained difference between means Larger difference = larger t Larger difference = larger t Magnitude of sample variance Magnitude of sample variance Larger variance = smaller t Larger variance = smaller t Sample size Sample size Larger sample size = larger t Larger sample size = larger t

Confidence Limits on the Mean A specific estimate for a parameter value is termed a point estimate. A specific estimate for a parameter value is termed a point estimate. We can also set confidence intervals. We can also set confidence intervals. A range of values with an associated probability that the true parameter value is contained within the range. A range of values with an associated probability that the true parameter value is contained within the range.

Confidence Limits on the Mean Let’s say we want to know the 95% confidence interval for the population mean IQ as estimated from our sample. Let’s say we want to know the 95% confidence interval for the population mean IQ as estimated from our sample. We know the sample mean and standard error (i.e., sd of sampling distribution for t for 99 df’s). We know the sample mean and standard error (i.e., sd of sampling distribution for t for 99 df’s). Note that CI does not include 100. Note that CI does not include 100. Our sample must represent a different population. Our sample must represent a different population.