Comparing Samples. Last Time I talked about what could go wrong in an experiment where you compared a sample mean against a population with a known population.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Statistical Decision Making
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Lecture 13: Review One-Sample z-test and One-Sample t-test 2011, 11, 1.
S519: Evaluation of Information Systems
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Hypothesis Testing Is It Significant?. Questions What is a statistical hypothesis? What is the null hypothesis? Why is it important for statistical tests?
Introduction to Analysis of Variance (ANOVA)
The t-test Inferences about Population Means when population SD is unknown.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Review As sample size increases, the distribution of sample means A.Becomes wider and more normally shaped B.Becomes narrower and more normally shaped.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
II.Simple Regression B. Hypothesis Testing Calculate t-ratios and confidence intervals for b 1 and b 2. Test the significance of b 1 and b 2 with: T-ratios.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Comparing Means From Two Sets of Data
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
Copyright © 2010 Pearson Education, Inc. Chapter 23 Inference About Means.
Education 793 Class Notes T-tests 29 October 2003.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
Modern Languages Row A Row B Row C Row D Row E Row F Row G Row H Row J Row K Row L Row M
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Statistical Power The ability to find a difference when one really exists.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
Stats Lunch: Day 4 Intro to the General Linear Model and Its Many, Many Wonders, Including: T-Tests.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
IS 4800 Empirical Research Methods for Information Science Class Notes March 13 and 15, 2012 Instructor: Prof. Carole Hafner, 446 WVH
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Descriptive Statistics Used to describe a data set –Mean, minimum, maximum Usually include information on data variability (error) –Standard deviation.
1 The t-distribution General comment on z and t
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Inferential Statistics Psych 231: Research Methods in Psychology.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 9 Introduction to the t Statistic
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
The ability to find a difference when one really exists.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Basic t-test 10/6.
Physics- atmospheric Sciences (PAS) - Room 201
Review Guppies can swim an average of 8 mph, with a standard deviation of 2 mph You measure 100 guppies and compute their mean What is the standard error.
Hypothesis Testing Is It Significant?.
Central Limit Theorem, z-tests, & t-tests
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2018 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Comparing Groups.
Chi-square and F Distributions
I. Statistical Tests: Why do we use them? What do they involve?
INTEGRATED LEARNING CENTER
Chapter 22 Inference About Means.
One sample problems Statistics 2126.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2018 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Psych 231: Research Methods in Psychology
Statistical Inference for the Mean: t-test
Presentation transcript:

Comparing Samples

Last Time I talked about what could go wrong in an experiment where you compared a sample mean against a population with a known population mean and standard deviation. You will build a sampling (null) distribution and set an alpha level using the population values. The population has a fixed amount of variability (SD) but the variability in the sample statistics is affected by the sample size. The smaller the sample size, the more variability in sample statistics.

Example with SE of Means SAS EG example of simulating a population Draw a single sample, get the mean Get many means Calculate the mean and SD from these means Compare vs. theoretical distribution

Critical Cut Points Given the hypothetical mean, standard deviation and sample size, you then determine what is such an usual sample that you would reject your null hypothesis (the null distribution).

Alpha and Beta Again If your sample data came from a different population, you will guess that the data for this (sub) population is centered around your sample mean and the distribution will not completely overlap the null distribution. The part of the alternative distribution (area under the curve) which does not overlap the null distribution is the power.

Graphical Example Here is an R example of cut points in the theoretic distribution and how the alternate distribution overlaps with the null distribution:

Comparing Means In reality, you will almost never have a known population mean and standard deviation and compare your sample against that. You will likely have a hypothetical population mean and you will want to see if your sample was likely to have come from the set of sample means distributed around that hypothetical population mean. Conceptually it is the same task but the shape of the sampling distribution is different when you don’t know the population SD.

Gossett described the function that describes the distribution for when you are comparing means and estimating the population SD from the sample. He figured it out while working at a brewery that would not let him publish under his own name so he published it under the name Students and called the distribution T. (Was he thinking tea?) The T distribution describes the samples when you don’t know the population standard deviation. There is extra uncertainty and that is manifested as a wider (and fatter- tailed) looking distribution.

Student’s T T with 5 df

Asymptotic T As your sample size gets bigger the T distribution looks more and more like a Z distribution. N of 30 is essentially indistinguishable from a Z.

Calculate It To do the t-test is trivially easy. First load the data into an analysis package. Graph it and then do the one sample t-test. See the example SAS Enterprise Guide project. The formula for the statistic sure looks familiar…

Two Samples If you have two samples, the formula gets a bit more complicated. Instead of using a single sample to get the guess for the population variability, you have two and if the samples are not of the same size, you want to put more trust (weight) in the larger sample.

Estimated Variance Basically you take the weighted average, with a tweak to the denominator to consider you are estimating population parameters in the formula.

The T-Statistic

Paired samples? What is your variance like if you sample the same person before and after a treatment relative to if you sampled two different people? Smaller

ANOVA To compare three or more groups you will want to use a method called ANOVA. Analysis of variance is baffling when you first see the algebra because you are looking for differences in group means by comparing variances.

How ANOVA Works Begin by looking at the overall variability in your data vs. the overall mean. Then look at the variability in your data if you compare relative to the subgroups. If there is no meaningful effect of the treatments, the overall variability will look like the variability relative to the subgroups.

Reduced Variance With the T or Z distributions you get excited if your sample mean is far from the proposed population mean. Here, you get excited if the ratio of the two variances is far from 1. You need a distribution that can describe the ratio of two variances. That distribution is the F. It has a parameter to describe the number of subjects in the two halves of the fraction.