Confidence Intervals & Effect Size. Outline of Today’s Discussion 1.Confidence Intervals 2.Effect Size 3.Thoughts on Independent Group Designs.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

1 One Tailed Tests Here we study the hypothesis test for the mean of a population when the alternative hypothesis is an inequality.
Confidence Intervals, Effect Size and Power
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Review Find the mean square (MS) based on these two samples. A.26.1 B.32.7 C.43.6 D.65.3 E M A = 14 M B = Flood.
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
PSY 307 – Statistics for the Behavioral Sciences
Behavioural Science II Week 1, Semester 2, 2002
Sample Size Determination
Tuesday, October 22 Interval estimation. Independent samples t-test for the difference between two means. Matched samples t-test.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Introduction to Analysis of Variance (ANOVA)
The t-test Inferences about Population Means when population SD is unknown.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Standard Error of the Mean
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Hypothesis Testing:.
Overview Definition Hypothesis
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
Estimation of Statistical Parameters
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
The Single-Sample t Test Chapter 9. The t Distributions >Distributions of Means When the Parameters Are Not Known >Using t distributions Estimating a.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
R. G. Bias | School of Information | SZB 562BB | Phone: | i 1 INF397C Introduction to Research in Information Studies.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Chapter Eight: Using Statistics to Answer Questions.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 6 Putting Statistics to Work.
Inferential Statistics Significance Testing Chapter 4.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Handout Seven: Independent-Samples t Test Instructor: Dr. Amery Wu
Outline of Today’s Discussion 1.The Distribution of Means (DOM) 2.Hypothesis Testing With The DOM 3.Estimation & Confidence Intervals 4.Confidence Intervals.
Hypothesis Testing. Outline of Today’s Discussion 1.Logic of Hypothesis Testing 2.Evaluating Hypotheses Please refrain from typing, surfing or printing.
Outline of Today’s Discussion 1.Introduction to Non-Linear Regression 2.Statistical Significance: Decision Errors 3.Statistical Significance: Effect Size.
Descriptive Statistics. Outline of Today’s Discussion 1.Central Tendency 2.Dispersion 3.Graphs 4.Excel Practice: Computing the S.D. 5.SPSS: Existing Files.
Outline of Today’s Discussion 1.Regression Analysis: Introduction 2.An Alternate Formula For Regression 3.Correlation, Regression, and Statistical Significance.
Independent Samples T-Test. Outline of Today’s Discussion 1.About T-Tests 2.The One-Sample T-Test 3.Independent Samples T-Tests 4.Two Tails or One? 5.Independent.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Chapter 13 Understanding research results: statistical inference.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Chapter Seven Point Estimation and Confidence Intervals.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 10: The t Test For Two Independent Samples.
Chapter 9 Introduction to the t Statistic
Dependent-Samples t-Test
INF397C Introduction to Research in Information Studies Spring, Day 12
Effect Size.
Central Limit Theorem, z-tests, & t-tests
Significance and t testing
Inferences about Population Means
Hypothesis Testing.
Psych 231: Research Methods in Psychology
Analysis and Interpretation of Experimental Findings
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
15.1 The Role of Statistics in the Research Process
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Presentation transcript:

Confidence Intervals & Effect Size

Outline of Today’s Discussion 1.Confidence Intervals 2.Effect Size 3.Thoughts on Independent Group Designs

The Research Cycle Real World Research Representation Research Results Research Conclusions Abstraction Data Analysis MethodologyGeneralization *** 1. Observational 2. Survey 3. Experimental

Part 1 Confidence Intervals A.K.A. How Big is Your Error Bar?

Confidence Intervals “ A picture is worth a thousand … p-values! ” (say it with me)

Confidence Intervals Which graph makes a more convincing case for Drug X, and why?

Confidence Intervals In some graphs, the error bars reflect the range (min to max).

Confidence Intervals In some graphs, the error bars reflect the inter-quartile range.

Confidence Intervals In some graphs, the error bars reflect one standard deviation.

Confidence Intervals In some graphs, the error bars reflect one standard error (of the mean).*

Confidence Intervals In still other graphs, the error bars reflect a confidence interval. *

Confidence Intervals 1.Standard Error (of the Mean) – The standard deviation of the “distribution of means” (D.O.M). The standard deviation describes the average extent to which a RAW SCORE (that’s one raw score) deviates from the mean of the distribution of raw scores. The standard error describes the average extent to which a SAMPLE MEAN (that’s the mean of one sample) deviates from the mean of the distribution of means (DOM).

Three Kinds of Distributions There are three kinds of distributions A.The distribution of the population of individuals B.The distribution of a sample C.The distribution of means (of samples) Critical Thinking Question: Why is the D.O.M. so ‘ skinny ’ ? Confidence Intervals

Main Points on the D.O.M Q: Why would we want to use the standard deviation of the D.O.M.? A: So we can put a mean in context! This is similar to the rationale for knowing the SD of a distribution of raw scores…whether we have a raw score or a mean we want some CONTEXT. Confidence Intervals

Main Points on the D.O.M Example: Your new drug is given to a sample of depressed patients. Subsequently, the sample’s mean mood score is 25, whereas the mean for the population of all depressed people is 20. Did our drug have a significant effect? IT DEPENDS!!!! If the D.O.M has a standard deviation of 10 units, then our sample is not so different from the D.O.M. mean. Our drug isn’t so special. If the D.O.M. has a standard deviation of 1 unit, then our sample mean is very different from the D.O.M. mean. Our drug is hot stuff!!! Confidence Intervals

Main Points on the D.O.M The standard error IS the standard deviation of the distribution of means (DOM). We can estimate the standard deviation of the DOM from a sample. To do so, we use the equation S.E. = SD sample / sqrt( n ). Confidence Intervals Please memorize this formula!

Confidence Intervals 1.Confidence Interval – A range of values assumed, with a specified degree of confidence (i.e., probability), to include a population parameter (usually the mean). 2.Example 1: We might be, say, 95% confident that the mean height in our room is in the range between 5’ 7’’ and 5’ 9’’. 3.Example 2: We might be, say, 99% confident that the mean height in our room is in the range between 5’ 6’’ and 5’ 10’’. 4.Critical Thinking Question: Why is the 99% confidence interval wider than the 95% confidence interval?

Confidence Intervals 1.Each confidence interval has an upper bound, and a lower bound. 2.The upper & lower bounds depend on - The mean - The standard error [ s.d. / sqrt(n) ] - The confidence level (95% versus 99%) 3.The confidence level is determined by the critical value of ‘t’ (the number to beat)…

Confidence Intervals 1.If we want a 95% confidence interval, we’ll need to find ‘t’ critical value at  = If we want a 99% confidence interval, we’ll need to find ‘t’ critical value at  = Upper Bound = Mean + (t crit * S.E.) 4.Lower Bound = Mean - (t crit * S.E.)

Confidence Intervals 1.Practice Item 1: Assume that a sample in your experiment has the following features: Mean = 10 S.D. = 8 n = 16 D.F. = 15 tcrit(15) = 2.13 at 0.05 alpha level 2.Compute the 95% confidence interval.

Confidence Intervals 1.Practice Item 1: Assume that a sample in your experiment has the following features: Mean = 10 S.D. = 8 n = 16 D.F. = 15 tcrit(15) = 2.95 at 0.01 alpha level 2. Compute the 99% confidence interval.

Confidence Intervals 1.To summarize, researchers can make their error bars equal to confidence intervals, instead of the standard deviation. 2.The researchers might then say: “We are 95% confident that the population mean falls between (upper bound) and (lower bound).” 3.Larger confidence levels have larger confidence intervals.

Part 2 Effect Size

Effect Size & Meta-Analysis There is Trouble in Paradise (Say it with me)

Effect Size & Meta-Analysis 1.One major problem with Null Hypothesis Testing (i.e., inferential statistics) is that the outcome depends on sample size. 2.For example, a particular set of scores might generate a non-significant t-test with n=10. But if the exact same numbers were duplicated (n=20) the t-test suddenly becomes “significant”.

Effect Size & Meta-Analysis 1.Effect Size – The magnitude of the influence that the IV has on the DV. 2.Effect size does NOT depend on sample size! (“And there was much rejoicing!”)

Effect Size & Meta-Analysis 1.A commonly used measure of effect size is Cohen’s d. 2.Conventions for Cohen’s d: d = 0.2 small effect d = 0.5 medium effect d = 0.8 large effect

Effect Size & Meta-Analysis 1.A statistically significant effect is said to be a ‘reliable effect’… it would be found repeatedly if the sample size were sufficient. 2.Statistically significant effects are NOT LIKELY due to chance. 3.An effect can be statistically significant, yet ‘puny’. 4.There is an important distinction between statistical significance, and practical significance…

Effect Size Examples that distinguish effect size and statistical significance…. 1.Analogy to a Roulette Wheel – An effect can be small, but reliable. 2.Anecdote about the discovery of the planet Pluto -An effect can be small, but reliable. 3.Anecdote about buddy’s doctoral thesis, “Systematic non-linearities in the production of time intervals”. 4.Denison versus “Other” in S.A.T. scores.

Effect Size & Meta-Analysis 1.Potential Pop Quiz Question – Using two sentences, generate your own novel example of a meta-analysis. 2.Potential Pop Quiz Question – In your own words, explain how Cohen’s d can be helpful in a meta- analysis.

Part 3 Thoughts on Independent Groups Designs

From Shaughnessy, Zechmeister, Zechmeister (2012)

Independent Groups Designs 1.Potential Pop Quiz Question – As we’ve seen, inferential statistics can address the issue of reliability. Statistically significant effects are ‘reliable effects’. What is the ultimate test of an experiment’s reliability? (One word will do.) 2.Potential Pop Quiz Question – In your own words, explain what a conceptual replication is. Use an example of your own, or from the readings.

Independent Group Designs 1.Potential Pop Quiz Question – In your own words, explain what a matched group design is, and when it can be advantageously used. 2.Potential Pop Quiz Question – As we’ve noted many times, the scientific method has 4 goals. Which goal or goals can be met by a natural groups design, and which cannot? Explain your reasoning.