Effect Size.

Slides:



Advertisements
Similar presentations
Confidence Intervals, Effect Size and Power
Advertisements

Review Find the mean square (MS) based on these two samples. A.26.1 B.32.7 C.43.6 D.65.3 E M A = 14 M B = Flood.
Ethan Cooper (Lead Tutor)
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
Behavioural Science II Week 1, Semester 2, 2002
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Understanding Research Results. Effect Size Effect Size – strength of relationship & magnitude of effect Effect size r = √ (t2/(t2+df))
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Lecture 7 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Chapter 9 Hypothesis Testing.
Hypothesis Testing Using The One-Sample t-Test
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
1.State your research hypothesis in the form of a relation between two variables. 2. Find a statistic to summarize your sample data and convert the above.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Please hand in homework on Law of Large Numbers Dan Gilbert “Stumbling on Happiness”
Chapter 8 Single Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social &
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter Seven Point Estimation and Confidence Intervals.
Chapter 9 Introduction to the t Statistic
Dr.Theingi Community Medicine
And distribution of sample means
9.3 Hypothesis Tests for Population Proportions
Dependent-Samples t-Test
More on Inference.
Hypothesis Testing.
ESTIMATION.
Effect Size 10/15.
Independent-Samples t-test
How does the Unpaired t-Test work?
Inference and Tests of Hypotheses
Review Ordering company jackets, different men’s and women’s styles, but HR only has database of employee heights. How to divide people so only 5% of.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
1. Estimation ESTIMATION.
Three Views of Hypothesis Testing
Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are Mmale = 15 and Mfemale = 17. The mean square.
Political Research & Analysis (PO657) Session V- Normal Distribution, Central Limit Theorem & Confidence Intervals.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
More on Inference.
Ch. 8 Estimating with Confidence
Statistical Inference: One- Sample Confidence Interval
Introductory Statistics: Exploring the World through Data, 1e
Review: What influences confidence intervals?
Hypothesis Testing.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers
Presentation transcript:

Effect Size

Effect Size If there's an effect, how big is it? How different is m from m0, or mA from mB, etc.? Separate from reliability Inferential statistics measure effect relative to standard error Tiny effects can be reliable, with enough power Danger of forgetting about practical importance Estimation vs. inference Inferential statistics convey confidence Estimation conveys actual, physical values Ways of estimating effect size Raw difference in means Relative to standard deviation of raw scores

Direct Estimates of Effect Size Goal: estimate difference in population means One sample: m - m0 Independent samples: mA – mB Paired samples: mdiff Solution: use M as estimate of m One sample: M – m0 Independent samples: MA – MB Paired samples: Mdiff Point vs. interval estimates We don't know exact effect size; samples just provide an estimate Better to report a range that reflects our uncertainty Confidence Interval Range of effect sizes that are consistent with the data Values that would not be rejected as H0

Computing Confidence Intervals CI is range of values for m or mA – mB consistent with data Values that, if chosen as null hypothesis, would lead to |t| < tcrit One-sample t-test (or paired samples): Retain H0 if Therefore any value of m0 within tcritSE of M would not be rejected i.e. m0 m0 m0 m0 M – tcritSE tcritSE M + tcritSE M M

Example Reject null hypothesis if: Confidence Interval:

Formulas for Confidence Intervals Mean of a single population (or of difference scores) M ± tcritSE Difference between two means (MA – MB) ± tcritSE Effect of sample size Increasing n decreases standard error Confidence interval becomes narrower More data means more precise estimate Always use two-tailed critical value p(|tdf| > tcrit) = a/2 Confidence interval has upper and lower bounds Need a/2 probability of falling outside either end

Interpretation of Confidence Interval Pick any possible value for m (or mA – mB) IF this were true population value 5% chance of getting data that would lead us to falsely reject that value 95% chance we don’t reject that value For 95% of experiments, CI will contain true population value "95% confidence" Other levels of confidence Can calculate 90% CI, 99% CI, etc. Correspond to different alpha levels: confidence = 1 – a Leads to different tcrit: t.crit = qt(alpha/2,df,low=FALSE) Higher confidence requires wider intervals (tcrit increases) Relationship to hypothesis testing If m0 (or 0) is not in the confidence interval, then we reject H0 m0 tcritSE M – tcritSE M + tcritSE M M

Standardized Effect Size Interpreting effect size depends on variable being measured Improving digit span by 2 more important than for IQ Solution: measure effect size relative to variability in raw scores Cohen's d Effect size divided by standard deviation of raw scores Like a z-score for means Samples d One Independent Paired True Estimated d

Meaning of Cohen's d How many standard deviations does the mean change by? Gives z-score of one mean within the other population (negative) z-score of m0 within population z-score of mA within Population B pnorm(d) tells how many scores are above other mean (if population is Normal) Fraction of scores in population that are greater than m0 Fraction of scores in Population A that are greater than mB mB mA ds m m0 ds

Cohen's d vs. t t depends on n; d does not Samples Statistic One Independent Paired d t t depends on n; d does not Bigger n makes you more confident in the effect, but it doesn't change the size of the effect