Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Inferential Statistics
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Significance and probability Type I and II errors Practical Psychology 1 Week 10.
Statistical Issues in Research Planning and Evaluation
Winslow Homer: “On The Stile” INFERENTIAL PROBLEM SOLVING Hypothesis Testing and t-tests Chapter 6:
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Making Inferences for Associations Between Categorical Variables: Chi Square Chapter 12 Reading Assignment pp ; 485.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
What z-scores represent
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Chapter 6 Hypotheses texts. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Understanding Statistics in Research
Chapter 8: Hypothesis Testing and Inferential Statistics What are inferential statistics, and how are they used to test a research hypothesis? What is.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Statistical hypothesis testing – Inferential statistics I.
Inferential Statistics
Hypothesis Construction Claude Oscar Monet: The Blue House in Zaandam, 1871.
AM Recitation 2/10/11.
Overview Definition Hypothesis
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Statistical Power The ability to find a difference when one really exists.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Comparing Two Population Means
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Individual values of X Frequency How many individuals   Distribution of a population.
Beak of the Finch Natural Selection Statistical Analysis.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Hypotheses tests for means
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Hypothesis Testing State the hypotheses. Formulate an analysis plan. Analyze sample data. Interpret the results.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Welcome to MM570 Psychological Statistics
Sampling Distributions Statistics Introduction Let’s assume that the IQ in the population has a mean (  ) of 100 and a standard deviation (  )
Welcome to MM207 Unit 7 Seminar Dr. Bob Hypothesis Testing and Excel 1.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
HYPOTHESIS TESTING.
Unit 5 – Chapters 10 and 12 What happens if we don’t know the values of population parameters like and ? Can we estimate their values somehow?
Hypothesis Testing: Preliminaries
Chapter 8: Hypothesis Testing and Inferential Statistics
Hypothesis Testing: Hypotheses
Chapter 6 Making Sense of Statistical Significance: Decision Errors, Effect Size and Statistical Power Part 1: Sept. 18, 2014.
CHAPTER 12 Inference for Proportions
CHAPTER 12 Inference for Proportions
Sample Mean Compared to a Given Population Mean
Sample Mean Compared to a Given Population Mean
Is this quarter fair?. Is this quarter fair? Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of.
Type I and Type II Errors
Presentation transcript:

Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific papers 2)Working on Results sections: text, tables and figures

We collect data to test or “falsify” a null hypothesis Null hypothesis: Male and female fish are of similar length Approach: Measure lengths of males and females Data: Length measurements Attributes: Mean, standard deviation, sample size Statistical test:two-sample t-test Output:Probability that the male and female length measurements are drawn from the same population of data (i.e., are similar).

Length (cm) Frequency Male mean = 6.5, SD = 0.94, n = 60 Female mean = 7.5, SD = 0.94, n = 60 t = 5.57, df = 118, P < 0.001

In this case we can be very confident that the lengths of males and females differ. That is, there is a less than 0.1% chance that their distributions were drawn from the same pool of data. However, this assumes that the fish were collected in a random (non-biased manner), and so the sampling methods are important. Regardless, how would our confidence change is the mean values were the same but the data were more variable?

Length (cm) Frequency Male mean = 6.5, SD = 2.73, n = 60 Female mean = 7.5, SD = 2.73, n = 60 t = 2.00, df = 118, P = 0.02

In this case we can still be confident that the lengths of males and females differ as the statistical test indicates that there is a less than 2% chance that their distributions were drawn from the same pool of data. Suppose we increase the variation in the data even more but still keep the means the same?

Length (cm) Frequency Male mean = 6.5, SD = 4.19, n = 60 Female mean = 7.5, SD = 4.19, n = 60 t = 1.31, df = 118, P = 0.10

By convention among scientists, a probability of 5% or less that the distributions are different when in fact they are the same is acceptable. Thus such results are referred to as “statistically significant”. This error is referred to as “alpha” or “Type I” error: the probability of falsely rejecting a null hypothesis that is true, in reality. If the probability is greater than 5% we typically conclude that the difference was not “statistically significant”. This does not mean that the distributions are necessarily the same!

Male mean = 6.5, SD = 4.19, n = 60 Female mean = 7.5, SD = 4.19, n = 60 t = 1.31, df = 118, P = 0.10 Suppose we take the last data set and double it, without making any other changes? Male mean = 6.5, SD = 4.19, n = 120 Female mean = 7.5, SD = 4.19, n = 120 t = 1.86, df = 238, P = 0.03

More on the importance of sample size If you were gambling, at what point would you suspect that a coin that was being flipped was not “honest”? 3 heads out of 5 tosses? 6 heads out of 10 tosses? 12 heads out of 20 tosses? 30 heads out of 50 tosses? 60 heads out of 100 tosses?

Contingency and probability Are men and women equally likely to be right and left handed? How will sample size influence our confidence in a result? If you want to detect a certain effect, how big should your sample be? P > 0.30 P < 0.05 P = 0.07 P = 0.20

Beta or Type II error: There is another kind of error, referred to as beta or Type II error: the probability of accepting the null hypothesis when in fact it is false. This is not always reported but whenever you see a conclusion that there was “no effect”, you should immediately ask whether the sampling was sufficient to detect an effect. It is easy to make a Type II error if: 1)the sample size was small 2)the effect was subtle 3)the data were highly variable

The probability of making a Type II error is calculated with something called “power analysis” – how powerful was our sampling in terms of the ability to detect an effect? Typically, there is a phrase more or less like this: “Power analysis indicated that these samples provided an 80% chance of detecting a 5% difference in mean length.” This allows the reader to drawn his or her own conclusions as to whether this is an acceptable risk of error.

Hypothesis testing and the burden of proof H 0 : “Null Hypothesis” usually means “no effect” or “no difference” H 0 is either true or false. The data might allow you to reject it, with some chance that it is actually true. Alternatively, the data might not allow you to reject it, but in fact it could be false. Ho is trueHo is False Ho rejectedType I error 5% at alpha = 0.05 NO ERROR Ho is not rejected NO ERRORType II error Beta

Why is this important? The null hypothesis is typically “no effect” and it must be falsified. The alternative would be to assume an effect until it was proved that there was none. In matters of public health, this may means that the burden of proof is on the physicians to show that something is harmful. Consider the process of proving that cigarette smoking contributes to the risk of lung cancer (really, rejecting the null hypothesis that there is no effect). Likewise, in environmental matters, if we use the “falsify the null hypothesis” approach, we assume no harm until harm is shown. This may take a long time, especially if the harm has a long latency period or is complicated by other factors.

In-class activity: Examine published papers to see how statistical terms are used and integrated into the text. How might this have changed over the years? Why?

In-class activity: Students work on Results sections of their papers.

Assignment: 1)Plan your paper’s Discussion. 2)Study the Discussion in the published paper. Read pages 89 – 94 in the book.