Assessing the Evidence: Statistical Inference Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Objectives 10.1 Simple linear regression
Lecture 6 Outline – Thur. Jan. 29
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Inference Sampling distributions Hypothesis testing.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Stat 301 – Day 28 Review. Last Time - Handout (a) Make sure you discuss shape, center, and spread, and cite graphical and numerical evidence, in context.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Chapter 7 Sampling and Sampling Distributions
Statistics 201 – Lecture 23. Confidence Intervals Re-cap 1.Estimate the population mean with sample mean Know sample mean is unbiased estimator for 
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
5-3 Inference on the Means of Two Populations, Variances Unknown
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Overview Definition Hypothesis
Statistical Analysis Statistical Analysis
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
More About Significance Tests
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 8 Introduction to Hypothesis Testing
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Quick and Simple Statistics Peter Kasper. Basic Concepts Variables & Distributions Variables & Distributions Mean & Standard Deviation Mean & Standard.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Medical Statistics as a science
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
How confident are we in the estimation of mean/proportion we have calculated?
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work.
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
A review of key statistical concepts. An overview of the review Populations and parameters Samples and statistics Confidence intervals Hypothesis testing.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote.
Week 13a Making Inferences, Part III t and chi-square tests.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Student’s t test This test was invented by a statistician WS Gosset ( ), but preferred to keep anonymous so wrote under the name “Student”. This.
P-values and statistical inference Dr. Omar Aljadaan.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Confidence Intervals and Hypothesis Testing Mark Dancox Public Health Intelligence Course – Day 3.
Review Statistical inference and test of significance.
Statistical Significance or Hypothesis Testing. Significance testing Learning objectives of this lecture are to Understand Hypothesis: definition & types.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Presentation transcript:

Assessing the Evidence: Statistical Inference Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research

Objectives of Session Understand idea of inference Understand idea of inference Confidence interval approach Confidence interval approach Significance testing Significance testing Briefly - Some simple tests Briefly - Some simple tests

Statistical Inference The aim is to draw conclusions (INFER) from the specific (sample) to the more general (population). Are differences between groups chance occurrences or do they represent statistically significant results (I.e. real differences)?

Extrapolating from the sample to population Illustrations Ian Christie, Orthopaedic & Trauma Surgery, Copyright 2002 University of Dundee

Two approaches: confidence intervals and hypothesis testing Confidence Intervals Random variability means that there is statistical (random) variation around any summary statistic: Mean, proportion, difference between means, etc

Confidence intervals Uncertainty expressed as a Confidence Interval: Summary statistic  constant x standard error e.g. For 95% CI constant = 1.96 from Normal distribution

Confidence intervals For a percentage the standard error is given by: So for p = 35%, se = 4.8%, where n = 100

Confidence intervals Consider a prevalence of 35% for the uptake of statins for secondary prevention of MI from one practice p = 35%, 95% CI = 35% ± 1.96x 4.8% p = 35%, 95% CI = 35% ± 1.96x 4.8% = 25.6% to 44.4% = 25.6% to 44.4%

Confidence intervals For a mean the standard error is given by: where s is the standard deviation of the distribution

Confidence intervals Consider a mean cholesterol measurement of 5.4 mmol/l for a group of 100 patients with type 2 diabetes and standard deviation s = 1.1 mmol/l  = 5.4, 95% CI = 5.4 ± 1.96x 1.1/√100  = 5.4, 95% CI = 5.4 ± 1.96x 1.1/√100 = 5.2 to 5.6 mmol/l = 5.2 to 5.6 mmol/l

Confidence intervals Confidence intervals give estimation of precision of summary statistic Precise Precise Imprecise Major determinant of precision is sample size

Illustrations Ian Christie, Orthopaedic & Trauma Surgery, Copyright 2002 University of Dundee

Confidence intervals Warning! Confidence intervals are usually interpreted in a Bayesian way even though used a frequentist method to estimate The probability of the true value lying within the confidence interval is NOT, repeat NOT 95% Bayesian confidence intervals are called CREDIBLE INTERVALS and probability of true value lying in credible interval IS 95%

Frequentist Confidence Interval means that with repeat samples… 95% Confidence interval means that 1 in 20 of proportions from repeat samples would be outside confidence interval Study Sample & 95% CI RepeatSamples… …..ad infinitum

To put it another way the 95% Confidence Interval is… …one of many that could be constructed with the assurance that 95% of the time the true value of the parameter would be included Sample & 95% CI RepeatSamples… …..ad infinitum

Statistical Inference: Hypothesis testing Are differences between groups chance occurrences or do they represent statistically significant results (I.e. real differences)? The process of inference starts from a neutral position – Null Hypothesis

Statistical Inference: Hypothesis testing The null hypothesis (H 0 ) is usually set to ‘there is no difference’  Collect data and carry out hypothesis tests  Accept or reject the null hypothesis

Legal analogy Hypothesis testing Legal trial Hypothesis test Defendant assumed innocent until proved guilty Null hypothesis assumes no difference between groups

Legal analogy Hypothesis testing Legal trial Hypothesis test Examine evidence Calculate test statistic based on evidence from sample data

Legal analogy Hypothesis testing Legal trial Hypothesis test 1.Accept evidence proves guilt 2.Evidence does not prove guilt ‘not proven’ 1.Accept significant difference between groups 2.Insufficient evidence to reject H 0

Legal Analogy Hypothesis Testing No statistical significance not same as No difference Illustrations Ian Christie, Orthopaedic & Trauma Surgery, Copyright 2002 University of Dundee

Statistical Inference: Hypothesis testing The test statistic generally consists of: Summary statistic – H 0 value Standard error of summary e.g.Test that the mean is different to zero: t = Mean – 0 Se(Mean) Se(Mean)

Statistical Inference: Hypothesis testing The test statistic is then compared with tabulated values of a distribution (e.g Normal distribution, t-distribution) Assuming the null hypothesis is true, what is the probability of obtaining the actual observed value of the test statistic, t? How likely is the value of t, to have occurred by chance alone?

Statistical Inference: Hypothesis testing Assuming the null hypothesis is true, what is the probability of obtaining the actual observed or greater value of the test statistic, t? Using distribution Of t which is similar to a Normal distribution this probability can be Obtained in figure as p = %

Statistical Inference: Hypothesis testing If probability of the occurrence of the observed value < 5% or p < 0.05 then this is unlikely to be a chance finding Result is declared statistically significant Fortunately most statistical software (e.g. SPSS) will carry out the test you request and give p-values (SPSS labels as ‘Sig’)

Two group hypothesis testing We will consider three common tests: 1. t-test for difference between two means 2. Chi-squared test (  ) for difference between two proportions 2. Chi-squared test (  2 ) for difference between two proportions 3. Logrank test for difference between two groups median survival All are easily carried out in SPSS

Are practices with access to community hospitals further away on average from general hospitals? No access Access to CH n=17 Mean = 8.68 km SD = km Se (mean) = 2.89 n=10 Mean = km SD = 5.68 km Se (mean) = 1.79

Example t-test 1 and 2 refer to the two groups N is the number in each group bar refers to the mean and X bar refers to the mean and s p is the pooled standard deviation

Example t-test t= / t= / t= With 25 degrees of freedom from t-tables p = and so the difference of is highly statistically significant

Consider a recent RCT Rimonabant vs. placebo to reduce body weight in obese people (BMI > 30kg/m 2 ) Rimonabant vs. placebo to reduce body weight in obese people (BMI > 30kg/m 2 ) Rimonabant (20 mg daily) inhibits affects of cannabinoid agonists which in turn affects energy balance Rimonabant (20 mg daily) inhibits affects of cannabinoid agonists which in turn affects energy balance Mean reduction in body weight at one year was 6.6kg vs. 1.8 kg (rimonab vs. plac) Mean reduction in body weight at one year was 6.6kg vs. 1.8 kg (rimonab vs. plac) Difference was 4.7 kg (95% CI 4.1, 5.4) Difference was 4.7 kg (95% CI 4.1, 5.4) By end of year 2 mean weight was back to start! By end of year 2 mean weight was back to start!

Are practices with access to community hospitals more likely to have training status? 12 (71%) 4 (40%) 5 (29%) 6 (60%) NoYes No Training Status Training status Community Hospital

Are practices with access to community hospitals more likely to have training status? Is the difference in proportionsIs the difference in proportions 60% - 29% = 31% well within the realms of chance or a statistically significant finding? Null hypothesis Difference = 0Null hypothesis Difference = 0 Use chi-squared (  2 ) test for significance of difference Use chi-squared (  2 ) test for significance of difference

Pearson Chi-Squared Test NoYes No training status Training status a b cd Comm. Hosp. a+cb+d a+b c+d N

Pearson Chi-Squared Test where N = a+b+c+d and |ad – bc| means take the positive value of the calculation

Pearson Chi-Squared Test = 2.44 with 1 degree of freedom df = (no. rows – 1) x (no. columns – 1) P = which is not statistically significant

More complicated analyses Introduced simple two-group tests Introduced simple two-group tests Results of more complicated analyses are expressed in the same way Results of more complicated analyses are expressed in the same way Summary statistic and 95% confidence interval Summary statistic and 95% confidence interval Usually p-value is also stated but often implicit from the confidence interval Usually p-value is also stated but often implicit from the confidence interval Beware spurious significance e.g. Beware spurious significance e.g. p = (3 d.p. are enough) p = (3 d.p. are enough) ‘Importance’ refers to size of difference ‘Importance’ refers to size of difference An ‘important’ result can be statistically non- significant An ‘important’ result can be statistically non- significant

Sacred 5% level 5% level is arbitrary 5% level is arbitrary Practical choice before computer era to make tables easier to construct Practical choice before computer era to make tables easier to construct Are p = and p = different? Are p = and p = different? In past researchers tended to only present p- values In past researchers tended to only present p- values Now emphasis is on size of effect and 95% CI Now emphasis is on size of effect and 95% CI Unfortunately, Editors still influenced by p- values leading to publication bias Unfortunately, Editors still influenced by p- values leading to publication bias

Summary Do not get carried away by p-values Do not get carried away by p-values Interpretation requires knowledge of area to put into context, but also understanding of what the tests do Interpretation requires knowledge of area to put into context, but also understanding of what the tests do A p-value close to 5% is approaching significance and may suggest it is worth investigating A p-value close to 5% is approaching significance and may suggest it is worth investigating The size of the effect may be more clinically or scientifically important The size of the effect may be more clinically or scientifically important