Lecture 2: Statistical Overview Elizabeth Garrett Child Psychiatry Research Methods Lecture Series.

Slides:



Advertisements
Similar presentations
Chapter 7 Hypothesis Testing
Advertisements

ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Hypothesis Testing making decisions using sample data.
Departments of Medicine and Biostatistics
Estimation of Sample Size
Sample Size Estimation
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Data Freshman Clinic II. Overview n Populations and Samples n Presentation n Tables and Figures n Central Tendency n Variability n Confidence Intervals.
Data Analysis Statistics. Inferential statistics.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Inference about Population Parameters: Hypothesis Testing
Sample Size and Statistical Power Epidemiology 655 Winter 1999 Jennifer Beebe.
Sample Size Determination Ziad Taib March 7, 2014.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Inferential Statistics
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Sample size calculation
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Overview Definition Hypothesis
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Dr Mohammad Hossein Fallahzade Determining the Size of a Sample In the name of God.
Multiple Choice Questions for discussion
Gil Harari Statistical considerations in clinical trials
More About Significance Tests
Comparing Two Population Means
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
How to Teach Statistics in EBM Rafael Perera. Basic teaching advice Know your audience Know your audience! Create a knowledge gap Give a map of the main.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
An Introduction to Statistics and Research Design
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Sample Size Considerations for Answering Quantitative Research Questions Lunch & Learn May 15, 2013 M Boyle.
Medical Statistics as a science
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Chapter Eight: Using Statistics to Answer Questions.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun Practice 3
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Sample Size Determination
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Descriptive and Inferential Statistics Descriptive statistics The science of describing distributions of samples or populations Inferential statistics.
Chapter 13 Understanding research results: statistical inference.
Introdcution to Epidemiology for Medical Students Université Paris-Descartes Babak Khoshnood INSERM U1153, Equipe EPOPé (Dir. Pierre-Yves Ancel) Obstetric,
Statistical Significance or Hypothesis Testing. Significance testing Learning objectives of this lecture are to Understand Hypothesis: definition & types.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Sample size.
Presentation transcript:

Lecture 2: Statistical Overview Elizabeth Garrett Child Psychiatry Research Methods Lecture Series

Two Types of Statistics Descriptive Statistics –Uses sample statistics (e.g. mean, median, standard deviation) to describe the sample and the population from which it was drawn. –Not “decision” oriented –Pilot studies are descriptive Statistical Inference –Inference: The act of passing from statistical sample data to generalizations …. usually with calculated degrees of certainty. – Key elements: Ô sample Ô generalizations Ô certainty – Often used for making decisions: Ô drug works or it doesn’t Ô ADHD is genetically inherited or it isn’t

Example 1: “Viral Exposure and Autism” (Deykin and MacMahon, 1979) Hypothesis: –Direct exposure to or clinical illness with measles, mumps, or chicken pox may play a causal role in autism.

Example 2: “Neurobiology of Attention in Fetal Alcohol Syndrome” (Lockhart, 2001?) Hypotheses: (1) The neurobiological basis of problems in response inhibition and motor impersistence in children with FAS is related to abnormalities in the “anterior” frontostriatal network. (2) The neurobiological basis of problems in orienting/shifting attention in children with FAS is related to abnormalities in the “posterior” parietal network.

4 Statistical Plan 4.1 Primary outcome(s) 4.2 Statistical analysis 4.3 Sample size justification

4 Statistical Plan 4.1 Primary outcome(s) Common Problem: Primary outcome variable not defined!

Defining Primary Outcome Variables Continuous –MRI volumes –fMRI activation levels –blood pressure –response time –number of voxels activated –cost of hospital visit –neurobehavioral test score

Categorical Nominal Binary (two categories) –gene carrier status (as diagnosed by….) –measles (as diagnosed by….) –ADHD (as diagnosed by….) Polychotomous (more than two unordered categories) –region of activation Ordinal –severity score (see BPI) –symptom rating –“on a scale of 1 to 5….”

Example 1: Primary outcomes Disease history of measles mumps chicken pox Example 2: Primary outcomes MRI volumes of corpus collosum caudate cerebellar vermis parietal lobes frontal lobe

4 Statistical Plan 4.1 Primary outcomes - Be clear about each variable and how it is measured. - NOT okay to say “our primary outcome variable is cognition.” - It IS okay to say “our primary outcome variable is cognition as measured by the WISC-III.” - Multiple outcomes are okay: e.g. MRI volumes and cognitive tests can both be primary outcomes.

4 Statistical Plan 4.1 Primary outcome(s) 4.2 Statistical analysis - How are you going to answer specific aims using primary outcome variable?

Commonly seen statistical methods in analysis plans: –t-test –confidence interval –Chi-square test –Fisher’s exact test –linear regression –logistic regression –Wilcoxon rank sum test –ANOVA –GEE

Key Idea: Data Reduction Statistics is the art/science of summarizing a large amount of information by just a few numbers and/or statements. Examples: pvalue = 0.01 OR = 5.0 prevalence = 0.20  0.05

Example 1: Recall aim: To compare measles history in autistic versus non-autistic kids. Methods: –Odds ratio: Quantifies risk of disease in two exposure groups –Confidence interval: Answers “What is reasonable range for true odds ratio?” –Fisher’s exact test: Answers “Is the risk the same in the two exposure groups?”

Statistical Analysis “We will measure the risk of autism associated with measles using an odds ratio. Significance will be assessed by Fisher’s exact test and a 95% confidence interval will be calculated.”

Example 2: Recall aim: To compare MRI volumes in FAS kids and controls. Methods: –Two-sample t-test: Answers “are the mean volumes in the two groups different?” –95% confidence interval: Answers “what is the estimated difference in volumes in the two groups, approximately?”

Statistical Analysis “To answer the specific aims, we will compare the caudate volumes in the FAS group to those in the control group using a two sample t-test. We will also estimate a 95% confidence interval to provide a reasonable range of the difference in mean volumes in the two groups.”

4 Statistical Plan 4.1 Primary outcome(s) 4.2 Statistical analysis - Data reduction is key: How are you going to combine information from all patients to answer scientific question? - Specific methods need to be designated. - Study design often changes after statistical issues are considered!

4 Statistical Plan 4.1 Primary outcome(s) 4.2 Statistical analysis 4.3 Sample size justification - Do you have enough subjects to answer the question, but not too many so that you are efficient (in terms of money and risks)?

Power and Sample Size Considerations All about precision! (Recall Craig last time) Intuition: –the more individuals, the better your estimate –the more individuals, the less variability in your estimate –the more individuals, the more precise your estimate –but, how precise need your estimate be? Example 1: –Odds ratio of measles for autism: 3.7 –Interpretation: Babies exposed to measles prenatally or in early infancy are at 3.7 times the risk for autism compared to children who are unexposed. –Strong result?

Three Theoretical Outcomes 95% confidence intervals

Actual Result from Study 95% Confidence interval: (0.97, 14.2) Fisher’s exact pvalue = 0.12

Magnitude versus Significance Magnitude of finding: How big is the odds ratio? Statistical significance of the finding: Is the odds ratio different than 1? Clinical significance of the finding: Is the size of the estimated odds ratio worth worrying about? Autism and Measles: –exposure to measles is rare –need a lot of subjects to show significant difference!

Justifying sample size in a study design Hypothesis testing: H o : OR=1 H a : OR=3 Which is a more reasonable conclusion? Issues: type 1 error (  ) type 2 error (  ) HoHo HaHa

Type I and II Errors Type I error (  ): –The probability that we reject Ho given that it is true –The probability that we find an association between measles and autism when, in truth, one does not exist. Type II error (  ): –The probability that we reject Ha given that it is true –The probability that we find no association between measles and autism when, in truth, one does exist. Note: Power = 1 - 

Sample size dictates overlap Small samples Large samples Scenario 1: Scenario 2:

Decision Rule Before study is completed, you know what you need to observe to find evidence for OR=1 or OR=3 Scenario 1: If observed OR > 3.6, then conclude that there IS an association Scenario 2: If observed OR > 1.6, then conclude that there IS an association.

Type I Error: alpha Alpha usually predetermined = 0.05

Type II Error: beta Beta is figured out conditional on alpha. If sample size is small, beta will be big If sample size is big, beta will be small  = 0.60  =0.02

Power: 1- beta Power is 1 - beta. If sample size is small, power will be small If sample size is large, power will be large Power = 0.40 Power = 0.98

Power/Sample Size Estimate Kids with Autism: N = 608 Kids without Autism: N = 1216 “ Using Fisher’s exact test, we have 80% power with alpha = 0.05 to detect an odds ratio of 3 if we enroll 608 children with autism and 1216 normal controls. This assumes that 3% of autistic children have been exposed to measles and 1% of the controls have been exposed.”

Sample Size Table (80% power, alpha 0.05)

Example 2: FAS and controls How many FAS children and controls do we need to detect a significant difference in MRI volumes? From previous research we can estimate (i.e. guess): –Volumes of cerebellar vermis in FAS kids are approximately 400. –It would be interesting if FAS kids had volumes 10% or more less than normal controls (i.e. 400 versus 450).

Sample size needed depends on overlap between FAS and control kids. FAS control FAS

Two sample t-test Same general approach as the odds ratio Define  = difference in mean volumes = control mean - FAS mean H 0 :  = 0 H a :  = 50 Same thing: which hypothesis is more reasonable based on our data? Note: Based on previous research, we can estimate that the standard deviaion of volumes is 70.

What if N = 100 (50 per group)? Alpha = 0.05 Beta = 0.06

Power/Sample Size Options For power = 80%, alpha = FAS and 32 controls For power = 90%, alpha = FAS and 43 controls “To achieve 80% power with a type I error of 5%, we require 32 FAS kids and 32 controls. This will allow us to detect a 10% difference in mean MRI volumes of cerebellar vermis (400 versus 450, respectively) assuming standard deviations of 70 in each group.”

4 Statistical Plan 4.1 Primary outcome(s) 4.2 Statistical analysis 4.3 Sample size justification -Explain justification in terms of statistics. Saying “we are confident that 10 subjects will provide….” is not sufficient.

General Biostatistics References Practical Statistics for Medical Research. Altman. Chapman and Hall, Medical Statistics: A Common Sense Approach. Campbell and Machin. Wiley, 1993 Principles of Biostatistics. Pagano and Gauvreau. Duxbury Press, Fundamentals of Biostatistics. Rosner. Duxbury Press, 1993.