Experimental Design Dr. Anne Molloy Trinity College Dublin.

Slides:



Advertisements
Similar presentations
Analysis by design Statistics is involved in the analysis of data generated from an experiment. It is essential to spend time and effort in advance to.
Advertisements

LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Estimation of Sample Size
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Final Review Session.
Lecture 9: One Way ANOVA Between Subjects
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
One-way Between Groups Analysis of Variance
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Today Concepts underlying inferential statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Sample Size Determination
Sampling Distributions & Point Estimation. Questions What is a sampling distribution? What is the standard error? What is the principle of maximum likelihood?
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 14 Inferential Data Analysis
Introduction to the design (and analysis) of experiments James M. Curran Department of Statistics, University of Auckland
Descriptive Statistics
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Hypothesis Testing in Linear Regression Analysis
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Sampling Distribution of the Mean Central Limit Theorem Given population with and the sampling distribution will have: A mean A variance Standard Error.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
PARAMETRIC STATISTICAL INFERENCE
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Step 3 of the Data Analysis Plan Confirm what the data reveal: Inferential statistics All this information is in Chapters 11 & 12 of text.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Section 10.1 Confidence Intervals
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Medical Statistics as a science
Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
RESEARCH & DATA ANALYSIS
T tests comparing two means t tests comparing two means.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Sample Size Determination
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Descriptive and Inferential Statistics Descriptive statistics The science of describing distributions of samples or populations Inferential statistics.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Inferential Statistics Psych 231: Research Methods in Psychology.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 11: Between-Subjects Designs 1.
MEASURES OF CENTRAL TENDENCY Central tendency means average performance, while dispersion of a data is how it spreads from a central tendency. He measures.
More on Inference.
Sample Size Determination
Psych 231: Research Methods in Psychology
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Hypothesis testing using contrasts
Understanding Results
Power, Sample Size, & Effect Size:
More on Inference.
Psych 231: Research Methods in Psychology
Inferential Statistics
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Introductory Statistics
Presentation transcript:

Experimental Design Dr. Anne Molloy Trinity College Dublin

Ethical Approach to Animal Experimentation Replace Reduce Refine Reduce Good Experimental Design Appropriate Statistical Analysis

Good Experimental Design is Essential in Experiments using Animals The systems under study are complex with many interacting factors High variability can obscure important differences between treatment groups: – Biological variability (15-30% CV in animal responses) – Experimental imprecision (up to 10% CV). Confounding variables between treatment groups can affect your ability to interpret effects. – Is a difference due to the treatment or a secondary effect of the treatment? (e.g. weight loss, lack of appetite)

Do a Pilot Study and Generate Preliminary Data for Power Calculations. Observational study -not an experiment; an experience (RA Fisher ) Observational; generates data to give you the average magnitude and variability of the measurements of interest Gives background information on the general feasibility of the project (essentially validates the hypothesis) Allows you to get used to the system you will be working with and get information that might improve the design of the main study

Dealing with Subject Variation Choose genetically uniform animals where possible Avoid clinical and sub-clinical disease Standardize the diet and environment - house under optimal conditions Uniform weight and age (else choose a randomized block design) Replicate a sufficient number of times. – Increases the confidence of a genuine result – Allows outliers to be detected.

Some issues to think about before you set out to test your hypothesis What is the best treatment structure to answer the question? – Scientifically – Economically What type of data are being collected? – Categorical, numerical (discrete or continuous), ranks, scores or ratios. This will determine the statistical analysis to be used How many replicates will be needed per group? – Too many: wasteful; diminishing additional information – Too few: Important effects can be rejected as non- significant

Choosing the Correct Design How many treatments (independent variables)? – e.g. Dose Response over Time How many outcome measurements (dependent variables) – Aim for the maximum amount of informative data from each experiment – (but power for one) Are there important confounding factors that should be considered? – Gender, age – Dose Response over Time x Gender x Age Complex experiments with more treatment groups generally allow reduction in the number of animals per group. Continuous numerical type data generally require smaller sample sizes than categorical data

Types of Study Design Completely randomized study (basic type) – Random not haphazard sampling Randomized block design: e.g. stratify by weight or age. (removes systematic sources of variation) Factorial Design: e.g examine two or more independent variables in one study Crossover, sequential, repeated measures, split plot, latin square designs Can greatly reduce the number of animals required – ANOVA type analysis is essential

Example: You want to examine the effect of two well known drugs on tissue bio- markers Control Drug 1 Experiment 1 Control Drug 2 Experiment 2 ControlDrug 2Drug 1 Reduces animals by the number of controls in one experiment

Identify the Experimental Unit SalineDrugSalineDrug Control DietExperimental Diet Defines the independent unit of replication Cage; animal; tissue Sometimes pseudoreplication is unavoidable – so be aware of effect limitations

Power and Sample Size Calculation in the Design of Experiments What is the likelihood that the statistical analysis of your data will detect a significant effect given that the experimental treatment truly has an effect? POWER How many replicates will be needed to allow a reliable statistical judgement to be made? SAMPLE SIZE

The Information You Need What is the variability of the parameter being measured? What effect size do you want to see? What significance level to you want to detect (commonly use minimum of p=0.05)? What power do you want to have (commonly use 0.80) This information is used to calculate the sample size

Variability of the Parameter An estimate of central location: “About how much?” (e.g. the mean value) An estimate of variation: “How spread out?” (e.g. the standard deviation)

An experiment: Testing the difference between two means In an experiment we often want to test the difference between two means where the means are sample estimates based on small numbers. It is easier to detect a difference if: – The means are far apart – There is a low level of variability between measurements – There is good confidence in the estimate of the mean

Plasma Cysteine (µmol/L) Mean 235: SD 22.9: Mean 236: SD 22.3 Mean 236: SD 23.8 SEM= SD/  N SEM=22.9/  20 =5.12SEM=22.3/  50 =3.15 SEM=23.8/  2500 =0.48 Mean 235: SD 21.4 SEM=21.4/  500 =0.96 Means and SDs are about the same! Coefficient of Variation (CV) (SD/Mean)% = 10.1%

Mean = 236 SD = SD = 48 (approx) 3SD = 71 (approx) About 95% of results are between 236 ± 48 i.e. 188 and 284 About 99.7% of results are between 236 ± 71 i.e. 165 and 307 We can make the same predictions for a sample mean using SEM instead of SD

Having confidence in the estimate of the mean value Mean 235: SEM= SEMs = We can be 95% confident that true mean of the population will fall between and This is a sample. We don’t know the ‘true’ mean of the population The sample mean is our best guess of the true population mean (µ) but with a small sample there is much uncertainty and we need a wide margin of error

The effect of increasing numbers Mean 235: SD=22.9 Mean 235: SD=21.4  20 = 4.47 SEM=5.12  500 = SEM=0.96 Number of samples is increased 25 times Standard error is decreased by  25 = 5 times 95% CI of the mean is 5 times narrower

Sample size considerations: Viewpoint 1 Fix the sample size at six replicates per group and CV at 10% The significance depends on the effect size Effect you want in treated group Student’s t-test 50% differenceP< %P= %P= %P= %P= groups – control and treated 6 replicates per group; CV of the assay 10% Cut-off for a significant result P=0.05 ( Mean of treated outside the 95% CI of the controls) P=0.05

Sample size considerations: Viewpoint 2 Fix the effect size at 25% difference and CV at 10%. The significance depends on the number of replicates Number of replicates per group Student’s t-test 6P= P= P= P=0.03 2P= % difference expected; CV of the assay 10% Cut-off for a significant result P=0.05 P=0.05

Sample size considerations: Viewpoint 3 Fix the effect size at 25% and number of replicates at 6. The significance depends on the variability of the data (CV) CV of the AssayStudent’s t-test 10%P= %P= %P= %P= %P= % difference expected; 6 replicates per group Cut-off for a significant result P=0.05 P=0.05

Summary: The underlying issues in demonstrating a significant effect The size of the effect you are interested in seeing – Big – e.g. 50% difference will be seen with very few data points – Small - major considerations The precision of the measurement – Low CVs - few replicates needed – High CVs – multiple replicates

How do we interpret a non- significant result? A.There is no difference between the groups B.There is a difference but we didn’t see it (because of low numbers, SD too wide, etc.) The decision to reject or not reject the Null Hypothesis can lead to two types of error.

Interpreting a Statistical Test Correct Decision α (Type 1) Error (p value) Reject H(o) Declare that the treatment has an effect Significant β (Type II) Error Correct Decision Do not reject H(o) Declare that the treatment has no effect Not Significant The Null is false The treatment has an effect The Null is true The treatment has no effect DECISION RESULTS Reality Evidence from the experiment

β-Errors and the overlap between two sample distributions Mean sample A Continuous data range 95% CI of mean A95% CI of mean B Mean sample B Miss an effect: β-error See an effect: POWER

Some Power Calculators leads to Java applets for power and sample size calculations. Direct into Java applet site

General Formula r = 16 (CV/d) 2 r= No of replicates CV= coefficient of variation (SD/mean) (as a percent) d=difference required (as a percent) Valid for a Type I error =5% and Type II error =80%.

Some General Comments on Statistics All statistical tests make assumptions – They assume independent data points –ignore this at your peril! – They assume that the data are a good representation of the wider experimental series under study – Some assumptions are very specific to the test being carried out

Final Thoughts Ideally, to minimise the sample number, use equal numbers of control and treated animals. Ethically, if an experiment is particularly stressful, lower numbers may be desired in the treated group. This requires use of more animals overall to gain equivalent power – but can be justified. Remember - Statistical tests assume that the experiment has been done on a random sample from the complete population of similar items and that each result is an independent event. This is often not the case in laboratory research. Statistical logic is only part of the data interpretation. Scientific judgement and common sense are essential.

Dealing with Experimental Variation Randomization – Essential! – Ensures that the remaining inescapable differences are spread among all the treatment groups – Minimises potential bias – Provides a reliable estimate of the true variability – “Control treatment” must be one of the randomized arms of the experiment

Power Considerations You know the variability of the parameter being measured What effect size do you want to see? You need a minimum significance level of p=0.05 What power do you want to have (commonly use 0.80)