Center for Biofilm Engineering Marty Hamilton Professor Emeritus of Statistics Montana State University Statistical design & analysis for assessing the.

Slides:



Advertisements
Similar presentations
Inference for Regression
Advertisements

Objectives (BPS chapter 24)
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
The Two Factor ANOVA © 2010 Pearson Prentice Hall. All rights reserved.
Design of Engineering Experiments - Experiments with Random Factors
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Independent Sample T-test Formula
Part I – MULTIVARIATE ANALYSIS
Chapter 7 Analysis of ariance Variation Inherent or Natural Variation Due to the cumulative effect of many small unavoidable causes. Also referred to.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
6.4 One and Two-Sample Inference for Variances. Example - Problem 26 – Page 435  D. Kim did some crude tensile strength testing on pieces of some nominally.
Every achievement originates from the seed of determination. 1Random Effect.
PSY 307 – Statistics for the Behavioral Sciences
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Two-Sample Testing: Small Samples Problem 9.15: Bear gallbladder is used in Chinese medicine to treat inflammation. Due to the difficulty of obtaining.
13 Design and Analysis of Single-Factor Experiments:
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
ISE 352: Design of Experiments
Calculations of Reliability We are interested in calculating the ICC –First step: Conduct a single-factor, within-subjects (repeated measures) ANOVA –This.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Laws of Logic and Rules of Evidence Larry Knop Hamilton College.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Ch7 Inference concerning means II Dr. Deshi Ye
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework –The two-sample.
Hypothesis Testing CSCE 587.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
ANOVA (Analysis of Variance) by Aziza Munir
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Copyright © 2004 Pearson Education, Inc.
Design of Engineering Experiments Part 2 – Basic Statistical Concepts
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Jeopardy Opening Robert Lee | UOIT Game Board $ 200 $ 200 $ 200 $ 200 $ 200 $ 400 $ 400 $ 400 $ 400 $ 400 $ 10 0 $ 10 0 $ 10 0 $ 10 0 $ 10 0 $ 300 $
An alternative approach to testing for a linear association The Analysis of Variance (ANOVA) Table.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Diploma in Statistics Design and Analysis of Experiments Lecture 2.11 Design and Analysis of Experiments Lecture Review of Lecture Randomised.
DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
ANOVA: Analysis of Variance.
Lack of Fit (LOF) Test A formal F test for checking whether a specific type of regression function adequately fits the data.
1 Analysis of Variance & One Factor Designs Y= DEPENDENT VARIABLE (“yield”) (“response variable”) (“quality indicator”) X = INDEPENDENT VARIABLE (A possibly.
Multiple regression. Example: Brain and body size predictive of intelligence? Sample of n = 38 college students Response (Y): intelligence based on the.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Center for Biofilm Engineering Al Parker, Biostatistician Standardized Biofilm Methods Research Team Montana State University Statistical methods for analyzing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
ISMT253a Tutorial 1 By Kris PAN Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable 
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
Analysis of Variance STAT E-150 Statistical Methods.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
Week 101 Test on Pairs of Means – Case I Suppose are iid independent of that are iid. Further, suppose that n 1 and n 2 are large or that are known. We.
Chapter 9 Lecture 3 Section: 9.3. We will now consider methods for using sample data from two independent samples to test hypotheses made about two population.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
The Differences in Ticket Prices for Broadway Shows By Courtney Snow I wanted to find out whether there was a significant difference in the price of musicals,
Analysis of variance approach to regression analysis … an (alternative) approach to testing for a linear association.
Six Easy Steps for an ANOVA 1) State the hypothesis 2) Find the F-critical value 3) Calculate the F-value 4) Decision 5) Create the summary table 6) Put.
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Comparing Three or More Means
Lecture #28 Thursday, December 1, 2016 Textbook: 16.1
Presentation transcript:

Center for Biofilm Engineering Marty Hamilton Professor Emeritus of Statistics Montana State University Statistical design & analysis for assessing the efficacy of instructional modules CS 580 April 24, 2006

Why Statistics? Provide convincing results Improve communication “...I do not mean to suggest that computers eliminate stupidity---they may in fact encourage it.” Robert P. Abelson, in Statistics as Principled Argument (cited on Rocky Ross’s CS 580 home page)

What is Statistics? Data Design Uncertainty assessment

Statistical Thinking Data Design Uncertainty assessment

Data:Choosing the quantity to measure Reliable test of knowledge Quantitative response

Statistical thinking Data Design Uncertainty assessment

After-treatment score A student used the modules, then scored 80% on the test Conclusion: modules have high efficacy

Data:Choosing the quantity to measure Reliable tests of knowledge: before-treatment test after-treatment test Quantitative response: difference in test scores, after-treatment minus before-treatment

After-treatment score High Low Test score After

Before- and after-treatment scores High Low After Before Test score Response

Difference between before- and after-treatment scores A student used the modules, then scored 50 points higher on the after- treatment test than on the before treatment test (Response = 50). Conclusion: modules have high efficacy

Anticipating criticism: “natural” improvement High Low After Before Test score without the treatment Response

Anticipating criticism Before/after observations for just the “treated” student may not accurately represent the treatment effect May need treated and untreated students (i.e., a control)

Control or comparison The control can be either a negative control or positive control A student taking a conventional classroom lecture/recitation course would provide a positive control or comparison (placebo) (best conventional)

Difference scores for each of 12 students, 6 per group Treated group Control group Difference (after – before) Of practical importance?

Study design Before and after test scores for each student in both the treated and control groups

Good study design Control or comparison Replication Randomization Anticipate criticism

Data: 20 students per group (randomly assigned?) TreatmentResponse C C C C C C C C C C C C C C C C C C C C TreatmentResponse T T T T T T T T T T T T T T T T T T T T

Analysis via Minitab 14.Minitab: FirstStudy_CS580.MTW Show data layout... matrix Stat > Basic Statistics > Display Descriptive Statistics... Ask for individual value plot Stat > Basic Statistics > 2 Sample t... Minitab output Two-Sample T-Test and CI: Response, Treatment Two-sample T for Response Treatment N Mean StDev SE Mean C T Difference = mu (C) - mu (T) Estimate for difference: % CI for difference: ( , ) T-Test of difference = 0 (vs not =): T-Value = P-Value = DF = 38 Both use Pooled StDev = Null hypothesis: true mean response for Treatment = true mean response for Control Conclusions: 1. Reject the null hypothesis because it is discredited by the data (p-value < 0.001) 2. 95% confident that the treatment mean response is between 38.6 and 61.5 larger than the true control mean response 3. Is this efficacy repeatable?

Analysis via Minitab 14 (more)

Minitab: SixStudies_CS580.MTW Show data layout... matrix Stat > Tables > Descriptive Statistics Minitab output Tabulated statistics: Replicate, Treatment Rows: Replicate Columns: Treatment C T All All Cell Contents: Response: Mean Count

Analysis via Minitab 14 (more)

Stat > ANOVA > General Linear Model... Minitab output General Linear Model: Response versus Treatment, Replicate Factor Type Levels Values Treatment fixed 2 C, T Replicate(Treatment) random 12 1, 2, 3, 4, 5, 6, 1, 2, 3, 4, 5, 6 Analysis of Variance for Response, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Treatment Replicate(Treatment) Error Total S = Variance Components, using Adjusted SS Estimated Source Value Replicate(Treatment) Variance among replicate studies Error Variance among students in same study and treatment added by Marty Total variance Repeatability Standard Deviation = 20.5 (single student) Repeatability Standard Deviation = 9.9 (mean of 20 treated students minus mean of 20 control students) Stat > Basic Statistics > Normality Test... of residuals provides an evaluation of key statistical assumption underlying the ANOVA

Analysis via Minitab 14 (more) Data copied from Tables output and pasted into the worksheet: Rep CntrlMean TrtMean Mean (Treatment minus Control) Stat > Basic Statistics > 1 sample t... analysis of 6 Means Conclusions: 1. Reject the null hypothesis because it is discredited by the data (p-value < 0.001) 2. Estimated difference in mean responses = % confident that the treatment mean response is between 36.9 and 54.9 larger than the true control mean response 4. 95% confident that the treatment mean response is at least 38.6 larger than the true control mean response 5. The efficacy measure is repeatable Note: this straightforward analysis of the six means, one mean for each of the 6 repeated studies, using a 1-sample t-test provides nearly the same results as does the ANOVA variance component analysis approach.

Trade-offs: What is the main source of variability? It is often more important to repeat the study than to expend time and materials finding a precise efficacy estimate for a single study.

Fin