Statistical Experimental Design

Slides:



Advertisements
Similar presentations
THE CENTRAL LIMIT THEOREM
Advertisements

Sampling: Final and Initial Sample Size Determination
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Sampling Distributions (§ )
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Testing Differences Among Several Sample Means Multiple t Tests vs. Analysis of Variance.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
12.5 Differences between Means (s’s known)
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Industrial Applications of Response Surface Methodolgy John Borkowski Montana State University Pattaya Conference on Statistics Pattaya, Thailand.
Lecture 13: Review One-Sample z-test and One-Sample t-test 2011, 11, 1.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chapter 14 Inferential Data Analysis
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
ANOVA Greg C Elvers.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Quick and Simple Statistics Peter Kasper. Basic Concepts Variables & Distributions Variables & Distributions Mean & Standard Deviation Mean & Standard.
Human-Computer Interaction. Overview What is a study? Empirically testing a hypothesis Evaluate interfaces Why run a study? Determine ‘truth’ Evaluate.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Statistics Presentation Ch En 475 Unit Operations.
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Descriptive Statistics Used to describe a data set –Mean, minimum, maximum Usually include information on data variability (error) –Standard deviation.
8.2 Testing the Difference Between Means (Independent Samples,  1 and  2 Unknown) Key Concepts: –Sampling Distribution of the Difference of the Sample.
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
PSY 1950 Factorial ANOVA October 8, Mean.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Sampling Theory and Some Important Sampling Distributions.
Statistics Presentation Ch En 475 Unit Operations.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
§2.The hypothesis testing of one normal population.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Normal Distributions. Probability density function - the curved line The height of the curve --> density for a particular X Density = relative concentration.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Sample Size Needed to Achieve High Confidence (Means)
Chapter 17 Estimation and Hypothesis Tests: Two Populations.
Differences Among Groups
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical Inferences for Variance Objectives: Learn to compare variance of a sample with variance of a population Learn to compare variance of a sample.
Inferential Statistics Psych 231: Research Methods in Psychology.
Sampling Distributions
Confidence Intervals Topics: Essentials Inferential Statistics
IEE 380 Review.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Sample Mean Distributions
AP Statistics: Chapter 7
Statistical Experimental Design
Statistics Presentation
Random Sampling Population Random sample: Statistics Point estimate
Statistics Review ChE 477 Winter 2018 Dr. Harding.
Sampling Distribution
Sampling Distribution
Confidence Intervals Topics: Essentials Inferential Statistics
Chapter 6 Confidence Intervals.
CHAPTER 6 Statistical Inference & Hypothesis Testing
Confidence intervals for the difference between two means: Independent samples Section 10.1.
Psych 231: Research Methods in Psychology
Chapter 7: The Distribution of Sample Means
ENM 310 Design of Experiments and Regression Analysis Chapter 3
Sampling Distributions (§ )
Presentation transcript:

Statistical Experimental Design A Primer by H. B. Oblad (Bruce)

Getting Answers Easier - Overview The Old Method The Better Method Simple Statistics for the Lab Let’s Try It Out!

The Old Method Experiments one variable at a time in sequence. Effect of Temperature on Yield Pressure = 1000 psi Time = 20 min Yield, wt % Temperature, °C

Next Set of Experiments Effect of Pressure Temperature = 300 °C Time = 20 min Yield, wt % Pressure, psi

More Experiments Effect of Time on Yield Temperature = 300 °C Pressure = 1000 psi Yield, wt % Time, min

What Have We Learned? 13 Experiments in 3 Factors Pressure Time Temp

What combinations of conditions have we covered? What’s still unknown? Do we know anything about the repeatability of our lab technique? Are the responses straight or curved? Can we build a meaningful model that leads to a mechanism? Could we have done less work and gotten more information? Minor information about effects of factors. Know nothing about interactions.

A Smarter Way Pressure Time Temp

2-Level Factorial Design 8 Tests (XY X = levels, Y= factors) Now know what happens over a large experimental volume. Now know the effects of factors at two surfaces. Effect of factors tested 4 x each Some information about interactions between factors. Repeatability is still unknown. Curvature?

An Even Smarter Way Pressure Time Temp

2-Level Factorial Design w/ Center Points 11 Tests (3 cntr pts), 13 Tests (5 cntr pts) Now know the effects of factors at two surfaces and within the volume. More information about interactions between factors. Repeatability is now estimated or known. Curvature can be estimated. Predictive model is easy to create.

Box-Behnken Design 3 Factor, 3 Level A fractional factorial design Spherical, so extrapolation is less risky. 15 tests (3 cp), 17 tests (5 cp)

Simple Statistics Bell Curve = Normal Dist. = Gaussian Dist. Total population or very large sample Errors in lab methodology are assumed random and normally distributed except for time. Must randomize order to bury effect of time into error. Repeated tests may be pooled to estimate std. dev. and variance.

Bell Curve = Normal Dist. 68% of area is <>+/-1 std. dev. 94% of area us <>+/- 2 std. dev. 99% of area is <>+/- 3 std. dev.

Means Testing If the means and standard deviations of the measurements are equal, the things being measured are of the same population. Opposite is true also (null hyp.) Use Student’s t-test.

Means Testing If the means are the same, the things are of the same population. Use Welch’s t-test

Analysis of Variance (ANOVA) Variance (standard deviation2) of means of several sample groups is determined by F-test. Probability criterion is used for pass/fail or probability of F being equal is given.

Factors, Responses and Interactions Numeric Factors are variable inputs to a process e.g. feed rate, temperature, pressure, component concentration, knobs, levers Categorical Factors are discrete inputs e.g. catalyst type, feed material, operator Responses are effects of changes in factors e.g. Reaction rate increases w/ temp. Factors that affect each other are said to interact e.g. drinking, driving, vs drunken driving

Rubber Band Experiment What affects the distance traveled? Factors? How many? Numeric or categorical? Which design to use? Can we make a predictive model? Any interaction of factors? Can we understand the problem better?