Notes on Data Collection and Analysis Dale Weber PLTW EDD Fall 2009.

Slides:



Advertisements
Similar presentations
Exercise 7.5 (p. 343) Consider the hotel occupancy data in Table 6.4 of Chapter 6 (p. 297)
Advertisements

Chapter 12 Inference for Linear Regression
Applied Econometrics Second edition
Excel Notes Phys244/246 © 2007, B.J. Lieb. Calculating Velocity The velocity is calculated by entering the following: =(B3-B2) / (A3-A2). Then drag the.
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Appendix D Example n The procedure described in Appendix D is meant to determine a battery’s performance parameters from the data taken during a HPPC test.
Econ 140 Lecture 81 Classical Regression II Lecture 8.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Regression Regression: Mathematical method for determining the best equation that reproduces a data set Linear Regression: Regression method applied with.
Chapter 12 Simple Regression
BA 555 Practical Business Analysis
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
1 MF-852 Financial Econometrics Lecture 6 Linear Regression I Roy J. Epstein Fall 2003.
Simple Linear Regression Analysis
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Analysis of Variance Introduction The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
Regression Basics For Business Analysis If you've ever wondered how two or more things relate to each other, or if you've ever had your boss ask you to.
Simple Linear Regression
Example of Simple and Multiple Regression
Hydrologic Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
1 4. Curve fitting On many occasions one has sets of ordered pairs of data (x 1,...,x n, y 1,...,y n ) which are related by a concrete function Y(X) e.g.
AGB 260: Agribusiness Information Technology Arrays and Array Formulas.
Chapter 8: Regression Analysis PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with Spreadsheets,
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Error Analysis, Statistics, Graphing and Excel Necessary skills for Chem V01BL.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Regression and Correlation Jake Blanchard Fall 2010.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
AGB 260: Agribusiness Information Technology Arrays and Array Formulas.
Part IV Significantly Different: Using Inferential Statistics
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Excel Workshop CHEM 2001, FALL Make some calculations Always begin a function with ‘=‘ Multiply X and Y Multiply X by 50 (2 methods) – Absolute.
Excel Workshop CHEM 2001, FALL Make some calculations Always begin a function with ‘=‘ Multiply X and Y Multiply X by 50 (2 methods) – Absolute.
Simple & Multiple Regression 1: Simple Regression - Prediction models 1.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Spreadsheet Data Tables Data Table 1 allows you to change one input variable’s value at a time and record the impact that the input assumption has on several.
Sensitivity Analysis A systematic way of asking “what-if” scenario questions in order to understand what outcomes could possibly occur that would affect.
Copyright © 2005 by Nelson, a division of Thomson Canada Limited 14-0 EXCEL CHAPTER 14 PHILIP BEDIENT.
Vitor Duarte Teodoro, April Organize a table for computations... Data for linear fitting These columns accept 100 pairs (x, y) Note: columns G to.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 12 1 MER301: Engineering Reliability LECTURE 12: Chapter 6: Linear Regression Analysis.
Error Analysis, Statistics, Graphing and Excel Necessary skills for Chem V01BL.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
Correlation and Regression Stats. T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela.
PreCalculus 1-7 Linear Models. Our goal is to create a scatter plot to look for a mathematical correlation to this data.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Regression and Correlation of Data Summary
AGB 260: Agribusiness Data Literacy
EXCEL: Multiple Regression
CHAPTER 3 Describing Relationships
Multiple Regression Equations
REGRESSION (R2).
Correlation, Bivariate Regression, and Multiple Regression
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Ch12.1 Simple Linear Regression
Microsoft Office Illustrated
Multiple Regression.
Using Excel to Graph Data
Statistical Analysis Determining the Significance of Data
Using Excel to Graph Data
Model Adequacy Checking
Presentation transcript:

Notes on Data Collection and Analysis Dale Weber PLTW EDD Fall 2009

Things to Consider Experiment Planning Replication Randomization Blocking Data Analysis Strength of “Effects” – Individual Factors – Factor/Factor Interaction Modeling Linear Regression

Replication 1.Using mean of replicate data gives more precise results 2.Comparing mean to raw data gives an estimate of experimental error – Standard Deviation of data is commonly used – Also, can identify Outliers Typically 3 Replicates are considered sufficent

Equal Means 2x Variance Outliers 2 close pts - suggests dropping outliers - performing another experiment

Randomization and Blocking Want to “average out” the impact of extraneous factors Ex. Weather, pressure variation, cone smoothness, etc. Compile a list of all experiments to be performed (including replicates) Perform tests in random order Roll dice or use computer (Excel –RAND) to generate random sequence

Strength of Effects Montgomery, D.C. Design and Analysis of Experiments, Effect of A: Average of High A value minus Average of Low A value

Factor/Factor Interaction Montgomery, D.C. Design and Analysis of Experiments, Effect of A at Low B: = 30 Effect of A at High B: 12 – 40 = -28 Another way to view it Since the Effect of A depends on value of B: There is Interaction

Modeling Regression Model Measured output Random Noise Coefficients Mean Factor Values Interaction Term Can add other terms to model:and so on.

(Multiple) Linear Regression You know Linear Regression from using adding trend-lines to plots in Excel For multiple independent variables, need to use LINEST function in spreadsheet 1.Make table of model terms in columns with output in last column:

(Multiple) Linear Regression (2) 2.Enter LINEST Command in blank cell Measured Data Model Input Data (Exp Factor values and combos) Force const (    to 0? T = No F = Yes Calculate Fit Statistics Least Squares Fit Coefficients  ’s – in reverse order! R 2 – value (Goodness of Fit)

(Multiple) Linear Regression (3) 3.Drag LINEST cell and Fill i.Drag box needs as many Columns as factors and factor combos in the model + 1 ii.Drag box needs 5 Rows. 4.Press F2 to convert LINEST formula and Drag box to an array. 5.Press CTRL+SHIFT+ENTER to fill

(Multiple) Linear Regression (4) 6.Use Least Squares Model to make predictions Note: 1. There is no noise term in the fit model 2. A hat (^) signifies model estimate ANY QUESTONS? Don’t Forget: - LINEST Help File Handout - Montgomery Handout