Data Analysis II Anthony E. Butterfield CH EN 4903-1 "There is a theory which states that if ever anybody discovers exactly what the Universe is for and.

Slides:



Advertisements
Similar presentations
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
Advertisements

Objectives (BPS chapter 24)
Simple Linear Regression
Hypothesis testing Week 10 Lecture 2.
MF-852 Financial Econometrics
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Multiple regression analysis
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Sample size computations Petter Mostad
Final Jeopardy $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 LosingConfidenceLosingConfidenceTesting.
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Tuesday, October 22 Interval estimation. Independent samples t-test for the difference between two means. Matched samples t-test.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
T-test.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Inference about a Mean Part II
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
IENG 486 Statistical Quality & Process Control
Chapter 9 Hypothesis Testing.
5-3 Inference on the Means of Two Populations, Variances Unknown
AM Recitation 2/10/11.
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
II.Simple Regression B. Hypothesis Testing Calculate t-ratios and confidence intervals for b 1 and b 2. Test the significance of b 1 and b 2 with: T-ratios.
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Data Analysis Examples Anthony E. Butterfield CH EN
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
1 Lecture 4 Main Tasks Today 1. Review of Lecture 3 2. Accuracy of the LS estimators 3. Significance Tests of the Parameters 4. Confidence Interval 5.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
1 Econometrics (NA1031) Lecture 3 Interval Estimation and Hypothesis Testing.
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13 Understanding research results: statistical inference.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 10: The t Test For Two Independent Samples.
Final project questions Review sessions tomorrow: 1:00-2:00 pm, 3:30-5:00 pm, 6:30-7:30 pm… all in SSC 107 You can start the exam as early as 8:30 am,
Chapter 9 Hypothesis Testing.
Regression Analysis AGEC 784.
AP Statistics Chapter 14 Section 1.
LECTURE 33: STATISTICAL SIGNIFICANCE AND CONFIDENCE (CONT.)
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Inference on Mean, Var Unknown
Statistical Methods For Engineers
Interval Estimation and Hypothesis Testing
Simple Linear Regression
Presentation transcript:

Data Analysis II Anthony E. Butterfield CH EN "There is a theory which states that if ever anybody discovers exactly what the Universe is for and why it is here, it will instantly disappear and be replaced by something even more bizarre and inexplicable. There is another theory which states that this has already happened.” ~ Douglas Adams, Hitchhiker's Guide to the Galaxy

Data Analysis II Review of Data Analysis I. Hypothesis testing. – Types of errors. – Types of tests. – Student’s T-Test Fit lines of lines to data.

Quick Review of PDFs and CDFs What is the probability of measuring a value between -0.5 and 1.5, with  =0 and  =1? What is the probability of measuring a value between -0.5 and 1.5 or between -2 and -1?

Hypothesis Testing How do we know if one hypothesis is more likely true over alternatives? Null Hypothesis (H 0 ) – The hypothesis to be tested to determine if it is true (often that the data observed are the result of random chance). Alternative Hypothesis (H i ) – A hypothesis that may be found to be the more probable source of the observations if the null hypothesis is not (often that the observations are the result of more than chance, a real effect).

Possible Types of Error in Tests Type I Error: – Rejecting a true hypothesis,  significance level  Type II Error: – Accepting a false hypothesis,  1-test’s power  Tradeoff between  and .

Testing Alternatives, Tail Tests One Tail (One-Sided) Test. – H 0 :  =  0. “Our new drug is no better than the old drug” H 1 :  >  0. “ Our new drug works better than the old one.” – H 0 :  =  0. “The catalytic converter is just as effective as it was when new.” H 1 :  <  0. “The catalytic converter has fowled.” Two Tail (Two-sided) Test. – H 0 :  =  0. “Our liquid is a Newtonian fluid.” H 1 :  ≠  0. “Our liquid is a non-Newtonian fluid.”

Student’s T-Test T-distribution : Used for small data sets, where the standard deviation is unknown. As the degrees of freedom, v, goes to ∞, the t-distribution becomes the normal distribution.

Student’s T-Test Can use to determine the likelihood of two means being the same. t

T Statistics Example The test statistic puts the data in question into a scale in which we can use the T-distribution. Is  a =  b, or  a ≠  b, or  a >  b, or  a <  b ?

T Statistics Example v = 38  ab = t = -1.53

Student’s T-Test Example Two sets of data, 10 measurements each, with different variances and with means separated by an increasing value. Note the error. What if we take 100 measurements?

Student’s T-Test for Our  Data Use t statistic and the CDF to find probability. Two-tailed test (P  2). Would need t=0.064 for 95% confidence.

Linear Fitting How to best fit a straight line, Y=b+mx, to data?

Coefficient of Determination ( R 2 ): The closer R 2 is to 1 the better the fit. Linear Fit Quality

Nonlinear Fits Linearized fits. – Prone to problems. Nonlinear fits. – Best for nonlinear equations. – End up with n nonlinear equations and n unknowns.

Fitting Example Equation: Linearized fit puts inordinate emphasis on data taken at larger values of x, in this case.

C.I. For Fitted Constants Method uses Student’s T-Test, residuals and Jacobian (Matrix of partial derivatives with respect to parameters for each data point). You may use a statistics program. For example: Matlab nlfit – get fit parameters, residuals, and Jacobian. nlparci – find the CI for parameters. nlpredici – find CI for predicted values. Open the functions, though, to see how they function (“>> open nlparci” and “>> help nlparci”).

C.I. For Fitted Constants, Example Put code for this example online, here. >> nlinfitex2 Fit to equation: y = b1 + b2 * exp(-b3 * x) x data y data b1 was 1.0, and is estimated to be: ± (95% CL) b2 was 2.0, and is estimated to be: ± (95% CL) b3 was 3.0, and is estimated to be: ± (95% CL)

Data Analysis Conclusions Data analysis is necessary to near any objective use of measurements. Must have a basic grasp on statistics. All data and calculated values should come with some confidence interval at some probability. You can reject data under some circumstances, but avoid them. Use Student’s T-Test and fitting techniques to judge if your data match theory.