MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Sampling: Final and Initial Sample Size Determination
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
1 Choice of Distribution 1.Theoretical Basis e.g. CLT, Extreme value 2.Simplify calculations e.g. Normal or Log Normal 3.Based on data: - Histogram - Probability.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Chapter 7 Sampling and Sampling Distributions
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
IEEM 3201 Two-Sample Estimation: Paired Observation, Difference.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Inference about a Mean Part II
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Inferences About Process Quality
BCOR 1020 Business Statistics
Review of normal distribution. Exercise Solution.
Chapter 4 Continuous Random Variables and Probability Distributions
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Mid-Term Review Final Review Statistical for Business (1)(2)
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Chi-squared Tests. We want to test the “goodness of fit” of a particular theoretical distribution to an observed distribution. The procedure is: 1. Set.
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Determination of Sample Size: A Review of Statistical Theory
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Probability = Relative Frequency. Typical Distribution for a Discrete Variable.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
ENGR 610 Applied Statistics Fall Week 4 Marshall University CITE Jack Smith.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Confidence Intervals for Variance and Standard Deviation.
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Mystery 1Mystery 2Mystery 3.
© Copyright McGraw-Hill 2004
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Point Estimates point estimate A point estimate is a single number determined from a sample that is used to estimate the corresponding population parameter.
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
Section 6.4 Inferences for Variances. Chi-square probability densities.
9-1 ESTIMATION Session Factors Affecting Confidence Interval Estimates The factors that determine the width of a confidence interval are: 1.The.
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
ESTIMATION.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Sampling Distributions and Estimation
Parameter, Statistic and Random Samples
CONCEPTS OF ESTIMATION
MEGN 537 – Probabilistic Biomechanics Ch
Discrete Event Simulation - 4
Chapter 6 Confidence Intervals.
Presentation transcript:

MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD

Determination of Distribution The underlying distribution can be established in one of the following ways: Drawing a frequency diagram Plotting the data on probability paper Conducting statistical tests known as goodness-of-fit tests for distribution

Probability Paper Gumbel (1954) N observations (X 1, X 2, X 3 …X N ) Arrange Data in increasing order ith value is plotted at the CDF of i/(N+1)

Probability Paper

Plotted versus Normal Dist

Goodness of Fit Question: Whether two independent samples come from identical continuous distributions? Dataset compared to the theoretical distribution Restated: Is the theoretical distribution an acceptable representation of the dataset? Chi Square based on PDF Kolmogorov-Smirnov based on the CDF

Based on error between the observed and assumed PDF of the distribution Methodology: Arrange N data points in increasing order Break data into m intervals Determine: n i – observed frequency of data points in interval “i” e i – theoretical Frequency of data points in interval “i” Chi-Square Test (  2 )

Methodology: Determine c 1- ,f  = Significance Level (usually between 1% and 10%) f = degrees of freedom = m – 1 – k m = # of intervals k = # of distribution parameters (= 2 for normal or lognormal) Obtain c 1- ,f from Appendix 3 The assumed distribution is acceptable at the significance level  if: Chi-Square Test (  2 ) NOTE: m should be > = 5 to obtain satisfactory results

Significance Level,  Significance level, , represents probability that any differences between sample and theoretical distribution are due to chance A higher value implies a more stringent requirement to accept proposed distribution, i.e., better agreement Values as low as 1% to 10% are common

Example (Haldar 5.2)

a) Uniform distributed random variables Ordinary graph paper can be prob. paper b)

Example (Haldar 5.2) c) f = m – 1 - k

Example (Haldar 5.5) Perform Chi-square test on the data from Problem 3.1 n = 30 data points Can the underlying distribution be accepted as normal at a 5% significance level? f = degrees of freedom = m – 1 – k m = # of intervals k = # of distribution parameters

Solution (Haldar 5.5a)

Kolmogorov-Smirnov (K-S) Test Based on the error between the observed and assumed CDF of the distribution Methodology: Arrange data in increasing order and assign index, m to each data point where m = 1,2,…,n Determine S n (x i ) = manual CDF: S n (x i ) = 0; x < x 1 S n (x i ) = m/n; x m ≤ x ≤ x m+1 S n (x i ) = 1;x ≥ x n Determine F X (x i ) = Assumed distribution

K-S Test Methodology: Determine D n = max| F x (x i ) – S n (x i ) | Determine D n   = Significance Level D n  value found in Appendix 4 The assumed distribution is acceptable at the significance level  if the maximum difference D n is less than or equal to the tabulated value of D n 

Example (Haldar 5.8) Perform K-S test on the data from Problem 3.1. Can the underlying distribution be accepted as normal at a 5% significance level?

Solution (Haldar, 5.8)

Parameter Estimation

Method of Moments Moments are statistical parameters of a dataset 1 st moment (mean = E(X)) 2 nd central moment (Var(X)) 3 rd central moment (skewness) Distribution parameters are derived from the moments PDF forms and parameters for distributions in Table 5.6 on page 118 All are based on first two moments, E(x) and Var(X)

Method of Maximum Likelihood

Interval Estimation Differences exist between expected values of populations and samples Distribution parameters (  ) are typically Estimated from samples Applied to populations Intervals estimate the range of possible values for the parameter to a specified level of confidence

Confidence Intervals Distributions can be linked to probability – making possible predictions and evaluations of the likelihood of a particular occurrence In a normal distribution, the number of standard deviations from the mean tells us the percent distribution of the data and thus the probability of occurrence

x = Mean  = Standard Deviation n = Sample Size (1 –  ) = Confidence Interval k  /2 = value of the standard normal variate (z) =  -1 (p) (found using Appendix 1) Interval Estimation for the Mean with Known Variance Two tailed interval!

Lower Confidence Limit for  Upper Confidence Limit for  Lower and Upper Confidence Limit for the Mean with Known Variance Each is a one tailed interval!

Interval Estimation for the Mean with Unknown Variance t  /2,n-1 = value of Student’s t distribution – found using Appendix 5 Standard normal distribution valid for… Known population variance Large n ( > 30) If n is small (< 10), s ≠  use Student’s t

Student’s t distribution f = n – 1 = DOF

Interval Estimation for Variance C ,n-1 = value of Chi Square distribution – found using Appendix 3