Sufficient statistics. The Poisson and the exponential can be summarized by (n, ). So too can the normal with known variance Consider a statistic S(Y)

Slides:



Advertisements
Similar presentations
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Advertisements

Logistic Regression Example: Horseshoe Crab Data
The General Linear Model. The Simple Linear Model Linear Regression.
STAT 270 What’s going to be on the quiz and/or the final exam?
Model Estimation and Comparison Gamma and Lognormal Distributions 2015 Washington, D.C. Rock ‘n’ Roll Marathon Velocities.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
Section 6.1 Let X 1, X 2, …, X n be a random sample from a distribution described by p.m.f./p.d.f. f(x ;  ) where the value of  is unknown; then  is.
Chapter 14 Analysis of Categorical Data
Estimation of parameters. Maximum likelihood What has happened was most likely.
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Horng-Chyi HorngStatistics II127 Summary Table of Influence Procedures for a Single Sample (I) &4-8 (&8-6)
3-1 Introduction Experiment Random Random experiment.
Inferences About Process Quality
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Simple Linear Regression and Correlation
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Generalized Linear Models
Logistic regression for binary response variables.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
1 Categorical Data (Chapter 10) Inference about one population proportion (§10.2). Inference about two population proportions (§10.3). Chi-square goodness-of-fit.
METU Informatics Institute Min 720 Pattern Classification with Bio-Medical Applications PART 2: Statistical Pattern Classification: Optimal Classification.
AS 737 Categorical Data Analysis For Multivariate
Regression Analysis (2)
Exponential and Chi-Square Random Variables
Chapter 5 Statistical Models in Simulation
Random Variables & Probability Distributions Outcomes of experiments are, in part, random E.g. Let X 7 be the gender of the 7 th randomly selected student.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
01/20151 EPI 5344: Survival Analysis in Epidemiology Maximum Likelihood Estimation: An Introduction March 10, 2015 Dr. N. Birkett, School of Epidemiology,
Multinomial Distribution
Bayesian Analysis and Applications of A Cure Rate Model.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
1 Lecture 13: Other Distributions: Weibull, Lognormal, Beta; Probability Plots Devore, Ch. 4.5 – 4.6.
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 6 Some Continuous Probability Distributions.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
1 Chi-squared Test (1) H 0 : population follows distribution Divide the observations into k bins O i = observed frequency in i-th bin E i = expected frequency.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
1 Standard error Estimated standard error,s,. 2 Example 1 While measuring the thermal conductivity of Armco iron, using a temperature of 100F and a power.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Brief Review Probability and Statistics. Probability distributions Continuous distributions.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Math 4030 Final Exam Review. Probability (Continuous) Definition of pdf (axioms, finding k) Cdf and probability (integration) Mean and variance (short-cut.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Categorical Data Analysis
Chapter 3: Uncertainty "variation arises in data generated by a model" "how to transform knowledge of this variation into statements about the uncertainty.
Log-linear Models Please read Chapter Two. We are interested in relationships between variables White VictimBlack Victim White Prisoner151 (151/160=0.94)
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Modeling and Simulation CS 313
Generalized Linear Models
Chapter 7: Sampling Distributions
Chapt 2. Variation How to: summarize/display random data appreciate variation due to randomness Data summaries. single observation y (number, curve,
Statistics 200b. Spring David Brillinger
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Categorical Data Analysis
ECE 7251: Signal Detection and Estimation
Applied Statistics and Probability for Engineers
Modeling Ordinal Associations Bin Hu
Presentation transcript:

Sufficient statistics. The Poisson and the exponential can be summarized by (n, ). So too can the normal with known variance Consider a statistic S(Y) Suppose that the conditional distribution of Y given S does not depend on , then S is a sufficient statistic for  based on Y Occurs iff the density of Y factors into a function of s(y) and  and a function of y that doesn't depend on  More Chapter 4

Example. Exponential IExp(  ) ~ Y E(Y) =  Var(Y) =  2 Data y 1,...,y n L(  ) =   -1 exp(-  y j /  ) l(  ) = -nlog(  ) -  y j /   y j /n is sufficient

maximum

=

Approximate 100(1-2  )% CI for  0 Example. spring data

Weibull.

Note. Expected information

Gamma.

Example. Bernoulli Pr{Y = 1} = 1 - Pr{Y = 0} =  0    1 L(  ) =   ^y i (1 -  )^(1-y i ) =  r (1 -  ) n-r l(  ) = rlog(  ) + (n-r)log(1-  ) r =  y j R =  Y j is sufficient for , as is R/n L(  ) factors into a function of r and a constant

Score vector  [ y j /  - (n-y j )/(1-  )] Observed information  [y j /  2 + (n-y j )/(1-  ) 2 ] M.l.e.

Cauchy. ICau(  ) f(y;  ) = 1/  (1+(y-  ) 2 ) E|Y| =  Var(Y) =  L(  ) =  1/(  (1+(y j -  ) 2 ) Many local maxima l(  ) = -  log(1+(y j -  ) 2 ) J(  ) = 2  ((1-(y j -  ) 2 )/(1+(y j -  ) 2 ) 2 I(  ) = n/2

Uniform. f(u;  ) = 1/  0 < u <  = 0 otherwise L(  ) = 1/  n 0 < y 1,..., y n <  = 0 otherwise

l(  ) becomes increasingly spikey E u(  ) = -1 i(  ) = - 

Logistic regression. Challenger data Ibinomials R j, m j,  j

Likelihood ratio. Model includes  dim(  ) = p true (unknown) value  0 Likelihood ratio statistic

Justification. Multinormal result If Y ~ N ( ,  ) then (Y-  ) T  -1 (Y-  ) ~  p 2

Uses. Pr[W(  0 )  c p (1-2  )]  1-2  Approx 100(1-2  )% confidence region

Example. exponential Spring data: 96 <  <335 vs. asymp normal approx 64 <  <273 kcycles

Prob-value/P-value. See (7.28) Choose T whose large values cast doubt on H 0 Pr 0 (T  t obs ) Example. Spring data Exponential E(Y) =  H 0 :  = 100?

Nesting  : p by 1 parameter of interest : q by 1 nuisance parameter Model with params (  0, ) nested within ( , ) Second model reduces to first when  =  0

Example. Weibull params ( ,  ) exponential when  = 1      How to examine H 0 :  = 1?

Spring failure times. Weibull

Challenger data. Logistic regression temperature x 1 pressure x 2  (  0,  1,  2 ) = exp{  }/(1+exp{  })  =  0 +  1 x 1 +  2 x 2 linear predictor loglike l(  0,  1,  2 ) =  0  r j +  1  r j x 1j +  2  r j x 2j - m  log(1+exp{  j }) Does pressure matter?

Model fit. Are labor times Weibull? Nest its model in a more general one Generalized gamma. Gamma for  =1 Weibull for  =1 Exponential for  =  =1

Likelihood results. max log likelihood: generalized gamma gamma Weibull gamma vs. generalized gamma - 2 log like diff: 2( ) =.94 P-value Pr 0 (  1 2 >.94) = Pr(|Z|>.969) = 2(.166) =.332

Chi-squared statistics. Pearson's chi-squared categories 1,...,k count of cases in category i: Y i Pr(case in i) =  i 0 <  i < 1  1 k  i =1 E(Y i ) = n  i var(Y i ) =  i (1 -  i )n cov(Y i,Y j ) = -  i  j n i  j E.g. k=2 case cov(Y,n-Y) = -var(Y) = -n  1  2  = { (  1,...,  k ):  1 k  i = 1, 0<  1,...,  k <1} dimension k-1

Reduced dimension possible? model  i ( ) dim( ) = p log like general model:  1 k-1 y i log  i + y k log[1-   k-1 ],  1 k y i = n log like restricted model: l( ) =  1 k-1 y i log  i ( ) + y k log[1-  1 ( )-...-  k-1 ( )]

likelihood ratio statistic: if restricted model true The statistic is sometimes written W = 2  O i log(O i /E i )   (O i - E i ) 2 /E i

Pearson's chi-squared.

Example. Birth data. Poisson? Split into k=13 categories [0,7.5), [7.5,8.5),...[18.5,24] hours O(bserved) E(xpected) P = 4.39 P-value Pr(  11 2 > 4.39) =.96

Two way contingency table. r rows and c columns n individuals Blood groups A, B, AB, O A, B antigens - substance causing body to produce antibodies group count model I model II O = 1 - A - B

Question. Rows and columns independent? W = 2  y ij log ny ij / y i. y.j with y i. =  j y ij ~  k-1-p 2 =  (r-1)c-1) 2 with k=rc p=(r-1)+(c-1) P =  (y ij - y i. y.j /n) 2 / (y i. y.j /n) ~  (r-1)(c-1) 2

Model 1 W = Pr(  1 2 > 17.66) = Pr(|Z| > 4.202) = 2.646E-05 P = Pr(  1 2 > 15.73) = Pr(|Z| > 3.966) = 7.309E-05 k-1-p = = 1 Model 2 W = 3.17 Pr(|Z| > 1.780) =.075 P = 2.82 Pr(|Z|>1.679) =.093

Incorrect model. True model g(y), fit f(y;  )

Example 1. Quadratic, fit linear

Example 2. True lognormal, but fit exponential

Large sample distribution.

Model selection. Various models: non-nested Ockham's razor. Prefer the simplest model

Formal criteria. Look for minimum

Example. Spring failure ModelpAICBIC M *769.9* M M M stress levels M 1 : Weibull - unconnected ,  at each stress level