The hazard function The hazard function gives the so-called “instantaneous” risk of death (or failure) at time t, assuming survival up to time t. Estimate.

Slides:



Advertisements
Similar presentations
Sampling Distributions (§ )
Advertisements

Chap 10: Summarizing Data 10.1: INTRO: Univariate/multivariate data (random samples or batches) can be described using procedures to reveal their structures.
TH EDITION LIAL HORNSBY SCHNEIDER COLLEGE ALGEBRA.
Time-Dependent Failure Models
Section 6.1 Let X 1, X 2, …, X n be a random sample from a distribution described by p.m.f./p.d.f. f(x ;  ) where the value of  is unknown; then  is.
Continuous Random Variables Chap. 12. COMP 5340/6340 Continuous Random Variables2 Preamble Continuous probability distribution are not related to specific.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Exponential distribution Reliability and failure rate (Sec )
1 Fundamentals of Reliability Engineering and Applications Dr. E. A. Elsayed Department of Industrial and Systems Engineering Rutgers University
Introduction Before… Next…
Use of Quantile Functions in Data Analysis. In general, Quantile Functions (sometimes referred to as Inverse Density Functions or Percent Point Functions)
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Topic 4 - Continuous distributions
1 Lecture 13: Other Distributions: Weibull, Lognormal, Beta; Probability Plots Devore, Ch. 4.5 – 4.6.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Logarithms The previous section dealt with exponential functions of the form y = a x for all positive values of a, where a ≠ 1. The horizontal.
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 7: Parametric Survival Models under Censoring STT
Sampling and estimation Petter Mostad
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 4: Data Plot STT
Chapter 4 Continuous Random Variables and Probability Distributions  Probability Density Functions.2 - Cumulative Distribution Functions and E Expected.
Survival Data survival time examples: –time a cancer patient is in remission –time til a disease-free person has a heart attack –time til death of a healthy.
Topic 19: Survival Analysis T = Time until an event occurs. Events are, e.g., death, disease recurrence or relapse, infection, pregnancy.
Chapter 4 Continuous Random Variables and Probability Distributions  Probability Density Functions.2 - Cumulative Distribution Functions and E Expected.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Logarithmic Functions Logarithms Logarithmic Equations Logarithmic Functions Properties of Logarithms.
Chapter 6 Continuous Random Variables Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Sampling Distribution of the Sample Mean
Chapter 6 The Normal Distribution and Other Continuous Distributions
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005
Modeling and Simulation CS 313
Lecture Slides Elementary Statistics Twelfth Edition
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen
Expectations of Random Variables, Functions of Random Variables
Introduction to Probability - III John Rundle Econophysics PHYS 250
Confidence Intervals and Sample Size
The Maximum Likelihood Method
Chapter 4 Continuous Random Variables and Probability Distributions
ASV Chapters 1 - Sample Spaces and Probabilities
STT : Biostatistics Analysis Dr. Cuixian Chen
The Exponential and Gamma Distributions
Chapter 4 Continuous Random Variables and Probability Distributions
12. Principles of Parameter Estimation
Normal Distribution.
Modeling and Simulation CS 313
Chapter 6: Random Variables
Section 11.1 Day 2.
Inference for Regression
STAT 206: Chapter 6 Normal Distribution.
Elementary Statistics
The Maximum Likelihood Method
Chapter 4 Section 1.
Sequences F.LE.1, 2, 5 F.BF.1, 2 A.SSE.1.a F.IF.1, 2, 3, 4
No notecard for this quiz!!
Inference for Proportions
Chapter 6: Random Variables
The normal distribution
Parametric Survival Models (ch. 7)
Statistics for Managers Using Microsoft® Excel 5th Edition
Regression Assumptions
Derivatives of Exponential and Logarithmic Functions
11. Conditional Density Functions and Conditional Expected Values
Sampling Distributions (§ )
11. Conditional Density Functions and Conditional Expected Values
2.3. Measures of Dispersion (Variation):
12. Principles of Parameter Estimation
HKN ECE 313 Exam 2 Review Session
Continuous Distributions
Regression Assumptions
The Normal Distribution
Presentation transcript:

The hazard function The hazard function gives the so-called “instantaneous” risk of death (or failure) at time t, assuming survival up to time t. Estimate h(t) by the quotient

The hazard function is also called the instantaneous failure rate or force of mortality or conditional mortality rate or age-specific failure rate. In a real sense it gives the risk of failure (death) per unit time over the progress of aging. We have seen f(y)=-d/dy(S(y)), and so that f, S, and h are all related and each can be obtained from the others. Hazard functions can be flat, increasing, decreasing, or more complex…

Consider a simple hazard function, the constant hazard h(y)= for all y≥0. Here we assume , where 0. We have seen that so if we evaluate this for h(y)= , we get Since f(y)=-d/dy(S(y)), we have the exponential probability density with parameter . This means the expected value is  and the variance is 2.

Definition 2.1 writes Y as Y ~ exp( with h(y)=1/ and notes that this is one of the most commonly used models for lifetime distributions. One reason is because of the “memoryless property” of the exponential distribution given on page 20: Theorem 2.1: If Y ~ exp( then for any y>0 and t>0 we have P(Y>y+t | Y>y)=P(Y>t) Note this says that given survival past y, the conditional probability of surviving an additional t is the same as the unconditional probability of surviving t. Thus there is no “aging” with an increased risk of dying… Go over Example 2.1 to see a picture of exponential data which would then have a constant hazard (of value =1/mean(Y)) Note from Theorem 2.2 that if Y ~ exp( , Y/ ~ exp(1). This tells us that if we multiply an exponential survival variable by a constant, the MTTF is correspondingly multiplied by the same constant.

How do we decided whether a set of survival data is following the exponential distribution? That the hazard is constant? Look over Example 2.1 - 200 randomly generated exponential variables with mean=100. Characteristic skewed distribution, sample mean=107.5, sample s.d.=106.1; (Recall that if Y~exp( then E(Y)=SD(Y)= . ) The sample stemplot and the sample mean and sd approximate the true shape, center and spread of the exponential. The estimated hazards (rightmost column) approx. .01 (1/100) - constant - see the formula on p.22 for getting these values… But another way to check the distribution is to compare the quantiles of the exponential distribution with the sample quantiles in a plot known as a qqplot. See R#3 for a way to compute the quantiles and do the plot… Recall that the p-th quantile of a distribution of a r.v. Y is the value Q s.t. P(Y<=Q)=p. So we must compute the quantiles of the theoretical distribution and compare them (smallest to smallest, next smallest to next smallest, etc.) to the sample quantiles.

Power Hazard: Note this is of the form (constant)yconstant and if 1 this reduces to the constant hazard we just considered. Note that and so We say in this case that Y ~ Weibull(. The mean and variance of Y are given in Theorem 2.5 in terms of the gamma function Note that Y ~ Weibull(1,exp(and if >1 the hazard function is increasing; if 1, the hazard is decreasing; if 1 then the hazard is constant (as in the exponential survival case) Note that if 2, the hazard is linear in y. Go over Example 2.2 on pages 24-25. Y ~ Weibull(2,sqrt(3)). Use R (gamma()) to compute the values of the Gamma function… Also in R, shape=alpha and scale=beta in qweibull.

To check graphically if a distribution is Weibull, do essentially a qqplot on the log-log scale (see section 4.4, p. 61-63). The key formulas are: (4.2) Now substitute the ordered data and take natural logs (4.2a) Take logs again… (4.2b) Write as a linear equation: (4.3) (4.3a) So plot the points in 4.4, look for a straight line and the slope will equal 1/ and the intercept will equal log()