Comprehensive Project By Melissa Joy.  Background Information on Probability  Intro to Fay’s Formula  Notation  Overview of the method behind Fay’s.

Slides:



Advertisements
Similar presentations
A Brief Introduction to Epidemiology - VII (Epidemiologic Research Designs: Demographic, Mortality & Morbidity Studies) Betty C. Jung, RN, MPH, CHES.
Advertisements

Probability & Certainty: Intro Probability & Certainty.
Part 9: Normal Distribution 9-1/47 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Samuel Clark Department of Sociology, University of Washington Institute of Behavioral Science, University of Colorado at Boulder Agincourt Health and.
CHAPTER 6 Statistical Analysis of Experimental Data
Epidemiology Kept Simple
CHAPTER 5 Probability: What Are the Chances?
Probability & Certainty: Intro Probability & Certainty.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Adapted from Walch Education The conditional probability of B given A is the probability that event B occurs, given that event A has already occurred.
HaDPop Measuring Disease and Exposure in Populations (MD) &
Measuring Epidemiologic Outcomes
90% survival rate Stage 1 68% Survival rate Stage 2 59% survival rate Stage 3 5% If spread to brain Stage 4 Ages 0 to 19 is 0.7% Ages 20 to 34 is 7.1%
Prevention Strategies: Emerging Trends and Issues Kimberly E. Davis, Ph.D. July 17, 2010 Center for Cancer Research and Therapeutic Development.
Overview of All SEER-Medicare Publications Through 2012 Mark D. Danese, MHS, PhD July 24, 2012.
Virus and Cancer.
University of Minnesota Medical Technology Evaluation and Market Research Course: MILI/PUBH 6589 Spring Semester, 2012 Stephen T. Parente, Ph.D. Carlson.
1 Managerial Finance Professor Andrew Hall Statistics In Finance Probability.
Estimation of Various Population Parameters Point Estimation and Confidence Intervals Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology.
INCIDENCE AND SURVIVAL TRENDS OF COLORECTAL CANCER FROM 2002 TO 2011 BE Ansa; E Alema-Mensah; MD Claridy; JQ Sheats; B Fontenot, and SA Smith Georgia Regents.
Prostate Cancer Symposium An Educational Initiative For Patients, Spouses, Advocates and Healthcare Professionals The Impact of Prostate Cancer in New.
Prevalence The presence (proportion) of disease or condition in a population (generally irrespective of the duration of the disease) Prevalence: Quantifies.
Theory of Probability Statistics for Business and Economics.
1 Chapters 6-8. UNIT 2 VOCABULARY – Chap 6 2 ( 2) THE NOTATION “P” REPRESENTS THE TRUE PROBABILITY OF AN EVENT HAPPENING, ACCORDING TO AN IDEAL DISTRIBUTION.
Previous Lecture: Data types and Representations in Molecular Biology.
Managerial Decision Making Facilitator: René Cintrón MBA / 510.
LECTURE 15 THURSDAY, 15 OCTOBER STA 291 Fall
Slides last updated: October Ferlay J, Shin HR, Bray F, Forman D, Mathers C and Parkin DM. GLOBOCAN 2008 v2.0, Cancer Incidence and Mortality.
Chapter 7 LIFE TABLES AND POPULATION PROBLEMS
April 13, 2011 Back to Basics, 2011 POPULATION HEALTH : Vital & Health Statistics Presented by Robert Spasoff, MD Epidemiology & Community Medicine 1.
Is for Epi Epidemiology basics for non-epidemiologists.
Rates, Ratios and Proportions and Measures of Disease Frequency
Measures of Association and Impact Michael O’Reilly, MD, MPH FETP Thailand Introductory Course.
1 RES 341 RESEARCH AND EVALUATION WORKSHOP 4 By Dr. Serhat Eren University OF PHOENIX Spring 2002.
SCT The analysis of case cohort design in the presence of competing risks Melania Pintilie, Yan Bai, Lingsong Yun and David C. Hodgson Ontario Cancer.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Bayesian Estimation of Confidence Intervals for NAACCR Predicted Age-adjusted Incidence Rates Gentry White, J. Jackson-Thompson, Missouri Cancer Registry,
EPIDEMIOLOGY The study of the distribution and determinants of health related states and events in specified population and application of this.
Conditional Probability and Independence. Learning Targets 1. I can use the multiplication rule for independent events to compute probabilities. 2. I.
Foundational Concepts & Measures Monday, Sept 13.
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Measures of Disease Frequency
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Breast cancer affects 1 in 8 women during their lives. 1 Population Statistics.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
01/20151 EPI 5344: Survival Analysis in Epidemiology Hazard March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
STATISTICS 6.0 Conditional Probabilities “Conditional Probabilities”
Slides last updated: March Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray, F. GLOBOCAN.
Sets and Probability Chapter 7. Ch. 7 Sets and Probabilities 4-2 Basic Concepts of Probability 4-3 Addition Rule 4-4 Multiplication Rule 4-5 Multiplication.
EPI 5344: Survival Analysis in Epidemiology Week 6 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa 03/2016.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Chapter5 Statistical and probabilistic concepts, Implementation to Insurance Subjects of the Unit 1.Counting 2.Probability concepts 3.Random Variables.
Probability and Probability Distributions. Probability Concepts Probability: –We now assume the population parameters are known and calculate the chances.
Overview of Incidence Data at the Virginia Cancer Registry Carolyn Halbert M.A.,.M.P.H. Statistical Analysis Coordinator Virginia Cancer Registry.
Conditional Probability and Independence
Presented by: Karen Miller
More Ontarians need to be screened for colorectal cancer (Sept. 2012)
College of Applied Medical Sciences
Intro to Probability Math 9.
POINT ESTIMATOR OF PARAMETERS
Measures of Morbidity.
Measures of Disease Occurrence
Epidemiological Measurements of health
Chapter 4 SURVIVAL AND LIFE TABLES
Statistics and Data Analysis
Measures of Disease Occurrence
The failure of breast cancer screening
South Carolina Statewide Cancer Prevention & Control Program
Presentation transcript:

Comprehensive Project By Melissa Joy

 Background Information on Probability  Intro to Fay’s Formula  Notation  Overview of the method behind Fay’s Formula  Breast cancer example using raw data  Table of age conditional breast cancer risk  Table of age conditional cancer risk (all sites)  Bibliography  Thank you’s

 Probability is the likelihood or chance that something will happen  Conditional Probability is the probability of some event A, given the occurrence of some other event B. ◦ It is written P(A|B) ◦ It is said “the probability of A, given B” ◦ P(A|B) = P(A ∩ B) P(B)

 Probability density function (pdf) is a function,f(x), that represents a probability distribution in terms of integrals.  The probability x lies in the interval [a, b] is given by ∫ a f (x) dx b

A(x,y): Age-conditional probability of getting cancer between x and y, given alive and cancer free up until age x Or equivalently, the probability that an individual of age x will get cancer in the next (y - x) years, given alive and cancer free up until age x Goal: Write A(x,y) in terms of data that is easily found and collected

Probability density functions: (For simplicity, these pdf’s will be constant so I will refer to them as probabilities)  λ: Failure rates  S: Survival rates Subscripts:  c: denotes incidence of cancer  d: denotes incidence of death from cancer  o: denotes death from other (non-cancer) related causes  An asterisk (*) signifies that the data implies that the individual was cancer free up until a particular age.

A(x,y): Age-conditional probability of getting cancer between x and y, given alive and cancer free up until age x A(x,y) = P(first cancer occurs between age x and y) P(alive and cancer free at age x given cancer free before) A(x,y) = ∫ x f c (a) da S* (x) Goal: Rewrite A(x,y) with no * terms f c (a): probability density function of the first occurrence of cancer happening at age a (a between x and y) S*(a): probability that the person is alive and cancer free at age x, given they are cancer free up until age x y Fay, Michael P. "Estimating Age Conditional Probability of Developing Disease From Surveillance Data." Population Health Metrics 2 (2004): Fay, Michael P., Ruth Pfeiffer, Kathleen A. Cronin, Chenxiong Le, and Eric J. Feuer. "Age-Conditional Probabilities of Developing Cancer." Statistics in Medicine 22 (2003):

It is true that f c (a) = λ c * (a) S* (a) P (first cancer occurs between age x and y) = ∫ x f c (a) da = ∫ x λ c * (a) S* (a) da f c (a): probability density function of the first occurrence of cancer happening at age a (a between x and y) λ c *(a): probability that the first cancer occurs at age a, given alive and cancer free up until age a S*(a): probability that the person is alive and cancer free at age a, given they are cancer free up until age a y A(x,y) = ∫ x f c (a) da S* (x) y y Starting with the Numerator Goal: Rewrite A(x,y) with no * terms A(x,y) = ∫ x λ c *(a) S*(a) da S* (x) y

It could be found that: λ c (a) = f c (a) S(x) λ c (a) = λ c * (a) S* (a) S(x) So by re-arranging the above equation we get λ c (a) S (a) = λ c * (a) S*(a) f c (a): probability density function of the first occurrence of cancer happening at age a (a between x and y) λ c (a): probability that the first cancer occurs at age a S(a): probability that the person is alive and cancer free at age a λ c *(a): probability that the first cancer occurs at age a, given alive and cancer free up until age a S*(a): probability that the person is alive and cancer free at age a, given they are cancer free up until age a A(x,y) = ∫ x λ c (a) S (a) da S* (x) y We can now rewrite the numerator without * terms Goal accomplished for the numerator! A(x,y) = ∫ x λ c *(a) S*(a) da S* (x) y

S* (x) = S c * (x) S o *(x) and we know S o *(x) = S o (x) Through a long series of calculations we find that: S c *(x) = 1 - ∫ 0 λ c (a) S d (a) da A(x,y) = ∫ x λ c (a) S (a) da S* (x) So we can rewrite the denominator as S* (x) = S o (a) {1 - ∫ 0 λ c (a) S d (a) da} y x x Goal: Rewrite A(x,y) with no * terms A(x,y) = ∫ x λ c (a) S (a) da S o (x) {1 - ∫ 0 λ c (a) S d (a) da} y x S*(a): probability that the person is alive and cancer free at age a, given they are cancer free up until age a S c *(a): probability that the person is cancer free at age a, given they are cancer free up until age a S o *(a): probability that the person did not die from non- cancer related causes at age a, given they are cancer free up until age a S o (a): probability that the person did not die from non-cancer related causes at age a S d (a): probability that the person did not die from cancer at age a λ c (a): probability that the first cancer occurs at age a S(a): probability that the person is alive and cancer free at age a

A(x,y): Age-conditional probability of getting cancer between x and y, given alive and cancer free up until age x A(x,y) = ∫ x λ c (a) S (a) da S o (x) {1 - ∫ 0 λ c (a) S d (a) da} y x A(x,y) = ∫ x f c (a) da S* (x) y We started from: Goal accomplished!

c : number of incidences of cancer ≈ 160 d: number of cancer caused deaths ≈ 20 o: number of deaths from other causes ≈ 1500 n : Mid-interval population ≈ 3 million Approximated SEER Data 2004 λ c (a)≈ c /n λ d (a) ≈ d /n λ o (a) ≈ o /n λ c (20) ≈ 160/3 million = λ d (20) ≈ 20/3 million = λ o (a) ≈ 1500/3 million = Let’s find the failure rates Failure rates are the probability that you will get cancer, die of cancer or die from other causes

 S c (20)= 1- λ c (20) =  S d (20)= 1- λ d (20) =  S o (20)= 1- λ o (20) =  S(20) = 1- {λ c (20) + λ o (20)} = Survival rates are the probability that the individual has not gotten cancer, died from cancer, or died from other causes. S (without a subscript) is the probability of being alive and cancer free.

A(x,y) = ∫ x λ c (a) ∙ S (a) da S o (x) {1 - ∫ 0 λ c (a) ∙ S d (a) da} d/topic_lifetime_risk.pdf A(20,30) = ∫ 20 λ c (20) ∙ S (20) da S o (20) {1 - ∫ 0 λ c (20) ∙ S d (20) da} = 10 λ c (20) ∙ S (20) S o (20) {1 – (20 λ c (20) ∙ S d (20) )} = = % What does this number mean? y x 30 20

Current Age+10 years+20 years+30 yearsEventually 00 %0 %0 %0.06 %12.28 % 100 %0.06 %0.48 %12.42 % %0.48 %1.89 %12.45 % %1.84 %4.24 %12.46 % %3.86 %7.04 %12.19 % %5.79 %8.93 %11.12 % %6.87 %8.76 %9.21 % %6.07 %-6.59 % % % Table from Surveillance, Epidemiology and End Results (SEER) database d/topic_lifetime_risk.pdf

Current Age+10 years+20 years+30 yearsEventually %0.33 %0.75 %40.93 % %0.60 %1.58 %41.33 % %1.42 %3.93 %41.39 % %3.55 %9.59 %41.49 % %8.77 %20.01 %41.35 % %18.27 %31.33 %40.67 % %27.71 %36.08 %38.13 % %29.07 % % % % Table from Surveillance, Epidemiology and End Results (SEER) database merged/topic_lifetime_risk.pdf

 Fay, Michael P. "Estimating Age Conditional Probability of Developing Disease From Surveillance Data." Population Health Metrics 2 (2004):  Fay, Michael P., Ruth Pfeiffer, Kathleen A. Cronin, Chenxiong Le, and Eric J. Feuer. "Age-Conditional Probabilities of Developing Cancer." Statistics in Medicine 22 (2003):  Ries LAG, Melbert D, Krapcho M, Mariotto A, Miller BA, Feuer EJ, Clegg L, Horner MJ, Howlader N, Eisner MP, Reichman M, Edwards BK (eds). SEER Cancer Statistics Review, , National Cancer Institute. Bethesda, MD, based on November 2006 SEER data submission, posted to the SEER web site,  "What Is Your Risk?." Your Disease Risk. (2005). Harvard Center For Cancer Prevention. 2 Oct 2007.

 Professor Lengyel  Professor Buckmire  Professor Knoerr  And… the entire Oxy math department THANK YOU!

Go to to calculate your risk and learn what could raise and lower your risk