Multivariate Analysis Project

Slides:



Advertisements
Similar presentations
Year 2 Objectives: Number 1 NUMBER AND PLACE VALUE Objective 1: Count on in steps of 2, 3, 5 and 10 from any number Count in 10s to 100 *1 Count in 2s.
Advertisements

Consecutive Integer Problems. What is a consecutive integer? Integer: natural numbers and zero Integer: natural numbers and zero Consecutive: one after.
1 Combining (with SQL) HRP223 – 2010 October 27, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
Prepared By: Eng.Ola M. Abd El-Latif
Lecture 4 Sept 8 Complete Chapter 3 exercises Chapter 4.
Lecture 4 Sept 7 Chapter 4. Chapter 4 – arrays, collections and indexing This chapter discusses the basic calculations involving rectangular collections.
FIGURES FOR CHAPTER 1 INTRODUCTION NUMBER SYSTEMS AND CONVERSION
Consecutive Numbers Algebra I.
February  Study & Abstract StudyAbstract  Graphic presentation of data. Graphic presentation of data.  Statistical Analyses Statistical Analyses.
Section 8.1/8.2 Matrix Solutions to Linear Systems Inconsistent and Dependent Systems.
Unit 5 Lecture 2 Error Control Error Detection & Error Correction.
Week 21 Conditional Probability Idea – have performed a chance experiment but don’t know the outcome (ω), but have some partial information (event A) about.
How to make tables in HTML By Daniel Arze. How do they do this?
Centers for Disease Control and Prevention National Center for Health Statistics ICE on Injury Statistics, Sept 2006 Margaret Warner, PhD Office of Analysis.
The length of a rectangle is 6 in. more than its width. The perimeter of the rectangle is 24 in. What is the length of the rectangle? What we know: Length.
NORMALIZATION. What is Normalization  The process of effectively organizing data in a database  Two goals  To eliminate redundant data  Ensure data.
Screening Administrative Data To Assess the Accuracy Of Present-on-Admission Coding Michael Pine, M.D., M.B.A. Michael Pine and Associates, Inc. Chicago,
 Life Expectancy is 180 th in the World.  Literacy Rate is 4 th in Africa.
3.11 More Expressions and Equation Goals: To set up and solve word problems.
Week 21 Rules of Probability for all Corollary: The probability of the union of any two events A and B is Proof: … If then, Proof:
Continue the sequence 1. ½, 1, 2, 4, 8, __, __, __ 2. __, 5, 9, 13, __, __, , 55, __, 15, __, __ 4. 8, 27, 103, __ 5 Minutes Remain.
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Objective: Students will subtract integers using rules (11-4).
BITMAP INDEXES Barot Rushin (Id :- 108).
BINARY LOGISTIC REGRESSION
Quality of Electronic Emergency Department Data: How Good Are They?
Lec 3: Data Representation
1.1: Objectives Properties of Real Numbers
Today is Tuesday.
April 18 Intro to survival analysis Le 11.1 – 11.2
What is Probability? Quantification of uncertainty.
Consecutive Numbers Algebra I.
Warm Up Identify the slope and y-intercept of each equation. Then graph. 1. Y = -5X X + 5Y = X = Y = 12.
Goal: use counters to add integers
The probability of an event is a number between
Volume 1: Chronic Kidney Disease Chapter 3: Morbidity and Mortality
Samantha King, Jill Jones MD, Russ Waitman PhD
Consecutive Integer Problems
Dead Man Visiting Farrokh Alemi, PhD Narrated by …
Stratified Covariate Balancing Using R
Multiple logistic regression
Chapter 4 Linear Programming: The Simplex Method
Supplemental Digital Content Figure 1: Sample selection
GROUP BY & Subset Data Analysis
SQL for Predicting from Likelihood Ratios
SQL for Calculating Likelihood Ratios
Types of Joins Farrokh Alemi, Ph.D.
SQL for Cleaning Data Farrokh Alemi, Ph.D.
Receiver Operating Curves
Rank Order Function Farrokh Alemi, Ph.D.
Date Functions Farrokh Alemi, Ph.D.
Procedures Organized by Farrokh Alemi, Ph.D. Narrated by Yara Alemi
Cursors Organized by Farrokh Alemi, Ph.D. Narrated by Yara Alemi
Dead Patients Visiting
Lab 3 and HRP259 Lab and Combining (with SQL)
Indexing & Computational Efficiency
Minimal assignment of ICD-9 codes relating to malnutrition to a sample of malnourished, hospitalized patients Contributors: Jessica Moehling, BS; Anne.
Selecting the Right Predictors
Combining (with SQL) HRP223 – 2012 November 05, 2011
4 Years Milestone plan – (Company Name)
The probability of an event is a number between
Improving Overlap Based on Expected Value
Polynomial Functions 1 Definitions 2 Degrees 3 Graphing.
Chapter 4 SURVIVAL AND LIFE TABLES
How to find the nth rule for a linear sequence
The probability of an event is a number between
Finding the Least Common Denominator to Add/Subtract Fractions
1.8 Consecutive Integers Notes on Handout.
Incidence Incidence of a disease is: “the rate at which new cases occur in a population in a given period, usually in the past 12 months (one calendar.
Presentation transcript:

Multivariate Analysis Project Marla Surette

Overview Remove cases where: Create Training and Validation Sets Patients died before visits Diagnoses have no age or a negative age One person has more than 365 diagnoses in a year Create Training and Validation Sets Complete likelihood ratio for training set diagnoses and list of diagnoses Use training set to calculate likelihood ratio: Where each diagnosis that repeats at least 29 times Associated with each category of diagnosis Associated with 2nd and 3rd Occurrences of the same person that repeats at least 29 times Use validation set to calculate posterior odds of mortality Calculate sensitivity and Specificity

Removing Cases: Patients Dead Before Visits Step 2: Exclude cases found in step one by left joining Final Table with #AliveBFVisit Step 1: Identify patients who are alive before visits by using ‘WHERE AgeAtDeath<AgeAtDx’

Removing Cases: One person has more than 365 diagnoses in a year Step 1: Find cases where # of Dx is greater than 365 per one year Change AgeAtDx into an integer by using ‘cast’ command Find cases with greater than 365 by using ‘Count’ command with icd9 Step 2: Exclude cases where diagnoses is over 365 per one year Left join #DeadBFVisit & #NumDxLessYr ALSO add ‘AgeAtDx >0’ to remove patients where there is no AgeAtDx

Creating Training & Validation Sets Step 1: Create training set by collecting 80% randomly from #Data3, by using the ‘rand’ command Step 2: Select unique IDs from the training set by using ‘SELECT DISTINCT’ Step 3: Create validation set with remaining 20% of data found in #Data3 by left joining #Data3 and #TrainID, so that the validation table does not include the same IDs found in the training set.

Calculating Likelihood Ratios Basic Steps of Calculating Likelihood Ratios: Calculate number of patients dead and alive within table being used Select patients who died six months after diagnosis Select patients who did not die within six months after diagnosis Calculate the likelihood ratio by utilizing this equation: Patients who died within six months after diagnosis Dead Patients Patients who lived six months after diagnosis Alive Patients Likelihood Ratio =

Finding Cases that Occurred More than 29 Times and Excluding Those that Did Not Step One: Determine cases in which Dx occurs at least 29 times Use ‘Count’ command with AgeAtDx Step Two: Exclude cases in which Dx occurs less than 29 times Use ‘inner join’ command with Training set and #DxOver29

Finding Categories When the length of ICD9 is seven characters 2. Shorten ICD9 to six characters long 3. If length is less than 7 characters long, then keep ICD9 the same 4. Create new column called ICD92 which includes both changed and unchanged ICD9s

Finding Second & Third Occurrences Step 1: Use ‘Rank’ command to create a column that indicates number of occurrence e.g. 1st occurrence Rank = 1 Step 2: Create a table that only shows Ranks of 2 and 3, indicating 2nd and 3rd Occurrences.

Finding 2nd & 3rd Occurrences that Have Repeated over 29 times Step One: Count the number of times each ICD9 occurs in the table Step Two: Find ICD9 codes that have a count of at least 29 Step Three: Create a table that only has the information of those with ICD9s found in #Occur29

Calculating Posterior Odds of Mortality

Calculating Posterior Odds of Mortality Results: