1997 Summer Public Health Research Institute on Minority Health, UNC-CH Living Beyond Our “Means”: New Methods for Comparing Distributions Camara Phyllis.

Slides:



Advertisements
Similar presentations
Descriptive Statistics-II
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
BA 275 Quantitative Business Methods
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
1 Empirical and probability distributions 0.4 exploratory data analysis.
Analysis of Research Data
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Simple Linear Regression Analysis
REGRESSION AND CORRELATION
Linear Regression Example Data
Correlation and Regression Analysis
Introduction to Regression Analysis, Chapter 13,
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
STA291 Statistical Methods Lecture 27. Inference for Regression.
2004/10/07Faculty Participation in CAT...1 Faculty Participation in the Center for the Advancement of Teaching, Xavier University of Louisiana.
Some Introductory Statistics Terminology. Descriptive Statistics Procedures used to summarize, organize, and simplify data (data being a collection of.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Biostatistics course Part 16 Lineal regression Dr. Sc. Nicolas Padilla Raygoza Department of Nursing and Obstetrics Division Health Sciences and Engineering.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Statistics for Business and Economics Chapter 10 Simple Linear Regression.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
CHAPTER 18: Inference about a Population Mean
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Statistics Workshop Tutorial 8 Normal Distributions.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
World Cornea Congress VI Michael K. Smolek & Panagiotis Kompotiatis Department of Ophthalmology LSU Eye Center of Excellence, New Orleans The authors.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Lecture 10: Correlation and Regression Model.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 2.
Statistics for Managers Using Microsoft® Excel 5th Edition
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
1 Chapter 23 Inferences About Means. 2 Inference for Who? Young adults. What? Heart rate (beats per minute). When? Where? In a physiology lab. How? Take.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
MARCH 18, 2014 DATA ANALYSIS. WHAT TO DO WITH DATA Take a look at your data Histogram Descriptive statistics Mean, mode, range, standard deviation/standard.
Copyright ©2000, Camara Jones The Impacts of Racism on Health Camara Phyllis Jones, MD, MPH, PhD Department of Health and Social Behavior Department of.
CHAPTER 10 Comparing Two Populations or Groups
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Statistics for Managers using Microsoft Excel 3rd Edition
Model validation and prediction
Inferences for Regression
Chapter 13 Simple Linear Regression
Statistics PSY302 Review Quiz One Fall 2018
Two Independent Samples
PENGOLAHAN DAN PENYAJIAN
Shape of Distributions
Chapter 3 Statistical Concepts.
Inference for Distributions
CHAPTER 12 More About Regression
Indicator Variables Response: Highway MPG
Statistics PSY302 Review Quiz One Spring 2017
Inferences for Regression
Presentation transcript:

1997 Summer Public Health Research Institute on Minority Health, UNC-CH Living Beyond Our “Means”: New Methods for Comparing Distributions Camara Phyllis Jones, MD, MPH, PhD Department of Health and Social Behavior Department of Epidemiology Harvard School of Public Health

Why Study Distributions? Preserve information on location, spread, shape Describe populations The Strategy of Preventive Medicine probability density systolic blood pressure

Overview Critique of current approaches Proposed methods –Projection plot –Projection spline –Iter-1 test Systolic blood pressure by “race”

Critique of Current Approaches Location Spread Shape Stable Test Proportions yes yes Moments: Mean yes - - yes yes Std deviation - yes - yes yes Skewness - - symm yes yes Kurtosis - - peak yes yes Boxplots yes yes symm yes - Histograms yes yes yes - yes Kernel densities yes yes yes - - Cum distrib fns yes yes - yes yes Q-Q plot yes yes yes yes -

Fitting a Line to the Quantile-Quantile Plot Covariance of quantiles Asymmetry of modeling

Covariance of Sample Quantiles For i < j, the covariance is p i * (1 - p j ) n * dens i * dens j

Asymmetry of Modeling

Modeling B as dependent on A: quant B = quant A SE of intercept = 3.78 SE of slope = 0.031

Asymmetry of Modeling (cont.) Modeling A as dependent on B: quant A = quant B SE of intercept = 7.17 SE of slope = Noteworthy: Slopes are not exact reciprocals Standard errors differ in magnitude

Symmetry with respect to the line y=x x quantiles y quantiles line y=x (A,B) (B,A)

Perpendicular distance of the point (A,B) from the line y=x A (A,B) B-A B A sqrt(2)

Projection of the point (A,B) on the line y=x (A,B) B AB B-A 2 A+B 2

Rotating the plot 45 degrees x quantiles y quantiles line y=x B-A sqrt(2) (A,B) (A+B, A+B) 2 2

Projection plot difference between corresponding quantiles B-A A+B 2 average of corresponding quantiles line dif=0

Example of a Knotted Linear Spline Y = b0 + b1 x + b2 xplus (knot1) + b3 xplus (knot2) slope = b1 + b2 + b3 slope = b1 + b2 slope = b1 knot 1 knot 2

Iterative Fit of a Knotted Linear Spline Iteration 6 Iteration 7 Iteration 5 Candidate knots Initial knots Iteration 1 Iteration 4 Iteration 3Iteration 2 ^ ^ ^ ^ ^ ^...

Highest Level of Difference Shape? If not shape: Spread? If neither shape nor spread: Location? None? Shape Spread Location None Segments Slope Intercept > 00 00

Interpretation of Projection Splines Number of segments – More than one SHAPE – One evaluate slope Slope – Differs from zero SPREAD – Zero evaluate intercept Intercept – Differs from zero LOCATION – Zero NONE

Synopsis Two groups Continuous outcome Graphical Comparison –Location –Spread –Shape Statistical test –Global –Level of difference

Design of NHANES 1 Probability survey All 50 states Direct examination Systolic blood pressure 2,178 “black” females 9,778 “white” females

P-values from the iter-1 test, with projection spline inferences Crude distributions p-value Level of difference Shape Spread

P-values from the iter-1 test, with projection spline inferences (cont.) Adjusted for body mass index p-value Level of difference Shape Spread Shape

P-values from the iter-1 test, with projection spline inferences (cont.) Age-shifted analysis p-value Level of difference bf 5-14 / wf bf / wf bf / wf bf / wf bf / wf bf / wf Shape Location -- Shape --

Systolic Blood Pressure by “Race” Same-age comparisons – No differences in childhood – Shape differences in middle age Age-shifted comparisons – Acceleration of age-dependence – Shift of entire distributions

Significance of Age-Shifting Blood pressure and age Social meaning of “race” Other health conditions

Hypotheses “Race” - associated differences in health outcomes in the U.S. are due to accelerated aging of the black population. Accelerated aging of the black population in the U.S. is due to racism.