Measuring Dietary Intake Raymond J. Carroll Department of Statistics Faculty of Nutrition and Faculty of Toxicology Texas A&M University

Slides:



Advertisements
Similar presentations
Statistical stuff happens Vioxx, placebo power and other wonders.
Advertisements

Chapter 8 The Rest of the Good Nutrition Line-up.
Section 10.2: Tests of Significance
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Chapter 7 Sampling and Sampling Distributions
Chapter 4: Basic Estimation Techniques
You will need Your text Your calculator
Abstract Is There a Relationship Between Dietary Intake, Food Preferences, and BMI in Obese, Hispanic Toddlers? A. Popejoy, N. DiMarco, E. Andersson, C.
Chi-Square and Analysis of Variance (ANOVA)
Panel Data Models Prepared by Vera Tabakova, East Carolina University.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Statistical Significance for 2 x 2 Tables Chapter 13.
Inferential Statistics
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
Simple Linear Regression Analysis
Week 13 November Three Mini-Lectures QMM 510 Fall 2014.
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Assessing Variables of Health Plan Authorization Turn-Around-Timelines Susan J. Dickerson Professor Edward Schumacher HCAI November 2012.
Correlation and regression Dr. Ghada Abo-Zaid
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Correcting for measurement error in nutritional epidemiology Ruth Keogh MRC Biostatistics Unit MRC Centre for Nutritional Epidemiology in Cancer Prevention.
Copyright (c) Bani Mallick1 Lecture 2 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #2 Population and sample parameters More on populations.
Measuring Dietary Intake Raymond J. Carroll Department of Statistics Faculty of Nutrition and Toxicology Texas A&M University
Introduction to Hypothesis Testing
Nonparametric Regression and Clustered/Longitudinal Data
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Model Selection in Semiparametrics and Measurement Error Models Raymond J. Carroll Department of Statistics Faculty of Nutrition and Toxicology Texas A&M.
Introduction to Hypothesis Testing Raymond J. Carroll Department of Statistics Faculty of Nutrition Texas A&M University
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Measuring Dietary Intake Raymond J. Carroll Department of Statistics Faculty of Nutrition and Faculty of Toxicology Texas A&M University
A New Model for Dietary Intake Instruments Based on Self- Report and Biomarkers Raymond J. Carroll Texas A&M University (
Today Concepts underlying inferential statistics
Sample Size Determination
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Hypothesis Testing in Linear Regression Analysis
09/15/05William Wu / MS meeting1 Measurement error and measurement model with an example in dietary data.
Associations of Red Meat, Fat, and Protein Intake With Distal Colorectal Cancer Risk 100/5/31 鄒季臻 Nutr Cancer August ; 62(6): 701–709.
What is statistics really about? Neil Sheldon Royal Statistical Society Centre for Statistical Education & Manchester Grammar School.
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Topic 6.1 Statistical Analysis. Lesson 1: Mean and Range.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
MANAGEMENT AND ANALYSIS OF WILDLIFE BIOLOGY DATA Bret A. Collier 1 and T. Wayne Schwertner 2 1 Institute of Renewable Natural Resources, Texas A&M University,
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Chapter 2 Research in Abnormal Psychology. Slide 2 Research in Abnormal Psychology  Clinical researchers face certain challenges that make their investigations.
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Correlation & Regression
Economic evaluation of health programmes Department of Epidemiology, Biostatistics and Occupational Health Class no. 19: Economic Evaluation using Patient-Level.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Status on statistical methods in dietary assessment and Multiple Source Method Heiner Boeing German Institute of Human Nutrition Potsdam- Rehbrücke Department.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Understanding Medical Articles and Reports Linda Vincent, MPH UCSF Breast SPORE Advocate September 24,
Chapter 21: More About Tests
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
ANOVA, Regression and Multiple Regression March
Example x y We wish to check for a non zero correlation.
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Stats Methods at IC Lecture 3: Regression.
Statistics & Evidence-Based Practice
Unit 5 – Chapters 10 and 12 What happens if we don’t know the values of population parameters like and ? Can we estimate their values somehow?
Statistics in Clinical Trials: Key Concepts
Statistics and Data Analysis
Correlations: testing linear relationships between two metric variables Lecture 18:
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Measuring Dietary Intake Raymond J. Carroll Department of Statistics Faculty of Nutrition and Faculty of Toxicology Texas A&M University _________________________________________________________

I Still Cook Me in the kitchen, Yokohama (my birthplace), 1953 _________________________________________________________

Advertisement

College Station, home of Texas A&M University I-35 I-45 Big Bend National Park Wichita Falls, my hometown West Texas Palo Duro Canyon, the Grand Canyon of Texas Guadalupe Mountains National Park East Texas

What I am Not I know that potato chips are not a basic healthy food group. However, if you ask me a detailed question about nutrition, then I will ask Joanne LuptonNancy TurnerMeeyoung Hong _________________________________________________________

You are what you eat, but do you know who you are? This talk is concerned with a simple question. Will lowering her intake of fat decrease a womans chance of developing breast cancer? _________________________________________________________

Basic Outline Diet affects health. Many (not all!) studies though are not statistically significant. Focus: quality of the instruments used to measure diet Conclusion #1: The instruments are largely to blame. Conclusion #2: Expect studies to disagree _________________________________________________________

Evidence in Favor of the Fat- Breast Cancer Hypothesis Animal studies Ecological comparisons Case-control studies _________________________________________________________

International Comparisons _____________________________________________________________

Evidence against the Fat-Breast Cancer Hypothesis Prospective studies These studies try to assess a womans diet, then follow her health progress to see if she develops breast cancer The diets of those who developed breast cancer are compared to those who do not Only (?) 1 prospective study has found firm evidence suggesting a fat and breast cancer link, and 1 has a negative link _________________________________________________________

Prospective Studies NHANES (National Health and Nutrition Examination Survey): n = 3,145 women aged Nurses Health Study: n = 100,000+ Pooled Project: n = 300,000+ Norfolk (UK) study: n = 15,000+ _________________________________________________________

The Nurses Health Study, Fat and Breast Cancer _________________________________________________________ 60,000 women, followed for 10 years Prospective study Note that the breast cancer cases were announcing that they eat less fat Donna Spiegelman, the NHS statistician

Clinical Trials The lack of consistent (even positive) findings led to the Womens Health Initiative Approximately 40,000 women randomized to two groups: healthy eating and typical eating _________________________________________________________

WHI Diet Study Objectives _________________________________________________________

Prior Objections to WHI Cost ($415,000,000) Whether North Americans can really lower % Calories from Fat to 20%, from the current 38% Even if the study was successful, difficulties in measuring diet mean that we will not know what components led to the decrease in risk. _________________________________________________________

Change in Fat Calories Over Time _________________________________________________________ Women reported a decrease in fat- calories, but not to 20%

How do we measure diet in humans? 24 hour recalls Diaries Food Frequency Questionnaires (FFQ) _________________________________________________________ Walt Willett has a popular book and a popular FFQ

Food diaries Hot topic at NCI Only measures a few days diet, not typical diet A single 3-day diary finding a diet-cancer link is not universally scientifically acceptable Need for repeated applications Induces behavioral change?? _________________________________________________________

Typical (Median) Values of Reported Caloric Intake Over 6 Diary Days: WISH Study

The Food Frequency Questionnaire Do you remember the SAT? _________________________________________________________

The Pizza Question _________________________________________________________

The Norfolk Study with ~Diaries and FFQ _________________________________________________________ 15,000 women, aged 45-74, followed for 8 years 163 breast cancer cases Diary: p = FFQ: p = 0.229

Summary FFQ does not find a fat and breast cancer link 24 hour recalls and diaries are expensive They have found links, but in opposite directions Diaries also appear to modify behavior Question: do any of these things actually measure dietary intake? How well or how badly? These are statistical questions! _________________________________________________________

Do We Know Who We Are? Karl Pearson was arguably the 1 st great modern statistician Pearson chi-squared test Pearson correlation coefficient _________________________________________________________ Karl Pearson at age 30

Do We Know Who We Are? Pearson was deeply interested in self- reporting errors In 1896, Pearson ran the following experiment. For each of 3 people, he set up 500 lines of a set of paper, and had them bisected by hand _________________________________________________________ A gaggle of lines

Pearsons Experiment He then had an postdoc measure the error made by each person on each line, and averaged Dr. Lee spent several months in the summer of 1896 in the reduction of the observations _________________________________________________________ A gaggle of lines, with my bisections

Pearsons Personal Equations Pearson computed the mean error committed by each individual: thepersonal equations He found: the errors were individual. His errors were to the right, Dr. Lees to the left _________________________________________________________ Karl Pearson in later life

What Do Personal Equations Mean? Given the same set of data, when we are asked to report something, we all make errors, and our errors are personal In the context of reporting diet, we call this person-specific bias _________________________________________________________ Laurence Freedman of NCI, with whom I did the work

Model Details for Statisticians The model in symbols The existence of person-specific bias means that variance of true intake is less than one would have thought _________________________________________________________

Model Details for Statisticians The OPEN Study had the following measurements Two FFQ Two Protein biomarkers Two Energy biomarkers _________________________________________________________

Model Details for Statisticians The model in symbols Linear mixed model, fit by PROC MIXED _________________________________________________________

Attenuation The attenuation is the slope in the linear regression of X on Q _________________________________________________________

Relative Risk and Attenuation Start with a logistic model True relative risk Observed relative risk (regression calibration) _________________________________________________________

Relative Risk and Attenuation _________________________________________________________ AttenuationRelative Risk 1.0 (no meas. Error)

Our Hypothesis We hypothesized that when measuring Fat intake The personal equation, or person-specific bias, unique to each individual, is large and debilitating. The problem: the actual variability in American diets is much smaller than suspected. _________________________________________________________

Can We Test Our Hypothesis? We need biomarker data that are not much subject to the personal equation There is no biomarker for Fat There are biomarkers for energy (calories) and Protein We expect that studies are too small by orders of magnitude _________________________________________________________

Biomarker Data Calories and Protein: Available from NCIs OPEN study Results are surprising Victor Kipnis was the driving force behind OPEN _________________________________________________________

Sample Size Inflation There are formulae for how large a study needs to be to detect a doubling of risk from low and high Fat/Energy Diets These formulae ignore the personal equation We recalculated the formulae _________________________________________________________

Biomarker Data: Sample Size Inflation _________________________________________________________ If you are interested in the effect of calories on health, multiply the sample size you thought you needed by 11. For protein, by 4.5

Relative Risk _________________________________________________________ If high calories increases the risk of breast cancer by 100% in fact, and you change your intake dramatically, the FFQ thinks doing so increases the risk by 4% Result: It is not possible to tell if changing your absolute caloric intake, or your fat intake, or your protein intake will have any health effects

Relative Risk, Food Composition _________________________________________________________ If high protein (fat) increases the risk of breast cancer by 100%, your calories remain the same, you dramatically lower your protein (fat) intake, then FFQ thinks your risk increases by 20%- 30% Result: It is pretty difficult to tell if changing your food composition while maintaining your caloric intake will have any health effects

New Results The AARP Study: 250,000+ women, by far the greatest number in any single study Results according to rumor: Huge size statistical significance FFQ small measured increase in risk for dramatic behavioral change Statisticians dream: use Pearsons idea to get at the true increase in risk _________________________________________________________ A happy statistician dreaming about AARP

New Results The WHI Controls Study: 30,000+ women All with > 32% Calories from Fat via FFQ Diaries in a nested case- control study Highly significant fat effect in the diaries (RR in quantiles of 1.6) _________________________________________________________ A happy statistician doing field biology in Northwest Australia (the Kimberley)

Summary WHI, 2006, clinical trial My best case conjecture in 2005: Probably no statistically significant effects The p-value was 0.07, relative risk about 1.2 My best case conjecture in 2008 after further follow-up Statistically significant, modest effects _________________________________________________________

You are what you eat, but do you know who you are? Diet is incredibly hard to measure Even 100% increases in risk cannot be seen in large cohort studies with an FFQ If you read about a diet intervention, measured by a FFQ, and it achieves statistical significance multiple times: wow! _________________________________________________________

You are what you eat, but do you know who you are? Much work at NCI and WHI and EPIC on new ways of measuring diet EPIC (a multi-country study) may be a model, because of the wide distribution of intakes _________________________________________________________

What Was Done The OPEN analysis actually fit Protein and Energy together. We call this the Seemingly Unrelated Measurement Error Model Can get major gains in efficiency _________________________________________________________

SUMEM Gains in efficiency come from the correlations of the random effects _________________________________________________________