Show me the evidence! Discovery through exhaustive search of the evidence Isaac S. Kohane, MD, PhD Associate Professor of Health Sciences and Technology.

Slides:



Advertisements
Similar presentations
Regulation of Consumer Tests in California AAAS Meeting June 1-2, 2009 Beatrice OKeefe Acting Chief, Laboratory Field Services California Department of.
Advertisements

Predictive Analysis of Gene Expression Data from Human SAGE Libraries Alexessander Alves* Nikolay Zagoruiko + Oleg Okun § Olga Kutnenko + Irina Borisova.
Understanding Statistics in Research Articles Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management Assistant Professor,
Obtaining The Numbers Behind the Translational Imperative Harvard Medical School Center for Biomedical Informatics i2b2 National Center for Biomedical.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
Estimation of Sample Size
From Bedside to Bench and Back
Genetics: Past, Present, and Future Robert M. Fineman, M.D., Ph.D. Web siteWeb site.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence November-December 2007.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence May–June 2009.
3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory.
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
MammaPrint, the story of the 70-gene profile
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
University of Utah Department of Human Genetics Pharmacogenomics Louisa A. Stark, Ph.D. Director.
AKI: The “Omics” Prasad Devarajan, MD Professor of Pediatrics and Developmental Biology University of Cincinnati College of Medicine Director, Nephrology.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
Critical Appraisal of Clinical Practice Guidelines
Genomics Alexandra Hayes. Genomics is the study of all the genes in a person, as well as the interactions of those genes with each other and a person’s.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Multiple Choice Questions for discussion
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Assessing Minority Participation in Clinical Trials: Setting Attainable Goals The Minority and Women Clinical Trials Recruitment Program Department of.
NAPHSIS Annual Meeting 2009 Electronic Health Records: Why are they important? Linette T Scott, MD, MPH Deputy Director Health Information and Strategic.
DEB BYNUM, MD AUGUST 2010 Evidence Based Medicine: Review of the basics.
Prediction-based Threshold for Medication Alert Yoshimasa Kawazoe 1 M.D., Ph.D., Kengo Miyo 1,2 Ph.D., Issei Kurahashi 2 Ph.D., Ryota Sakurai 1 M.D. Kazuhiko.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Journal Club/September 24, Swing et al. Television and video game exposure and the development of attention problems. Pediatrics 2010;126:
Bioinformatics and medicine: Are we meeting the challenge?
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
1 Jack London, PhD Research Professor, Cancer Biology Thomas Jefferson University Informatics Shared Resource Director Sidney Kimmel Cancer Center at Jefferson.
The Value of Tissue Banks to Drug and Dx Developers Barbara L. Handelin, Ph.D. Conflicts of Interest, Privacy/Confidentiality, and Tissue Repositories:
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Term 4, 2006BIO656--Multilevel Models 1 Midterm Open “book” and notes; closed mouth minutes to read carefully and answer completely  60 minutes.
Acknowledgements Contact Information Anthony Wong, MTech 1, Senthil K. Nachimuthu, MD 1, Peter J. Haug, MD 1,2 Patterns and Rules  Vital signs medoids.
Results Compliance with Breast Cancer Screening Guidelines in the HIV Clinic: A Quality Improvement Tool E. Patrozou M.D., E. Christaki M.D., L. Hicks.
Wisconsin Genomics Initiative W isconsin M edical R esearch T riangle Wisconsin Medical Discovery Triangle.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
Clinical Decision on Harm. Clinical scenario or question Will laparoscopic hysterectomy increase post operative complications for our obese patient with.
Population Management What is that and why do I need to know? Elisha Brownfield, MD.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Validation and Refinement of a Prediction Rule to Identify Children at Low Risk for Acute Appendicitis Kharbanda AB, Dudley NC, Bajaj L, et al; Pediatric.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
Vitamin D: A New Frontier in Diabetes Management Contact Information: Background Acknowledgement Methods.
EBM --- Journal Reading Presenter :黃美琴 Date : 2005/10/27.
A Cost-Effectiveness Analysis of Maternal Genotyping to Guide Treatment in Postnatal Patients.
NAAT identified chlamydial infections: Enhanced sensitivity, reduced transmissibility? Presenter: Maria Villarroel, MA Authors: Maria A. Villarroel, MA.
Chips? SNPs? or PCR? What do we really want and what do we need? Heinz-Josef Lenz, MD Professor of Medicine Co-Director, Colorectal Center Co-Director,
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Case Control study. An investigation that compares a group of people with a disease to a group of people without the disease. Used to identify and assess.
Shubhangi Arora1; Eden Haverfield2; Gabriele Richard2; Susanne B
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Biomedical Data Science for Precision Medicine
National Healthcare Science Week 2017
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair Research, Department of Biomedical Informatics.
Statistical Applications in Biology and Genetics
Introduction to translational and clinical bioinformatics Connecting complex molecular information to clinically relevant decisions using molecular.
Influenza Vaccine Effectiveness Against Pediatric Deaths:
Medication Use Pattern Mining for Childhood Pneumonia Using Six Year Inpatient Electronic Medical Records in a Shanghai Hospital, China Chunlei Tang, PhD1,2,3,
Genomes and Their Evolution
Endometriosis and ovarian cancer: potential benefits and harms of screening and risk- reducing surgery  Sun-Wei Guo, M.Med., Ph.D.  Fertility and Sterility 
Wisconsin Genomics Initiative
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair Research, Department of Biomedical Informatics.
ERRORS, CONFOUNDING, and INTERACTION
Selecting the Right Predictors
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Presentation transcript:

Show me the evidence! Discovery through exhaustive search of the evidence Isaac S. Kohane, MD, PhD Associate Professor of Health Sciences and Technology and Pediatrics, Harvard Medical School Co-Director HMS Center for Biomedical Informatics Director, Countway Library of Medicine, Harvard Medical School Director, Children’s Hospital Informatics Program?

One Very Old Biomarker: A Prismatic Example of the Failure of EBM 1410 original articles on PSA screening for cancer 179 review articles With 1000’s of gene products, how do we systematically address this problem?

Relevance Networks & Computational Challenges

Patients and cell lines are analyzed as cases Clinical parameters, laboratory tests, RNA expression, and susceptibility to anti-cancer agents are all example features of those cases Construction of Relevance Networks 1 Lab Test Lab Test Clinical Param RNA Expr J02923 Susceptibility to Anti-cancer Agent Patient, Cell Line, Time, etc

Construction of Relevance Networks 3 Perform a pairwise comparison between all features For each scatter plot, we fit a linear model and stored –Correlation coefficient r Every feature is completely connected to every other feature by a linear model of varying quality Lab Test r = 0.65 Susceptibility to Anti-cancer Agent Lab Test 2

r ^ 2 = r 2 * r / abs(r) Choose a threshold r ^ 2 to split the network Drop links with r ^ 2 under threshold Breaks the completely connected network into islands where connections are stronger than threshold Islands are what we call “relevance networks” Display graphically, with thick lines representing strongest links Construction of Relevance Networks 4 Lab Test 1 Lab Test 2 r^2r^2 Clinical Param 1 Expression of J02923 Clinical Param 2 Susceptibility to r^2r^2 r^2r^2 r^2r^2 r^2r^2

The New Pharmacology RNA expression in NCI 60 cell lines was determined using Affymetrix HU6000 arrays –5,223 known genes –1,193 expressed sequence tags The RNA expression data set and Anti-cancer susceptibility data set were merged, using the 60 cell lines the two tables had in common RNA ExpressionDrug Susceptibility 6,000 genes5,000 anti-cancer agents Common 60 Cell Lines

Genes and Anti-Cancer Agents Threshold r ^ 2 was networks 834 features out of 11,692 (7.1%) 1,222 links out of 68,345,586 (.0018%) Only one link between a gene and anti-cancer agent

Genes and Anti-Cancer Agents Elevated levels of J02923 (lymphocyte cytosolic protein-1, LCP1, L-plastin, pp65) is associated with increased sensitivity to Agent is 4-Thiazolidinecarboxylic acid, 3-[[6-[2-oxo-2-(phenylthio)ethyl]- 3-cyclohexen-1-yl]acetyl]-2 thioxo-, methyl ester, [1R-[1a(R*),6a]]- (9CI)) LCP1 is an actin-binding protein involved in leukocyte adhesion A role for LCP1 in tumorogenicity has been previously postulated Low level expression of LCP1 is thought to occur in most human cancer cell lines Other thiazolidine carboxylic acid derivatives are known to inhibit tumor cell growth Butte et al. PNAS 2000

RelNet: Live Patient ‘Phenotypic’ Data –After patient cohorts are identified through the electronic medical records analysis and they agree to participate in the ‘genomics part’ of the study (see Globe story ) –RelNet methodology is easily expandable to non- numeric, patient demographics and other phenotypic data variables identified from the electronic medical records part of the analysis –This will potentially increase the number of variables with thousands that will make the problem even more challenging

New taxonomy Discoveries samples: BDNF expressed at lower levels in data sets associated with aging BDNF previously shown to be decreased in aging skin. 9 other genes associated with aging.

Biomarkers as Continuum

The single gene case Single genomic test that is 99.9% sensitivity (true positive rate) and 99.9% specificity (true negative rate) for a rare treatable disease X –Works well in patient group that is suspected of disease (high prior) In the general population (disease 1 in 100,000) –In 10,000,000 patients –100 true positives, 10,000 false positives –Less than 1 in 1000 positives are true.

The economic case The comprehensiveness case Using a much much better test –sensitivity of 100% and a specificity of 99.99% –Only 10 in 100,000 false positives A chip with 10,000 independent tests Impanelling the Incidentalome

Averting the Incidentalome’s Threat Education of clinicians Informatics for decision support of sequencing of clinical testing Real evidence, real probabilities.

Example: PPAR  Pro12Ala and diabetes Estimated risk (Ala allele) Deeb et al. Mancini et al. Ringel et al. Meirhaeghe et al. Clement et al. Hara et al. Altshuler et al. Hegele et al. Oh et al. Douglas et al. All studies Lei et al. Hasstedt et al Sample size Ala is protective Mori et al. Overall P value = 2 x Odds ratio = 0.79 ( ) Courtesy J. Hirschhorn

Airways Disease Research at Brigham and Women’s Hospital China ASP CAMP NHW Costa Rica Hispanic China Asian Costa Rica 10 EP 60,000 subjects 20 studies Linkage Fine Mapping Candidate Genes Pharmacogenetics

Who? Health Care Utilization (Hospitalization, ED Visits) + Genes Clinical Factors 96,000 patients identified through i2b2 datamart

Thank you