Vanderbilt’s DNA Databank:

Slides:



Advertisements
Similar presentations
Meaningful Use and Health Information Exchange
Advertisements

Lecture 2 Strachan and Read Chapter 13
Maryam Nazir. Personal Genomics:  Branch of genomics concerned with the sequencing and analysis of the genome of an individual  Once sequenced, it can.
Genetic Analysis in Human Disease
CARe: The NHLBI’s Candidate Gene Association Resource.
Area 4 SHARP Face-to-Face Conference Phenotyping Team – Centerphase Project Assessing the Value of Phenotyping Algorithms June 30, 2011.
The NIHR Cambridge BioResource A means of correlating disease susceptibility genotypes with phenotypes Bringing together local people and leading research.
Reuse of Electronic Medical Records for Research Our architecture Two examples.
Re-Examination of the Design of Early Clinical Trials for Molecularly Targeted Drugs Richard Simon, D.Sc. National Cancer Institute linus.nci.nih.gov/brb.
Course Overview Personalized Medicine: Understanding Your Own Genome Fall 2014.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Standard 5: Patient Identification and Procedure Matching Nicola Dunbar, Accrediting Agencies Surveyor Workshop, 10 July 2012.
Is this Research? Exempt? Expedited?
Overview of the Synthetic Derivative April 16, 2010 Melissa Basford, MBA Program Manager – Synthetic Derivative.
NEW ENHANCEMENTS IN THE SYNTHETIC DERIVATIVE AND WHAT THAT MEANS FOR THE RESEARCHER Jacqueline Kirby June 7 th, 2013.
Luděk Bláha, PřF MU, RECETOX BIOMARKERS AND TOXICITY MECHANISMS 13 – BIOMARKERS Summary and final notes.
Vanderbilt’s DNA Databank: BioVU. Personalized Medicine Integration of genomic information into clinical decision making Personalized disease treatment.
Understanding Genetics of Schizophrenia
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Translational Research and Patient Safety in Europe TRANSFoRm: Requirements analysis for the learning healthcare system.
Human Research Protection Programs 1a: How to Navigate Human Subject Protection Regulations Sponsored by the American Society for Investigative Pathology.
Strong Heart Family Study Phase VI Genetics Center Aims October 8, 2009.
BioVU and the Synthetic Derivative Erica Bowton, PhD Program Manager, Personalized Medicine.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Butte Lab Journal Club 16 Aug 2010 Alexander A. Morgan.
Analyzing DNA Differences PHAR 308 March 2009 Dr. Tim Bloom.
Future Use of Stored Samples & Data and the NIH Policy on GWAS and dbGaP NIAID/DAIDS Dione Washington, M.S. -- ProPEP Sudha Srinivasan, Ph.D.-- TRP Tanisha.
PGPop: PharmacoGenomic discovery and replication in very large patient POPulations PGPop: SUMMARY PGPop was conceived as a network resource to provide.
Utilizing Science & Technology and Innovation for Development A Genome Wide Association Study for Type 2 Diabetes Susceptibility Gene and Treatment in.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Anticipated FY2016 Appropriations Agency$ Million NIH200 Cancer70 Cohort130 FDA10 Office of the Natl Coord. for Health IT (ONC) 5 TOTAL215 Mission: To.
1 Jack London, PhD Research Professor, Cancer Biology Thomas Jefferson University Informatics Shared Resource Director Sidney Kimmel Cancer Center at Jefferson.
Building the Electronic Data Infrastructure: Lessons from Indiana PROSPECT Paul Dexter, MD Chief Medical Information Officer, Wishard Health Services Regenstrief.
OVERVIEW OF THE SYNTHETIC DERIVATIVE June 29, 2012 Melissa Basford, MBA Program Manager – Synthetic Derivative.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
Pharmacogenetics & Pharmacogenomics Personalized Medicine.
CS177 Lecture 10 SNPs and Human Genetic Variation
EXPLORING DATA AND COHORT DISCOVERY IN THE SYNTHETIC DERIVATIVE.
Sample to Insight Alexander Kaplun, PhD Sep PGMD: a comprehensive pharmacogenomic database for personalized medicine and drug discovery.
Personalized Medicine Dr. M. Jawad Hassan. Personalized Medicine Human Genome and SNPs What is personalized medicine? Pharmacogenetics Case study – warfarin.
LEADING RESEARCH… MEASURES THAT COUNT Challenges of Studying Cardiovascular Outcomes in ADHD Elizabeth B. Andrews, MPH, PhD, VP, Pharmacoepidemiology and.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
One Health Information Exchange’s experience in responding to the changing landscape Funding: AHRQ Contract ; State of Tennessee; Vanderbilt.
1 Ethical issues in genomics research Bernard Lo, M.D. March 3, 2009.
Risk Prediction of Complex Disease David Evans. Genetic Testing and Personalized Medicine Is this possible also in complex diseases? Predictive testing.
SHARe. What is WHI-SHARe? SHARe – stands for SNP Health Association Resource. It is an NHLBI program for genome-wide association studies (GWAS). GWAS.
Health Services Research SIG October 2010 – September 2011 Newly established SIG (no previous WHI focus on health services research or claims data) About.
Leveraging Big Data and Electronic Health Records to Create a Data Fluent Culture for Cancer Medicine Mia Levy, MD, PhD October 8, 2015 Komen Big Data.
Enrollment and Monitoring Procedures for NCI Supported Clinical Trials Barry Anderson, MD, PhD Cancer Therapy Evaluation Program National Cancer Institute.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
VICTR Data Management CRC Research Skills Workshop Michael Assink April 6 th, 2012.
D4FF55A0-6B6F BF422A9BA9 Present by: Xiao Chen On December 7, 2015.
Use of Data from the Electronic Medical Record in Research Opportunities & Pit Falls Kristin West.
Kathleen Giacomini, Mark J. Ratain, Michiaki Kubo, Naoyuki Kamatani, and Yusuke Nakamura NIH Pharmacogenomics Research Network III & RIKEN Center for Genomic.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Regulatory Guidance for Genetic Testing. Three Specific Areas Laboratory tests Results of genetic testing – Clinical – Research GenomeWide Association.
Human Research Protection Program 101 March 20, 2007 Cincinnati, OH.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
Personal DNA Testing Melanie Wark Mackenzie Steen.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
Genomics and Disease Gene Identification. Is the Disease Genetic or Environmental.
Single Nucleotide Polymorphisms (SNPs
Clinical and Translational Research at Vanderbilt
Presentation for the SCTR Scientific Retreat on Aging Related Research
TOPMed Analysis Workshop Genetic Analysis Center Biostatistics Department University of Washington TOPMed Data Coordinating Center August 7-9, 2017 Introduction.
Discovery From Data Repositories H Craig Mak  Nature Biotechnology 29, 46–47 (2011) 2013 /06 /10.
Introduction to Genetic Association Studies
Presentation transcript:

Vanderbilt’s DNA Databank: BioVU

Personalized Medicine Integration of genomic information into clinical decision making Personalized disease treatment and also preventative therapies

Personalized Medicine A SNP is a single base-pair mutation that occurs at a specific site in the DNA sequence - occurs in at least 1% of the population SNPs are responsible for over 80% of the variation between two individuals; they are ideal for establishing correlations between genotype and phenotype As some SNPs predispose individuals to have a certain disease or trait or react to a drug in a different way, they will be highly useful in diagnostics and drug development

What is BioVU? The move towards personalized medicine requires very large sample sets for discovery and validation BioVU: biobank intended to support a broad view of biology and enable personalized medicine Contains de-identified DNA extracted from leftover blood after clinically-indicated testing of Vanderbilt patients who have not opted out Linked to Synthetic Derivative: de-identified EMR Current sample number: 116,551 105,910 adult samples 10,641 pediatric samples

The “synthetic derivative” Extract DNA John Doe A7CCF99DE65732…. eligible A7CCF99DE5732…. One way hash A7CCF99DE65732…. scrubbed John Doe The “synthetic derivative” (SD): can be updated

Synthetic Derivative vs. BioVU A7CDE6532 …. scrubbed A7CDE6532 …. scrubbed A7CDE6532…. + Synthetic Derivative BioVU ~1.9 million ~116,000

The Synthetic Derivative A Derivative of the EMR - information content reduced by ‘scrubbing’ identifiers Systematically shifted event dates Contains ~1.9 million records ~1 million with detailed longitudinal data averaging 100,000 bytes in size an average of 27 codes per record Records updated over time and are current through 9/31/09 Can be searched restricting to records for which DNA is available

Synthetic Derivative Data Types Narratives, such as: Clinical Notes Discharge Summaries History and Physicals Problem Lists Surgical Reports Progress Notes Letters Diagnostic Codes, Procedural Codes Forms (intake, assessment) Reports (pathology, ECGs, echocardiograms) Clinical Communications Lab Values and Vital Signs Medication Orders TraceMaster (ECGs)

BioVU Program Review

Sample accrual

BioVU Sample Management Samples are stored in a robotic sample storage system that allows for fully automated high speed sample picking. Currently genotyping on BioVU samples is being done in the DNA Resources Core here at Vanderbilt. RTS SmaRTStore

Validation in BioVU Sample handling algorithms Ancestry Gender match 1/384 gender mismatches Ancestry Characterize sample ancestry, assess usefulness of ‘race’ as defined in EMR Provide a panel of ancestry informative markers that define ancestry No significant difference between the concordance of self-report or observer-report with genetic ancestry Demonstration project – American Journal of Human Genetics Can known associations between genetic variants and common diseases be identified in the EMR?

The “demonstration project” Genotype “high-value” SNPs in the first 8,000 samples accrued. including SNPs associated by replicated genome-wide experiments with common diseases & traits Atrial fibrillation Crohn’s disease Multiple Sclerosis Rheumatoid arthritis Type II Diabetes Develop Natural Language Processing methods to identify cases and controls Are genotype-phenotype relations replicated?

First results 0.5 1.0 2.0 5.0 Odds Ratio gene / disease marker region Chr. 4q25 Atrial fibrillation rs10033464 Chr. 4q25 rs11805303 IL23R rs17234657 Chr. 5 Crohn's disease rs1000113 Chr. 5 rs17221417 NOD2 rs2542151 PTPN22 rs3135388 DRB1*1501 Multiple sclerosis rs2104286 IL2RA rs6897932 IL7RA rs6457617 Chr. 6 Rheumatoid arthritis rs6679677 RSBN1 rs2476601 PTPN22 rs4506565 TCF7L2 rs12255372 TCF7L2 rs12243326 TCF7L2 rs10811661 CDKN2B Type 2 diabetes rs8050136 FTO rs5219 KCNJ11 rs5215 KCNJ11 rs4402960 IGF2BP2 0.5 1.0 2.0 5.0 Odds Ratio

First results 0.5 1.0 2.0 5.0 Odds Ratio gene / disease marker region Chr. 4q25 Atrial fibrillation rs10033464 Chr. 4q25 rs11805303 IL23R rs17234657 Chr. 5 Crohn's disease rs1000113 Chr. 5 rs17221417 NOD2 rs2542151 PTPN22 rs3135388 DRB1*1501 Multiple sclerosis rs2104286 IL2RA rs6897932 IL7RA rs6457617 Chr. 6 Rheumatoid arthritis rs6679677 RSBN1 rs2476601 PTPN22 rs4506565 TCF7L2 rs12255372 TCF7L2 rs12243326 TCF7L2 rs10811661 CDKN2B Type 2 diabetes rs8050136 FTO rs5219 KCNJ11 rs5215 KCNJ11 rs4402960 IGF2BP2 0.5 1.0 2.0 5.0 Odds Ratio

Types of projects Discovery or validation of genotype-phenotype relations for disease susceptibility or drug responses Discovery of new disease/susceptibility genes  resequence in patients (obesity, Cushing's, susceptibility to infection, insomnia, pre-term birth) Access samples without disease X, or “normals” of specified ancestry, or old normals Phenome-wide association study (PheWAS): in development

Research Use Cases Retrospective chart reviews Rapid preliminary data for grant submissions Feasibility assessment Hypothesis generation

Examples of ICD-9 codes for rare diseases Example Rare Disease Number in SD Number in BioVU Microcephalus 1,070 85 Pica 115 22 Septicemic Plague 21 Pick’s Disease 45 8 Acromegaly and Gigantism 571 123 Ehlers-Danlos Syndrome 285 34 Narcolepsy without Cataplexy 438 76 Spina Bifida 1968 238 Stiff-Man Syndrome 82 17 Tourette Syndrome 667 Bell’s Palsy 2534 402 Bulimia Nervosa 919 88 Cushing’s 1443 298 Peyronies Disease 694 157 Wilson’s Disease 140 49 Meningioma 1444 355 Wegener’s 363 141

Data use agreement + IRB Approval Investigator query Data use agreement + IRB Approval cases controls + 20

Data use agreement + IRB Approval Investigator query Data use agreement + IRB Approval Manual Review cases controls + 21

Data use agreement + IRB Approval cases controls + Investigator query Data use agreement + IRB Approval Sample retrieval cases controls + 22

+ + cases Investigator query controls Sample retrieval cases controls Genotyping, genotype-phenotype relations Investigator query Data use agreement + IRB Approval Sample retrieval cases controls + 23

Data Use Agreement

Genotyping Data Accrual

Coronary Artery Disease Nationally Prevalent Diseases in the African American Population Disease BioVU Count Hypertension 1095 Type 2 Diabetes 714 Coronary Artery Disease 273 Kidney Disease 252 Asthma 210 Pneumonia 193 Stroke 133 Lupus 48 Lung cancer 21 Genotype data on 1786 AA subjects

BioVU Home page. Description of resource BioVU Home page. Description of resource. On the left, you can click for application instructions, status of your application, information about support for BioVU projects, and Record Counter to determine approximate number of records that meet specific criteria (useful for sample size estimations and power calculations when developing BioVU projects/proposals). Tools for assistance with grant language and also statistical analysis plan for BioVU proposals. Click “Genotyping Data” button for more information about current sample counts and genotyping data (page that comes up is on next slide).

BioVU Home page. Description of resource BioVU Home page. Description of resource. On the left, you can click for application instructions, status of your application, information about support for BioVU projects, and Record Counter to determine approximate number of records that meet specific criteria (useful for sample size estimations and power calculations when developing BioVU projects/proposals). Tools for assistance with grant language and also statistical analysis plan for BioVU proposals. Click “Genotyping Data” button for more information about current sample counts and genotyping data (page that comes up is on next slide).

Record Counter is on the BioVU homepage.

This is the screen when you log into Record Counter.

You can query by ICD-9 code, for example MS.

You can also query for medications as shown here.

It will instruct you to enter the medication as shown here with avonex as an example.

You can search for multiple medications as well.

For example, if you want to query the records for MS cases with interferon treatment, you can add the medications to the same Criteria Group as shown here for 3 different interferon medications. This results in a query for MS cases with either avonex, betaseron, OR rebif. This will pull MS records with an occurrence of any of the 3 medications. If you wanted to query for MS cases on all 3 medications, you would need to add each medication to a SEPARATE Criteria Group. This is just using “and” / “or” logic. If you want to query only cases with DNA samples in BioVU, you have to click “Limit by DNA availability”. When you have all your criteria entered, click “Execute Query”.

This will give you a readout of total number of cases in BioVU that meet your criteria with demographic breakdown.

This will give you a readout of total number of cases in BioVU that meet your criteria with demographic breakdown.

BioVU Home page. Description of resource BioVU Home page. Description of resource. On the left, you can click for application instructions, status of your application, information about support for BioVU projects, and Record Counter to determine approximate number of records that meet specific criteria (useful for sample size estimations and power calculations when developing BioVU projects/proposals). Tools for assistance with grant language and also statistical analysis plan for BioVU proposals. Click “Genotyping Data” button for more information about current sample counts and genotyping data (page that comes up is on next slide).

This will give you a readout of total number of cases in BioVU that meet your criteria with demographic breakdown.

When you click on “Application Instructions”, an explanation of the 5 different application options will appear. This depends on whether investigator will be requesting genotyping or not and also whether the investigator intends to apply for VICTR funding to support their BioVU project. The first option is just for SD users (the process will now be electronic).

An example of the web-based application process (this particular example is for an investigator who wants to request genotyping as well as apply for VICTR funding). Investigators will need IRB approval for their project, a 5 page research proposal, and a biosketch. They will also need to sign a Data Use Agreement. Once all of these are completed, they will complete the electronic application (in REDCap) where they will download all of their documents and complete application information. The link to VICTR resource request is also there (this process will not change – they will apply for VICTR funding as usual, however their application will not be processed until Shraddha receives the go ahead that the BioVU proposal has been approved).

For assistance with BioVU project development or application submission, there are office hours, information clinics, and studios. Investigators may also apply for VICTR funding.

BioVU Application Process Investigator(s) completes VICTR Resource Request Funding request reviewed by VICTR SRC BioVU Application Process Proposal reviewed by BioVU Review Committee BioVU program office contacts investigators with any necessary revisions if applicable Proposal Approved/ Access granted Investigator resubmits proposal if necessary Total Time: Data Requests: 4-6 weeks DNA Access: 8-12 weeks Funding Decision Proposal Review Process:

BioVU Genotyping Process Genotyped data analyzed by investigator Investigator selects cases and controls from Synthetic Derivative Investigator signals BioVU program to initiate sample selection BioVU notifies DNA resources core that samples are ready for selection and picking Samples are provided to appropriate lab and are genotyped Investigator and BioVU program receive genotype data BioVU Genotyping Process:

BioVU Requests 37 Total Requests 24 Approvals

FAQ “answers” SD access: “non-human subjects” IRB review (days) Current access costs: $4/sample Genotyping: Investigator-funded Consider VICTR as a funding source Genotyping/sequencing performed in VUMC Core Facilities Justification must be provided for outside genotyping, including quality control plans Genotype “redeposit” part of the data use agreement Anticipate 16,000 BioVU subjects will have GWAS-type genotyping data by fall 2011

Contact: Erica Bowton PhD Questions? Contact: Erica Bowton PhD BioVU Program Manager erica.bowton@vanderbilt.edu 322-1975