Discovery From Data Repositories H Craig Mak  Nature Biotechnology 29, 46–47 (2011) 2013 /06 /10.

Slides:



Advertisements
Similar presentations
Design and Implementation of a Web-Based Patient Portal Linked to an Ambulatory Care Electronic Health Record: Patient Gateway for Diabetes Collaborative.
Advertisements

PERSONALIZED MEDICINE: Planning for the Future You, Your Biomarkers and Your Rights.
Lecture 2 Strachan and Read Chapter 13
Applications of HGP Genetic testing Forensics. Testing for a pathogenic mutation in a certain gene in an individual that indicate a persons risk of developing.
Health IT Certificate Series. 2 Why Health IT? Health information technology (health IT) makes it possible for health.
What is an association study? Define linkage disequilibrium
Genetic Analysis in Human Disease
Reuse of Electronic Medical Records for Research Our architecture Two examples.
2 The Use of Health Information Technology in Physician Practices.
Genetic Epidemiology Lecture 13 PS Timiras. A Few Definitions GENOME: THE COMPLETE SET OF GENES OF AN ORGANISM GENOTYPE: THE GENETIC CONSTITUTION OF.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Human Genetics Overview.
PrimeSUITE’s Practice Management and Electronic Health Record Software
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Study Design Discussion The Ghost of Candidate Gene Past and the Ghost of Genome-wide Association Yet to Come Stephen S. Rich, Ph.D. Wake Forest University.
Chapter 2 Electronic Health Records
August 12, Meaningful Use *** UDOH Informatics Brown Bag Robert T Rolfs, MD, MPH.
Terminology in Health Care and Public Health Settings
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Butte Lab Journal Club 16 Aug 2010 Alexander A. Morgan.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Introduction to Public Health Informatics and their Applications August 27, 2015 Francis B Annor Georgia Department of Public Health.
PHDSC session Readiness of public health information systems to support Meaningful Use of EHRs through health information exchanges.
Medicaid HIT Program Jon Barley, Ph.D., Chief Bureau of Health Services Research Office of Ohio Health Plans
The potential to bring huge benefits to Patients..
Unit 1b: Health Care Quality and Meaningful Use Introduction to QI and HIT This material was developed by Johns Hopkins University, funded by the Department.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
This material was developed by Oregon Health & Science University, funded by the Department of Health and Human Services, Office of the National Coordinator.
Component 3-Terminology in Healthcare and Public Health Settings Unit 15-Overview/ Introduction to the EHR This material was developed by The University.
Component 3-Terminology in Healthcare and Public Health Settings Unit 17-Clinical Vocabularies This material was developed by The University of Alabama.
Wisconsin Genomics Initiative W isconsin M edical R esearch T riangle Wisconsin Medical Discovery Triangle.
Breaking the Code Information Technology, Genomics, and the Journey Toward Personalized Medicine.
Unit 3.02 Understanding Health Informatics.  Health Informatics professionals treat technology as a tool that helps patients and healthcare professionals.
SSHRC - ERA - SAGE Workshop Research on the Ethical, Legal and Societal Aspects of Human Genomics: North America Denise Avard PhD Director of Research.
Terminology in Health Care and Public Health Settings Unit 15 Overview / Introduction to the EHR.
This material was developed by Oregon Health & Science University, funded by the Department of Health and Human Services, Office of the National Coordinator.
Kathleen Giacomini, Mark J. Ratain, Michiaki Kubo, Naoyuki Kamatani, and Yusuke Nakamura NIH Pharmacogenomics Research Network III & RIKEN Center for Genomic.
Health IT Workforce Curriculum Version 1.0 Fall Networking and Health Information Exchange Unit 4a Basic Health Data Standards Component 9/Unit.
Terminology in Healthcare and Public Health Settings Electronic Health Records Lecture a – Introduction to the EHR This material Comp3_Unit15 was developed.
Moving Toward HITECH Healthcare EHR Adoption at the Dawn of a New Era
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Copyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. McGraw-Hill/Irwin Chapter 2 Clinical Information Standards – Unit 3 seminar Electronic.
Implementation of Electronic Health Records at Choctaw General Hospital Lisa Sims November 12, 2012.
3.02 Understand Health Informatics
Single Nucleotide Polymorphisms (SNPs
SNPs and complex traits: where is the hidden heritability?
SSHRC - ERA - SAGE Workshop Research on the Ethical, Legal and Societal Aspects of Human Genomics: North America Denise Avard PhD Director of Research.
Genomic Analysis: GWAS
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
Lesson 1- Introduction to Electronic Health Records
3.02 Understand Health Informatics
Genome Wide Association Studies using SNP
3.02 Understand Health Informatics
Electronic Health Information Systems
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Lesson 1- Introduction to Health Information Technology
3.02 Understand Health Informatics
Special Topics in Vendor-Specific Systems
Wisconsin Genomics Initiative
Medical genomics BI420 Department of Biology, Boston College
Fig. 3 Context-dependent inherited risk and the importance of intermediate phenotypes in clinical research. Context-dependent inherited risk and the importance.
Osteoarthritis year 2012 in review: genetics and genomics
Medical genomics BI420 Department of Biology, Boston College
Component 11 Unit 7: Building Order Sets
3.02 Understand Health Informatics
3.02 Understand Health Informatics
Osteoarthritis year 2012 in review: genetics and genomics
EHR Customization Services by CustomSoft India. Objective EHR customized by CustomSoft offers functionality for charting, problem lists, medication management,
Lesson 1- Introduction to Electronic Health Records
Presentation transcript:

Discovery From Data Repositories H Craig Mak  Nature Biotechnology 29, 46–47 (2011) 2013 /06 /10 吳柏誠 陳孝銓 朱唐廷 謝煒騏 李峻霆 徐振庭 葉騰遠

Introduction Electronic Health Records(EHR) Improve care coordination Reduce healthcare disparities Engage patients and their families Improve population and public health Ensure adequate privacy and security 2019/5/12

Electronic Health Record Systems 2019/5/12

Health Information Technology for Economic and Clinical Health Act (HITECH) 2019/5/12 The Health Information Technology for Economic and Clinical Health (HITECH) Act seeks to improve American health care delivery and patient care through an unprecedented investment in Health IT (HIT). Meaningful use The main components of Meaningful Use are: The use of a certified EHR in a meaningful manner, such as e-prescribing. The use of certified EHR technology for electronic exchange of health information to improve quality of health care. The use of certified EHR technology to submit clinical quality and other measures.

Standardized Administrative Billing Codes 2019/5/12

ICD9 2019/5/12 database

ICD10 2019/5/12

ICD International Classification Of Diseases WHO(1940) Pathogen、Pathology、 Clinical、 Anatomical location Latest version is ICD10 http://icd9cm.chrisendres.com/index.php 2019/5/12

Genome-wide Association Study 2019/5/12 Genome-wide Association Study 間單 準確的CSM

Genome-wide association study Basic idea: Screen and test the whole genome for associations with a disease Use genotyping technologies to assay100,000s of SNPs and relate them to disease GWAS rely on ”common disease, common variant (of SNP) Basic idea: A variant (of SNP) has to be common in order to cause a common disease However, many important disease- causing variants may be rarer than this,and are thus unlikely to be detected with the GWAS approach GWA studies typically identify common variants with small effect sizes (“common” means >1-5% of population must have the genetic variant )   2019/5/12

GWAS has typically 4 steps 1. Selection of large number of individuals with disease, and a comparison group 2. DNA genotyping and quality control (QC) of assays and data, check for genotyping errors 3. Statistical tests for association between the SNPs (those passing the quality control) and the disease (one test for each SNP, multiple testing…) 4. Replication of identified associations in an independent population sample and examination of functionality of identified SNPs 2019/5/12

Limitations to GWAS False-positive results (false associations found) False-negative results (true associations missed) Insensitivity to rare SNP variants Requirement for large sample sizes Genotyping errors 2019/5/12

( Phenome-wide association scan ) PheWAS ( Phenome-wide association scan ) ● What disease associations can we make with a given gene?  GWAS 2019/5/12 ● Main information sources are genomic databases and electronic medical records (EMR) ● <20 susceptibility genes variants are responsible for >50% of the disease What gene is associated with a given disease?(GWAS) Scientists believe that human disease results from complex interactions between genes and environmental risk factors, and that variants from a few (<20) susceptibility genes variants are responsible for >50% of this disease burden. Hence, given a specific limited set of genes (or polymorphisms) scientists can determine the associations with phenomic markers such as, CPT codes, medications, ICD-9 codes. ● Potential gene-disease associations could be dicovered

Introduction PheWAS use readily available billing code to replicate several known genotype-phenotype association and suggests novel possible association. Typical EMR system contain diverse data sources, including billing data , laboratory and imaging result, medication records and clinical documentation. Structured data V.S. Unstructured data. 2019/5/12

Method Selected 5 SNPs which are associated with 7 conditions: atrial fibrillation, coronary artery disease, carotid artery stenosis, multiple sclerosis, Crohn’s disease, rheumatoid arthritis, and systemic lupus erythematosus. Using ICD-9 coding to link genetic data to the phenome. 2019/5/12 coronary artery disease冠狀動脈疾病 atrial fibrillation心房顫動 carotid artery stenosis頸內動脈狹窄 multiple sclerosis多發性硬化症 Crohn’s disease克羅恩病 rheumatoid arthritis類風濕關節炎的 systemic lupus erythematosus系統性紅斑狼瘡

Null hypothesis(Ho): 擁有基因型’rs3135388’和擁有’MS’,’SLE’ 疾病的是不同組人 5.558 (p-value=2.77*10^-6) 2019/5/12 P-value(MS多發性硬化症)=2.77*10^-6 , P-value(SLE紅斑性狼蒼)=0.51(圖為0.29) 先行chi square test,再做假設檢定,設p-value=0.05,選擇可能情況中p-value最小的, 連結到MS,但卻無法連結到SLE,表示PheWAS並非一定成功。 藍虛線為bonferroni correction,是多重比較時為避免type1 error過高,而做的修正。 P-value = 0.05

Result The PheWAS was able to detect 4 out of the 7 known SNP-disease associations (p<0.05). The PheWAS picked up an additional 19 unknown associations between the SNPs and diseases 2019/5/12 1.預計從5個SNP找到7個已知的相對應疾病,但卻只成功找到四個。 2.有趣的是用phewas方式竟找到19個意料之外的答案,證明用此方法比起重統的GWAS更能找出新的SNP和disease對應關西。

EMR (ICD-9) regrouping analysis 2019/5/12 The ICD9 coding system describes diseases, signs and symptoms, injuries,poisonings, procedures and screening codes Since the ICD9 terminology was designed primarily for billing and administrative functions, we developed custom ‘case groups’ of ICD9 codes to better allow for large-scale genomic research involving ICD9 codes. we combined three-digit codes that represented common etiologies e.g. creating a single ‘arrhythmias’ code group

Thank You for Your Attention!!!!!! 2019/5/12