High dimensional genomic data, identifiability, and query-response Haixu Tang School of Informatics and Computing Indiana University, Bloomington.

Slides:



Advertisements
Similar presentations
NIST Big Data Public Working Group Security and Privacy Subgroup Presentation September 30, 2013 Arnab Roy, Fujitsu Akhil Manchanda, GE Nancy Landreville,
Advertisements

1 Genetics and Individualized Therapies Jan C. Heller, PhD Bioethicist, Seattle, WA 4 March 2009.
Wrapup. NHGRI strategic plan What does the NIH think genomics should be for the next 10 years? [Nature, Feb. 2011]
Chapter 15 The Human Genome Project and Genomics
Using ICD Codes and Birth Records to Prevent Mismatches of Multiple Births in Linked Hospital Readmission Data Alison Fraser 1, MSPH, Zhiwei Liu 2, MS,
Social Genome: Putting Big Data to Work to Advance Society Hye-Chung Kum Texas A&M Health Science Center, Dept. of Health Policy & Management University.
Collaborative Information Management: Advanced Information Processing in Bioinformatics Joost N. Kok LIACS - Leiden Institute of Advanced Computer Science.
Ethical, Legal & Social Implications (ELSI) Research Program of the Human Genome Project Tricia C. Clarke Biotechnology Project BIO 210 (Prof. S. Saunders)
An Authentication Service Against Dishonest Users in Mobile Ad Hoc Networks Edith Ngai, Michael R. Lyu, and Roland T. Chin IEEE Aerospace Conference, Big.
Human Genetics Overview.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Challenge of personalized health care: To what extent is medicine already individualized and what are the future trends? Author: Walter Fierz Presented.
Database Administration Chapter 16. Need for Databases  Data is used by different people, in different departments, for different reasons  Interpretation.
Chapter 10 Molecular Diagnosis. Keypoints Identification of the gene for a disorder permits diagnostic testing by direct mutation analysis. Some genetic.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Enterprise Privacy Architectures Leveraging Encryption to Keep Data Private Karim Toubba VP of Product Management Ingrian Networks.
Computer Science and Engineering 1 Cloud ComputingSecurity.
Your Data Any Place, Any Time Online Transaction Processing.
The analyses upon which this publication is based were performed under Contract Number HHSM C sponsored by the Center for Medicare and Medicaid.
Precision Medicine A New Initiative. The Concept of Precision Medicine (PM) The prevention and treatment strategies that take individual variability into.
© Synergetics Portfolio Security Aspecten.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Haplotype-Based Noise- Adding Approach to Genomic Data Anonymization Yongan Zhao, Xiaofeng Wang and Haixu Tang School of Informatics and Computing, Indiana.
ICETE 2012 Joint Conference on e-Business and Telecommunications Hotel Meliá Roma Aurelia Antica, Rome, Italy July
Privacy Communication Privacy Confidentiality Access Policies Systems Crypto Enforced Computing on Encrypted Data Searching and Reporting Fully Homomorphic.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
NIST Big Data Public Working Group Security and Privacy Subgroup Presentation September 30, 2013 Arnab Roy, Fujitsu Akhil Manchanda, GE Nancy Landreville,
De-identification: A Critical Success Factor in Clinical and Population Research Steven Merahn MD Dee Lang, RHIT Prepared for 2007 APIII Pittsburgh, PA.
Sample to Insight Alexander Kaplun, PhD Sep PGMD: a comprehensive pharmacogenomic database for personalized medicine and drug discovery.
Securing Data in Transit and Storage Sanjay Beri Co-Founder & Senior Director of Product Management Ingrian Networks.
Data Management Recommendation ISTeC Data Management Committee.
Secure Systems Research Group - FAU SW Development methodology using patterns and model checking 8/13/2009 Maha B Abbey PhD Candidate.
Asp/IEETA Health-Grid Workshop Brussels 20 th September 2002 A. Sousa Pereira Univ. Aveiro - IEETA.
What have we learned?. What is a database? An organized collection of related data.
Bringing Genomics Home Your DNA: A Blueprint for Better Health Dr. Brad Popovich Chief Scientific Officer Genome British Columbia March 24, 2015 / Vancouver,
The 1 st Competition on Critical Assessment of Data Privacy and Protection The privacy workshop is jointly sponsored by iDASH (U54HL108460) and the collaborating.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #22 Secure Web Information.
Clinical Research Informatics at the University of Michigan Daniel Clauw M.D. Professor of Medicine, Division of Rheumatology Assistant Dean for Clinical.
Database Administration
Educational Template Chapter 11 Data Privacy and Security Ross Fraser Chapter 11 Data Privacy & Security.
The International Consortium. The International HapMap Project.
Software Connectors Acknowledgement: slides mostly from Software Architecture: Foundations, Theory, and Practice; Richard N. Taylor, Nenad Medvidovic,
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
Consumer Advocate Perspective Clinical Trials Registration Sharon F. Terry, JAM Sharon F. Terry, JAM President and CEO, Genetic Alliance, Inc. Founding.
Clinical Research Informatics [CRI]. Informatics, defined generally as the intersection of information and computer science with a health-related discipline,
GenoGuard: Protecting Genomic Data against Brute-Force Attacks Zhicong Huang, Erman Ayday, Jacques Fellay, Jean-Pierre Hubaux, Ari Juels Presented by Chuong.
Genome characterization in the post-HGP era Haixu Tang School of Informatics.
“Translational research includes two areas of translation. One (T1) is the process of applying discoveries generated during research in the laboratory,
Healthcare Informatics Prof. William W. Song Informatics and Business Intelligence Dalarna University Borlänge, Sweden.
CSE 5810 Biomedical Informatics and Cloud Computing Zhitong Fei Computer Science & Engineering Department The University of Connecticut CSE5810: Introduction.
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Genome-Wides Association Studies (GWAS) Veryan Codd.
LCA1 Erman Ayday, Jean Louis Raisaro and Jean-Pierre Hubaux Privacy-Enhancing Technologies for Medical Tests and Personalized Medicine Laboratory for Computer.
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
CMSC 818J: Privacy enhancing technologies Lecture 2.
Big Data Security and Privacy
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
National Healthcare Science Week 2017
Celtic-Plus Proposers Day 22 September 2016, Istanbul
What contribution can automated reasoning make to e-Science?
of Pathology Specimens for the VA Precision Oncology Program
Research and Evidence Based Medicine
Thoughts related to integration of genomic information with clinical phenotypes & issues related to data privacy Mark Gerstein, Yale.
Real-time Protection for Open Beacon Network
Omnibus Care Plan (OCP) Care Coordination System
Topics at the Interface of Privacy and Genomics
Computer Science and Engineering
کتابهای خریداری شده فن آوری اطلاعات سلامت 1397
Evaluation of power for linkage disequilibrium mapping
Presentation transcript:

High dimensional genomic data, identifiability, and query-response Haixu Tang School of Informatics and Computing Indiana University, Bloomington

“Big Data” in Personal Genomics Genomics is a key component of personalized medicine – Massive Large research-oriented projects: 1000 genomes to 10 6 Genome sequencing for all new-borns? Open data project, e.g., the Personal Genomics Project (PGP) – Heterogeneous Genomic sequence (variations) Constant, dynamic monitoring – Transcritpomics, proteomics, metabolomics, microbial communities, etc. (as demonstrated by iPOP)

Challenges in Personal Genomics Personalized HealthcareResearch (secondary) AnalysisDetection of markers for diagnosis and treatment (pharmacogenomics) Discovery of markers SharingSharing patient data among health practitioners Searching for successful treatment on similar patients (“patient like me”) Methodology development Validation of markers Challenges: Speed, Storage, Scalability, Security Solution: cloud, hybrid cloud, bring computing to the data!

Privacy Enhancing Technologies Personalized HealthcareResearch (Secondary) AnalysisDetection of markers for diagnosis and treatment (pharmacogenomics) Discovery of markers SharingSharing patient data among health practitioners Searching for successful treatment on similar patients (“patient like me”) Methodology development Validation of markers Cryptographic protocols: SMC, homomorphic computation, functional encryption Database security approaches: access control, query auditing, differential privacy Ethic studies, informed consent, policy

What is specific for genomic data? Challenges – Genome technologies evolve very fast! – Genomic data are extremely high dimensional Millions of SNPs, easily identifiable Balance between data security and utility – Not only the data, but also analysis results need to be protected Allele frequencies or test statistics (e.g., Homer’s attack) Special properties – Different dimensions are NOT independent Genetic structures (e.g., linkage disequilibrium) – Specific genomic research focuses on a small number of dimensions (e.g., disease-associated SNPs)