LCA1 Erman Ayday, Jean Louis Raisaro and Jean-Pierre Hubaux Privacy-Enhancing Technologies for Medical Tests and Personalized Medicine Laboratory for Computer.

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

The Human Genome Project
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Which Phenotypes Can be Predicted from a Genome Wide Scan of Single Nucleotide Polymorphisms (SNPs): Ethnicity vs. Breast Cancer Mohsen Hajiloo, Russell.
Maryam Nazir. Personal Genomics:  Branch of genomics concerned with the sequencing and analysis of the genome of an individual  Once sequenced, it can.
Personal genetics education project “Life With a Lethal Gene” New York Times, March 18, 2007 How did she make her decision to be tested? How did she feel.
1 Single Nucleotide Polymorphisms (SNP) Gary Jones SPE, Technology Center 1600 (703)
Direct to Consumer Testing Issues. Types of tests being sold Diagnostic tests – identify specific conditions Carrier testing - identify those who carry.
UTEPComputer Science Dept.1 University of Texas at El Paso Privacy in Statistical Databases Dr. Luc Longpré Computer Science Department Spring 2006.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Human Genetics Overview.
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
1 © 2012 Oracle Corporation – Proprietary and Confidential Big Data Challenge – From EMR’s to Translational Research Miroslav Končar, PhD Oracle Healthcare.
DbSNP: the NCBI database of genetic variation S. T. Sherry, M.H. Ward, M. Kholodov, J. Baker, L. Phan, E. M. Smigielski and K. Sirotkin, Nucleic Acids.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
What is the Human Genome Project? Identify all the approximately 35,000 genes in human DNA Determine the sequences of the 3,000,000,000 bases ( = 200 phone.
Core Laboratory of Population Genetic Polymorphisms 族群遺傳核心實驗室 負責人:陳為堅 于明暉 National Taiwan University Center for Genomic Medicine 台大基因體醫學中心.
The Human Genome Project Jacob D. Schroeder Department of Chemistry and Chemical Engineering South Dakota School of Mines and Technology Rapid City, SD.
Pharmacogenomics. Developing drugs on the basis of individual genetic differences Tailoring therapies to genetically similar subpopulations results in.
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
Cluster-based SNP Calling on Large Scale Genome Sequencing Data Mucahid KutluGagan Agrawal Department of Computer Science and Engineering The Ohio State.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
m-Privacy for Collaborative Data Publishing
Internet2 Health Sciences Security Jere Retzer, OHSU March 7, 2001.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Holistic Privacy From Location Privacy to Genomic Privacy Jean-Pierre Hubaux With contributions from E. Ayday, M. Humbert, J.-Y. Le Boudec, J.-L. Raisaro,
A gene is a particular sequence (a string) of nucleotides on a particular site of a chromosome. It is made up of combinations of A, T, C, and G. These.
Personalized Medicine Dr. M. Jawad Hassan. Personalized Medicine Human Genome and SNPs What is personalized medicine? Pharmacogenetics Case study – warfarin.
High dimensional genomic data, identifiability, and query-response Haixu Tang School of Informatics and Computing Indiana University, Bloomington.
HUMAN GENOME PROJECT International effort of 13 years (1990 – 2003) Identified all the approximate 20,000 – 25,000 genes in human DNA Determined the sequences.
Rachel Liao, PhD Coordinator of the Clinical Working Group and the BRCA Challenge demonstration project for the Global Alliance for Genomics and Health.
SHARe. What is WHI-SHARe? SHARe – stands for SNP Health Association Resource. It is an NHLBI program for genome-wide association studies (GWAS). GWAS.
Human Genetics and Genetic Technology Using Genetic InformationUsing Genetic Information.
Time to Encrypt our DNA? Stuart Bradley Humbert, M., Huguenin, K., Hugonot, J., Ayday, E., Hubaux, J. (2015). De-anonymizing genomic databases using phenotypic.
m-Privacy for Collaborative Data Publishing
The International Consortium. The International HapMap Project.
Use of Data from the Electronic Medical Record in Research Opportunities & Pit Falls Kristin West.
The Human Genome Project
Kathleen Giacomini, Mark J. Ratain, Michiaki Kubo, Naoyuki Kamatani, and Yusuke Nakamura NIH Pharmacogenomics Research Network III & RIKEN Center for Genomic.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
The Human Genome Project
GenoGuard: Protecting Genomic Data against Brute-Force Attacks Zhicong Huang, Erman Ayday, Jacques Fellay, Jean-Pierre Hubaux, Ari Juels Presented by Chuong.
What did I inherit from my parents? Today’s Objectives: Review basic genetic concepts Create family health tree Identify patterns suggesting genetic link.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
BY S.S.SUDHEER VARMA (13NT1D5816)
Single Nucleotide Polymorphisms (SNPs
Nucleotide variation in the human genome
GenoGuard: Protecting Genomic Data against Brute-Force Attacks
Privacy-Preserving Clustering
Introduction to Direct-to-Consumer Genetic Testing
X Chromosome Inactivation
INFORMATION SECURITY The protection of information from accidental or intentional misuse of a persons inside or outside an organization Comp 212 – Computer.
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Direct-To-Consumer Genetic Testing
Real-time Protection for Open Beacon Network
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Applications of DNA Analysis
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The Human Genome Project
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Discovery From Data Repositories H Craig Mak  Nature Biotechnology 29, 46–47 (2011) 2013 /06 /10.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

LCA1 Erman Ayday, Jean Louis Raisaro and Jean-Pierre Hubaux Privacy-Enhancing Technologies for Medical Tests and Personalized Medicine Laboratory for Computer communications and Applications 1 1. Introduction and Goals 2. Threat Model 3. Genomic Background 4. Proposed Solution 5. Implementation and Computational Complexity 6. Discussion  Protect the privacy of users’ genomic data.  Protect the privacy of medical unit’s confidential data.  Allow specialists to access only to the genomic data they need (or they are authorized for).  Keep the access time to a single patient’s genomic data to a few seconds. 6) Markers related to disease X and their contributions 5) “Check my susceptibility to disease X” and part of P’s secret key, x (2) 3) Encrypted variants 8) End-result or related variants 7) Homomorphic operations and proxy encryption Patient (P) Medical Center (MC) 1) Sample Certified Institution Curious SPU Malicious 3 rd party Storage and Processing Unit (SPU) 2) Sequencing and encryption 4) Part of P’s secret key, x (1) Leakage of genomic data:  A curious party at the Storage and Processing Unit (SPU), who tries to infer the genomic sequence of a patient from his stored data.  A malicious medical unit, who can be considered either as an attacker that hacks into the medical unit’s system or a disgruntled employee, who happens to access the medical unit’s database.  Store patients’ genomic data encrypted by their public keys at the Storage and Processing Unit (SPU).  Process the encrypted genomic data for medical tests and personalized medicine methods using homomorphic encryption and proxy encryption. A full genome sequence not only uniquely identifies each one of us; it also contains information about, our ethnic heritage, disease predispositions, and many other phenotypic traits. The human genome has approximately 3 billion letters. Encourage the use of genomic data, by the individual and by the medical unit, and accelerate the move of genomics into clinical practice. Paillier Encryption Homomorphic operations Proxy encryption Storage Paillier decryption 30 ms. per variant 10 sec. (10 variants) 2 ms. 500xΩ MB per patient 26 ms.  Intel Core i7-2620M CPU with 2.70 GHz processor.  Size of the security parameter: 1024 bits.  Real SNP profiles from 1000 Genomes Project.  Java programming language. Threat:  Revelation of predisposition to diseases, ethnicity, paternity, filiation, etc.  Genetic discrimination.  Denial of access to health insurance, mortgage, education, and employment.  Single Nucleotide Polymorphisms (SNPs): DNA variations, occurring when a single nucleotide differs between members of the same species.  Potential nucleotides for a SNP position are called alleles.  A disease susceptibility test is done by analyzing particular SNPs.  Each SNP contributes to the disease susceptibility in a different amount.  40 million approved SNPs in the human population.  Each patient carries around 4 million SNPs out of 40 million – real SNPs of the patient.  75 real SNPs enable the attacker to identify a person. Collaboration:  EPFL - School of Computer and Communication Sciences.  EPFL - School of Life Sciences.  University Hospital of Lausanne (CHUV).  Sophiagenetics.com.