Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach  Zhiyu Wan, Yevgeniy Vorobeychik, Weiyi Xia, Ellen Wright.

Slides:



Advertisements
Similar presentations
Sofia A. Oliveira, Yi-Ju Li, Maher A
Advertisements

The Structure of Common Genetic Variation in United States Populations
Pharmacogenetics: Implications of race and ethnicity on defining genetic profiles for personalized medicine  Victor E. Ortega, MD, Deborah A. Meyers,
A Single SNP in an Evolutionary Conserved Region within Intron 86 of the HERC2 Gene Determines Human Blue-Brown Eye Color  Richard A. Sturm, David L.
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
Genomic Patterns of Homozygosity in Worldwide Human Populations
Marc A. Coram, Huaying Fang, Sophie I. Candille, Themistocles L
Adaptive Evolution of Gene Expression in Drosophila
Claudio Verzilli, Tina Shah, Juan P
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
2016 Curt Stern Award Address: From Rare to Common Diseases: Translating Genetic Discovery to Therapy1  Brendan Lee  The American Journal of Human Genetics 
Comparing Algorithms for Genotype Imputation
Yu Jiang, Glen A. Satten, Yujun Han, Michael P. Epstein, Erin L
A Single SNP in an Evolutionary Conserved Region within Intron 86 of the HERC2 Gene Determines Human Blue-Brown Eye Color  Richard A. Sturm, David L.
Haplotype Estimation Using Sequencing Reads
Alessia Ranciaro, Michael C. Campbell, Jibril B
Thomas Willems, Melissa Gymrek, G
Improved Heritability Estimation from Genome-wide SNPs
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
The American Journal of Human Genetics 
The Structure of Linkage Disequilibrium at the DBH Locus Strongly Influences the Magnitude of Association between Diallelic Markers and Plasma Dopamine.
10 Years of GWAS Discovery: Biology, Function, and Translation
Arpita Ghosh, Fei Zou, Fred A. Wright 
Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project  Trevor J. Pemberton, Zachary.
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
Highly Significant Linkage to the SLI1 Locus in an Expanded Sample of Individuals Affected by Specific Language Impairment    The American Journal of.
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Michael J. Bray, B. S. , Todd L. Edwards, Ph. D. , Melissa F
Ida Moltke, Matteo Fumagalli, Thorfinn S. Korneliussen, Jacob E
Towfique Raj, Manik Kuchroo, Joseph M
Assessing the Pathogenicity, Penetrance, and Expressivity of Putative Disease-Causing Variants in a Population Setting  Caroline F. Wright, Ben West,
Xiangqing Sun, Robert Elston, Nathan Morris, Xiaofeng Zhu 
Ivan P. Gorlov, Olga Y. Gorlova, Shamil R. Sunyaev, Margaret R
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
Are Rare Variants Responsible for Susceptibility to Complex Diseases?
Structural Architecture of SNP Effects on Complex Traits
Matthieu Foll, Oscar E. Gaggiotti, Josephine T
Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies 
Genotype Imputation with Millions of Reference Samples
A Three–Single-Nucleotide Polymorphism Haplotype in Intron 1 of OCA2 Explains Most Human Eye-Color Variation  David L. Duffy, Grant W. Montgomery, Wei.
CAG Expansion in the Huntington Disease Gene Is Associated with a Specific and Targetable Predisposing Haplogroup  Simon C. Warby, Alexandre Montpetit,
Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent  Sharon R. Browning, Brian L. Browning  The.
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Pritam Chanda, Aidong Zhang, Daniel Brazeau, Lara Sucheston, Jo L
Katy Hanlon, Lorna W. Harries, Sian Ellard, Claudius E. Rudin 
Template-Directed Dye-Terminator Incorporation with Fluorescence Polarization Detection for Analysis of Single Nucleotide Polymorphisms Implicated in.
Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number.
Evolutionary History of the ADRB2 Gene in Humans
Ying Jin, Stanca A. Birlea, Pamela R. Fain, Richard A. Spritz 
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Identifying Darwinian Selection Acting on Different Human APOL1 Variants among Diverse African Populations  Wen-Ya Ko, Prianka Rajan, Felicia Gomez, Laura.
L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals  Xiaowei Wu, Mary Sara McPeek 
Shapes.
Deleterious- and Disease-Allele Prevalence in Healthy Individuals: Insights from Current Predictions, Mutation Databases, and Population-Scale Resequencing 
Genome-Wide Association Study of Generalized Vitiligo in an Isolated European Founder Population Identifies SMOC2, in Close Proximity to IDDM8   Stanca.
Pleiotropic Effects of Trait-Associated Genetic Variation on DNA Methylation: Utility for Refining GWAS Loci  Eilis Hannon, Mike Weedon, Nicholas Bray,
Yu Zhang, Tianhua Niu, Jun S. Liu 
Tao Wang, Robert C. Elston  The American Journal of Human Genetics 
Whole-Genome Scan, in a Complex Disease, Using 11,245 Single-Nucleotide Polymorphisms: Comparison with Microsatellites  Sally John, Neil Shephard, Guoying.
The HTT CAG-Expansion Mutation Determines Age at Death but Not Disease Duration in Huntington Disease  Jae Whan Keum, Aram Shin, Tammy Gillis, Jayalakshmi Srinidhi.
A Multilocus Model of the Genetic Architecture of Autoimmune Thyroid Disorder, with Clinical Implications  Veronica J. Vieland, Yungui Huang, Christopher.
Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP Platforms  Carl A. Anderson, Fredrik H. Pettersson,
Genotype-Imputation Accuracy across Worldwide Human Populations
Harold A. Nieuwboer, René Pool, Conor V. Dolan, Dorret I
Quanhe Yang, W. Dana Flanders, Ramal Moonesinghe, John P. A
Zuoheng Wang, Mary Sara McPeek  The American Journal of Human Genetics 
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
The Size Distribution of Homozygous Segments in the Human Genome
Presentation transcript:

Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach  Zhiyu Wan, Yevgeniy Vorobeychik, Weiyi Xia, Ellen Wright Clayton, Murat Kantarcioglu, Bradley Malin  The American Journal of Human Genetics  Volume 100, Issue 2, Pages 316-322 (February 2017) DOI: 10.1016/j.ajhg.2016.12.002 Copyright © 2017 American Society of Human Genetics Terms and Conditions

Figure 1 A Comparison of Genomic Summary Data Sharing Policies for Participants in the SPHINX Program The compared policies include (1) the single-nucleotide polymorphism (SNP) suppression policies, which rely only on hiding of genomic regions (blue dots), (2) the existing SNP suppression policy, according to Sankararaman et al.’s approach (red circle), (3) the data use agreement (DUA) policy, which relies only on a legally enforceable contract (gold square), (4) the game theoretic policy, which allows for a combination of a DUA and SNP suppression in a Stackelberg framework (brown triangle), (5) the no-risk game theoretic policy, which ensures no attack is committed by the recipient (green outlined triangle), and (6) the no SNP suppression policy, whcih illustrates what transpires when no DUA or SNP suppression is applied (purple circle). Utility is directly related to the absolute difference between the minor allele frequencies of shared SNPs in the study and their known minor allele frequencies in the underlying reference population (a utility score of 1 is achieved when all SNPs are shared). Privacy is inversely related to risk, the likelihood a recipient achieves success in compromising the privacy protection of targeted individuals (a privacy score of 1 is achieved when no attacks are successful—in other words, when no risk exists). A higher payoff value represents a more desirable option. SPHINX, Sequence and Phenotype Integration Exchange. The American Journal of Human Genetics 2017 100, 316-322DOI: (10.1016/j.ajhg.2016.12.002) Copyright © 2017 American Society of Human Genetics Terms and Conditions

Figure 2 The Genomic Data Sharing Process In this process, a genomic data sharing policy is made by the sharer (A), a recipient chooses to attack targets in received data (B), and the overall payoffs as a consequence are shown (C). SNP, single-nucleotide polymorphism; DUA, data use agreement. The American Journal of Human Genetics 2017 100, 316-322DOI: (10.1016/j.ajhg.2016.12.002) Copyright © 2017 American Society of Human Genetics Terms and Conditions

Figure 3 Comparisons of Four Protection Policies for the SPHINX Program with a Varying Penalty against the Genomic Inference Attack The compared policies include (1) the optimal game theoretic solution (brown lines), (2) the game theoretic solution that ensures no attack is successful (black lines), (3) the data use agreement (DUA) (yellow lines), and (4) the SNP suppression solution (blue lines) with no penalty. The overall payoff (the main graph on the right) is the result of combining (1) the privacy protection afforded to the targeted individuals (the upper graph on the left) and (2) the utility in the set of SNPs that are shared (the lower graph on the left). SPHINX, Sequence and Phenotype Integration Exchange. The American Journal of Human Genetics 2017 100, 316-322DOI: (10.1016/j.ajhg.2016.12.002) Copyright © 2017 American Society of Human Genetics Terms and Conditions

Figure 4 Comparisons of Four Protection Policies for a Range of Genomic Data Sharing Programs with Varying Prior Probabilities against the Genomic Inference Attack The compared policies include (1) the optimal game theoretic solution (brown bars filled with downward diagonal pattern), (2) the game theoretic solution that ensures no attack is successful (black bars with no fill), (3) the data use agreement (DUA) (gold bars filled with checkerboard pattern), and (4) the single-nucleotide polymorphism (SNP) suppression solution (blue bars with solid fill). The overall payoff (the main graph on the right) is the result of combining (1) the privacy protection afforded to the targeted individuals (the upper graph on the left) and (2) the utility in the set of SNPs that are shared (the lower graph on the left). PMI, Precision Medicine Initiative; MVP, Million Veteran Program; SPHINX, Sequence and Phenotype Integration Exchange; BioVU, de-identified biorepository of Vanderbilt University Medical Center; RDCRN, Rare Diseases Clinical Research Network. The American Journal of Human Genetics 2017 100, 316-322DOI: (10.1016/j.ajhg.2016.12.002) Copyright © 2017 American Society of Human Genetics Terms and Conditions