Linkage Disequilibrium and Heritability of Copy-Number Polymorphisms within Duplicated Regions of the Human Genome  Devin P. Locke, Andrew J. Sharp, Steven.

Slides:



Advertisements
Similar presentations
Recurrent Reciprocal Genomic Rearrangements of 17q12 Are Associated with Renal Disease, Diabetes, and Epilepsy  Heather C. Mefford, Séverine Clauin, Andrew.
Advertisements

Mosaic Uniparental Disomies and Aneuploidies as Large Structural Variants of the Human Genome  Benjamín Rodríguez-Santiago, Núria Malats, Nathaniel Rothman,
Blanca E. Himes, Gary M. Hunninghake, James W. Baurley, Nicholas M
Comprehensively Evaluating cis-Regulatory Variation in the Human Prostate Transcriptome by Using Gene-Level Allele-Specific Expression  Nicholas B. Larson,
High-Resolution Genomic Profiling of Disseminated Tumor Cells in Prostate Cancer  Yu Wu, Jamie R. Schoenborn, Colm Morrissey, Jing Xia, Sandy Larson, Lisha.
Nonrecurrent 17p11.2p12 Rearrangement Events that Result in Two Concomitant Genomic Disorders: The PMP22-RAI1 Contiguous Gene Duplication Syndrome  Bo.
Tracy I. George, Joanna E. Wrede, Charles D. Bangs, Athena M
Genomic Patterns of Homozygosity in Worldwide Human Populations
Kendy K. Wong, Ronald J. deLeeuw, Nirpjit S. Dosanjh, Lindsey R
Microarray-Based Comparative Genomic Hybridization Using Sex-Matched Reference DNA Provides Greater Sensitivity for Detection of Sex Chromosome Imbalances.
Harvey A. Greisman, Noah G. Hoffman, Hye Son Yi 
Refinement and Discovery of New Hotspots of Copy-Number Variation Associated with Autism Spectrum Disorder  Santhosh Girirajan, Megan Y. Dennis, Carl.
Resolving the Breakpoints of the 17q21
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Jianbin Wang, H. Christina Fan, Barry Behr, Stephen R. Quake  Cell 
Angela Leo, Andrew M. Walker, Matthew S
Diagnostic Genome Profiling in Mental Retardation
Volume 8, Issue 5, Pages (September 2014)
Volume 70, Issue 5, Pages (June 2011)
A complex phenotype with cystic renal disease
Michael Cullen, Stephen P
Customized Oligonucleotide Array-Based Comparative Genomic Hybridization as a Clinical Assay for Genomic Profiling of Chronic Lymphocytic Leukemia  Rachel.
High-Resolution Mapping of Genotype-Phenotype Relationships in Cri du Chat Syndrome Using Array Comparative Genomic Hybridization  Xiaoxiao Zhang, Antoine.
Volume 7, Issue 5, Pages (November 2016)
Model of segmental duplication Acceptor regions of the genome acquire segments of genomic material that range from 1–200 kb from disparate regions.
Microarray Techniques to Analyze Copy-Number Alterations in Genomic DNA: Array Comparative Genomic Hybridization and Single-Nucleotide Polymorphism Array 
The Fine-Scale and Complex Architecture of Human Copy-Number Variation
Genome-wide Transcriptome Profiling Reveals the Functional Impact of Rare De Novo and Recurrent CNVs in Autism Spectrum Disorders  Rui Luo, Stephan J.
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
A Multiplex qPCR Gene Dosage Assay for Rapid Genotyping and Large-Scale Population Screening for Deletional α-Thalassemia  Wanjun Zhou, Ge Wang, Xuefeng.
Volume 7, Issue 1, Pages (April 2014)
A Gene Mutated in Nephronophthisis and Retinitis Pigmentosa Encodes a Novel Protein, Nephroretinin, Conserved in Evolution  Edgar Otto, Julia Hoefele,
Gene-Expression Variation Within and Among Human Populations
AZFc Deletions and Spermatogenic Failure: A Population-Based Survey of 20,000 Y Chromosomes  Steven G. Rozen, Janet D. Marszalek, Kathryn Irenze, Helen.
Volume 70, Issue 5, Pages (June 2011)
Jingjing Li, Xiumei Hong, Sam Mesiano, Louis J
Microdeletions of 3q29 Confer High Risk for Schizophrenia
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Recurrent 10q22-q23 Deletions: A Genomic Disorder on 10q Associated with Cognitive and Behavioral Abnormalities  Jorune Balciuniene, Ningping Feng, Kelly.
Integrative Multi-omic Analysis of Human Platelet eQTLs Reveals Alternative Start Site in Mitofusin 2  Lukas M. Simon, Edward S. Chen, Leonard C. Edelstein,
Nonrecurrent 17p11.2p12 Rearrangement Events that Result in Two Concomitant Genomic Disorders: The PMP22-RAI1 Contiguous Gene Duplication Syndrome  Bo.
Bassem A. Bejjani, Lisa G. Shaffer 
Recurrent Reciprocal Genomic Rearrangements of 17q12 Are Associated with Renal Disease, Diabetes, and Epilepsy  Heather C. Mefford, Séverine Clauin, Andrew.
Catarina D. Campbell, Nick Sampas, Anya Tsalenko, Peter H
Mamoru Kato, Yusuke Nakamura, Tatsuhiko Tsunoda 
Defining Ploidy-Specific Thresholds in Array Comparative Genomic Hybridization to Improve the Sensitivity of Detection of Single Copy Alterations in Cell.
Copy-Number Variations Measured by Single-Nucleotide–Polymorphism Oligonucleotide Arrays in Patients with Mental Retardation  Janine Wagenstaller, Stephanie.
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Jeffrey A. Fawcett, Hideki Innan  Trends in Genetics 
High-Resolution Molecular Characterization of 15q11-q13 Rearrangements by Array Comparative Genomic Hybridization (Array CGH) with Detection of Gene Dosage 
Pier Francesco Palamara, Laurent C. Francioli, Peter R
Adaptive Evolution of UGT2B17 Copy-Number Variation
A DNA Replication Mechanism for Generating Nonrecurrent Rearrangements Associated with Genomic Disorders  Jennifer A. Lee, Claudia M.B. Carvalho, James.
High-Throughput Analysis of Subtelomeric Chromosome Rearrangements by Use of Array-Based Comparative Genomic Hybridization  Joris A. Veltman, Eric F.P.M.
Highly Punctuated Patterns of Population Structure on the X Chromosome and Implications for African Evolutionary History  Charla A. Lambert, Caitlin F.
Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number.
Pei-Lung Chen, Dimitrios Avramopoulos, Virginia K. Lasseter, John A
Mechanisms for Nonrecurrent Genomic Rearrangements Associated with CMT1A or HNPP: Rare CNVs as a Cause for Missing Heritability  Feng Zhang, Pavel Seeman,
Jared R. Kohler, David J. Cutler 
Volume 110, Issue 4, Pages (August 2002)
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
The Variant inv(2)(p11.2q13) Is a Genuinely Recurrent Rearrangement but Displays Some Breakpoint Heterogeneity  Ina Fickelscher, Thomas Liehr, Kathryn.
Worldwide Population Analysis of the 4q and 10q Subtelomeres Identifies Only Four Discrete Interchromosomal Sequence Transfers in Human Evolution  Richard.
A Definitive Haplotype Map as Determined by Genotyping Duplicated Haploid Genomes Finds a Predominant Haplotype Preference at Copy-Number Variation Events 
Annabel C. Whibley, Vincent Plagnol, Patrick S
Harrison Brand, Ryan L. Collins, Carrie Hanscom, Jill A
C. E. Browne, N. R. Dennis, E. Maher, F. L. Long, J. C. Nicholson, J
Development of a Novel Next-Generation Sequencing Assay for Carrier Screening in Old Order Amish and Mennonite Populations of Pennsylvania  Erin L. Crowgey,
Mapping of Small RNAs in the Human ENCODE Regions
Beyond GWASs: Illuminating the Dark Road from Association to Function
Presentation transcript:

Linkage Disequilibrium and Heritability of Copy-Number Polymorphisms within Duplicated Regions of the Human Genome  Devin P. Locke, Andrew J. Sharp, Steven A. McCarroll, Sean D. McGrath, Tera L. Newman, Ze Cheng, Stuart Schwartz, Donna G. Albertson, Daniel Pinkel, David M. Altshuler, Evan E. Eichler  The American Journal of Human Genetics  Volume 79, Issue 2, Pages 275-290 (August 2006) DOI: 10.1086/505653 Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 1 Array CGH profiles of aneuploid samples. Six samples profiled in this study produced array profiles indicative of an aneuploidy event that most likely occurred during the proliferation of the cell-line source of these samples. If the profile for an entire chromosome appeared affected, those data points were removed from further analysis. In most cases, only a single chromosome was affected. The red and green horizontal lines on each plot indicate the ±0.3 and ±0.5 log2 ratio thresholds, respectively, and are a visual aid for determining the extent of duplication or deletion, since our SD cutoff method typically results in a threshold in the ±0.2–0.4 range. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 2 Frequency spectrum of novel and previously identified segmental duplication–array variants. The frequencies of a total of 257 autosomal BAC variants observed in multiple individuals are shown, 149 previously described variants1–3,13 (A), and 108 novel variants (B). Sites are coded by HapMap sample subpopulation. Note that only 11 novel variants were observed with an ACNF >10%, indicating that the majority of large, common CNPs in duplicated regions have likely been described. No common variants (ACNF >10%) were population specific, suggesting that these CNVs either predate the dispersal of the assayed populations or are recurrent events (population-specific variants >2.5% ACNF are indicated with an asterisk [*]). Only variants observed in two or more individuals are shown. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 3 CNP ACNF and corroboration with other data sources. When stratified by frequency, the variants more commonly seen in the HapMap population samples were more often supported by other sources of CNP data, such as previous reports,1–3,13 and by our own comparison with oligonucleotide-array data. This is partly because of the nature of rare variants, in that they are less likely to be seen by other studies of small populations, but also implies that a proportion may be false-positive results and should be considered with more caution until sequence data are generated to fully document the variant in question. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 4 Frequency differences between gain and loss CNPs. Of the total set of CNPs that had been corroborated by additional experiments or previously published reports (n=194), the gain (n=66) and loss (n=66) CNPs were divided into bins on the basis of ACNF with a threshold of 10%. A significant reduction in the number of sequence-loss CNPs was observed in the bin of common variants. This bias against common copy-number losses suggests that deletions are subject to purifying selection. Similar observations have been reported for smaller deletion variants (median size ∼7.0 kb).14 The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 5 Fine structure variation of a common CNP. The CNP initially observed in BAC RP11-142H6 was observed as a common variant found in all subgroups of the HapMap population samples (ACNF 0.46). BAC-based array CGH results (top panel) for eight BAC clones spanning ∼1.5 Mb of chromosome 19 show continuous variability. Fine-tiling oligonucleotide array CGH profiles of the same individuals confirms subtle copy-number differences over the 156-kb interval encompassed by RP11-142H6. Note the close correlation between the log2 hybridization data from BAC-based array CGH and oligonucleotide microarray data. The recurrent pattern of log2 ratio data corresponds to a tandem organization of a 60-kb region composed of intrachromosomal segmental duplication (gray bars). Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 6 Oligonucleotide array data underlying CNPs detected via BAC array. Each plot displays the oligonucleotide hybridization data for a subset of samples tested. The BAC ID and build 34 interval for that BAC are indicated at the top left corner of each plot. The average log2 ratio determined by BAC array CGH is listed in parentheses within the graph key. The tracks depicting segmental duplications detected using a whole-genome depth of coverage statistic (WSSD) and known genes, extracted from the UCSC Genome Browser, are shown below each plot. Note that a three-point moving average of the oligonucleotide data is plotted. Avg. = average. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 7 Continuous versus discrete CNPs. A comparison of log2 relative hybridization intensity ratio values between primary and replicate experiment for CEU and YRI pedigree samples distinguished continuously (A) and discretely (B) variable CNPs. A discrete CNP was defined as one in which two, three, or four distinct, well-separated genotype classes could be distinguished. Note that, unlike for SNPs, the genotype of the offspring (YRI family 71) can be a novel log2 ratio category, presumably because of the additive effect of the inheritance of two alleles with high copy number. More-common scenarios, such as a parent and child sharing a genotype (YRI family 24), were also observed. For continuous distributions, the additive effect seen for discrete sites was also observed (CEU family 1341), as well as the case in which the offspring log2 ratio value appeared to be intermediate beween the parental values (CEU family 1444). Therefore, each CNP can have a complex inheritance pattern within any given set of trios. The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 8 LD at a continuous site of CNP variation. A, Multiple tracks (top to bottom) depicting the distribution of R2 for SNPs within a 200-kb region flanking the CNP variant, the ratio of observed:expected heterozygosity (HETobs/HETexp), the location of SNP assays (triangles), the position of segmental duplications (Seg Dups) (gray bars, with pairwise relationships indicated by joining lines), and the position of the BAC array CGH probe (dark gray box). The best available SNP, rs1208143, was selected on the basis of a permutation test of all SNPs within an interval encompassing 200 kb proximal and distal of the BAC coordinates. Note the reduced SNP density in the BAC interval, along with increased heterozygosity, likely because of the presence of segmental duplications.20B, The number of SNP assays per kilobase, plotted with respect to the percentage of similarity of segmental duplications. The trend clearly indicates a reduction in the number of SNP assays, with increasing homology in duplicated space. This trend implies that the regions most prone to complex rearrangement are less likely to have dense SNP coverage and thus are less apt to associate with a nearby SNP. In addition, methods for the identification of CNPs based on the analysis of SNP data have reduced power to detect structural variations in regions of segmental duplication.6,7,14 The American Journal of Human Genetics 2006 79, 275-290DOI: (10.1086/505653) Copyright © 2006 The American Society of Human Genetics Terms and Conditions