Alternative Splicing QTLs in European and African Populations

Slides:



Advertisements
Similar presentations
Alternative Splicing QTLs in European and African Populations Halit Ongen, Emmanouil T. Dermitzakis The American Journal of Human Genetics Volume 97, Issue.
Advertisements

Test for Goodness of Fit
The Structure of Common Genetic Variation in United States Populations
Katarzyna Bryc, Eric Y. Durand, J
Comparison of Clinical Targeted Next-Generation Sequence Data from Formalin-Fixed and Fresh-Frozen Tissue Specimens  David H. Spencer, Jennifer K. Sehn,
Comprehensively Evaluating cis-Regulatory Variation in the Human Prostate Transcriptome by Using Gene-Level Allele-Specific Expression  Nicholas B. Larson,
CYP3A Variation and the Evolution of Salt-Sensitivity Variants
Katarzyna Bryc, Eric Y. Durand, J
Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci  Gosia Trynka,
Christopher R. Cabanski, Vincent Magrini, Malachi Griffith, Obi L
Michael Cullen, Stephen P
Model-free Estimation of Recent Genetic Relatedness
Discovery and Characterization of piRNAs in the Human Fetal Ovary
Reliable Identification of Genomic Variants from RNA-Seq Data
Volume 19, Issue 3, Pages (April 2017)
Volume 146, Issue 6, Pages (September 2011)
Haplotype Estimation Using Sequencing Reads
Estimating Kinship in Admixed Populations
Identification and Validation of Genetic Variants that Influence Transcription Factor and Cell Signaling Protein Levels  Ronald J. Hause, Amy L. Stark,
Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion  Davide Marnetto, Federica Mantica, Ivan Molineris, Elena Grassi, Igor.
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
Understanding Tissue-Specific Gene Regulation
Adrien Le Thomas, Georgi K. Marinov, Alexei A. Aravin  Cell Reports 
Mapping Whole-Transcriptome Splicing in Mouse Hematopoietic Stem Cells
Gene-Expression Variation Within and Among Human Populations
Biased Gene Conversion Skews Allele Frequencies in Human Populations, Increasing the Disease Burden of Recessive Alleles  Joseph Lachance, Sarah A. Tishkoff 
Widespread Inhibition of Posttranscriptional Splicing Shapes the Cellular Transcriptome following Heat Shock  Reut Shalgi, Jessica A. Hurt, Susan Lindquist,
Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project  Trevor J. Pemberton, Zachary.
Volume 154, Issue 1, Pages (July 2013)
Evidence for Widespread Reticulate Evolution within Human Duplicons
Volume 3, Issue 1, Pages (July 2016)
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
Integrative Multi-omic Analysis of Human Platelet eQTLs Reveals Alternative Start Site in Mitofusin 2  Lukas M. Simon, Edward S. Chen, Leonard C. Edelstein,
Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex.
Volume 173, Issue 1, Pages e9 (March 2018)
Biased Allelic Expression in Human Primary Fibroblast Single Cells
Xing Hua, Haiming Xu, Yaning Yang, Jun Zhu, Pengyuan Liu, Yan Lu 
CYP3A Variation and the Evolution of Salt-Sensitivity Variants
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
Daniel F. Tardiff, Scott A. Lacadie, Michael Rosbash  Molecular Cell 
Volume 14, Issue 7, Pages (February 2016)
Volume 17, Issue 12, Pages (December 2016)
Xin Li, Alexis Battle, Konrad J. Karczewski, Zach Zappala, David A
Robust Inference of Identity by Descent from Exome-Sequencing Data
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
Catarina D. Campbell, Nick Sampas, Anya Tsalenko, Peter H
Mamoru Kato, Yusuke Nakamura, Tatsuhiko Tsunoda 
Matthieu Foll, Oscar E. Gaggiotti, Josephine T
Volume 66, Issue 1, Pages e6 (April 2017)
Molecular Convergence of Neurodevelopmental Disorders
E. Wang, Y. -C. Ding, P. Flodman, J. R. Kidd, K. K. Kidd, D. L
A Clinically Validated Diagnostic Second-Generation Sequencing Assay for Detection of Hereditary BRCA1 and BRCA2 Mutations  Ian E. Bosdet, T. Roderick.
Diego Calderon, Anand Bhaskar, David A
Highly Punctuated Patterns of Population Structure on the X Chromosome and Implications for African Evolutionary History  Charla A. Lambert, Caitlin F.
An Efficient Multiple-Testing Adjustment for eQTL Studies that Accounts for Linkage Disequilibrium between Variants  Joe R. Davis, Laure Fresard, David A.
Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number.
A Fast, Powerful Method for Detecting Identity by Descent
Xiaoquan Wen, Yeji Lee, Francesca Luca, Roger Pique-Regi 
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
Trevor J. Pemberton, Chaolong Wang, Jun Z. Li, Noah A. Rosenberg 
Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data  Zihuai.
Complex History of Admixture between Modern Humans and Neandertals
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
Xing Hua, Haiming Xu, Yaning Yang, Jun Zhu, Pengyuan Liu, Yan Lu 
Volume 11, Issue 7, Pages (May 2015)
Adiposity-Dependent Regulatory Effects on Multi-tissue Transcriptomes
Brian C. Verrelli, Sarah A. Tishkoff 
Integrated analysis of gene expression and copy number alterations.
Presentation transcript:

Alternative Splicing QTLs in European and African Populations Halit Ongen, Emmanouil T. Dermitzakis  The American Journal of Human Genetics  Volume 97, Issue 4, Pages 567-575 (October 2015) DOI: 10.1016/j.ajhg.2015.09.004 Copyright © 2015 The Authors Terms and Conditions

Figure 1 Schematic of the Altrans Algorithm (A) Overlapping exons are grouped into exon groups where identical exons belonging to multiple transcripts are treated as one unique entity. Two transcripts, shown as connected brown and green boxes, result in two exon groups and three exons shown as blue boxes. Next, the unique regions of each exon, depicted as light blue boxes and a subscript u followed by the level of the exon, are identified. Because E2 has a region that is not shared by any other exon, it is assigned a “level” of 1, and the reads aligning to E2u,1 can be unambiguously assigned to E2. E1 does not have a unique portion, and therefore the level 1 exon, E2, is removed from the exon group and the whole of E1 becomes a unique portion, shown as an empty blue box, with a level of 2. These unique regions are used when assigning mate pairs to links as shown with the red lines where the solid portions of the line are the sequenced mates and the dashed part represents the inferred insert. (B) The default method for calculating link coverage. Link coverage is necessary to normalize the observed counts for the length of the unique portions being linked and the insert size. The theoretical minimum and maximum insert sizes linking the two unique portions, represented as brown and green lines, respectively, are calculated and given the empirically determined insert size distribution, and the area under the curve between the minimum and maximum insert sizes is estimated. The link coverage equals the number of mate pairs linking the two unique portions over the ratio of this area to the area of the whole insert size distribution. (C) The degrees of freedom method for determining link coverage. Here given a read length and insert size of 3 and two exons that are 6 and 5 bases long, there are three mate pair alignments that can link these two exons. Therefore, the degrees of freedom refer to the theoretical number of positions where a mate pair (given, in this case, 3+3+3 = 9 bp long fragment size) exists that links these exons on the mRNA, shown as black lines. The link coverage is the number of mate pairs linking the exons over the degrees of freedom. (D) The equation to calculate F value for a link. (E) A worked example of calculation of the F values. First the coverage of E2 to E3 link (CE2 − E3) is determined from level 1 unique regions (CE2u,1 − E3u,1), which is then subtracted from the coverage attained from the pseudo-unique E1 to E3 link (CE1u,2 − E3u,1) in order to calculate the true E1 − E3 coverage (CE1 − E3). In the forward direction, E1 and E2 become primary exons and in the reverse direction E3 is the primary exon and the corresponding F values are calculated as shown. The American Journal of Human Genetics 2015 97, 567-575DOI: (10.1016/j.ajhg.2015.09.004) Copyright © 2015 The Authors Terms and Conditions

Figure 2 Simulation Results Using Flux Simulator, we ran six simulations with varying levels of unannotated transcripts. Subsequently, we ran quantifications with three methods with the known GENCODE v.12 annotation. We compared the simulated versus measured link quantifications via Spearman’s rank correlation. These comparisons are shown as colored solid lines. In order to produce a null random distribution for each method, we took the link quantifications for each gene, permutated these for 100 times within the links of this gene, and measured the correlation of these random assignments with the simulated ones. By using this sampling method stratified by genes, we account for the variability of number of isoforms per gene. These correlations for random assignments are shown as dashed lines. We observe that as the percentage of novel transcripts increase, the performance of Cufflinks and MISO suffer, whereas this is not the case for Altrans, which results in best quantifications with increased levels of unannotated transcripts. The American Journal of Human Genetics 2015 97, 567-575DOI: (10.1016/j.ajhg.2015.09.004) Copyright © 2015 The Authors Terms and Conditions

Figure 3 asQTL Discovery (A) The relative distance of asQTLs to the TSS versus the p value. (B) Mosaic plots of gene level sharing of asQTLs for each method at FDR = 1%. (C) The p value distributions of a variant-link pair tested in the other population for each method. From these p value distributions, the π1 statistic is calculated that estimates the proportion of true positives. The American Journal of Human Genetics 2015 97, 567-575DOI: (10.1016/j.ajhg.2015.09.004) Copyright © 2015 The Authors Terms and Conditions

Figure 4 Functional Enrichments of asQTLs Discovered by Altrans and Cufflinks All variants identified in separate populations are merged. The null (frequency and distance matched) is represented as the black horizontal line. The numbers above each bar are the −log10 p values of the enrichment, Altrans enrichment p value followed by Cufflinks p value. The American Journal of Human Genetics 2015 97, 567-575DOI: (10.1016/j.ajhg.2015.09.004) Copyright © 2015 The Authors Terms and Conditions