Geuvadis RNAseq UNIGE Genetic regulatory variants

Slides:



Advertisements
Similar presentations
Geuvadis main paper figures & tables TL September 26 Very preliminary!
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
SHI Meng. Abstract The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants,
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Gene Expression Levels Are a Target of Recent Natural Selection in the Human Genome Mol. Biol. Evol. 26(3):649– Journal Club
Causes of regulatory variation in the human genome
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Genetics of gene expression Stephen Montgomery montgomerylab.stanford.edu Stanford University School of Medicine.
Bioinformatics at Molecular Epidemiology - new tools for identifying indels in sequencing data Kai Ye
Regulatory variation and eQTLs Chris Cotsapas
Peter Tsai Bioinformatics Institute, University of Auckland
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Genome-Wide Association Studies Xiaole Shirley Liu Stat 115/215.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Ongoing microRNA data analyses Geuvadis meeting July 2012 Marc Friedländer and Esther Lizano Xavier Estivill lab Programme for Genes and Disease Center.
Identification of obesity-associated intergenic long noncoding RNAs
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Department of Biomedical Informatics Bioinformatics and Genetics Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared Resource.
Geuvadis RNAseq analysis at UNIGE Analysis plans
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Geuvadis WP4: RNA sequencing Progress, Aims and Data Tuuli Lappalainen University of Geneva Geuvadis Analysis Group Meeting, April 16, 2012, Geneva.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Supplemental Figure 1A. A small fraction of genes were mapped to >=20 SNPs. Supplemental Figure 1B. The density of distance from the position of an associated.
SCRIPPS GENOME ADVISER Galina Erikson Senior Bioinformatics Programmer The Scripps Translational Science Institute Scripps Translational Science Institute.
Encode variation analysis. Analysis goals Quantify genetic variation in ENCODE regions Detect selective constraint in ENCODE features Develop rules for.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Geuvadis Analysis Meeting 16/02/2012 Micha Sammeth CNAG – Barcelona.
Lecture 15 Regulatory variation and eQTLs Chris Cotsapas 6.047/6.878/HST.507 Computational Biology: Genomes, Networks, Evolution.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Loss of Function variants and Transcription
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Supplemental Figure 1. False trans association due to probe cross-hybridization and genetic polymorphism at single base extension site. (A) The Infinium.
Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
Understanding GWAS SNPs Xiaole Shirley Liu Stat 115/215.
Common variation, GWAS & PLINK
Gil McVean Department of Statistics
Statistical Applications in Biology and Genetics
Functional Mapping and Annotation of GWAS: FUMA
Genetics of gene expression
Gene Hunting: Design and statistics
Post-GWAS and Mechanistic Analyses
The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans Science Volume 348(6235): May 8, 2015 Published by AAAS.
Linking Genetic Variation to Important Phenotypes
by Manuel A. Rivas, Matti Pirinen, Donald F
Relationship between Genotype and Phenotype
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Tuuli Lappalainen, Stephen B. Montgomery, Alexandra C
Eddie Park, Zhicheng Pan, Zijun Zhang, Lan Lin, Yi Xing 
Volume 162, Issue 5, Pages (August 2015)
Alternative Splicing QTLs in European and African Populations
Working in the Post-Genomic C. elegans World
Xin Li, Alexis Battle, Konrad J. Karczewski, Zach Zappala, David A
CaQTL analysis identifies genetic variants affecting human islet cis-RE use. caQTL analysis identifies genetic variants affecting human islet cis-RE use.
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
New approaches to personalized medicine for asthma: Where are we?
Gene Age Predicts the Strength of Purifying Selection Acting on Gene Expression Variation in Humans  Konstantin Y. Popadin, Maria Gutierrez-Arcelus, Tuuli.
by Manuel A. Rivas, Matti Pirinen, Donald F
Presentation transcript:

Geuvadis RNAseq analysis @ UNIGE Genetic regulatory variants Tuuli Lappalainen University of Geneva Geuvadis Analysis meeting II, July 11, 2012

Expression quantitative trait loci (eQTLs) Expression level T Genotypes Works very well in cis. Difficult in trans The same principle can be applied to any quantitative phenotype with a genomic locus Statistical power only for common variants

QC – mRNA quantifications

miRNA QC POP LAB

eQTLs in Geuvadis Trans-analysis of large deletions didn’t yield much… Pop N Genes with eQTL (FDR) Best eQTL indel (null 8.9%) CEU+GBR 161 2608 (5.1%) 375 (14.4%) TSI 92 1748 (7.7%) 242 (13.8%) FIN 89 1822 (7.3%) 255 (14.0%) YRI 77 2138 (6.3%) 242 (11.3%) EUR union 342 3898 NA ALL union 419 4895 Trans-analysis of large deletions didn’t yield much… TODO: Some methodological improvements Combine Europeans with a PC correction of pop structure Test exon versus transcript quantification

Splicing QTLs (sQTLs) in Geuvadis Pop N Genes with sQTL – transcript ratio Genes with sQTL – links CEU+GBR 161 121 (FDR 9.1%) 1251 forward (FDR 5.6%) 1077 reverse (FDR 6.6%) nonredundant: 1949 ALL union 419 274 NA E1 E2 E3 FRE1-E2 = 5 (RE1-E2) / 5 (RE1-E2) + 3 (RE1-E3) = 0.625 FRE1-E3 = 3 (RE1-E3) / 5 (RE1-E2) + 3 (RE1-E3) = 0.375 links or junctions? counts or fractions? ALTRANS method by Halit Ongen

Integrating transcriptome QTLs eQTLs for mRNA and miRNA exon/miRNA_quantification ~ snp + covariates sQTLs link/junction_ratio ~ snp + covariates link/junction quantification ~ snp + exon_quantification + covariates multiple tQTLs: for the same gene exon_quantification ~ snp2 + exon_eQTL_snp1 + covariates link/junction ratio ~ snp2 + exon_eQTL_snp1 + covariates targeted trans analysis exon quantification ~ mi(eQTL)_snp + covariates link/junction_ratio ~ mieQTL_snp + covariates

Functional annotation of eQTLs TODO: Direction of effect TF motifs, PWM scores Different eQTL frequencies Other tQTLs What’s the best way to tell if we have the causal variant or not? And how often do we seem to find it?

Allele specific expression Statistical testing for ASE What is the allelic ratio? Significantly different from 50-50? cis eQTL* coding SNP mRNA-sequencing T T G T T T T A C C C *or an epigenetic reason for higher expression of only one homolog in the studied cell population (e.g. imprinting)

Rare variants have higher effect sizes ASE analysis ~ REGULATORY VARIANT FREQUENCY 0 0.5 1 derived allele frequency power in eQTL analysis eQTL analysis – expected result Proper quantification of the effect?

Quantifying genetic effects to individual differences TODO: More work on the ASE difference analysis Variation within/between populations Rare variant ASE mapping

Can we predict functional effects of genetic variants? How likely is an unknown variant to have regulatory effects based on known priors? Gene expression ~ variant’s : distance from TSS + position in gene + functional annotation + allele frequency + conservation score + variant type… “gene expression” could be e.g. exon quantification or link ratio (Gaffney et al. 2012 Genome Biology) Does anyone have good experience of this type of modeling?

Acknowledgements The FunPopGen lab Manolis Dermitzakis Analysis Alfonso Buil Thomas Giger Halit Ongen Data processing Ismael Padioleau Alisa Yurovsky Technicians Deborah Bielsen Emilie Falconnet Alexandra Planchon Luciana Romano Stanford School of Medicine Stephen Montgomery The 1000 Genomes Consortium Functional Interpretation Group FUNDING European Union National Institute of Health Louis-Jeantet Foundation Academy of Finland Emil Aaltonen Foundation Swiss National Science Foundation NCCR