Post-GWAS and Mechanistic Analyses

Slides:



Advertisements
Similar presentations
What is neural stemness? Why is it important? What are the molecular signatures of neural stemness? What are the regulatory networks that control neural.
Advertisements

Imputation for GWAS 6 December 2012.
Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant.
Geuvadis RNAseq UNIGE Genetic regulatory variants
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Gene Regulation in Eukaryotic Cells. Gene regulation is complex Regulation, and therefore, expression of a gene is complex. Regulation of these genes.
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
The Central Dogma & Data DNA mRNA Transcription Protei n Translation Metabolite Cellular processes Phenotype Embryology Organismal Biology Genetic Data.
Introduction to BioInformatics GCB/CIS535
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
Identification of obesity-associated intergenic long noncoding RNAs
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Finish up array applications Move on to proteomics Protein microarrays.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
Affymetrix 2.0 miRNA arrays on lung tissue RNA n=3-4 mice/strain 92 differentially expressed miRNAs 38 miRNAs both differentially and highly expressed.
Trends Biomedical In silico. “Omics” a variety of new technologies help explain both normal and abnormal cell pathways, networks, and processes simultaneous.
Biostatistics-Lecture 19 Linkage Disequilibrium and SNP detection
Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Regulating Gene Expression To accompany “Regulating Gene Expression” Packet -review packet reading, pictures, and questions.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
Genetics Journal Club Sumeet A. Khetarpal 10 December 2015.
Single Nucleotide Polymorphisms (SNPs
(3) Gene Expression Gene Expression (A) What is Gene Expression?
EQTLs.
Imputation Sarah Medland Boulder 2015.
Common variation, GWAS & PLINK
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
Lesson Four Structure of a Gene.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Lesson Four Structure of a Gene.
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Epigenetics 04/04/16.
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
upstream vs. ORF binding and gene expression?
Functional Mapping and Annotation of GWAS: FUMA
Introduction to Bioinformatics February 13, 2017
Gene Hunting: Design and statistics
Transcription and Gene Regulation
Case Study #2 Session 1, Day 3, Liu
Regulation of Gene Expression by Eukaryotes
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Genome-wide Associations
Beyond GWAS Erik Fransen.
Linking Genetic Variation to Important Phenotypes
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Coordinately Controlled Genes in Eukaryotes
Regulation of Gene Expression
Department of Chemical Engineering
Agenda 3/16 Eukaryotic Control Introduction and Reading
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Genetics and Information
Principle of Epistasis Analysis
Medical genomics BI420 Department of Biology, Boston College
One SNP at a Time: Moving beyond GWAS in Psoriasis
A systems view of genetics in chronic kidney disease
The MultiOmics Explainer
Medical genomics BI420 Department of Biology, Boston College
GWAS-eQTL signal colocalisation methods
Integrative omic approaches for the study of host–pathogen interactions Integrative omic approaches for the study of host–pathogen interactions (A) Proteomic.
A, Proportion for different in silico predictions at the RNA level.
Presentation transcript:

Post-GWAS and Mechanistic Analyses Session 1, Day 4, Liu Post-GWAS and Mechanistic Analyses

I identified a SNP, now what? Congratulations! But…

More Questions A genotype-phenotype association does not mean a cause-effect relationship

Are You Sure? If you are dedicated to do some real science, every step down the road means time, money, labor, value, reputation and life! Validate the result carefully before you jump into any mechanistic study If you can’t do it, ask other people and collaborate.

You Only Identified a Marker! Is the top SNP what I should choose to go? Should I ignore the SNPs that do not reach the significance? P value is not a smart judgement! Did I miss any other SNP? Fine mapping Narrow down Top SNP Genotyped SNPs Untyped SNPs Unidentified SNPs Other type of variants

“Balloon” “Skyscraper”

Things To Do Imputation Sequencing LD Haplotype

Genotype Imputation Predicting or imputing genotypes that are not directly assayed in a sample of individuals “in silico genotyping” Missing data Untyped No cost Increase the possibility for identification

Application Before GWAS: increase chance After GWAS: fine mapping Alternative GWAS: using existing control data Meta-analysis: combining data

SNPs Patients ?

Process Marchini et al., 2011

Accuracy Howie, et al., Nature Genetics 44, 955–959 (2012)

Strategies SNP tagging-based approaches PLINK SNPMSTAT UNPHASED TUNA Simple multinomial model of haplotype frequencies Low accuracy but quick

Strategies Hidden markov model (HMM)-based approaches High accuracy Rare SNPs IMPUTE v1 IMPUTE v2 fastPHASE/BIMBAM MACH BEAGLE

IMPUTE v1, IMPUTE v2, SNPTEST http://www.stats.ox.ac.uk/~marchini/#software fastPHASE, BIMBAM http://stephenslab.uchicago.edu/software.html MACH, MACH2DAT, MACH2QTL http://www.sph.umich.edu/ csg/abecasis/MACH/BEAGLE http://www.stat.auckland.ac.nz/bbrowning/beagle/beagle.html SNPMSTAT http://www.bios.unc.edu/lin/software/SNPMStat/ GEDI http://dna.engr.uconn.edu/ PLINK http://pngu.mgh.harvard.edu/purcell/plink/ UNPHASED http://homepages.lshtm.ac.uk/frankdudbridge/software/ unphased/TUNA http://www.stat.uchicago.edu/wen/tuna/ ProbABEL http://mga.bionet.nsc.ru/yurii/ABEL/

Other Variants Known or unknown CNVs/nucleotide repeats/indel 1000 Genome (1KG) Other shared NGS data Re-sequencing

Other Factors Common disease: gene X environment Multiple genes

Other Factors Gene X Environment (GXE) Age, gender, diet, smoke, drink, etc

Other Factors Gene X Gene (GXG) Epistasis SNP1 SNP2 Disease X +

Other GWAS Approaches Prior Knowledge-based grouping PWAS Gene-based grouping PORCE

You Only Determined a “Locus”! Direction Transcription Translation Catalysis DNA RNA Protein Metabolites Genome Transcriptome Proteome Metabolome/Lipidome Clinical endpoint dysregulation Genetic effect Environmental effect

Which Gene? The closest? Gene desert mRNA vs other? Candidate variants Top SNP Genotyped SNPs Untyped SNPs Unidentified SNPs Other type of variants The closest? Gene desert mRNA vs other? Knowledge integration Candidate gene

How?—Functional Pathway How the SNP change the gene function? Quantitative regulation Regulatory region: TF binding, stability, miRNA targeting, methylation, etc Gene/protein expression eQTL, p-QTL, mQTL Qualitative regulation Gene function: AA change, frame shift, splicing, CNV gene deletion, etc enzyme activity substrate binding phosphorylation

Integration of Different Omics Transcription Translation Catalysis DNA RNA Protein Metabolites Genome Transcriptome Proteome Metabolome/Lipidome Clinical endpoint dysregulation Genetic effect Environmental effect QTL mapping

Why?—Signaling Pathway Proof-of-concept Cells: Gene knock-down, overexpression, CRISPR/CAS9 Model species: KO, TG, mice, yeast, etc. Signaling pathway: dependency test Gene 1 X Gene 2 Knockdown X Knockdown Omics Gene 3 Knockdown …. disease Gene 4 Knockdown

Tools SNPnexus SNAP GTExPortal http://snp-nexus.org/ https://www.broadinstitute.org/mpg/snap/ GTExPortal http://www.gtexportal.org/home/