Contribution of Epigenetic Variation to Expression Changes Among Tissues and Genotypes Steve Eichten – Springer Lab PAG iPlant Workshop 1/17/12.

Slides:



Advertisements
Similar presentations
Functional Genomics with Next-Generation Sequencing
Advertisements

Chromatin Immunoprecipitation and the Chip on Chip technique. Fredrik Fagerström Billai B E A- Core Facility for Bioinformatics and Expression Analysis.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
Peter Tsai Bioinformatics Institute, University of Auckland
Lab 13: Association Genetics. Goals Use a Mixed Model to determine genetic associations. Understand the effect of population structure and kinship on.
Data Analysis for High-Throughput Sequencing
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
RNA-seq Analysis in Galaxy
Before we start: Align sequence reads to the reference genome
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
NGS Analysis Using Galaxy
An Introduction to RNA-Seq Transcriptome Profiling with iPlant
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Customized cloud platform for computing on your terms !
Geuvadis RNAseq analysis at UNIGE Analysis plans
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Expression Analysis of RNA-seq Data
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Genomics and High Throughput Sequencing Technologies: Applications Jim Noonan Department of Genetics.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
TopHat Mi-kyoung Seo. Today’s paper..TopHat Cole Trapnell at the University of Washington's Department of Genome Sciences Steven Salzberg Center.
RNA-seq workshop ALIGNMENT
An Introduction to RNA-Seq Transcriptome Profiling with iPlant.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA RNA-seq CHIP-seq DNAse I-seq FAIRE-seq Peaks Transcripts Gene models Binding sites RIP/CLIP-seq.
Silence of the Genes. Genetics The study of inheritance.
Epigenetics Heritable characteristics of the genome other than the DNA sequence Heritable during cell-division (mitosis) To a lesser extent also over generations.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
NextGen Pipeline: Enabling the Plant Science Community Tom Brutnell (lead), Steve Rounsley (co-lead), Matt Vaughn (Engagement Lead) Ed Buckler, Justin.
Sackler Medical School
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
Figure SOM1. Functional roles of the genes affected in zmet2-m1 mutants. Although the genes localized on the intracellular membranes were slightly over-represented.
RNA-Seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis is doing the.
Introduction to RNAseq
ACCELERATING CLINICAL AND TRANSLATIONAL RESEARCH Challenges in Bioinformatics R.W. Doerge Department of Statistics Department Agronomy.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
An Introduction to RNA-Seq Transcriptome Profiling with iPlant (
The iPlant Collaborative
No reference available
Comparative transcriptomics of fungi Group Nicotiana Daan van Vliet, Dou Hu, Joost de Jong, Krista Kokki.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Case study: Saccharomyces cerevisiae grown under two different conditions RNAseq data plataform: Illumina Goal: Generate a platform where the user will.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
Canadian Bioinformatics Workshops
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Introductory RNA-seq Transcriptome Profiling of the hy5 mutation in Arabidopsis thaliana.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Canadian Bioinformatics Workshops
Introductory RNA-seq Transcriptome Profiling
Cancer Genomics Core Lab
Gil McVean Department of Statistics
Introductory RNA-Seq Transcriptome Profiling
Statistics in epigenomics
GENETICS A Conceptual Approach
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Material for today’s workshop is at:
GENETICS A Conceptual Approach
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Computational Pipeline Strategies
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Alignment and CNV analysis in cattle
Presentation transcript:

Contribution of Epigenetic Variation to Expression Changes Among Tissues and Genotypes Steve Eichten – Springer Lab PAG iPlant Workshop 1/17/12

Outline Questions we are interested in Using iPlant to assist in answering these questions Current and future analysis

Epigenetics & Gene Expression Heritable variation not due to sequence variation – DNA methylation, histone modification, etc – Classic examples: imprinting, paramutation Areas of research: Epigenomic variation across genotypes & development Relationship between genetic & epigenetic variation Role of epigenetic variation in phenotype (Chandler & Stam, 2004)

Epigenetic variation Across genotypes: – Maize Nested Association Mapping population (26 inbreds) Across development: – 5 tissues Embryo Endosperm Leaf Immature ear tassel

To answer these questions… Genome-wide methylation assessment – meDIP-chip (2.1M probe platform) – BS-seq (for B73 and Mo17 inbreds) Genome-wide expression assessment – RNA-seq (~20M reads x 25 inbreds) TechnologyLeafEmbryoEndospermEarTasselTOTAL meDIP-chip arrays BS-seq 179,109, ,109,340 reads RNA-seq 558,210,73242,581,92443,419,68085,671,33073,105, ,989,001 reads ~98,200,000,000 bases 209,949,399 array data points Bioinformatic knowhow Computational power and storage – iPlant!

Old way 1. Buy larger hard drives 2. Upgrade computers 3. Buy more hard drives 4. Gain basic understanding of UNIX / Perl / R / etc… 5. Lots of command line work / installation / troubleshooting - Array normalization / fastqc / tophat / cufflinks / DEGseq / …… 6. Realize you want it run analyses a different way. {GOTO 4} 7. Realize you really need more computing power to do what you want. {GOTO 2 or make some friends at a supercomputing institute} 8. Train others in your lab group how to do it! {Document everything and GOTO 1}

New way Move sequence data to iPlant DE Select apps for desired analysis Run software faster than you can locally Quickly adapt methods to find optimum Develop analysis pipeline for others to use Store large files for others in your lab group to access Larger storage + Faster Computing + Faster Training + Faster Adaptation + Faster Implementation = more user-friendly, more powerful computing

Applying iPlant to maize epigenomics fastq files Tophat aligner to pre-indexed maize reference genome Cufflinks transcript assembly against maize gene models iRODS file transfer to iPlant iPlant integrated Apps Download files from iPlant, display in IGV Assess read quality (FastQC) iPlant integrated Apps

Methylation variation in B73 and Mo17 Chromosome Most locations show similar methylation profiles Identified ~700 differentially methylated regions (DMRs) in B73 and Mo17 inbreds Eichten et al., PloS Genetics (B73 methylated, Mo17 not) (Mo17 methylated, B73 not)

Examples of expression changes correlated with epigenetic state Chromosome 6 Methylation inversely correlated with expression state Inbred-specific examples of methylation & expression variation Hundreds of genes correlated with epigenetic state

Methylation & Expression variation across NAM Chromosome 4 Putative targets of heritable epigenetic variation

Analysis of DNA methylation patterns across additional tissues and genotypes B73 embryo B73 endosperm B73 leaf Mo17 embryo Mo17 endosperm Mo17 leaf Ki11 leaf Mo18w leaf NC358 leaf Oh7b leaf DNA methylation patterns are generally quite similar among genotypes and tissues. However, there are ~1000 DMRs between any two genotypes. Variation frequently acts equally upon all tissues. Few Tissue specific DMRs and rarely conserved between genotypes

Wrap Up Epigenetics – Epigenetic variation exists in maize – Examples of gene expression states correlated with epigenetic state can be identified – Few tissue-specific methylation variants Utility of iPlant – Fast & remote location for storing large amounts of data – Fluid analysis of sequence data to develop transcript alignment and quantification

Springer Lab – Amanda Waters – Ruth Swanson-Wagner – Peter Hermanson – Nathan Springer iPlant & TACC – Matthew Vaughn NSF Thanks!