Microarray Experiment Design and Data Interpretation

Slides:



Advertisements
Similar presentations
Microarray Data Analysis Day 2
Advertisements

Microarray technology and analysis of gene expression data Hillevi Lindroos.
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Fuzzy K means.
Introduce to Microarray
Analysis of microarray data
with an emphasis on DNA microarrays
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Evolva Biotech SA Microarray and Macro opportunities for Discovery informatics Head of Informatics Mobile.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Scenario 6 Distinguishing different types of leukemia to target treatment.
CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data.
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Gene Expression and Networks. 2 Microarray Analysis Supervised Methods -Analysis of variance -Discriminate analysis -Support Vector Machine (SVM) Unsupervised.
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Methodology U937 Human Immune Cells Control (No treatment) (n=4) Estrogen (5 uM) (n=4) 4-nonylphenol (5 uM) (n=4) Cultured Cells, RNA Isolation, RT (to.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Intelligent DNA Chips: Logical Operation of Gene Expression Profiles on DNA Computers Ji Yoon Park Dept. of Biochem Hanyang University Yasubumi Sakakibara,
Microarray: An Introduction
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
DNA Microarray. Microarray Printing 96-well-plate (PCR Products) 384-well print-plate Microarray.
Experimental summary Norwich DMSO treated versus Control Yeast
The Transcriptional Landscape of the Mammalian Genome
Using Web-Based Tools for Microarray Analysis
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Gene Expression Analysis
Assessing functional consequences of epigenetic modifications An Data Analysis Activity for Students This teacher slide set was created by Dana Haine,
GCN5 Regulates FGF Signaling and Activates Selective MYC Target Genes during Early Embryoid Body Differentiation  Li Wang, Evangelia Koutelou, Calley.
*Habibi N [1], Mustafa AS [1,2], Al-Shammari S [3], Shaheed F [1]
Functional Genomics in Evolutionary Research
Microarray Technology and Applications
Figure 1. Effect of acute TNF treatment on transcription in human SGBS adipocytes as assessed by RNA-seq and RNAPII ChIP-seq. Following 10 days in vitro.
Example of a DNA Array (note green, yellow red colors; also note that only part of the total array is depicted)
Day 2: Session 8: Questions and follow-up…. James C. Fleet, PhD
Overview Expression data basics Introduction Biological network data
1 Department of Engineering, 2 Department of Mathematics,
Volume 4, Issue 4, Pages e5 (April 2017)
Gene Chips.
Gene Expression Analysis and Proteins
Microarrays 1/31/2018.
1 Department of Engineering, 2 Department of Mathematics,
DNA Chip Data Interpretation Tools: Genmapp & Dragon View
Altered microRNA expression in stenoses of native arteriovenous fistulas in hemodialysis patients  Lei Lv, MD, Weibin Huang, MD, Jiwei Zhang, MD, Yaxue.
1 Department of Engineering, 2 Department of Mathematics,
Loyola Marymount University
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
Volume 129, Issue 3, Pages (September 2005)
Dimension reduction : PCA and Clustering
Transcriptional Landscape of Cardiomyocyte Maturation
Interpretation of Similar Gene Expression Reordering
Volume 10, Issue 1, Pages (January 2018)
Ashton Breitkreutz, Lorrie Boucher, Mike Tyers  Current Biology 
Volume 16, Issue 4, Pages (November 2004)
The BRD4 bromodomain is critical for expression of SASP genes.
Gene Expression Analysis
Microarray Data Analysis
Loyola Marymount University
The up-regulation of Galnt3 by Pi requires activation of the ERK pathway and induction of Egr1 and Etv5. The up-regulation of Galnt3 by Pi requires activation.
Maria S. Robles, Sean J. Humphrey, Matthias Mann  Cell Metabolism 
Loyola Marymount University
Loyola Marymount University
Volume 26, Issue 24, Pages (December 2016)
Data Type 1: Microarrays
Presentation transcript:

Microarray Experiment Design and Data Interpretation Susan Hester, Ph.D. Environmental Carcinogenesis Division Toxicogenomic Core Facility US EPA hester.susan@epa.gov 919-541-1320

Presentation Outline Traditional biology versus genomics Basics of genomics Data mining goals and approaches using parallel analyses -some examples Interpreting changes in gene expression to identify altered molecular pathways Evaluating pathway alterations in concert with traditional toxicology data for greater understanding of mode of action

Traditional Biology Measure one tree at a time Measure one element in 10-50 samples

“Omic” Biology Measure tens of thousands of elements in 2 to 4 samples Measure Forests (groups of trees)

Genomic research is a data-rich technology Microarrays are called chips or arrays Takes advantage of the natural property of DNA to pair with its complimentary strand One strand is built into the array and then is used as a probe for the complementary strand in the biologic sample The binding confirms the presence of mRNA or cDNA In the sample

Genomic Profiling-Find ”Significantly Changed Genes” From: All probesets Typical experiment is ~ 1M datapoints To: Reduce to a much smaller number of “meaningful genes”

Finding genes in samples-1st step 1 genechip cell location 1 genechip apply sample

2nd step Tagged DNA fragments that base pair will glow 2nd step shine light final image text file with gene intensities

Experimental Design Use adequate controls Sample collection Choose time-points and doses Hybridization schemes-1 or 2 colors

Data Quality and Data Mining RNA quality Scans Summary statistics

RNA quality: Agilent 2100 Bioanalyzer Measure RNA quality and quantity Uses small sample size and take minutes Good Quality RNA Degraded RNA Agilent Gel Image

QC Assessment of Scanned Slide Showing Good Dynamic Range of Signal Intensity Low background signal Poor scan Good scan

Summary Statistics for each array Raw gene intensity distribution for each array After normalization shows reduced variance max median min Grp 1 2 3 4 5 6

Example of with-in group outliers Example of 2 array outliers (high and low median values) Arrays

Goals of Data Mining genes” Reduce the large dataset by first exclude “unchanging genes” Early microarray papers used a simple “fold change” to find differences Most analyses now rely on statistical tests to identify changed genes-supervised versus unsupervised Find genes that distinguish the various biologic classes “significant genes”

General Approach: From many genes to a few 28,000 rat genes 34,000 mouse genes normalize data to compare across arrays analysis begins here supervised (prior knowledge) and unsupervised (no prior knowledge) T test, ANOVA, etc. PCA, KNN, clustering genes…now associate with gene name using databases to assign gene function characterize genes into pathways explore pathways by combining into networks

Array Image Inspection Confirms the Induction of Many Genes 1 uM As 50 uM As

Statistical Filter shows more significant genes at higher doses 1 uM As 50 uM As genes that have values>1.5 fold and significant p<0.05

Many Views of the Data Principal Component Analysis (PCA) Table of filtered genes Principal Component Analysis (PCA) Venn Diagrams-gene level Correlate Transcription with Functional Assays Map genes to pathways Venn Diagram-pathway level

Table view: Significantly Altered Genes by Chemical, Day and Dose in rat liver

Principal Component Analysis Identifies dose-response, if present Assess experiment Worth analyzing ? Identify outliers-bad chips Find samples with similar expression patterns What it does What it looks like: uses all samples and genes using statistics, reduces and plots the data helps visualize data in 2 or 3 planes (3D) What it tells groups samples or genes with similar profiles differentiates treatment or exposure groups

Principal Component Analysis Rat Liver

Numbers of Common and Unique Genes Over Time (High Dose)-rat liver

Dose response corresponds to functional assays Better description of dose response by genomics

Mapping genes to pathways Process p-Value # of genes Expressed # of genes in Pathway % Transcription of Retinoid-Target genes Cell signaling/Regulation of transcription 7.56E-09 68 125 54 Regulation activity of EIF2 Cell signaling/Translation regulation 5.86E-05 31 56 55 IGF-R signaling Growth and differentiation 4.57E-06 40 72 AKT signaling 9.50E-06 33 57 58 PTEN pathway 3.65E-05 Tryptophan metabolism Metabolic maps/Amino acid metabolism 3.99E-05 17 24 71 Cholesterol Biosynthesis Metabolic maps/Steroid metabolism 6.25E-06 16 22 82 GTP-XTP metabolism Metabolic maps/Nucleotide metabolism 4.58E-07 34 63 CTP/UTP metabolism 1.32E-05 60 ATP/ITP metabolism 1.49E-05 36 65

Pathway Venn Unique and common pathways over time

Pathway and network visualizations cellular molecular network metabolic transcription

Example of a molecular pathway with gene intensity values added Oxidative Phosphorylation pathway red=gene induced green=gene repressed rainbow=mixed ATPase Oxidoreductase NADH dehydrogenase succinate dehydrogenase complex cytochrome c oxidase subunit

Cellular pathway extracellular cytoplasmic nuclear Note c-Jun JNK1, ERK1 repression* nuclear Expression legend Green= decreased Red=increased Rainbow=mixed

Gene Network: One Transcription factor:

Network objects mapped to cellular localization

Conclusions Steps for a successful microarray experiment: Experiment design-focus your research question Data quality assessment Supervised and unsupervised analyses Integrating gene expression results with other phenotypic endpoints