Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.

Slides:



Advertisements
Similar presentations
Dahlia Nielsen North Carolina State University Bioinformatics Research Center.
Advertisements

Introduction to Microarry Data Analysis - II BMI 730
Genomic Profiles of Brain Tissue in Humans and Chimpanzees II Naomi Altman Oct 06.
Microarray Normalization
Analysis and Interpretation of Microarray Data Michael F. Miles, M.D., Ph.D. Depts. of Pharmacology/Toxicology and Neurology and the Center for Study of.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
OHRI Bioinformatics Introduction to the Significance Analysis of Microarrays application Stem.
Statistics Tools in GeneSpring The Center for Bioinformatics UNC at Chapel Hill Jianping Jin Ph.D. Bioinformatics Scientist Phone: (919)
Detecting Differentially Expressed Genes Pengyu Hong 09/13/2005.
Getting the numbers comparable
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Microarray Data Preprocessing and Clustering Analysis
Gene Expression Data Analyses (3)
Differentially expressed genes
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Statistical Analysis of Microarray Data
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
1 Test of significance for small samples Javier Cabrera.
Introduction to microarray
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Generate Affy.dat file Hyb. cRNA Hybridize to Affy arrays Output as Affy.chp file Text Self Organized Maps (SOMs) Functional annotation Pathway assignment.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
Different Expression Multiple Hypothesis Testing STAT115 Spring 2012.
Microarray Preprocessing
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.
Dr Paul Lewis Lecturer in Bioinformatics Lecturer in Bioinformatics Cardiff University Cardiff University Biostatistics & Bioinformatics Unit Biostatistics.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Stat 565- Lecture 0 Introduction and Map of this Class.
Model-based analysis of oligonucleotide arrays, dChip software Statistics and Genomics – Lecture 4 Department of Biostatistics Harvard School of Public.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Other genomic arrays: Methylation, chIP on chip… UBio Training Courses.
Statistics for Differential Expression Naomi Altman Oct. 06.
Application of Class Discovery and Class Prediction Methods to Microarray Data Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Statistical Analysis of Microarray Data By H. Bjørn Nielsen.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
Analyzing Expression Data: Clustering and Stats Chapter 16.
Microarray Data Analysis The Bioinformatics side of the bench.
Fig. S1 RNA quality check. Shown are the results from Bio-Rad Experion system analysis of 12 RNA samples used in the microarray studies. The RQI (RNA Quality.
Variability & Statistical Analysis of Microarray Data GCAT – Georgetown July 2004 Jo Hardin Pomona College
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Gene expression  Introduction to gene expression arrays Microarray Data pre-processing  Introduction to RNA-seq Deep sequencing applications RNA-seq.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Canadian Bioinformatics Workshops
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Functional Genomics in Evolutionary Research
Getting the numbers comparable
Presentation transcript:

Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian

Introduction Big volumes of microarray data generated from different technologies are too large to analyze by simple sorting in spreadsheets, or manually comparing, plotting as graphs. Big volumes of microarray data generated from different technologies are too large to analyze by simple sorting in spreadsheets, or manually comparing, plotting as graphs. Each type and/or platform of microarray has its own unique analysis features. Each type and/or platform of microarray has its own unique analysis features.

General Procedures Normalization: remove/reduce systematic (non-biological) variation between array- array, chip-chip. Try to equalized overall signals across array/chip to be compared. Normalization: remove/reduce systematic (non-biological) variation between array- array, chip-chip. Try to equalized overall signals across array/chip to be compared. Examples of normalization: whole chip, per gene, quantile, Lowess, dye swap, RMA,… Examples of normalization: whole chip, per gene, quantile, Lowess, dye swap, RMA,…

Cont. Comparative: Compare gene expression across two or more samples to determine significant differential expressed gene list. Comparative: Compare gene expression across two or more samples to determine significant differential expressed gene list. Example methods: Example methods: t-test ANOVA t-test ANOVA Fold change Fold change Rank order (MAS 5 etc.) Rank order (MAS 5 etc.) Permutation (SAM) Permutation (SAM)

Cont. Clustering: Identifies significant correlation in expression data across experiments/conditions. Clustering: Identifies significant correlation in expression data across experiments/conditions. Example method: Hierarchical clustering Hierarchical clustering k-means clustering k-means clustering Self-organizing maps Self-organizing maps ….. …..

Cont. Biological overlay: Identify functions for give genes; functional clusters of genes; hypothesis generation Biological overlay: Identify functions for give genes; functional clusters of genes; hypothesis generation Example method: Example method: Multi-database access (Source) Multi-database access (Source) Functional grouping (Gene Ontology, KEGG, GenMAPP) Functional grouping (Gene Ontology, KEGG, GenMAPP) PubMed Correlations (PubGene) PubMed Correlations (PubGene)

GenomeStudio A software tool for analyzing illumina gene expression data from scanned microarray images collected from the illumina BeadArray Reader. A software tool for analyzing illumina gene expression data from scanned microarray images collected from the illumina BeadArray Reader. Resulting BeadStudio files can be used by the 3 rd party analysis programs. Resulting BeadStudio files can be used by the 3 rd party analysis programs.

The normalization uses quantiles of sample intensities to fit smoothing B-splines. It’s a non-linear method. Different scaling factors are applied to different parts of the population of genes.

Differntial Expression Algorithm This is used to compare a group of samples to a reference group. Illumina custom: assumes that signal intensith is normally distributed among replicates. The variation has 3 components: biological, non-biological, and technical errors. Illumina custom: assumes that signal intensith is normally distributed among replicates. The variation has 3 components: biological, non-biological, and technical errors. Mann-Whitney: also called Wilcoxon rank-sum rest. It’s a non-parametric test for assessing whether two samples of observation come from the same distribution. Mann-Whitney: also called Wilcoxon rank-sum rest. It’s a non-parametric test for assessing whether two samples of observation come from the same distribution. T-test: T-test:

Output Files of BeadStudio XXXXXX_gene_profile: Intensity data and various quality scores reported at the gene level. Signals from probes for the same gene are combined to give a single value for the gene. XXXXXX_qcinfo :Intensity data for categories of experimental control probes. XXXXXX_gene_diff: Intensity determining if gene expression levels have changed between two experimental groups.

Cluster Analysis

System Controls Housekeeping Controls: The intactness of the biological specimen can be monitored by this. Housekeeping Controls: The intactness of the biological specimen can be monitored by this. Biotin Control: Successful secondary staining is indicated by a positive hybridization signal from these probe. Biotin Control: Successful secondary staining is indicated by a positive hybridization signal from these probe. Negative Controls: This represents measurement of background, non-specific binding or cross-hybridization. Negative Controls: This represents measurement of background, non-specific binding or cross-hybridization.

Cont. Controls for Hybridization: Controls for Hybridization: Cy3-Labeled Hyb control Cy3-Labeled Hyb control Low Stringency Hyb Control Low Stringency Hyb Control High Stringency Hyb Control High Stringency Hyb Control