Microarray analysis Golan Yona ( original version by David Lin )

Slides:



Advertisements
Similar presentations
Microarray Technique, Analysis, and Applications in Dermatology Jennifer Villaseñor-Park 1 and Alex G Ortega-Loayza 2 1 Department of Dermatology, University.
Advertisements

Chapter Six Nucleic Acid Hybridization: Principles & Applications 1.Preparation of nucleic acid probes: - DNA: from cell-based cloning or by PCR. Probe.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
Introduction to Microarray Analysis and Technology Dave Lin - November 5, 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
The Human Genome Project and ~ 100 other genome projects:
Chip arrays and gene expression data. Motivation.
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Bacterial Physiology (Micr430)
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Introduce to Microarray
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
Analysis of microarray data
with an emphasis on DNA microarrays
Microarrays, RNAseq And Functional Genomics CPSC265 Matt Hudson.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
‘Omics’ - Analysis of high dimensional Data
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
Microarray Technology
Microarray - Leukemia vs. normal GeneChip System.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
High throughput Protein Measurement Techniques Harin Kanani.
Genomics I: The Transcriptome
Topic intro slides More complete coverage of components involved in gene expression More information on expression technologies -what would the ideal chip.
DNA Microarrays: An Introduction Jochen Mueller
Computational Biology and Bioinformatics Lab. Songhwan Hwang Functional Genomics DNA Microarray Technology.
Lecture 7. Functional Genomics: Gene Expression Profiling using
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
DNA Gene A Transcriptional Control Imprinting Histone Acetylation # of copies of RNA? Post Transcriptional Processing mRNA Stability Translational Control.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Gene expression and DNA microarrays No lab on Thursday. No class on Tuesday or Thursday next week –NCBI training Monday and Tuesday –Feb. 5 during class.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
Microarray - Leukemia vs. normal GeneChip System.
The Basics of cDNA Microarray Technology
Microarray Technology and Applications
Introduction to cDNA Microarray Technology
Introduction to Microarrays.
Presentation transcript:

Microarray analysis Golan Yona ( original version by David Lin )

What makes one cell different from another? liver vs. brain Cancerous vs. non-cancerous Treatment vs. control

There are about 100,000 genes in mammalian genome each cell expresses only ~15,000 of these genes genes can be expressed at a different level Gene expression can be measured by #copies of mRNA/cell 1-5 copies/cell - “rare” (~30% of all genes) copies/cell - “moderate” 200 copies/cell and up - “abundant”

What makes one cell different from another? Which genes are expressed How much of each gene is expressed Traditional biology: Try and find genes that are differentially expressed Study the function of these genes Find which genes interact with your favorite gene Extremely time consuming! Impractical for gene mining!

Microarrays Massively parallel analysis of gene expression screen an entire genome at once find not only individual genes that differ, but groups of genes that differ. find relative expression level differences Shifting the interest from analysis of single molecules to large complexes and networks Effective for Functional analysis Identify regulatory networks and cellular procedures Tune medical diagnosis and treatment

The technology Measure interactions between mRNA-derived target molecules and genome-derived probes. Based on old technique Many flavors- majority are of two essential varieties 1. cDNA Arrays (discussed today) printing on glass slides, miniaturization, throughput fluorescence based detection 2. Affymetrix Arrays in situ synthesis of oligonucleotides. Other arrays – protein arrays, combinatorial chemistry

The process BUILDING THE CHIP MASSIVE PCR PCR PURIFICATION and PREPERATION PREPARING SLIDESPRINTING PREPARING RNA CELL CULTURE And HARVEST RNA ISOLATION cDNA PRODUCTION HYBING THE CHIP POST PROCESSING ARRAY HYBRIDIZATION PROBE LABELING DATA ANALYSIS

BUILDING THE CHIP MASSIVE PCR PCR PURIFICATION and PREPERATION PREPARING SLIDES PRINTING POST PROCESSING Full yeast genome = 6,500 reactions IPA precipitation +EtOH washes well format Polylysine coating for adhering PCR products to glass slides The arrayer: high precision spotting device capable of printing 10,000 products in 14 hrs, with a plate change every 25 mins Chemically converting the positive polylysine surface to prevent non- specific hybridization

Micro Spotting pin

Practical problems Surface chemistry: uneven surface may lead to high background. Dipping the pin into large volume -> pre-printing to drain off excess sample. Spot variation can be due to mechanical difference between pins. Pins could be clogged during the printing process. Spot size and density depends on surface and solution properties. Pins need good washing between samples to prevent sample carryover.

HYBING THE CHIP ARRAY HYBRIDIZATION PROBE LABELING 1) Two RNA samples (from two different tissues) are labelled with Cy3 or Cy5 dyes via a chemical coupling to AA-dUTP. 2) Cy3 and Cy5 RNA samples are simultaneously hybridized to chip. 3) Ratio measurements are determined via quantification of emission values. Data analysis starts…..

TACAGAA RNA ATGTC TT CCAACCTATGG T T Cy5-dUTP GGTTGGATACC cDNA cDNA synthesis + or Cy3-dUTP Dye conjugated nucleotide Direct labeling of RNA

Gene 1 Gene N

Practical problems

Pre-processing issues Definition of what a real signal is: what is a spot, and how to determine what should be included in the analysis? How to determine background local (surrounding spot) vs. global (across slide) How to correct for dye effect How to correct for spatial effect e.g. print-tip, others How to correct for differences between slides e.g. scale normalization

Data representation Each spot on the array corresponds to one gene. If E1 is the expression of the gene in experiment 1 and E2 is the expression of the gene in experiment 2 then the spot is converted to a number representing the ratio E1/E2 or log(E1/E2)

Data analysis 1) Identify individual genes that are over or under expressed 2) Identify groups of genes that are co- expressed 3) Reconstruct the gene networks that underlie the observed expression levels

1) Analysis of individual genes Absolute values are misleading Need to establish the baseline in order to derive a measure of statistical significance for individual genes Define distributions over the whole array or a control group. Use mean/variance to determine the significance, normalization…statisticians like this stuff

2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i: (x1,x2,x3,…) This is an expression profile of a gene x1,x2,x3,.. are usually in log scale

Example: Yeast cell-cycle data Time Expression at time t Background Expression (control)

Each gene has its own profile Gene X Gene Y

How to measure similarity? Given two expression vectors U,V of dimension d 2

Problem with missing values Possible solutions: averaging, nearest- neighbor approach, EM approach, shuffling