Hybridization and data acquisition –Hybridization –Scanning –Image analysis –Background correction and filtering –Data transformation Methods for normalization.

Slides:



Advertisements
Similar presentations
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Advertisements

MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Chapter Six Nucleic Acid Hybridization: Principles & Applications 1.Preparation of nucleic acid probes: - DNA: from cell-based cloning or by PCR. Probe.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Image Quantitation in Microarray Analysis More tomorrow...
TIGR Spotfinder: a tool for microarray image processing
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Getting the numbers comparable
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Slides preparation - e.g., gelatin or poly – lysine treatment of slides siliconization of coverslips by precipitation ( e.g., ethanol) by cross-linkage.
Gene Expression Data Analyses (2)
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Introduce to Microarray
GeneChip Hybridization. The following hybridization mix is prepared for each sample Fragmented cRNA 5ug 10 ul Control B2 Oligo1.7 ul 20x Eukaryotic Control.
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
SPH 247 Statistical Analysis of Laboratory Data. Two-Color Arrays Two-color arrays are designed to account for variability in slides and spots by using.
What Can You Do With qPCR?
Lecture 10: DNA Quantitation.  Purpose of DNA quantitation  Quantitation Methods  Interchelating dyes  Slot blot  qPCR ▪ Taqman (Life Technologies.
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
Analysis of microarray data
Microarray Preprocessing
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
Image Quantitation in Microarray Analysis More tomorrow...
Scanning and Image Processing -by Steve Clough. GSI Lumonics cDNA microarrays use two dyes with well separated emission spectra such as Cy3 and Cy5 to.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
IMAGE INFORMATICS SOLUTIONS Extracting Information From Images Array-Pro 4.5 Training, May 2003.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Microarray Technology
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
Agenda Introduction to microarrays
Microarray - Leukemia vs. normal GeneChip System.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Blotting techniques are based Complementarity and Hybridization Blotting techniques are used to answer questions like oHow do we find genes of interest.
Computational Biology and Bioinformatics Lab. Songhwan Hwang Functional Genomics DNA Microarray Technology.
Final project – Computational Biology RNA Quantification מגישים: מיכל סימון חיים בן שימול בהנחיית: יהודה ברודי ד"ר ירון שב-טל.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
Lecturer: David. * Reverse transcription PCR * Used to detect RNA levels * RNA is converted to cDNA by reverse transcriptase * Then it is amplified.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Disease Diagnosis by DNAC MEC seminar 25 May 04. DNA chip Blood Biopsy Sample rRNA/mRNA/ tRNA RNA RNA with cDNA Hybridization Mixture of cell-lines Reference.
Microarray Data Analysis The Bioinformatics side of the bench.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Overview of Hybridization, Stringency, and Genechip Processing.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Other uses of DNA microarrays
Microarray: An Introduction
Microarray - Leukemia vs. normal GeneChip System.
Volume 1, Issue 1, Pages (February 2002)
The Basics of Microarray Image Processing
Overview of Hybridization, Stringency, and Genechip Processing
Overview of Hybridization, Stringency, and Genechip Processing
Overview of Hybridization, Stringency, and Genechip Processing
Presentation transcript:

Hybridization and data acquisition –Hybridization –Scanning –Image analysis –Background correction and filtering –Data transformation Methods for normalization

Hybridization Hybridization of DNA microarrays can be done in two different ways. The “classical” approach –placing labeled –denatured target on a slide –carefully covering the slide with a coverslip Skillfulness: the coverslip needs to be level to prevent gradients in hybridization and avoid trapped air bubbles. –The slide is then placed in a humid chamber and incubated at the hybridization temperature. to prevent desiccation during hybridization hybridization temperature –The slides are washed to remove unspecific bound target. More stringent washing steps are performed at the end of the washing procedure. –decreasing the ionic strength or increasing the washing temperature »Typical protocols use decreasing SSC buffer concentrations first with small concentrations of sodium dodecyl sulfate (SDS) »then without SDS. –The slides are finally dried by centrifugation. It is important to scan the arrays within several hours after hybridization because the fluorescence signal deteriorates with time.

Hybridization temperature hybridization temperature: –40 to 65°C for 5 to 12 h. –critical for oligonucleotide slides and has to be carefully optimized. start at a hybridization temperature 15°C below the mean melting temperature of the oligonucleotides used. –depends on the organism studied and the composition of the hybridization buffer saline sodium citrate (SSC) buffer with added detergent –Addition of Denhardt’s solution, sheared salmon sperm DNA, or tRNA reduces the background –The addition of formamide, dextran sulfate, or polyethylene glycol can improve binding of low-copy number transcripts

Hybridization Automatic array hybridization stations –They provide hassle-free hybridization and washing of the slides by running programmed protocols. –The results do not depend on the ability of the researcher and are very reproducible. –However, hybridization and washing conditions have to be fine-tuned in earlier experiments to the probes and slide chemistry used. They are therefore most adequate when a large number of arrays based on the same chemistry have to be handled identically. They are not well suited to deal with small number arrays based on varying chemistries.

Scanning The microarrays are scanned with microarray scanners. –Their appropriate driver and image analysis software determines the raw values. GenePix and ArrayVision software are examples of widely used softwares for image analysis and raw data acquisition. In principle, it is possible to scan standard-sized slides with any scanner. –Exceptions are a few slide types with a nonplanar surface that exclude confocal scanners. For successful data acquisition, a data file is needed that identifies the features and defines their dimensions and locations. –The GenePix array list (.gal) file format is often used for this purpose. The scanners mostly use lasers for exciting the surface of the hybridized microarray with a resolution of a few micrometers The resolution of scanning should be better than 10% of the spot size, that is, features of 150-μm size need to be scanned at least at 15 μm resolution. The fluorescence emitted from the dyes hybridized to the features is collected and quantified by photomultiplier tubes or charge-coupled device (CCD) cameras. Normally, the scanner generates gray-scale images of the fluorescence at 532 and 635 nm. The data are stored in a lossless tagged image file format (TIFF) that is used for quantification by image analysis.

A color depth of 2 byte is characteristic for most scanners, which means that each pixel can assume 65,535 different intensity levels. The sensitivity of the scanner has to be adjusted to ensure that most of the pixels in the picture do not saturate its dynamic range. It is convenient to roughly adjust the sensitivity of the scanner during a prescan so that constitutive controls result in roughly equal signals. For microarrays made from double-stranded DNA, this can also be done by spotting chromosomal DNA and adjusting these spots to a ratio of 1 during scanning.

Image analysis To quantify the fluorescence of the features via image analysis, pixels have to be assigned either to a spot or the background. This resulting boundary is often visualized in the software for acquiring raw values by a circle surrounding the feature and is then called a feature indicator. In most cases, the image analysis software allows the placement of the feature indicators in a semiautomatic or automatic manner. Even if an algorithm places the feature indicators automatically, it is advisable to manually control this placement. In real life, the spots might be irregular, or fluorescent impurities on the chip surface may confuse algorithms. It is common to define all pixels inside a feature indicator as foreground and all adjacent pixels within a radius of three times the feature diameter as the local background. The next step is the quantification of the image data by calculating the arithmetic mean or better median of the intensities of the foreground and background pixels. This resulting data are stored in form of a table. Common spreadsheet programs and all sorts of commercial and free software tools can be used for the next steps of data analysis.

Background correction and filtering The first and most important of these quality assessments is to exclude features with intensity smaller than the background or assign them a “floor” value, which is often the local or global background. –The assignment of a floor value allows interpretation of genes that are transcribed at one condition but are totally switched off at the other condition. –This is not uncommon with bacteria where some operons are specifically induced by an inducer but are not transcribed in its absence.

Data transformation The untransformed values of expression ratios have the disadvantage of treating up- and downregulated genes differently. That means that a fourfold upregulated gene has an expression ratio of 4, whereas a fourfold downregulated gene has an expression ratio of To circumvent this problem, the expression ratios are often handled as their logarithms to the base 2. This results in the values 2 and (−2) for a fourfold up- and downregulated gene, which has a number of practical advantages.

Methods for normalization Systematic biases –different amounts of RNA used for labeling –different incorporation efficiencies of the Cy-3 and Cy-5 dyes in the labeling protocols –different detection efficiencies of the dyes Method 1: Assumption: the total sum of intensities should be equal in both channels, and therefore, the ratio between them should be one. –A normalization factor is calculated from overall ratio, and ratios for all features are scaled accordingly. A somehow similar approach uses spotted chromosomal DNA for normalization, although this strategy is confined to microarrays made by spotting double-stranded DNA linear regression analysis the intensity-dependent locally weighted linear regression (LOWESS) normalization that corrects for intensity-dependent effects sometimes observed in the data. they imply statistically that there must be a downregulated gene for every upregulated one. –This assumption might be more appropriate from a statistical point of view with large eukaryotic genomes. –It is not necessarily correct for bacteria because the number of genes is much smaller.