GeneChips and Microarray Expression Data

Slides:



Advertisements
Similar presentations
Experiment Design for Affymetrix Microarray.
Advertisements

NASC Normalisation and Analysis of the Affymetrix Data David J Craigon.
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Bioconductor in R with a expectation free dataset Transcriptomics - practical 2012.
Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
1. Principles and important terminology 2. RNA Preparation and quality controls 3. Data handling 4. Costs 5. Protocols 6. Information for collaboration.
Introduction to Microarray
Gene Expression Index Stat Outline Gene expression index –MAS4, average –MAS5, Tukey Biweight –dChip, model based, multi-array –RMA, model.
Microarray Normalization
Evaluation of Affymetrix array normalization procedures based on spiked cRNAs Andrew Hill Expression Profiling Informatics Genetics Institute/Wyeth-Ayerst.
DNA microarray and array data analysis
Identification of spatial biases in Affymetrix oligonucleotide microarrays Jose Manuel Arteaga-Salas, Graham J. G. Upton, William B. Langdon and Andrew.
1 Preprocessing for Affymetrix GeneChip Data 1/18/2011 Copyright © 2011 Dan Nettleton.
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
Theoretical and experimental comparisons of gene expression indexes for oligonucleotide microarrays Department of Biostatistics, University pf North Carolina,
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
5 µm Millions of copies of a specific oligonucleotide probe >5 760,000 different complementary probes ~ targets Single stranded, labeled ‘target’
Information Aspects of Nucleic Acids Measurement Technologies Description of nucleic acid measurement technologies Algorithmic, optimization, data analysis.
Felix Naef & Marcelo Magnasco, GL meeting, Nov Outline Background subtraction Probeset statistics Excursions into.
A snapshot that captures the activity
Introduce to Microarray
Affymetrix GeneChip Data Analysis Chip concepts and array design Improving intensity estimation from probe pairs level Clustering Motif discovering and.
Introduction to DNA microarrays DTU - January Hanne Jarmer.
Theoretical and experimental comparisons of gene expression indexes for oligonucleotide microarrays Division of Human Cancer Genetics Ohio State University.
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
and analysis of gene transcription
Analysis of microarray data
Summaries of Affymetrix GeneChip probe level data By Rafael A. Irizarry PH 296 Project, Fall 2003 Group: Kelly Moore, Amanda Shieh, Xin Zhao.
Microarray Preprocessing
with an emphasis on DNA microarrays
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Affymetrix GeneChips Oligonucleotide.
Affymetrix vs. glass slide based arrays
DNA Microarrays M. Ahmad Chaudhry, Ph. D..
Introduction to Gene Chips and Microarray Expression Data
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
Affymetrix GeneChips and Analysis Methods Neil Lawrence.
Data Type 1: Microarrays
Panu Somervuo, March 19, cDNA microarrays.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Introduction to DNA microarrays DTU - May Hanne Jarmer.
Microarrays A snapshot that captures the activity pattern of thousands of genes at once. Custom spotted arrays Affymetrix GeneChip.
Microarray - Leukemia vs. normal GeneChip System.
Bioconductor in R with a expectation free dataset Transcriptomics - practical 2014.
Scenario 6 Distinguishing different types of leukemia to target treatment.
CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits)
Comparison of Microarray Data Generated from Degraded RNA using Five Different Target Synthesis Methods and Commercial Microarrays Scott Tighe and Tim.
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Model-based analysis of oligonucleotide arrays, dChip software Statistics and Genomics – Lecture 4 Department of Biostatistics Harvard School of Public.
GeneChip® Probe Arrays
Introduction to Microarrays.
Whole Genome Approaches to Cancer 1. What other tumor is a given rare tumor most like? 2. Is tumor X likely to respond to drug Y?
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Disease Diagnosis by DNAC MEC seminar 25 May 04. DNA chip Blood Biopsy Sample rRNA/mRNA/ tRNA RNA RNA with cDNA Hybridization Mixture of cell-lines Reference.
Statistical Analyses of High Density Oligonucleotide Arrays Rafael A. Irizarry Department of Biostatistics, JHU (joint work with Bridget Hobbs and Terry.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Introduction to Oligonucleotide Microarray Technology
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
Microarray - Leukemia vs. normal GeneChip System.
Microarray Technology and Applications
The Basics of Microarray Image Processing
Lecture 3 From Images to Data
Presentation transcript:

GeneChips and Microarray Expression Data David Paoletti

The Problem Determine gene expression (activity) What proteins are being produced by a group of cells?

The Assumption The RNA present in the cell determines what proteins are being produced Efficiency

The Why Understanding Toxicology Drug design Evaluation Specificity Response

What is a GeneChip? 1.28 x 1.28 cm glass wafer  500,000 features 24 x 24 m probe site 25 mer oligo, complementary PM: perfect match MM: mismatch  2.5 M copies GeneChip

The Solution

The Gains Speed Possibility Sensitivity Reproducibility

The Process Poly-A RNA Biotin-labeled Antisense cRNA Cells cDNA IVT L AAAA Fragment (heat, Mg2+) Labeled fragments Hybridize Wash/stain Scan L L L

Hybridization and Staining GeneChip Biotin Labeled cRNA Hybridized Array L L L L + L L L + L L L L SAPE Streptavidin- phycoerythrin L

Specialized Equipment

How Features Are Chosen 5’ Gene Sequence 3’ Multiple oligo probes 25 mers Perfect Match Mismatch

Feature Values Remove outermost rows and columns 83 112 96 32 47 382 165 87 55 246 140 93 104 552 187 65 Remove outermost rows and columns Find 75th percentile of remaining values This value is taken as representative of this feature

Background Noise Removal The array is divided into 16 equal sectors For each sector Find the lowest 2% of the feature intensities Average these Subtract this average from the intensity value of all features in the sector

Noise Calculation

Average Difference Intensity For a given gene For each probe pair for the given gene Calculate the difference PM-MM Calculate ,  for this set If abs( (PM – MM) - )  3, delete from set Remaining set is pairs in avg

Positive & Negative Probe Pairs PM-MM  SDT PM/MM  SRT MM-PM  SDT MM/PM  SRT If both true, mark as positive If both true, mark as negative SDT = Q · STDmult By default, SRT = 1.5, STDmult = 2.0 (low density), 4.0 (high)

Voting Methods for Absolute Call Positive/negative ratio PNR = #pos / #neg Positive fraction PF = #pos / #used Log average ratio

Decision Matrix Absent Marginal Present PNR 3.00 4.00 PF 0.33 0.43 LA 0.90 1.30

Average Difference and Absolute Call Which of these do you base a decision on, for whether a gene is being expressed? Use the absolute call for decision Use average difference to compare those which are present

Conclusions Incredible amalgam of biological and computational processes Allows analyses that would not be performed otherwise Already of proven worth

References Moore, S K; Making chips to probe genes, IEEE Spectrum, March 2001, 54-60. GeneChip Gene Expression Algorithm Training, Part I: Absolute Analysis; Affymetrix. Berberich, S, and McGorry, M; GeneChip protocols; Wright State University.