Gene Expression BMI 731 Winter 2005 Catalin Barbacioru Department of Biomedical Informatics Ohio State University.

Slides:



Advertisements
Similar presentations
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Advertisements

Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
Introduction to Microarray
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Statistics for Microarrays
Gene Expression Chapter 9.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
DNA microarray and array data analysis
Microarrays Dr Peter Smooker,
The Human Genome Project and ~ 100 other genome projects:
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Alternative Splicing As an introduction to microarrays.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Gene Expression BMI 731 week 5
A snapshot that captures the activity
Introduce to Microarray
Gene Expression Data Analyses (1) Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
B IOINFORMATICS Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 8 Analyzing Microarray Data Aleppo University Faculty of technical.
with an emphasis on DNA microarrays
Gene expression and the transcriptome I
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Lecture 22 Introduction to Microarray
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Microarray Technology
Technology for Systems Biology. Nucleic Acid Hybridization In principle complementary strands will associate Chemistry is quite different on surfaces.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Introduction to Microarrays.
Lecture 7. Functional Genomics: Gene Expression Profiling using
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Gene expression and DNA microarrays No lab on Thursday. No class on Tuesday or Thursday next week –NCBI training Monday and Tuesday –Feb. 5 during class.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Functional Genomics Carol Bult, Ph.D. Course coordinator The Jackson Laboratory Winter/Spring 2011 Keith Hutchison, Ph.D. Course co-coordinator.
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
MICROARRAY. Microarray  A multiplex lab-on-a-chip  A 2D array on a solid substrate (Usually a glass slide or silicon thin-film cell) that assays large.
Detecting DNA with DNA probes arrays. DNA sequences can be detected by DNA probes and arrays (= collection of microscopic DNA spots attached to a solid.
The Central Dogma. Life - a recipe for making proteins DNA protein RNA Translation Transcription.
Microarray - Leukemia vs. normal GeneChip System.
Microarray Technology and Applications
Overview Expression data basics Introduction Biological network data
Introduction to Microarrays.
Data Type 1: Microarrays
Presentation transcript:

Gene Expression BMI 731 Winter 2005 Catalin Barbacioru Department of Biomedical Informatics Ohio State University

Thesis: the analysis of gene expression data is going to be big in 21st century statistics Many different technologies, including Spotted DNA arrays (Brown/Botstein) Short oligonucleotide arrays (Affymetrix) Serial analysis of gene expression (SAGE) Long oligo arrays (Agilent) Fibre optic arrays (Illumina)

(projected) Year Number of papers Total microarray articles indexed in Medline

Common themes Parallel approach to collection of very large amounts of data (by biological standards) Sophisticated instrumentation, requires some understanding Systematic features of the data are at least as important as the random ones Often more like industrial process than single investigator lab research Integration of many data types: clinical, genetic, molecular…..databases

Central dogma The expression of the genetic information stored in the DNA molecule occurs in two stages: (i) transcription, during which DNA is transcribed into mRNA; (ii) translation, during which mRNA is translated to produce a protein. DNA → mRNA → protein Other important aspects of gene regulation: methylation, alternative splicing.

Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be better, but is currently harder.

DNA microarrays represent an important new method for determining the complete expression profile of a cell. Monitoring gene expression lies at the heart of a wide variety of medical and biological research projects, including classifying diseases, understanding basic biological processes, and identifying new drug targets.

Affymetrix ® Instrument System Platform for GeneChip ® Probe Arrays Integrated Integrated Easy to use Easy to use Exportable Exportable Versatile Versatile

Photolithography

Synthesis of Ordered Oligonucleotide Arrays O O O O OO O O O O Light (deprotection) HO HO O O O T T O O OT T O O O T T C C OT T C C O Light (deprotection) T T O O OT T O O O C A T A TC A T A T A G C T GA G C T G T T C C GT T C C G Mask Substrate Mask Substrate T – C – REPEAT

Affymetrix GeneChip arrays

GeneChip ® Probe Arrays 24µm Millions of copies of a specific oligonucleotide probe Image of Hybridized Probe Array Image of Hybridized Probe Array >200,000 different complementary probes Single stranded, labeled RNA target Oligonucleotide probe * * * * *1.28cm GeneChip Probe Array Hybridized Probe Cell

Perfect Match (PM) Mis Match (MM) Control log(PM / MM) = difference score All significant difference scores are averaged to create “average difference” = expression level of the gene. Each pixel is quantitated and integrated for each oligo feature (range 0-25,000) Analysis of expression level from probe sets

each oligo sequence (20-25 mer) is synthesized as a 20 µ square (feature) each feature contains > 1 million copies of the oligo scanner resolution is about 2 µ (pixel) each gene is quantitated by oligos and compared to equal # of mismatched controls 22,000 genes are evaluated with 20 matching oligos and 10 mismatched oligos = 480,000 features/chip 480,000 features are photolithographically synthesized onto a 2 x 2 cm glass substrate Analysis of expression level from probe sets

Affymetrix arrays Global views of gene expression are often essential for obtaining comprehensive pictures of cell function. For example, it is estimated that between 0.2 to 10% of the 10,000 to 20,000 mRNA species in a typical mammalian cell are differentially expressed between cancer and normal tissues. Whole-genome analyses also benefit studies where the end goal is to focus on small numbers of genes, by providing an efficient tool to sort through the activities of thousands of genes, and to recognize the key players. In addition, monitoring multiple genes in parallel allows the identification of robust classifiers, called "signatures", of disease. Global analyses frequently provide insights into multiple facets of a project. A study designed to identify new disease classes, for example, may also reveal clues about the basic biology of disorders, and may suggest novel drug targets.

Spotted DNA microarrays In ‘‘spotted’’ microarrays, slides carrying spots of target DNA are hybridized to fluorescently labeled cDNA from experimental and control cells and the arrays are imaged at two or more wavelengths Expression profiling involves the hybridization of fluorescently labeled cDNA, prepared from cellular mRNA, to microarrays carrying thousands of unique sequences. Typically, a set of target DNA samples representing different genes is prepared by PCR and transferred to a coated slide to form a 2-D array of spots with a center-to-center distance (pitch) of about 200 μm, providing a pan-genomic profile in an area of 3 cm2 or less. cDNA samples from experimental and control cells are labeled with different color fluors (cytochrome Cy5 and Cy3) and hybridized simultaneously to microarrays, and the relative levels of mRNA for each gene are then determined by comparing red and green signal intensities

Spotted DNA microarrays Scanning Technology Microarray slides are imaged with a modified fluorescence microscope designed for scanning large areas at high resolution (arrayWoRx, Applied Precision, Issaquah, WA, Affymetrix). Fluorescence illumination are obtained from a metal halide arc lamp focused onto a fiber optic bundle, the output of which is directed at the microarray slide and emission recorded through a microscope objective (Nikon) onto a cooled CCD (charge-coupled device) camera. Interference filters are used to select the excitation and emission wavelengths corresponding to the Cy3 and Cy5 fluorescent probes (Amersham Pharmacia). Each image covered a 2.4 x 2.4 mm area of the slide at 5-μm resolution. To scan the entire microarray, a series of images (‘‘panels’’) were acquired by moving the slide under the microscope objective in 2.4-mm increments.

s/genomics/chip/chip.swf

The red/green ratios can be spatially biased. Top 2.5%of ratios red, bottom 2.5% of ratios green

Spotted vs. Affymetrix Arrays Affymetrix strengths: - highly reliable: synthesized in situ - highly reproducible from run to run - no clone maintenance or ‘drift’ - sealed fluidics and controlled temperature - standardized chips increase database power - excellent scanner - complex, but very reliable labelling - excellent cost/benefit ratio - amenable to mutation and SNP detection

Affymetrix weaknesses/limitations - not easily customized: $300K/chip - high labeling cost $170/chip - high per chip cost $350 to $ limited choice of species - requires knowledge of sequence - not designed for competitive protocols

Limitations to all microarrays - dynamic range of gene expression: very difficult to simultaneously detect low and high abundance genes accurately - each gene has multiple splice variants 2 splice variants may have opposite effects (i.e. trk) arrays can be designed for splicing, but complexity ^ 5X - translational efficiency is a regulated process: mRNA level does not correlate with protein level - proteins are modified post-translationally glycosylation, phosphorylation, etc. - pathogens might have little ‘genomic’ effect

Biological question Differentially expressed genes Sample class prediction etc. Testing Biological verification and interpretation Microarray experiment Estimation Experimental design Image analysis Normalization Clustering Discrimination R, G 16-bit TIFF files (Rfg, Rbg), (Gfg, Gbg)