Gene Expression Data Analyses (1) Trupti Joshi Computer Science Department 317 Engineering Building North 573-884-3528(O)

Slides:



Advertisements
Similar presentations
A Little More Advanced Biotechnology Tools
Advertisements

Introduction to Microarray Gene Expression
M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Introduction to the design of cDNA microarray experiments Statistics 246, Spring 2002 Week 9, Lecture 1 Yee Hwa Yang.
Sandrine Dudoit1 Microarray Experimental Design and Analysis Sandrine Dudoit jointly with Yee Hwa Yang Division of Biostatistics, UC Berkeley
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Introduce to Microarray
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
and analysis of gene transcription
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
Analysis of microarray data
with an emphasis on DNA microarrays
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
‘Omics’ - Analysis of high dimensional Data
Lecture 22 Introduction to Microarray
CDNA Microarrays MB206.
Data Type 1: Microarrays
Panu Somervuo, March 19, cDNA microarrays.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
Microarray Technology
Agenda Introduction to microarrays
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits)
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Genomics I: The Transcriptome
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
MICROARRAY TECHNOLOGY
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Introduction to Microarrays.
Lecture 7. Functional Genomics: Gene Expression Profiling using
Microarrays and Gene Expression Arrays
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Gene expression and DNA microarrays No lab on Thursday. No class on Tuesday or Thursday next week –NCBI training Monday and Tuesday –Feb. 5 during class.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Other uses of DNA microarrays
Microarray: An Introduction
DNA Microarray. Microarray Printing 96-well-plate (PCR Products) 384-well print-plate Microarray.
Gene Expression Analysis
Microarray Technology and Applications
Lecture 11 By Shumaila Azam
Introduction to Microarrays.
Data Type 1: Microarrays
Presentation transcript:

Gene Expression Data Analyses (1) Trupti Joshi Computer Science Department 317 Engineering Building North (O)

Lecture Schedule for Gene Expression Analyses l Concept of microarray and experimental design for DNA microarray (9/6/05) l Data transformation and normalization for DNA microarray (9/8/05) l Statistical analysis for DNA microarray and Software comparison (9/13/05) l Clustering Techniques for DNA microarray (Dr. Dong Xu 9/15/05)

Lecture Outline l Central Dogma of Molecular Biology l Introduction to Gene Expression and Microarray l Experimental Design

Lecture Outline l Central Dogma of Molecular Biology l Introduction to Gene Expression and Microarray l Experimental Design

Central Dogma of Molecular Biology Gene Expression mRNA level Protein level

Lecture Outline l Central Dogma of Molecular Biology l Introduction to Gene Expression and Microarray l Experimental Design

Introduction: Gene Expression Same DNA in all cells, but only a few percent common genes expressed (house-keeping genes). A few examples: (1) Specialized cell: over-represented hemoglobin in blood cells. (2) Different stages of life cycle: hemoglobins before and after birth, caterpillar and butterfly. (3) Different environments: microbial in nutrient poor or rich environment. (4) Diversity of life.

Microarray is about gene expression. l All information about living being is coded in DNA as a set of genes. l Each gene contains structural information about protein sequence and regulatory information about protein expression. l Intermediate step between gene and protein is mRNA. l The concentration of mRNA is measured by microarray.

Problem l RNA levels and protein levels are not always directly correlated. l No mRNA no protein; Relation is not simple and not universal. l Functional genomics fill the gap between gene expression and organism function. l The meaning of life is hidden in gene expression value but it is not easy to get it out.

Eucaryote Gene Expression Control DNA Primary RNA transcript mRNA nucleuscytosol RNA transport control inactive mRNA degradation control translation control nucleus membrane transcriptional control protein inactive protein activity control RNA processing control Microarray  mRNA Mass-spec  protein

Principle of DNA Microarray l Complimentary hybridization is the basis of RNA measurement. å Base-pairing rules X DNA: A-T and G-C X RNA: A-U, G-C, G-U A--T G--C T--A C--G

Microarray Technology l Macroarray: sample spot sizes >= 300 microns l Microarray: typically < 200 microns å biochip, DNA chip, DNA microarray, gene array, genome array, gene chip

Initial Ideas of DNA Microarray Immunoassay Ekins, R. and F. W. Chu. Microarrays: their origins and applications. Trends in Biotech. 17:

Application of DNA Microarray Technology  Gene discovery  Biological mechanisms (gene regulatory network, etc.)  Disease diagnosis (cancer, infectious disease, etc.)  Drug discovery: Pharmacogenomics  Toxicological research: Toxicogenomics  Microbial diversity in the environment  …

Increasing Microarray Applications

Advantages and Disadvantages of Micoarray l Advantages: å High-throughput å Analyze gene expressions of different cells or from cells under different condition simultaneously l Disadvantages: å High noise å Relatively high cost

Categories of DNA Microarray  Probe based  cDNA microarray: cDNA (500~5,000 bases) as probe. 10, ,000 spots/slide.  Oligo microarray (Affimetrix Microarray): oligonucleotide (20~80- mer oligos) as probe. 200, ,000 spots/slide.  Dye based  Double label. For example, Cy3 and Cy5. X One sample is labeled with a “green” dye and the other with “red”. X Relative fluorescent intensity of red and green from the same spot.  Single label. X All samples are labeled with one color. X Absolute fluorescent intensity between different slides. X Does not control for the amount of DNA in each spot.

Chips  Typically a glass slide with cDNA or oligo  Printed by robot or synthesized by photo-lithography.  Typical arrays are 25x75 mm. Contains up to 500,000 probed gene fragments.

Probe Layout on Chips  Positive control  Genome DNA  House keeping genes  Negative control  Spots with cDNA from very different species  Blank spots  Spots with buffer  Samples  Technical replicates

Microarray Procedures RNA extraction cDNA prepration cDNA labeling Sample mixing Hybridization Scanning Image Analysis Data transformation and Normalization Statistical analysis Experimental Design Data interpretation

Molecular Interaction on microarray  1 molecule per square angstroms  Large molecules are easily to be folded by themselves  Short targets are better than large targets to interact with tethered oligos  Ideally, target and probe should have the same length  Molecules interaction are dynamic  Competitive hybridization

Lecture Outline l Central Dogma of Molecular Biology l Introduction to Gene Expression and Microarray l Experimental Design

Experimental Protocol l A. Synthesis of cDNA Synthesis of the second strand DNA l B. Labeling l C. Hybridization l D. Scanning

Rational for Experimental Design l Scientific constrains: å Scientific aims and their priorities l Physical constrains: å Number of slides å Amount of mRNA l Goal of an optimal design: Minimize costs from money, time l Maximize the useful information

Issues for Experimental Design l Scientific å Specific questions and their priorities. l Practical (logistic) å Types of mRNA samples: reference, control, treatment. å Amount of material available (mRNA, slides, dyes). l Other factors å The experimental process before hybridization: sample isolation, mRNA extraction, amplification, and labeling. å Controls planned: positive, negative, ratio, and so on. å Verification method: northern blot, reverse transcriptase (RT)-PCR, in situ hybridization, and so on.

Variability and Replicates l Gene expression level for one gene in different slides may not be the same l Replicates: å Technical replicates: the target mRNA is from the same pool (RNA extraction) X Reduce variability å Biological replicates: the target mRNA is from different individual extraction. X Obtain averages of independent data X Validate generalizations of conclusions l Variation within technical replicates are smaller than that within Biological replicates

Importance of Replicates

Graphical Representation of Design  Use directed graphs  Node: sample  Edge: hybridization, use Cy3  Cy5  Weight: replicates Cy3: green Cy5: red Cy3+Cy5: blue

Direct & Indirect Comparison  Compared objectives: T and C  Directive design: TC are on the same slide  Indirect design: TR and CR are on the same slides, respectively. But T and C are on different slides

Variance & Std Deviation l Variance The most common statistical measure of variability of a random quantity or random sample about its mean. Its scale is the square of the scale of the random quantity or sample. l Standard Deviation Standard deviation is the square root of the variance. It measures the spread of a set of observations. The larger the standard deviation is, the more spread out the observations are.

Variance for Indirect Design l For sample T and C: l Differential Expression l Direct design l Indirect design α and β are means of log intensities across slides for a typical gene.

Dye-swapped Replication  Two hybridizations for two mRNA samples are on the two slides, but dye swapped. For example, Cy3 for A and Cy5 for the first hybridization (slide 1), then C5 for A and Cy3 for the second hybridization (slide 2).  Advantage: reduce systematic bias (e.g. dye bias) Two sets of replications Dye-swapped replications

Reference Design It may not be feasible to perform direct design when experimental conditions are more than 3.

Factors in the design l Single factor l Two factors l Multiple factors

Single Factor Experiments

Time-course Experiments

2x2 factorial experiments

Lecture Outline l Central Dogma of Molecular Biology l Introduction to Microarray å Application å Advantage vs. Disadvantage å Chips å Microarray procedure l Experimental design å Rational å Variability and Replication å Graphical representation å Direct comparison and Indirect comparison å Dye swap å Reference design å Single-factor design å Multifactorial design

Reading Assignments l Suggested reading: å Yang, YH and T. Speed Design issues for cDNA microarray experiments. Nature Reviews, 3: å Statistical analysis of gene expression microarray data. Chapter 2. pp Chapman&Hall/CRC Press, 2003.