CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data.

Slides:



Advertisements
Similar presentations
Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Advertisements

Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
Introduction to Microarray
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Gene Expression Chapter 9.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
DNA microarray and array data analysis
Microarray analysis Golan Yona ( original version by David Lin )
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
5 µm Millions of copies of a specific oligonucleotide probe >5 760,000 different complementary probes ~ targets Single stranded, labeled ‘target’
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Alternative Splicing As an introduction to microarrays.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
What are microarrays? Microarrays consist of thousands of oligonucleotides or cDNAs that have been synthesized or spotted onto a solid substrate (nylon,
Introduce to Microarray
Affymetrix GeneChip Data Analysis Chip concepts and array design Improving intensity estimation from probe pairs level Clustering Motif discovering and.
Introduction to DNA microarrays DTU - January Hanne Jarmer.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
GeneChips and Microarray Expression Data
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
Analysis of microarray data
with an emphasis on DNA microarrays
Microarrays, RNAseq And Functional Genomics CPSC265 Matt Hudson.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
Evolva Biotech SA Microarray and Macro opportunities for Discovery informatics Head of Informatics Mobile.
‘Omics’ - Analysis of high dimensional Data
DNA, Gene, and Genome Translating Machinery for Genetic Information.
DNA microarrays to study gene expression Steve Clough
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Microarray Technology
Introduction to DNA microarrays DTU - May Hanne Jarmer.
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
Topic intro slides More complete coverage of components involved in gene expression More information on expression technologies -what would the ideal chip.
MICROARRAY TECHNOLOGY
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Soybean Microarrays Microarray construction An Introduction By Steve Clough November 2005.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
Detecting DNA with DNA probes arrays. DNA sequences can be detected by DNA probes and arrays (= collection of microscopic DNA spots attached to a solid.
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
Microarray - Leukemia vs. normal GeneChip System.
Functional Genomics in Evolutionary Research
Microarray Technology and Applications
Presentation transcript:

CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data integration and Data mining

Substrates for High Throughput Arrays Nylon Membrane Glass SlidesGeneChip Single label P 33 Single label biotin streptavidin Dual label Cy3, Cy5

GeneChip ® Probe Arrays 24µm Millions of copies of a specific oligonucleotide probe Image of Hybridized Probe Array Image of Hybridized Probe Array >200,000 different complementary probes Single stranded, labeled RNA target Oligonucleotide probe * * * * *1.28cm GeneChip Probe Array Hybridized Probe Cell

GeneChip ® Expression Array Design GeneSequence Probes designed to be Perfect Match Probes designed to be Mismatch Multiple oligo probes 5´3´

Procedures for Target Preparation cDNA Fragment (heat, Mg 2+ ) LLLL Wash & Stain Scan Hybridize (16 hours) Labeled transcript Poly (A) + / Total RNA RNA AAAA IVT(Biotin-UTPBiotin-CTP) Labeled fragments L L L L Cells

Microarray Technology

NSF Soybean Functional Genomics Steve Clough / Vodkin Lab Printing Arrays on 50 slides

Cells from condition A Cells from condition B mRNA Label Dye 2 NSF / U of Illinois Microarray Workshop -Steve Clough / Vodkin Lab Ratio of expression of genes from two sources Label Dye 1 cDNA equaloverunder Mix Total or

GSI Lumonics NSF Soybean Functional Genomics Steve Clough / Vodkin Lab

Beta Actin PKG HPRT Beta 2 microglobulin Rubisco AB binding protein Major latex protein homologue (MSG) Cattle and Soy Controls Array of cattle and soy spiking controls. 50 ug of cattle brain total RNA was labeled with Cy3 (green). 1 ul each of in vitro transcribed soy Rubisco (5 ng), AB binding protein (0.5 ng) and MSG (0.05 ng) were labeled with Cy5. The two labeled samples were cohybridized on superamine slides (Telechem, Inc.). To the right of each set of spots are five negative controls (water).

IgM IgM heavy chain MYLK COL1A2 MYLK IgM Fetal Spleen-Cy3Adult Spleen-Cy5 IgM heavy chain

Placenta vs. Brain – 3800 Cattle Placenta Array cy3 cy5 GenePix Image Analysis Software

1.Experimental Design 2.Image Analysis – raw data 3.Normalization – “clean” data 4.Data Filtering – informative data 5.Model building 6.Data Mining (clustering, pattern recognition, et al) 7.Validation Microarray Data Process

Scatterplot of Normalized Data Adult Fetal

>0.3<-0.3

Characteristics of Data Data can be viewed as a NxM matrix (N >> M): N is the number of genes M is the number of data points for each gene Or Nx(M+K) K is the number of Features describing each gene(genome location, functional description, metabolic pathway et al)

Model for Data Analysis Gene Expression is a Dynamic Process Each Microarray Experiment is a snap shot of the process Need basic biological knowledge to build model For Example: Assumption – In most of experiments, only a small set of genes (100s/1000s) have been affected significantly.

Data Mining Data volumes are too large for traditional analysis methods Large number of records and high dimensional data Only small portion of data is analyzed Decision support process becomes more complex Functions of Data Mining Need for Data Mining Use the data to build predictors – prediction, classification, deviation detection, segmentation Generates more sophisticated summaries and reports to aid understanding of the data – find clusters, partitions in data

Data Mining Methods Classification, Regression (Predictive Modeling) Clustering (Segmentation) Association Discovery (Summarization) Change and deviation detection Dependency Modeling Information Visualization

Cholesterol Biosynthesis Cell Cycle Immediate Early Response Signaling and Angiogenesis Wound Healing and Tissue Remodeling Clustered display of data from time course of serum stimulation of primary human fibroblasts. Eisen et al. Proc. Natl. Acad. Sci. USA 95 (1998) pg 14865

Self Organizing Maps

Molecular Classification of Cancer

Gene Expression Profile of Aging and Its Retardation by Caloric Restriction Cheol-Koo Lee, Roger G. Klopp, Richard Weindruch, Tomas A. Prolla

Expression Landscape of cell-cycle regulated genes in yeast

Multi-dimension data visualization