DNA, Gene, and Genome Translating Machinery for Genetic Information.

Slides:



Advertisements
Similar presentations
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Advertisements

Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Gene Expression Chapter 9.
DNA microarray and array data analysis
Microarrays Dr Peter Smooker,
Microarray Data Preprocessing and Clustering Analysis
Microarray analysis Golan Yona ( original version by David Lin )
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
5 µm Millions of copies of a specific oligonucleotide probe >5 760,000 different complementary probes ~ targets Single stranded, labeled ‘target’
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
What are microarrays? Microarrays consist of thousands of oligonucleotides or cDNAs that have been synthesized or spotted onto a solid substrate (nylon,
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Introduce to Microarray
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2015 Xiaole Shirley Liu Please Fill Out Student Sign In.
Affymetrix GeneChip Data Analysis Chip concepts and array design Improving intensity estimation from probe pairs level Clustering Motif discovering and.
Introduction to DNA microarrays DTU - January Hanne Jarmer.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Analysis of High-throughput Gene Expression Profiling
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
Analysis of microarray data
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Microarrays, RNAseq And Functional Genomics CPSC265 Matt Hudson.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Microarray Technology
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits)
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
MICROARRAY TECHNOLOGY
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Lecture 7. Functional Genomics: Gene Expression Profiling using
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Screening for the effect of a potent new anti-HIV compound on HIV infected cells using oligonucleotide arrays to measure gene expression. Sanjive Qazi,
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Soybean Microarrays Microarray construction An Introduction By Steve Clough November 2005.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2016 Xiaole Shirley Liu.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Gene Expression Analysis
Statistical Applications in Biology and Genetics
Microarray Technology and Applications
Presentation transcript:

DNA, Gene, and Genome

Translating Machinery for Genetic Information

Transcription factors mRNA levels

Automated DNA Sequencing

Data Increase (from NCBI web site)

Partial Display of Human Draft Sequence (Nature, 2001)

Human Genome Map at NCBI

MGALRPTLLPPSLPLLLLLMLGMGCWAREVLVPEGPLYRVAGTAVSISCNVTGY EGPAQQNFEWFLYRPEAPDTALGIVSTKDTQFSYAVFKSRVVAGEVQVQRLQGD AVVLKIARLQAQDQGIYECTPSTDTRYLGSYSGKVELRVLPDVLQVSAAPPGPR GRQAPTSPPRMTVHEGQELALGCLARTSTQKHTHLAVSFGRSVPEAPVGRSTLQ EVVGIRSDLAVEAGAPYAERLAAGELRLGKEGTDRYRMVVGGAQAGDAGTYH CTAAEWIQDPDGSWAQIAEKRAVLAHVDVQTLSSQLAVTVGPGERRIGPGEPLE LLCNVSGALPPAGRHAAYSVGWEMAPAGAPGPGRLVAQLDTEGVGSLGPGYE GRHIAMEKVASRTYRLRLEAARPGDAGTYRCLAKAYVRGSGTRLREAASARSR PLPVHVREEGVVLEAVAWLAGGTVYRGETASLLCNISVRGGPPGLRLAASWWV ERPEDGELSSVPAQLVGGVGQDGVAELGVRPGGGPVSVELVGPRSHRLRLHSL GPEDEGVYHCAPSAWVQHADYSWYQAGSARSGPVTVYPYMHALDTLFVPLL VGTGVALVTGATVLGTITCCFMKRLRKR KDa Protein interacting with prostate cancer suppressor

Molecular biology databases Sequence databases –Annotated –Low-annotation –Specialized Structural databases Motif databases Genome databases Proteome databases RNA expression Literature Populations Mutations Polymorphisms Organisms Pathways

PromotersESTs Tissues and cells Genome maps DNA sequences Molecular Phylogeny Protein sequences Protein structures DNA motifs Protein motifs Substrates Metabolic pathways Transcription Factors RNA expression Mutations/polymorphisms Gene Family

Databases formats Relational databases –GDB, GSDB, MGD etc. –Vender: Sybase, Oracle etc. Flat file databases –GenBank, SWISS-PROT etc. Object-oriented databases –ACeDB, AtDB etc.

Molecular biology data types OrganismsGenome maps Mouse chromosome X from the Mouse Genome Informatics project

Molecular biology data types OrganismsGenome maps DNA sequences RNA sequences...AATGGTACCGATGACCTGGAGCTTGGTTCGA...

Molecular biology data types OrganismsGenome maps DNA sequences RNA sequences Protein sequences...TRLRPLLALLALWPPPPARAFVNQHLCGSHLVEA...

Molecular biology data types OrganismsGenome maps DNA sequences RNA sequences Protein sequences Protein structures RNA structures PDB entry 1CIS P.Osmark, P.Sorensen, F.M.Poulsen

Molecular biology data types OrganismsGenome maps DNA sequences RNA sequences Protein sequences Protein structures DNA motifs Protein motifs RNA expression RNA structures

DNA microarrays measure variations in RNA levels The full Yeast genome on a chip De Risi et al, Science 278:680 Red dots: genes whose RNA level increased Green dots: genes whose RNA level decreased

Substrates for High Throughput Arrays Nylon Membrane Glass SlidesGeneChip Single label P 33 Single label biotin streptavidin Dual label Cy3, Cy5

GeneChip ® Probe Arrays 24µm Millions of copies of a specific oligonucleotide probe Image of Hybridized Probe Array Image of Hybridized Probe Array >200,000 different complementary probes Single stranded, labeled RNA target Oligonucleotide probe * * * * *1.28cm GeneChip Probe Array Hybridized Probe Cell

GeneChip ® Expression Array Design GeneSequence Probes designed to be Perfect Match Probes designed to be Mismatch Multiple oligo probes 5´3´

Procedures for Target Preparation cDNA Fragment (heat, Mg 2+ ) LLLL Wash & Stain Scan Hybridize (16 hours) Labeled transcript Poly (A) + / Total RNA RNA AAAA IVT(Biotin-UTPBiotin-CTP) Labeled fragments L L L L Cells

Microarray Technology

NSF Soybean Functional Genomics Steve Clough / Vodkin Lab Printing Arrays on 50 slides

Cells from condition A Cells from condition B mRNA Label Dye 2 NSF / U of Illinois Microarray Workshop -Steve Clough / Vodkin Lab Ratio of expression of genes from two sources Label Dye 1 cDNA equaloverunder Mix Total or

GSI Lumonics NSF Soybean Functional Genomics Steve Clough / Vodkin Lab

Beta Actin PKG HPRT Beta 2 microglobulin Rubisco AB binding protein Major latex protein homologue (MSG) Cattle and Soy Controls Array of cattle and soy spiking controls. 50 ug of cattle brain total RNA was labeled with Cy3 (green). 1 ul each of in vitro transcribed soy Rubisco (5 ng), AB binding protein (0.5 ng) and MSG (0.05 ng) were labeled with Cy5. The two labeled samples were cohybridized on superamine slides (Telechem, Inc.). To the right of each set of spots are five negative controls (water).

IgM IgM heavy chain MYLK COL1A2 MYLK IgM Fetal Spleen-Cy3Adult Spleen-Cy5 IgM heavy chain

Placenta vs. Brain – 3800 Cattle Placenta Array cy3 cy5 GenePix Image Analysis Software

1.Experimental Design 2.Image Analysis – raw data 3.Normalization – “clean” data 4.Data Filtering – informative data 5.Model building 6.Data Mining (clustering, pattern recognition, et al) 7.Validation Microarray Data Process

Scatterplot of Normalized Data Adult Fetal

>0.3<-0.3

Complexity Levels of Microarray Experiments: 1.Compare genes in a control situation versus a treatment situation Example: Is the level of expression (up-regulated or down-regulated) significantly different in the two situations? (drug design application) Methods: t-test, Bayesian approach 2.Find multiple genes that share common functionalities Example: Find related genes that are dependent? Methods: Clustering (hierarchical, k-means, self-organizing maps, neural network, support vector machines) 3.Infer the underlying gene and protein networks that are responsible for the patterns and functional pathways observed Example: What is the gene regulation at system level? Directions: mining regulatory regions, modeling regulatory networks on a global scale

Comparing data from two experiments.

NO DRUG 1nM Drug 1  M Drug Statistical filters used: The genes present (Presence Call in Affymetrix) in drug treated, ANOVA p<0.02 between groups. Red indicates increased expression, and green is decreased expression (Log(fold change)). Genesight 3 (Biodiscovery Software, Clustering to extract genes which tightly co-express.

Statistical filters used: The genes present (Presence Call in Affymetrix) in absence of drug, ANOVA p<0.02 between groups. NO DRUG 1nM Drug 1  M Drug

Self Organizing Maps

Molecular Classification of Cancer

Gene Expression Profile of Aging and Its Retardation by Caloric Restriction Cheol-Koo Lee, Roger G. Klopp, Richard Weindruch, Tomas A. Prolla

Data Mining Methods Classification, Regression (Predictive Modeling) Clustering (Segmentation) Association Discovery (Summarization) Change and deviation detection Dependency Modeling Information Visualization