Introduction to the EMERALD Dataset  Ron Peterson  Anne Bergstrom Lucas, Agilent  Jean Lozach, Illumina  Marc Salit, NIST  Russ Wolfinger, SAS  Walter.

Slides:



Advertisements
Similar presentations
Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Advertisements

Affymetrix Gene Expression Microarrays Application to Pulmonary Arterial Hypertension Bob Stearman 02/24/2014.
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.
1. Principles and important terminology 2. RNA Preparation and quality controls 3. Data handling 4. Costs 5. Protocols 6. Information for collaboration.
Bioinformatics Lectures at Rice
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Gene Expression Chapter 9.
Affymetrix Microarray and Illumina/ Solexa NextGen Sequencing Yuannan Xia, Ph.D Genomics Core Research Facility
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Gene expression and the transcriptome I. Genomics and transcriptome After genome sequencing and annotation, the second major branch of genomics is analysis.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Gene Expression Microarrays Microarray Normalization Stat
Real Time PCR = Quantitative PCR.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Molecular Biology Dr. Chaim Wachtel April 4, 2013.
This Week: Mon—Omics Wed—Alternate sequencing Technologies and Viromics paper Next Week No class Mon or Wed Fri– Presentations by Colleen D and Vaughn.
Whole Genome Expression Analysis
The European Nutrigenomics Organisation Deciding and acting on quality of microarray experiments in genomics Chris Evelo BiGCaT Bioinformatics Maastricht.
CDNA Microarrays MB206.
Genomics – Next-Gen sequencing and Microarrays
Data Type 1: Microarrays
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Microarray Technology
Agenda Introduction to microarrays
Introduction to Affymetrix Microarrays
Scenario 6 Distinguishing different types of leukemia to target treatment.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Glue Grant Human Transcriptome Array. 2 Affymetrix Confidential PNAS (9) ; published ahead of print February 11, 2011, doi: /pnas
Genomics I: The Transcriptome
Topic intro slides More complete coverage of components involved in gene expression More information on expression technologies -what would the ideal chip.
Introduction to Microarrays.
Whole Genome Approaches to Cancer 1. What other tumor is a given rare tumor most like? 2. Is tumor X likely to respond to drug Y?
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Biases in RNA-Seq data. Transcript length bias Two transcripts of length 50 and 100 have the same abundance in a control sample. The expression of both.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Example of a DNA array used to study gene expression (note green, yellow red colors; also note.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Analysis of honey bee microarray gene expression data Sandra Rodriguez Zas and Heather Adams Department of Animal Sciences Institute for Genomic Biology.
Functional Genomics Carol Bult, Ph.D. Course coordinator The Jackson Laboratory Winter/Spring 2011 Keith Hutchison, Ph.D. Course co-coordinator.
Gene Expression Biology 224 Instructor: Tom Peavy October 4 & 6, 2010
Introduction to Oligonucleotide Microarray Technology
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
Microarray: An Introduction
MICROARRAY. Microarray  A multiplex lab-on-a-chip  A 2D array on a solid substrate (Usually a glass slide or silicon thin-film cell) that assays large.
1st workshop of the Aberdeen Microarray Analysis Network (AMAN)
ENCODE Pseudogenes and Transcription
Functional Genomics in Evolutionary Research
Microarray Data Analysis
Data Type 1: Microarrays
Design Issues Lecture Topic 6.
Presentation transcript:

Introduction to the EMERALD Dataset  Ron Peterson  Anne Bergstrom Lucas, Agilent  Jean Lozach, Illumina  Marc Salit, NIST  Russ Wolfinger, SAS  Walter Liggett, NIST  Jean Thierry-Mieg, NCBI  DanielleThierry-Mieg, NCBI

2 EMERALD dataset introduction MicroArray Quality Control Shippy, R. et al, Nature Biotechnology - 24, (2006) Titration working Group Ambion Human Brain RNAStratagene Universal Human RNA

3 EMERALD dataset introduction MAQC Phase II – Conduct a new titration experiment  Agilent performed a one color analysis of the phase I material.  Added more intermediate titrations to a total of 19 samples.  Total of 80 samples processed  Experiment split up and performed over three days  First day used a mixture of reagent kits. Second day fresh kit. Third day reagents from colleague. Ambion BrainHUR

4 EMERALD dataset introduction Evaluation of New Agilent Titration Discriminate Coordinate Discriminate Coordinate Clustering of the arrays by amplify-label date. 3/2, 3/6, 4/10, 4/12

5 EMERALD dataset introduction Sample information for Biological vs Technical variation study. Animal: Rattus norvegicus Strain: Sprague Dawley Crl:CD(SD) Age: 7-8 weeks Sex: Male Treatment: 20% propylene glycol/80%/lactic acid containing 4.3% mannitol, pH 4.0 Duration : intravenous, once per week for 13 weeks Number: 6 A100% Liver1A, 2A, 3A, 4A, 5A, 6A B75% Liver, 25% Kidney1B, 2B. 3B, 4B, 5B, 6B C25% Liver, 75% Kidney1C, 2C, 3C, 4C, 5C, 6C D100% Kidney1D, 2D, 3D, 4D, 5D, 6D

6 EMERALD dataset introduction Study Design  Each Sample was performed in triplicate eg, 1-A-1, 1A-2, 1-A3, etc. Each sample was placed into a single well of a 96 well plate in a randomized pattern. Remaining wells were filled with samples made from pooling animals. -1-3A, B, C, D & 4-6A, B, C, D; single samples. -1-6A, B, C, D; each sample repeated 4 times on plate.  3 duplicate plates were produced and one plate was processed on; Affymetrix Rat Genome U133 plus 2.0 arrays Agilent Whole Rat Genome Oligo Microarray (4x44K) [G4131F] Illumina RatRef-12 v1 Expression BeadChip

7 EMERALD dataset introduction Affymetrix Study design  Chip performed at the Novartis Institutes for Biomedical Research  96 well plate was processed on an Affymetrix GCAS robot.  96 chips were washed on 12 Fluidics Machines (48 chip lots).  48 chips lots were scanned on one of two Affymetrix Scanner. Technical variation parameters. -Sample plate location. -Affymetrix chip lot. -Fluidics station location. -Scanner used. -Day processed (8 of the 96 chips were processed on a different date by rehybridizing the hybridization mix on a new chip.

8 EMERALD dataset introduction Agilent Study Design  Chips were processed at Agilent.  2 different technicians processed the arrays.  12 chips (48 arrays) were processed on 2 different days  A single scanner was used. Technical variation parameters -Technician. -Day processed. -Chip and reagent lots. -Substrate. -Starting total RNA amount (400 ng vs 200 ng).

9 EMERALD dataset introduction Illumina Study Design  Chips were processed at Asuragen (service provider).  2 different technicians processed the arrays.  8 chips (96 arrays) were processed on 4 different days with 4 different kits.  A single scanner was used. Technical variation parameters -Technician. -Day processed. -Chip and reagent lots. -Location on the chip. -cRNA yield.

10 EMERALD dataset introduction Data Access links  The data and supporting material are available from ArrayExpress  Affymetrix as/aer/result?queryFor=Experiment&eAccession=E-TABM-536  Agilent as/aer/result?queryFor=Experiment&eAccession=E-TABM-555  Illumina as/aer/result?queryFor=Experiment&eAccession=E-TABM-554

11 EMERALD dataset introduction Current Plans of MAQC Phase II Titration Group  Jean and Danielle Thierry-Mieg (NCBI) have done a complete annotation of the rat genes in AceView and identified the alternative transcripts tested on all three rat arrays.  The mapping, available at ftp://ftp.ncbi.nlm.nih.gov/repository/acedb/rat, will be used to identify the groups of probes from the three array platforms testing the same transcripts and genes. ftp://ftp.ncbi.nlm.nih.gov/repository/acedb/rat  We will use this correspondence to contrast the performance of the arrays in their ability to identify biological and technical variation.