Tiling Arrays Madelaine Gogol Programmer Analyst Microarray Norman Pavelka Postdoc Rong Li Lab Technology.

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

Comparative genomic hybridization (CGH) is a technique for studying chromosomal changes in cancer. As cancerous cells multiply, they can undergo dramatic.
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Breakdown of 244 total (Yale+Vega) Pseudogenes Amongst Various ENCODE Regions 211 Yale, 178 Vega, Union is 244 More pseudogenes in the manually picked.
Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
Technology & Methods Seminar:
Bioinformatics Lectures at Rice
Transcriptome Sequencing with Reference
Gene regulation in cancer 11/14/07. Overview The hallmark of cancer is uncontrolled cell proliferation. Oncogenes code for proteins that help to regulate.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
Gene expression analysis summary Where are we now?
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
The Human Genome Project and ~ 100 other genome projects:
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Introduce to Microarray
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
ChrX probes Autosomal probes ChrX probes Autosomal probes Autosomal probes ChrX probes Effect of hybridization temperature on microarray performance Figure.
with an emphasis on DNA microarrays
Technology and Methods Seminar
Large-Scale Copy Number Polymorphism in the Human Genome J. Sebat et al. Science, 305:525 Luana Ávila MedG 505 Feb. 24 th /24.
Page 1 Mouse Genome CGH Microarray 44A. Page 2 Mouse Genome CGH Microarray Kit 44A Designed for CGH, Validated with samples of known aberrations Designed.
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Genomics BIT 220 Chapter 21.
Data Type 1: Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Affymetrix CytoScan HD array
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
Microarrays and Their Uses Brad Windle, Ph.D
Scenario 6 Distinguishing different types of leukemia to target treatment.
Mapping Sites of Transcription Across the Drosophila Genome Using High Resolution Tiling Microarrays LBNL, Berkeley CA August 20, 2007 A. WillinghamAffymetrix,
Dose Response 100% 50% 0% IC50, EC50 Dose response curve.
Genome-Scale Mutagenesis Introduction Model systems –Yeast –Mouse Implications for science.
1 Commentary 1.Do not get too worried about "methods" and details. I fully expect there to be concepts and techniques that you simply are not going to.
Nature Genetics Vol.36 Sept 2004 Detection of Large-scale Variation In the Human Genome Iafrate, Feuk, Rivera, Listewnik, Donahoe, Qi, Scherer, Lee any.
Chapter 21 Eukaryotic Genome Sequences
Monday Human and chimp DNA is ~98.7 similar, But, we differ in many and profound ways, Can this difference be attributed, at least in part, to differences.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Other genomic arrays: Methylation, chIP on chip… UBio Training Courses.
1 Genomics The field of biology based on studying the entire DNA sequence of an organism - its “genome”. Genomics tools don’t replace classical genetics.
____ __ __ _______Birol et al :: AGBT :: 7 February 2008 A NOVEL APPROACH TO IMPROVE THE NOISE IN DETECTING COPY NUMBER VARIATIONS USING OLIGONUCLEOTIDE.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Starting Monday M Oct 29 –Back to BLAST and Orthology (readings posted) will focus on the BLAST algorithm, different types and applications of BLAST; in.
Analysis of protein-DNA interactions with tiling microarrays
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies Genomics Chapter 10 Copyright © McGraw-Hill Companies Permission required.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Biochemistry April Lecture DNA Microarrays.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
Copy Number Analysis in the Cancer Genome Using SNP Arrays Qunyuan Zhang, Aldi Kraja Division of Statistical Genomics Department of Genetics & Center for.
Recent Advances in Genomic Science Julian Sampson Institute of Medical Genetics, Cardiff.
Special Topics in Genomics ChIP-chip and Tiling Arrays.
ENCODE Pseudogenes and Transcription
Microarray Technology and Applications
Other genomic arrays: Methylation, chIP on chip…
Presentation transcript:

Tiling Arrays Madelaine Gogol Programmer Analyst Microarray Norman Pavelka Postdoc Rong Li Lab Technology and Methods Seminar March 29,

Tiling Arrays - Overview What is a tiling array? What can I do with it? –ChIP-chip –CGH –Expression Which tiling arrays are available for my experiments? –in-house yeast tiling array (design details) –Agilent –Affymetrix How will we analyze and visualize the data?

What is a Tiling Array? Gene 1 Probe 1 A microarray with many probes distributed in an evenly spaced way across an entire genome.

What can I do with tiling arrays? Map the transcriptome –what’s being expressed? ChIP-chip –where are proteins binding? CGH –what are the differences in genome structure? Other possibilities –Map the methylome –Genome resequencing –Polymorphism discovery

ChIP-chip PCR w/aa-dNTP Hybridize to Microarray Analyze image, calculate ratios

array CGH Hybridize to Microarray

In-house yeast tiling array (YOGie) Covers the yeast genome Just printed resolution ~ 250 bases freely available

Spotted Microarray Manufacturing

Operon Probe Set 6307 probes, length 70 Designed one per ORF, near 3’ end. YOG arrays (yeast oligonucleotide) YBOX: 3072 new probes

Intergenic Probe Set: Design Design target: yeast Intergenic regions original goal: leave no area > 500 uncovered Gene 1Gene 2 Array Oligo Selector (AOS) Fasta format 140-mer sequences tiling the intergenic regions 9, mer sequences from the intergenic regions

Intergenic Probe Set mer probes No region greater than 360 left uncovered ~ 220 bases between probes on average Chromosome 3 completely tiled YOGi arrays (YOG+intergenic)

5’ probe set: Design Goal: –fill in gaps left by operon set in 5’ region of gene –leave no region > 500 without a probe Targets: ORF regions 5’ of operon probe Gene 1 operon probe 3’5’ Array Oligo Selector (AOS)

5’ probe set: target selection What size targets? –355mers that overlap by (500-70)/2 = target 1target =355

5’ probe set: Reduction Too many probes –reduce to 6666 or less (budget and printing constraints) Probes within 260 bases of eachother – winnowed Tm binding energy number of matches to genome Gene 1 operon probe 3’5’

5’ probe set 6, mer probes tiles the region of each gene between the operon probe and the 5' end. YOGie arrays (YOG+intergenic+enhanced)

In-house yeast tiling array: YOGie Together, the operon, intergenic, and 5’ sets make up our homemade yeast tiling array Freely available Also includes tight tiles of –all centromeres –7 sub-genomic regions kb

Agilent Microarray Manufacturing

Agilent Tiling Arrays ChIP-on-chip Arabidopsis Whole Genome C. elegans Whole Genome Drosophila Whole Genome Human CpG Island Human ENCODE 244K Human Promoter Mouse Promoter Yeast Whole Genome 4 x 44K Yeast Whole Genome 244K Zebrafish Expanded Promoter Zebrafish Proximal Promoter Custom ChIP-on-chip Oligo aCGH Human Genome 244K Human Genome 105K Human Genome 44K Mouse Genome 244K Mouse Genome 105K Mouse Genome 44K

244k 105k 44k 15k $400 $640 $720 $800 $320 $180 $100 per slide Agilent Tiling Arrays: formats and cost per hyb $400

Agilent Custom Array Design Take an agilent microarray design Remove some probes Put in your own probes –Design using agilent’s web application, earray. You can also design everything from scratch

earray

Affymetrix Microarray Manufacturing

Affymetrix Tiling Arrays Arabidopsis Tiling 1.0R Array C.elegans Tiling 1.0R Array Chromosome 21/ Array Set Chromosome 21/22 2.0R Array Drosophila Tiling 1.0R Array ENCODE Array Human Genome Arrays + Mouse Genome Arrays + S. cerevisiae Tiling 1.0R Array S. pombe Tiling 1.0FR Array Cost ~ 500$ per array, so 500$ per hyb.

Summary of Tiling Arrays (only yeast shown) Type#spots per array probe size resolution per slide cost per array cost customizable? Homemade YOGie arrays 26, between 00 some effort Agilent 44K44,000 (4)60160 between $720$180 easily Agilent 244K244,00060 overlap by 18 $400 easily Affy3,200,000 (pm/mm) 25 overlap by 20 $500 no

Tiling array data analysis: still an adventure Affy –TAS (Tiling Analysis Software) Agilent –ChIP Analytics & CGH Analytics Other –genome browsers, R packages, other people’s software, Do-it-yourself, perl, statistical models

Data Visualization: Genome Browsers: UCSC

Data Visualization: Genome Browsers: UCSC

Data Visualization: Genome Browsers: IGB

Data Analysis: Sliding windows ChIPOTle, PeakFinder, R scripts, etc.

Data Analysis: Annotating and comparing peaks

Data Analysis: Average Gene analysis Profile of binding across an average gene

Summary Tiling arrays –CHip-chip, CGH, expression Which ones are available –In-house, Agilent, Affy Data analysis Future...

The Future of Tiling Arrays The resolution continues to increase...

Other Future Genomic Technology 454 and Solexa/Illumina “Next Generation” sequencing “Sequence everything in the tube” Shares some things with tiling arrays –even more unbiased –vast quantities of data –analysis methods are being developed

5 th floor Building 2 N Bioinformatics Microarray AllisonKarinBrianChrisMe Thanks! Microarray Bioinformatics All the labs that use microarrays! Bing Li Workman lab Jennifer Bupp Jasperson lab Norman Pavelka Rong Li Lab

Technology & Methods Seminar  “Adventures in Electron Microscopy” Rhonda Allen Histology Thursday, April 26th, 1:00 p.m. Classroom (1 st floor, Administration Building) Schedule with abstracts and previous presentation slides can be found on: K:\Weekly Seminar Schedule\Thursday -- Technology & Methods Information regarding previous seminars can be found at:

Norman Pavelka (Rong Li lab) On the use of Affymetrix Tiling Arrays for Comparative Genomic Hybridizations March 29, 2007 Technology & Methods Seminar: “Tiling Arrays - Probing Genome and Transcriptome Structure”

Background: Role of MYO1 in cytokinesis Myo1 Phenotype of yeast cells experiencing an acute loss of MYO1: Severe cytokinesis defect Impaired cell viability Phenotype of yeast cells experiencing a chronic loss of MYO1: Extremely heterogenous Occasionally: full recovery of cytokinesis proficiency and of growth ability

Biological question: What genome changes occurred in e-strains? Albertson & Pinkel, Hum Mol Genet (2003) Polyploidization? Aneuploidization?Interstitial deletions?Reciprocal translocations? Non-reciprocal translocations? Amplifications? Single-nucleotide mutations?

U.C. Berkeley Division of Biostatistics Working Paper Series (2002), paper 106. Method: array-based Comparative Genomic Hybridization (aCGH)

Technology: Affymetrix Yeast Tiling Arrays ~6.5 million unique probes on the chip Designed to interrogate the yeast genome with a 5bp resolution: Gresham et al., Science (2006) ~12.5 million bp in the yeast genome

1.Extraction of genomic DNA with Phenol / Chloroform / Isoamylalcohol 2.“Controlled” fragmentation with DNase I (5 min at 37 ° C) 3.End-labeling with TdT and biotin-dUTP 4.Hybridize on Affy chips 5.Stain with streptavidin-PE 6.Wash and scan chips Ladder 75 mU DNase I 150 mU DNase I Fragment length (nt) Strain 7a-1 Strain 2b (wt) Experimental protocol:

2b (low DNase I, large fragments) 2b (high DNase I, small fragments) 7a-1 (low DNase I, large fragments) 7a-1 (high DNase I, small fragments)

Limitations: What genome changes can we see by aCGH? Albertson & Pinkel, Hum Mol Genet (2003) Polyploidization? Aneuploidization?Interstitial deletions?Reciprocal translocations? Non-reciprocal translocations? Amplifications? Single-nucleotide mutations?

Observation #1: Deletion of the MYO1 locus MYO1 locus log 2 (ratio) Chromosome VIII

Observation #2: “Duplication” of the TRP1 locus Caveat #1: No information on where the signal comes from! TRP log 2 (ratio) log 2 (ratio) Chromosome IV

Caveat #2: Highly repetitive sequences! Chromosome II +1 0 log 2 (ratio) +1 0 log 2 (ratio) Ty1 LTR Full-length Ty1 (aka “Saturation” effect)

Observation #3: Gradual loss of signal towards telomeres +1 0 log 2 (ratio) +1 0 log 2 (ratio) Full sequence of chromosome II

Observation #4: Aneuploidies Chr.

Caveat #3: “Dilution” effect

Possible observation #1: Non-reciprocal translocations? Dunham et al., PNAS (2002)

Possible observation #2: Single-nucleotide changes? Gresham et al., Science (2006) log 2 (ratio) Genomic DNA Probes on the chip

Summary: What can be seen by CGH on Tiling Arrays? →Anything that causes a change in the copy number of a DNA segment, e.g. aneuploidies, deletions/amplifications, non-reciprocal translocations, etc. →Mutations that affect the hybridization of multiple overlapping probes, i.e. single-nucleotide changes. What can not be seen by CGH? →Anything that does not cause a change in the copy number of a DNA segment, e.g. polyploidization, reciprocal translocations etc. →If probes are too long and non-overlapping, single-nucleotide mutations will not be detectable. What are the most common pitfalls? →No information about where the signal actually comes from! →No reliable information from probes hybridizing to highly-repetitive sequence (because of “saturation” effect)! →If some chromosomes are gained or lost, this will affect the log-ratios also of all other chromosomes (because of “dilution” effect)!

Acknowledgements: Rong Li lab: Giulia Rancati Rong Li Microarray group: Karin Zueckert-Gaudenz Allison Peak Chris Seidel