The 2007 Microarray Research Group Project

Slides:



Advertisements
Similar presentations
ICSA, 6/2007 Pei Wang, 1 Spatial Smoothing and Hot Spot Detection for CGH data using the Fused Lasso Pei Wang Cancer Prevention Research.
Advertisements

Comparative genomic hybridization (CGH) is a technique for studying chromosomal changes in cancer. As cancerous cells multiply, they can undergo dramatic.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Tumour karyotype Spectral karyotyping showing chromosomal aberrations in cancer cell lines.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
STAC: A multi-experiment method for analyzing array-based genomic copy number data Sharon J. Diskin, Thomas Eck, Joel P. Greshock, Yael P. Mosse, Tara.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Multi-dimensional Genomic Profiling of Acute Leukemias Characterized by MLL gene rearrangements Eunice S. Wang MD (Medicine) and Norma J. Nowak PhD (Cancer.
CLL Research Consortium FISH studies, Core C June, 2005 NCI Submission.
SIGMA: A Platform to Visualize and Analyze DNA Copy Number Microarray Data Raj Chari, PhD Student BC Cancer Research Centre Department of Cancer Genetics.
Large-Scale Copy Number Polymorphism in the Human Genome J. Sebat et al. Science, 305:525 Luana Ávila MedG 505 Feb. 24 th /24.
Page 1 Mouse Genome CGH Microarray 44A. Page 2 Mouse Genome CGH Microarray Kit 44A Designed for CGH, Validated with samples of known aberrations Designed.
GENOMIC COPY NUMBER Rudy Guerra Department of Statistics Rice University April 14, 2008.
Large Scale Variation Among Human and Great Ape Genomes Determined by Array Comparative Genomic Hybridization Devin P. Locke, Richard Segraves, Lucia Carbone,
Verna Vu & Timothy Abreo
Microarrays and Their Uses Brad Windle, Ph.D
Microarray - Leukemia vs. normal GeneChip System.
1 Commentary 1.Do not get too worried about "methods" and details. I fully expect there to be concepts and techniques that you simply are not going to.
Nature Genetics Vol.36 Sept 2004 Detection of Large-scale Variation In the Human Genome Iafrate, Feuk, Rivera, Listewnik, Donahoe, Qi, Scherer, Lee any.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
____ __ __ _______Birol et al :: AGBT :: 7 February 2008 A NOVEL APPROACH TO IMPROVE THE NOISE IN DETECTING COPY NUMBER VARIATIONS USING OLIGONUCLEOTIDE.
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
CGH Data BIOS Chromosome Re-arrangements.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
L.S. Rector 1, N.A. Yamada 2, M.E. Aston 1, M.C. Sederberg 1 R.A. Ach 2, P. Tsang 2, E. Carr 2, A. Scheffer-Wong 2, N. Sampas 2, B. Peter 2, S. Laderman.
Copy Number Analysis in the Cancer Genome Using SNP Arrays Qunyuan Zhang, Aldi Kraja Division of Statistical Genomics Department of Genetics & Center for.
Microarray: An Introduction
In situ hybridization and Chromosome painting
Comparison of Comparative Genomic Hybridization Technologies Across Microarray Platforms Susan Hester1, Laura Reid2, Agnes Viale3, Norma Nowak4, Herbert.
Recurrent Reciprocal Genomic Rearrangements of 17q12 Are Associated with Renal Disease, Diabetes, and Epilepsy  Heather C. Mefford, Séverine Clauin, Andrew.
Microarray - Leukemia vs. normal GeneChip System.
Global Variation in Copy Number in the Human Genome
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Integrated Cytogenetic and High-Resolution Array CGH Analysis of Genomic Alterations Associated with MYCN Amplification Cytogenet Genome Res 2011;134:27–39.
ENCODE Pseudogenes and Transcription
Some slides adapted from J. Fridlyand
The array comparative genomic hybridization (aCGH/CMA) technology
Volume 1, Issue 1, Pages (February 2002)
Lisa Edelmann, Raj K. Pandita, Bernice E. Morrow 
by Jaroslaw P. Maciejewski, and Ghulam J. Mufti
Comprehensive Screening of Gene Copy Number Aberrations in Formalin-Fixed, Paraffin-Embedded Solid Tumors Using Molecular Inversion Probe–Based Single-
Today… Review a few items from last class
Gene Chips.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Jianbin Wang, H. Christina Fan, Barry Behr, Stephen R. Quake  Cell 
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
CSCI2950-C Lecture 3 September 13, 2007.
Volume 133, Issue 5, Pages (November 2007)
Genomic alterations in breast cancer cell line MDA-MB-231.
Volume 7, Issue 5, Pages (November 2016)
Microarray Techniques to Analyze Copy-Number Alterations in Genomic DNA: Array Comparative Genomic Hybridization and Single-Nucleotide Polymorphism Array 
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Bassem A. Bejjani, Lisa G. Shaffer 
Recurrent Reciprocal Genomic Rearrangements of 17q12 Are Associated with Renal Disease, Diabetes, and Epilepsy  Heather C. Mefford, Séverine Clauin, Andrew.
Histology and genomic copy number alterations in TRAMP tumors.
Introduction to Sequencing
High-Resolution Molecular Characterization of 15q11-q13 Rearrangements by Array Comparative Genomic Hybridization (Array CGH) with Detection of Gene Dosage 
Chromosome 8 cDNA microarray gene expression profile of the amplified regions of 8p11–12 in SUM-44, -52, and -225 cells versus MCF10A HME control cells.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Identification and characterization of a novel KRAS rearrangement in metastatic prostate cancer. Identification and characterization of a novel KRAS rearrangement.
Summary results from analyses to attempt to replicate previously reported DS+AVSD (Down Syndrome with a complete atrioventricular septal defect)-associated.
Other genomic arrays: Methylation, chIP on chip…
Whole-chromosome view of gene density, inverse environmental variation, inverse total variation, and open chromatin signature. Whole-chromosome view of.
The MLPA assay and application to diagnosis of DGS
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Data Type 1: Microarrays
Next-Generation Sequencing of Duplication CNVs Reveals that Most Are Tandem and Some Create Fusion Genes at Breakpoints  Scott Newman, Karen E. Hermetz,
Cell lines with aberrant expression of NRG1 are exquisitely sensitive to downregulation of ERBB3 signaling. Cell lines with aberrant expression of NRG1.
A, heatmap of copy number alterations determined by array CGH for a panel of 79 frozen NSCLC samples. A, heatmap of copy number alterations determined.
Presentation transcript:

The 2007 Microarray Research Group Project Comparison of Comparative Genomic Hybridization Technologies Across Microarray Platforms The 2007 Microarray Research Group Project Susan Hester April 1, 2007

Two goals for this presentation: Present a Comparative Genomic Hybridization Study to the ABRF research community 2. Evaluate commercial CGH platforms for their ability to detect known gains or losses

Background copy number differences between a reference genome and Comparative Genomic Hybridization (CGH) measures DNA copy number differences between a reference genome and a test genome. Early CGH experiments, the DNA targets were hybridized to metaphase chromosome spreads in FISH assays. This technology later evolved: DNA targets are hybridized to microarrays containing cDNA fragments or bacterial artificial chromosomes (BACs). Commercial microarrays are characterized as whole-genome CGH measures, obtaining copy number differences in DNA across entire genomes.

What is CGH and what is it not? CGH has nothing to do with gene expression. CGH arrays are structural, not functional—they look for DNA that is missing or duplicated, not DNA that is expressed. In array CGH, thousands of probes are printed on a microscope slide. These probes are bacterial artificial chromosomes, or BACs, which are sequences of cloned human DNA 120-180Kb long that have been mapped to specific locations on specific chromosomes. To the array is added test DNA and reference or control DNA, each labeled with a different dye. The essence of the experiment is that the two DNA samples compete for hybridization to the BAC probes on the slide, thus the "comparative" aspect of CGH.

The 2007 MARG Study: The research question: How well do BAC and commercial CGH arrays detect known copy number gains or losses in test and reference DNA? Analyzed Human leukemic HL60 DNA compared to reference with 5 platforms: RPCI BAC 19K arrays, Agilent 44K, Illumina HAP 550K, Affymetrix 500K, and Affymetrix U133 expression array. DNA samples analyzed in quadruplicate. 3 different laboratory test sites; Roswell Park Cancer Institute, Memorial Sloan-Kettering Cancer Institute, and Columbus Children’s Hospital. Quality assessment of each platform performed by each test site.

Known copy number gains and losses on cell line HL-60* Amplification of the 8q24 locus, Trisomy 18 Deletions at loci 5q11.2-q31, 6q12, 9p21.3-p22, 10p12-p15, 14q22-q31, 17p12-p13.3, monosomy X *Cancer Genet Cytogenet. 2003 Nov;147(1):28-35; Peiffer et al.(2006)

The fundamental theory behind each platform*: nucleotide base-pairing Before: hybridization: DNA sample (varying locus concentrations reflecting inherent gains/losses) Chromosome #1 probes _ _ _ _ _ _ _ _ _ _ _ _ After hybridization: DNA sample labeled and fragmented _ _ _ _ __ _ _ _ __ _ _ _ _ __ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ gain gain gain -_-- - -_--- *in this example; one color platform loss loss

Chari et al, Cancer Informatics, 2006, 2, 48-58 Example of Array CGH Technology* *in this example; 2-color platform Chari et al, Cancer Informatics, 2006, 2, 48-58

Cont.

Platform Details Platforms # probes or probe sets (resolution) annotation platform resolution protocol replicates primary data BAC 19K1 ~19,000 in duplicate NCBI Build 35 (hg17) 100kb 2-color 4 HL60/lung Mean Log2 (test/ref) Agilent2 ~44,000 (~35 KB) Sub 100kb Affy U1333 54,675 1-color 4 HL60 4 lung Log2(RMA) Affy 500K2 ~500,000 Build 35 (hg17) 0 lung Reference database Illumina2 ~550,000 Log R ratio The test sites were: 1Roswell Park Cancer Institute, Buffalo, NY,2Sloan-Kettering Institute Genomic Core, Rochester, NY 3Columbus Pediatric Hospital, Columbus Ohio

Study Questions: 1. What is the precision of each platform? 2. How accurately does each platform detect known copy number changes? 3. What is the resolution of array CGH?

Results: Assessment of precision- coefficient of variation across replicates

Coefficient of variation statistics BAC Agilent Illumina Affy GE Affy Nsp Affy Sty Mean 2.13 3.16 5.98 9.64 10.69 9.44 Median 1.96 2.66 5.23 8.71 9.91 8.47 Std Dev 1.09 2.43 3.52 4.78 5.24 5.60 Std Err 0.01 0.00 0.02 Size 18067 42897 555353 54676 524629 476709 Min 0.05 0.71 0.15 Max 16.87 77.89 98.05 59.57 48.97 50.59

Platform Visualizations: Chromosome 8 gain Chromosome 17 deletion Chromosome 5 deletion

RPCI BAC 19K array chr8 -2 -1 1 2 3 chr17 3 +8q24 -17p12 2 1 -1 -2

Agilent 44K Chr. 5q deletion

Visualization of each platform: Illumina HAP 550K Chr. 5q deletion Replicate 1 Replicate 2 Replicate 3 Replicate 4 -5q11

Affymetrix U133 Gene Expression Array Chromosome 17

Partek: Affy500K Chr 17p deletion Replicate 1 Replicate 2 Replicate 3

Results: Assessment of Accuracy: concordance with known gains/losses Gain/Loss -5q11 -6q12 +8q24 -9p21 -10p12 -14q22 -17p12 Tri 18 monoX BAC Y N Agilent Affy U133 GE Affy 500K Illumina red=amplification black=loss

Results: Assessment of platform resolution Circular binary segmentation analysis (CBS)* This approach is used to translate noisy intensity measurements into regions of equal copy number. Detects the point at which signal changes. Log ratios of normalized signal intensities are used. log2 (signal HL60/ref DNA) ex: normal state is 2copies/2copies=1 log2 (1)=0 *Olshen, A.B. et al. (2004)-Circular Binary Segmentation for the Analysis of array-based DNA Copy Number Data

Adopted;http://www.birc.dk/bioinformatics2006/talks/BrennanBioinf2006.pdf

Circular Binary Segmentation CBS Calculation Point of transition in signal Simulated Intensity Ratio Segment

Assessment of platform resolution on a known deletion or gain: RPCI BAC 19K Agilent 44K Illumina HAP550 Affymetrix U133 Gene Expression Affymetrix 500K

If 1 copy present in HL-60 and 2 in ref; then intensity ratio= ½=0.5; log2(0.5)= -1 Log2 between 0 & -1= 1cn loss 0 & 0.5= 1cn gain 0.5 & 1= 2cn gain

CBS Summary Details for known gains/losses: Deletion Chr.5 and Gain Chr.8 Platform chromosome loc.start loc.end num.probes* seg.mean Est. Copy number RPCI BAC 5 53,817,423 139,426,041 491 -0.5315 -1 Agilent 53,787,209 139,475,226 1003 -0.1914 Illumina 53,729,928 113,350,609 10438 -0.6753 Affy GE 53,536,915 139,532,618 1212 -0.5862 Affy 500K** 53,925,665 73,549,145 13886 -0.3785 8 98,584,025 130,781,748 14 1.2942 >2 126,357,705 130,632,571 26 0.3993 >1 126,302,381 130,771,462 428 1.1630 126,335,997 128,498,243 22 1.612 126,000,000 131,000,000 1192 1.7650 *used only probes>10 **Data generated using Partek Software to determine gains/losses; Green= deletion Red=gain

Did Agilent underestimate the change? Chr.8 -entire view zoomed-in view of amplification at 120 to 130Mb follows

of our Agilent data compared to other platforms; CBS 2.011 CBS 1.1630 CBS 1.612 CBS 0.3993 Results: lowest CBS ratio value and small dynamic range of our Agilent data compared to other platforms; huge number of segments in Illumina data

CBS analysis finds 3 novel gains or losses** BAC& Affy500K agree -Chromosome 2 Illumina, Agilent, Affy GE, BAC and Affy500K all agree on Chromosome 16 Affy500K and Affy GE agreement on Chromosome 19 ** Agreement means that 2 of the 5 platforms found the same gain or losses in the same location (start to end location)

Table of Novel Gains and Losses  Platform Chr.2 Chr.16 Chr.19 BAC Y N Agilent Affy U133 Affy 500K Illumina Red=gain Black=loss  Platform Chr.2 Chr.16 Chr.19 BAC Y N Agilent Affy U133 Affy 500K Illumina

Conclusions Good precision across all platforms: CVs ranged from 2.1 to 10.7% across the platforms Illumina and Affymetrix 500K identified 9 of 9 (100%) of the known losses and gains in HL60 DNA Agilent, Affymetrix U133 gene expression, and RPCI BAC arrays identified 8 of 9 (89%) of known losses and gains in HL60 DNA CBS used to assess resolution of each platform; all platforms detected known gains/losses 3 novel changes were also detected by at least 2 or more platforms

Acknowledgments The Microarray Research Group (MARG): Susan Hester, Laura Reid, Agnes Viale, Norma Nowak, Herbert Auer, Kevin Knudtson, Bill Ward, Jay Tiesman, Caprice Rosato, Aldo Massimi, Greg Khitrov, and Nancy Denslow Laboratory test sites: Roswell Park Cancer Institute, Memorial Sloan-Kettering Cancer Institute, and Columbus Children’s Hospital Data analysis efforts: US Environmental Protection Agency Roswell Park Cancer Institute Columbus Children’s Hospital Expression Analysis Funding support for this project: The Executive Board of the Association of Biomolecular Resource Facilities

MARG poster number is RG4-S