Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.

Slides:



Advertisements
Similar presentations
ICSA, 6/2007 Pei Wang, 1 Spatial Smoothing and Hot Spot Detection for CGH data using the Fused Lasso Pei Wang Cancer Prevention Research.
Advertisements

Comparative genomic hybridization (CGH) is a technique for studying chromosomal changes in cancer. As cancerous cells multiply, they can undergo dramatic.
Genomic DNA Variation Computer-Aided Discovery Methods Baylor College of Medicine course Term 3, 2010/2011 Lecture on Wednesday, February 2 nd,
Bioinformatics lectures at Rice University Li Zhang Lecture 10: Networks and integrative genomic analysis-2 Genome instability and DNA copy number data.
Methods for copy number variation: hidden Markov model and change- point models.
SW-ARRAY: a dynamic programming solution for the identification of copy-number changes in genomic DNA using array comparative gnome hybridization data.
Tumour karyotype Spectral karyotyping showing chromosomal aberrations in cancer cell lines.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
TIGR Spotfinder: a tool for microarray image processing
DNA Copy Number Analysis Qunyuan Zhang, Ph.D. Division of Statistical Genomics Department of Genetics & Center for Genome Sciences Washington University.
By: Katie Adolphsen, Robin Aldrich, Brandon Hu, Nate Havko.
Some slides adapted from J. Fridlyand BioSys course: DNA Microarray Analysis – Lecture, 2007 Analysis of Array CGH Data by Hanni Willenbrock.
Genomic Arrays: Tools for cancer gene discovery Ian Roberts MRC Cancer Cell Unit Hutchison MRC Research Centre
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
1 Example of HMMs: copy number data. 2 DNA copy number is the number of copies of a genomic segment present in the cell. Copy numbers are measured in.
STAC: A multi-experiment method for analyzing array-based genomic copy number data Sharon J. Diskin, Thomas Eck, Joel P. Greshock, Yael P. Mosse, Tara.
Algorithms for Smoothing Array CGH data
DNA Copy Number Analysis Qunyuan Zhang Division of Statistical Genomics Department of Genetics & Center for Genome Sciences Washington University School.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Evaluating cell lines as tumor models by comparison of genomic profiles Domcke, S. et al. Nat. Commun 4:2126.
Genome-wide Copy Number Analysis Qunyuan Zhang,Ph.D. Division of Statistical Genomics Department of Genetics & Center for Genome Sciences Washington University.
Analysis of microarray data
Microarray Preprocessing
ChrX probes Autosomal probes ChrX probes Autosomal probes Autosomal probes ChrX probes Effect of hybridization temperature on microarray performance Figure.
SIGMA: A Platform to Visualize and Analyze DNA Copy Number Microarray Data Raj Chari, PhD Student BC Cancer Research Centre Department of Cancer Genetics.
Manifestation of Novel Social Challenges of the European Union in the Teaching Material of Medical Biotechnology Master’s Programmes at the University.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
TCGA The Cancer Genome Atlas Project January 24, 2008.
DNA Copy Number Analysis Qunyuan Zhang,Ph.D. Division of Statistical Genomics Department of Genetics & Center for Genome Sciences Washington University.
Agenda Introduction to microarrays
Large Scale Variation Among Human and Great Ape Genomes Determined by Array Comparative Genomic Hybridization Devin P. Locke, Richard Segraves, Lucia Carbone,
Microarray - Leukemia vs. normal GeneChip System.
Dose Response 100% 50% 0% IC50, EC50 Dose response curve.
Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
____ __ __ _______Birol et al :: AGBT :: 7 February 2008 A NOVEL APPROACH TO IMPROVE THE NOISE IN DETECTING COPY NUMBER VARIATIONS USING OLIGONUCLEOTIDE.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
1 Classification and Feature Selection Algorithms for Multi-class CGH data Jun Liu, Sanjay Ranka, Tamer Kahveci
CGH Data BIOS Chromosome Re-arrangements.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Copy Number Analysis in the Cancer Genome Using SNP Arrays Qunyuan Zhang, Aldi Kraja Division of Statistical Genomics Department of Genetics & Center for.
Tumor Heterogeneity: From biological concepts to computational methods Bo Li, PhD Dana Farber Cancer Institute Harvard Statistics Department.
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
The 2007 Microarray Research Group Project
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Some slides adapted from J. Fridlyand
The array comparative genomic hybridization (aCGH/CMA) technology
Volume 1, Issue 1, Pages (February 2002)
Fig. 8. Recurrent copy number amplification of BRD4 gene was observed across common cancers. Recurrent copy number amplification of BRD4 gene was observed.
Rearrangements of Gene A and Gene B
DNA Chip Data Interpretation Tools: Genmapp & Dragon View
Mariëlle I. Gallegos Ruiz, MSc, Hester van Cruijsen, MD, Egbert F
CSCI2950-C Lecture 3 September 13, 2007.
Timon P. H. Buys, BSc, Sarit Aviel-Ronen, MD, Thomas K
Microarray Techniques to Analyze Copy-Number Alterations in Genomic DNA: Array Comparative Genomic Hybridization and Single-Nucleotide Polymorphism Array 
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Volume 25, Issue 5, Pages e5 (October 2018)
Diverse abnormalities manifest in RNA
Histology and genomic copy number alterations in TRAMP tumors.
Identification and characterization of a novel KRAS rearrangement in metastatic prostate cancer. Identification and characterization of a novel KRAS rearrangement.
Presentation transcript:

Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu

Today’s Topics Review aCGH and its data analysis Homework of aCGH data analysis using tools in Genboree and ruby

Chromosomal Aberrations REF: Albertson et al

Array CGH Label Patient DNA with Cy3 Label Control DNA with Cy5 Hybridize DNA to genomic clone microarray Analyze Cy3/Cy5 fluorescence ratio of patient to control (log of Cy3/Y5)

Workflow of aCGH Analysis Finished chips (scanner) Raw image data (experiment info ) (image processing software) Probe level raw intensity data Background adjustment, Normalization, transformation Raw copy number (CN) data [log ratio of tumor/normal intensities] Segmentation and boundary determination Estimation of CN Characterizing individual genomic profiles

Background Adjustment/Correction Reduces unevenness of a single chip Before adjustment After adjustment Corrected Intensity (S’) = Observed Intensity (S) – Background Intensity (B) Eliminates non-specific hybridization signal Normalization

Reduces technical variation between chips Before After S – Mean of S S’ = STD of S S’ ~ N(0,1 ) Normalization Log Transformation before Log transformation S after Log transformation Log(S) S : Probe raw intensity; S’ : Log transformation, S’ = log 2 (S) CN = S’ tumor - S’ normal = log 2 (S tumor /S normal )

Segmentation/Smoothing CN Clone/Chromosome

CN Clone/Chromosome Segmentation/Smoothing

Goal:To partition the clones into sets with the same copy number and to characterize the genomic segments. Noise reduction Detection of Loss, Normal, Gain, Amplification Breakpoint analysis Biological model: genomic rearrangements lead to gains or losses of sizable contiguous parts of the genome. Recurrent (over tumors) aberrations may indicate an oncogene or a tumor suppressor gene

AWS - Adaptive Weights Smoothing CBS - Circular Binary Segmentation HMM - Hidden Markov Model partitioning Many more All existing methods amount to unsupervised, location- specific partitioning and operating on individual chromosomes. Segmentation Methods

Workflow of aCGH Data Analysis Finished chips (scanner) Raw image data (experiment info ) (image processing software) Probe level raw intensity data Background adjustment, Normalization, transformation Raw copy number (CN) data [log ratio of tumor/normal intensities] Segmentation and boundary determination Estimation of CN Characterizing individual genomic profiles

Homework: Analyze TCGA Data

The Cancer Genome Atlas Project (TCGA) Goal: find genomic alterations that cause cancer (mutations, CNA, methylation, …) Pilot project 1. brain (glioblastoma multiforme): 186 pairs of tumor and normal samples 2. lung (squamous) 3. ovarian (serous cystadenocarcinoma )

Flowchart of Data Analysis Raw copy number (CN) data [log ratio of tumor/normal intensities] Segmenttion and boundary determination Estimation of CN Characterizing individual genomic profiles Annotation Identify Recurrent Genes

Ruby: Mapping Probes

LFF format

Upload Data

Data Analysis: Segmentation

Data Analysis: Combine Tracks

Data Analysis: Annotation Selector

Data Analysis: Mapping Genes

Data Analysis: Recurrent Genes

Overview of Data Analysis Raw copy number (CN) data [log ratio of tumor/normal intensities] Data Preprocessing (Ruby) and uploading data to Genboree Segmentation (Segmentation Tool) Characterizing individual genomic profiles Combing data Annotation (Annotation Selector; Attribute Lifter) Identify Recurrent Genes (Ruby)

You Need To Submit 1.ruby script from step 1 that creates your lff file 2.ruby script from step 5 that parses your table 3.two-column final output from step 5