Genomic Arrays: Tools for cancer gene discovery Ian Roberts MRC Cancer Cell Unit Hutchison MRC Research Centre

Slides:



Advertisements
Similar presentations
Visualization and analysis of large data collections: a case study applied to confocal microscopy data Wim de Leeuw, Swammerdam Institute for Life Sciences,
Advertisements

We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Cancer-inducing genes - CRGs (cooperation response genes) Paper Presentation Nadine Sündermann.
Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.
Agilent CytoGenomics 1.5 For Detection of CNC, LOH and UPD
AIIA Lab, Department of Informatics Aristotle University of Thessaloniki Z.Theodosiou, F.Raimondo, M.E.Garefalaki, G.Karayannopoulou, K.Lyroudia, I.Pitas,
SW-ARRAY: a dynamic programming solution for the identification of copy-number changes in genomic DNA using array comparative gnome hybridization data.
Tumour karyotype Spectral karyotyping showing chromosomal aberrations in cancer cell lines.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
TIGR Spotfinder: a tool for microarray image processing
Development, Implementation and Testing of a DNA Microarray Test Suite Ehsanul Haque Mentors: Dr. Cecilie Boysen Dr. Jim Breaux ViaLogy Corp.
Some slides adapted from J. Fridlyand BioSys course: DNA Microarray Analysis – Lecture, 2007 Analysis of Array CGH Data by Hanni Willenbrock.
Genetic algorithms applied to multi-class prediction for the analysis of gene expressions data C.H. Ooi & Patrick Tan Presentation by Tim Hamilton.
STAC: A multi-experiment method for analyzing array-based genomic copy number data Sharon J. Diskin, Thomas Eck, Joel P. Greshock, Yael P. Mosse, Tara.
Algorithms for Smoothing Array CGH data
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Low Level Statistics and Quality Control Javier Cabrera.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Analysis of microarray data
Microarray Preprocessing
SIGMA: A Platform to Visualize and Analyze DNA Copy Number Microarray Data Raj Chari, PhD Student BC Cancer Research Centre Department of Cancer Genetics.
MCGH Analyzer1 Hans A. Kestler André Müller.
Page 1 Mouse Genome CGH Microarray 44A. Page 2 Mouse Genome CGH Microarray Kit 44A Designed for CGH, Validated with samples of known aberrations Designed.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Configuration Management (CM)
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Agenda Introduction to microarrays
GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon’s Cloud R. Jay Mashl October 20, 2014.
BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Identification of Copy Number Variants using Genome Graphs
Other genomic arrays: Methylation, chIP on chip… UBio Training Courses.
____ __ __ _______Birol et al :: AGBT :: 7 February 2008 A NOVEL APPROACH TO IMPROVE THE NOISE IN DETECTING COPY NUMBER VARIATIONS USING OLIGONUCLEOTIDE.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb GP3xCLI GenePix Post-Processing.
CGH Data BIOS Chromosome Re-arrangements.
First of all: “Darnit Jim, I’m a doctor not a bioinformatician!”
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Copy Number Analysis in the Cancer Genome Using SNP Arrays Qunyuan Zhang, Aldi Kraja Division of Statistical Genomics Department of Genetics & Center for.
LIGO-G9900XX-00-M DMT Monitor Verification with Simulated Data John Zweizig LIGO/Caltech.
Microarray: An Introduction
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Discovery of Multiple Differentially Methylated Regions
Figure 2: Make a component
Some slides adapted from J. Fridlyand
Kendy K. Wong, Ronald J. deLeeuw, Nirpjit S. Dosanjh, Lindsey R
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Timon P. H. Buys, BSc, Sarit Aviel-Ronen, MD, Thomas K
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of Array-CGH to FISH in Diagnostic Evaluation  Lu Wang, Mamta Rao,
Validation and Implementation of a Custom Next-Generation Sequencing Clinical Assay for Hematologic Malignancies  Michael J. Kluk, R. Coleman Lindsley,
Cyclin E1 Is Amplified and Overexpressed in Osteosarcoma
Defining Ploidy-Specific Thresholds in Array Comparative Genomic Hybridization to Improve the Sensitivity of Detection of Single Copy Alterations in Cell.
BF528 - Genomic Variation and SNP Analysis
Other genomic arrays: Methylation, chIP on chip…
Presentation transcript:

Genomic Arrays: Tools for cancer gene discovery Ian Roberts MRC Cancer Cell Unit Hutchison MRC Research Centre

2/17 What’s a genomic array? A platform of regularly spaced genomic sequences  All known genes or a subset of genes of interest A tool for querying the genome about damage  Genomic gains (oncogenes)  Genomic losses (tumour suppressor genes) Applications  Research  disease gene discovery  Clinical  diagnostic tests

Comparative genomic hybridisation Tumour DNA (Test) Normal DNA (Reference) + Available probe GAIN: More test probe than reference probe (oncogene) LOSS: Reference probe in excess of test (tumour suppressor) Vast majority is normal Array platform

4/17 New generation arrays produce large amounts of data Agilent 244K array 243,504 defined spots Raw data is foreground and background signal intensities in two channels Median ratio of foreground is important.

aCGH data analysis using camgrid

6/17 Genomic array analysis strategy using R 1. array data is processed by snapCGH R package Correct array data for background noise and mean distribution Order data by genomic location Apply an aCGH segmentation algorithm Draw some plots 2. Determine significant findings (in house R functions) Common and minimum genomic regions of gain and loss Summarise output R  snapCGH  parrot R on camgrid 

7/17 Old vs. New genomic array plots Chromosome 7

Significant region detection is computationally intensive

Distributed aCGH analysis Consolidate output Preprocess data Input data to snapCGH (e.g. 3 chrs, 2 analysis methods) Condor Job 1 Condor Job 2 Generate genome ordered data and condor dagman analysis batch files Chr 1Chr 2Chr 3 DNA copy GLAD DNA copy GLAD DNA copy GLAD Perform aCGH analysis + region detection (1 run per Chr per analysis method) DNAcopy dagman description file Score combining 1. Clone call scoring n. Clone call scoring Segmentation Step CRI MRI Detection Dagman job 1 … n

10/17 Condor job scripting in BASH & R BASH function  Responsible for producing required condor files for discrete jobs  Default_submit has 2 positional parameters R script name  $1 Data files  $2  Initiates aCGH analysis on grid. Condor dagman R function set  R-scripter Writes the appropriate R script for the current job  R-condor-submitter Writes the condor job submission file  R-condor-executer Writes the condor job executable file  R-job-descriptor Writes the condor dagman description file

11/17 End user abstraction – start_aCGH.sh aCGH analysis undertaken by a single shell command  Manages array data input  Collects user specified parameters Chromosome range Segmentation algorithms Significance thresholds  Links condor R job scripting

12/17 start_aCGH.sh session on mole

…. continued … 1 hr – 6 hr later! aCGH region information and plots

14/17 Summary findings (38 arrays) Rapid identification of regions of interest Easy comparison of aCGH analysis via different algorithms Bio HMM DNAcopy Sample percentage Region size

15/17 Real life application Retrospective analysis confirms initial findings! (summary of 38 samples) OSMR Sample percentage Region size

16/17 Future development Tailor output for specific user requirements Produce overall summary plot Apply approach to expression arrays

Grace Ng Steph Carter Konstantina Karagavriliidou Jenny Barna Mark Calleja Nick Coleman