Deploying Galaxy for use with High Throughput Screening

Slides:



Advertisements
Similar presentations
In Silico Primer Design and Simulation for Targeted High Throughput Sequencing I519 – FALL 2010 Adam Thomas, Kanishka Jain, Tulip Nandu.
Advertisements

Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Advantages of C. elegans: 1. rapid life cycle 2. hermaphrodite
Information Retrieval in Practice
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Visual Basic and Perl Applications for Genome Project Management. Svetlana N.Yurgel* 1, Brenda K. Schroeder 1, Hao Jin 2 and Michael L. Kahn 1,3. Institute.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Genome-Wide RNAi Analysis of Growth and Viability in Drosophila Cells Boutros et al.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Overview of Search Engines
Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
Genome-Wide RNAi Analysis of Growth and Viability in Drosophila Cells Michael Boutros, Amy A. Kiger, Susan Armknecht, Kim Kerr, Marc Hild, Britta Koch,
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
A new way of seeing genomes Combining sequence- and signal-based genome analyses Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI Introduction: So far,

Reconstruction of Transcriptional Regulatory Networks
Touchstone Automation’s DART ™ (Data Analysis and Reporting Tool)
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.
Sackler Medical School
The Power of “Genetics” LOSS OF FUNCTION Easy in yeast Difficult in mammals Powerful tool to address roles in developmental or signaling networks Gene.
NCBI Genome Workbench Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 15, 2004 Slides from Michael Dicuccio’s Genome Workbench.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Introduction to biological molecular networks
RNA-ligand interactions and control of gene expression
No reference available
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Information Retrieval in Practice
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Virginia Commonwealth University
Bacterial infection by lytic virus
Networks and Interactions
Cancer Genomics Core Lab
Search Engine Architecture
RNAi Overview
Preprocessing Data Rob Schmieder.
Quality Control & Preprocessing of Metagenomic Data
Telephone    Provider of Global Contract Research Services Accelerating Preclinical Research, Drug Discovery.
Bioinformatics in the Dynamic Genome Course
Regulatory Genomics Lab
ANIMAL TARGET PREDICTION - TIPS
Joseph JaJa, Mike Smorul, and Sangchul Song
Genome Sequence Annotation Server
Rod Eyles1, John Juma1, Morag Ferguson1, Trushar Shah1 1 IITA, Nairobi
Intelligent Information System Lab
University of Pittsburgh
Chapter 1 The Systems Development Environment
Large Scale Annotation of Genomic Datasets with Genephony
RNA interference: the new somatic cell genetics?
Explore Evolution: Instrument for Analysis
Working in the Post-Genomic C. elegans World
ChIP-seq Robert J. Trumbly
Wiley Interdiscip Rev Dev Biol May;7(3):e314
Regulatory Genomics Lab
Engineering parathyroid cells to treat secondary hyperparathyroidism
Regulatory Genomics Lab
The Power of “Genetics”
RNAi screening formats.
Genome-wide Functional Analysis Reveals Factors Needed at the Transition Steps of Induced Reprogramming  Chao-Shun Yang, Kung-Yen Chang, Tariq M. Rana 
Presentation transcript:

Deploying Galaxy for use with High Throughput Screening Grainne Kerr Signaling and Functional Genomics

RNA interference (RNAi) dsRNA (>27 nt) Dicer siRNA (18 – 21 nt) RISCs RISC targets homologous mRNA Targeted degradation

Long double stranded DNA (dsDNA) Small interfering DNA (siDNA) RNAi and Screening Target genes with Long double stranded DNA (dsDNA) Small interfering DNA (siDNA) Success depends on: Specificity (homology to target and non-targets, interferon response, concentration dependant cytotoxic responses) Efficiency (GC content, repeats, target site accessibility etc) Gene silencing – a reverse engineering approach Efficiency: GC conent (30 – 52%) Low internal stability at sese strand 3’ terminus Absense of inverted repeats Base preferences at certain positions Chemical modifications Target site accessibilty Specificity (sequence independent) Interforn response Concentration dependent effects Specificity (sequence dependant) Perfect homogy to unintended transcripts Perfect homology to ‘seed’ Imperfect homology Genome wide RNAi experiments – two functions really: 1 – for screens – the goal is identification of a few new core components of specifically designed assay – then they will be taken forward for more in depth research 2 – for surveys – the aim here is the systematic mapping of phenotypic profiles and possible genetic interaction networks.

High Throughput Screening Systematic silencing of whole genome Every gene is targeted. Phenotypic luminescence readout Experimental perturbations on a genome wide scale…. Phenotype readout – typically a fluorescence readout ….

Experimental Steps RNAi Library Design Cell based assay format Genome Annotation Library Annotation RNAi Library Design Plate Config Plate List Screen Description Cell based assay format Screen Data Files Screen Log Files Large scale experiment Analysis Reports Computational analysis

Experimental Steps RNAi Library Design Cell based assay format Genome Annotation Library Annotation RNAi Library Design Plate Config Plate List Screen Description Cell based assay format Screen Data Files Screen Log Files Large scale experiment Obvious that there is a lot of manipulation of files and data between steps Library annotation files…. Values for each gene, in replicates …. Design of siRNA or dsRNA reagents Normalization of plates Determination of significant changes in phenotype Grouping and finding patterns in the significant hits Long term storage, retrieval and presentation of results Analysis Reports Computational analysis

Galaxy’s role in HTS Glues different stages together

Computational Analysis Design of siRNA or dsRNA reagents Normalization of plates Determination of significant changes in phenotype Grouping and finding patterns in the significant hits Retrieval and presentation of results

Overview Design of reagents Analysis of the read-out data Selecting and analysis of “hit” genes

NEXT RNAi Automated design and evaluation of RNAi sequences on a genome wide scale Thomas Horn, Thomas Sandmann and Michael Boutros. Design and evaluation of genome-wide libraries for RNAi screens. Genome Biol. 2010 Jun 15;11(6):R61.

NEXT-RNAi Workflow Target sequence Feature tables Low-complexity filter In silico dicing Prediction of specificity and efficiency Discarding of siRNAs below threshold Design of long dsRNAs primers Ranking of designs Homology, feature and genome mapping Flat files, HTML report, GBrowse

NEXT RNAi Input and dependencies Fasta file of target sequences Target-group file (e.g. SNP’s, transcript variants) Requirements: Bowtie – sequence specificity Primer3 – evaluate the primers Optional BLAST BLAT – align longer sequences RNAfold – evaluate the 3d structure Mdust – evaluate primer sequences NEXT RNAI is a powerful workflow that does a lot.

NEXT RNAi in Galaxy Why Galaxy? Number of input parameters Number of dependent choices Guidance of choices – conditional inputs in Galaxy GUI - Not command line Incorporation into workflows

NEXT RNAi in Galaxy Guides the user threw the maze of inputs The more complicated parameters are hidden from the user until required.

NEXT-RNAI Output Outputs – tab delimited text file, comprehensive html report, including a graphical display of designs in gbrowse. Furhter information is given in GFF and fasta files. Note – a lot of information

NEXT RNAi in Galaxy Summary Allows for the rapid batch design of RNAi libraries for: Genome wide set Defined batch of genes Evaluation of commercial libraries Re-evaluation of libraries with release of updated genome annotation There are tools available to design dsRNA’s or siRNAs on a gene by gene basis – but this is different as it allows the user to do it in a rapid and high throughput manner. Uses multiple parameters, not just sequence specificity. Can be used to create multiple siRNAs per gene to give independent phenotypes. Reannotation of libraries – when updated genome annotation is released….

Systematic analysis of screens cellHTS Systematic analysis of screens Phenotypic results range from single numerical value to multiple dimensional images Standardization of experimental information Standardization of analysis Standardisation of analysis within the lab possible – no home custom remedies – makes integration possible Boutros, M., L. Bras, and W. Huber. (2006). Analysis of cell-based RNAi screens. Genome Biology 7:R66.

Per plate quality control cellHTS Import raw data files Per plate quality control Data normalization Scoring of phenotypes Raw data files = the row / well / luminesence value from reader. There is 1 file for each replicate measurement per plate. There is usually a separate set of files for each reporter. Annotation and analysis

cellHTS in Galaxy Vs.

cellHTS in Galaxy Cell HTS is good – but need to input a lot of files

Web based interactive guide to create input files. webcellHTS Web based interactive guide to create input files. Creation of input files is error prone and daunting for the novice user, and prone to errors. This is a user friendly interface to doing creating the files. Guides the user through the steps.

HTS Workflows Annotation Libraries storage

HTS Workflows Standardizing analysis

HTS Workflows Incorporation with other specific screening analysis tools e.g. Redundant siRNA activity analysis (RSA) e.g. cell HTS -> RSA -> join with data libraries, join with data from other screens. Very complicated analysis being simplified greatly into a few steps … RSA written in perl, cellHTS written in R – no problem….. Check for viability and/or effect…… “Present analysis methodologies for large scale RNAi datasets typically rely on ranking screening data and are based on singel siRNA activity or significance value. These analysis focus on the identification of highly active siRNAs (wells), which tupcally fall within the top 1% of the assayed activities and ignore much of the remainder of the dataset. Furthermore these strategies do not exploit redundancies in the genome-sclae libraries, which typically contain 2-4 siRNAs per gene. Thus, it is difficult to systematically identify genes for which multiple siRNAs are active across a screen, which do not fall within an upper threshold (i.e. a moderately active siRNA).” Koenig et al. Nature Methods König et al. A probability based approach for the analysis of large-scale RNAi screens, Nature Methods, 2007

HTS Workflows

HTS Workflows Simple joins are very useful!!!

Outlook Continue to develop and improve tools Expand tools to deal with wider range of screen data e.g. double knockdowns Training courses to develop familiarity with other tools Galaxy has to offer

Acknowledgements Michael Boutros Thomas Horn Chen Chen Oliver Pelz Thomas Sandmann The IT team in Signaling and Functional Genomics