R AG P OOLS : RNA-As-Graph-Pools A Web Server to Assist the Design of Structured RNA Pools for In-Vitro Selection R AG P OOLS : RNA-As-Graph-Pools A Web.

Slides:



Advertisements
Similar presentations
RNA & NKS Erik A. Schultes Hedgehog Research hedgehogresearch.info June 16, 2006.
Advertisements

Gene expression From Gene to Protein
1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
Towards RNA structure prediction: 3D motif prediction and knowledge-based potential functions Christian Laing Tamar Schlick’s lab Courant Institute of.
6 - 1 Chapter 6 The Secondary Structure Prediction of RNA.
Measuring the degree of similarity: PAM and blosum Matrix
The Many Roles of Computational Science in Drug Design and Analysis Mala L. Radhakrishnan Department of Chemistry, Wellesley College June 17, 2008 DOE.
A A A AA A A A A A A A A A A A A U A C GU C G AU U U A A A C C C G G G U U U AAA G G G C C C C C U A C G U C G AU U U A A A C C C G G G U U U A A.
Predicting the 3D Structure of RNA motifs Ali Mokdad – UCSF May 28, 2007.
Structural bioinformatics
An unbiased adaptive sampling algorithm for the exploration of RNA mutational landscapes under evolutionary pressure Jérôme Waldispühl, PhD School of Computer.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Luddite: An Information Theoretic Library Design Tool Jennifer L. Miller, Erin K. Bradley, and Steven L. Teig July 18, 2002.
OCFS: Optimal Orthogonal Centroid Feature Selection for Text Categorization Jun Yan, Ning Liu, Benyu Zhang, Shuicheng Yan, Zheng Chen, and Weiguo Fan et.
Creating enzymes not found in nature Burckhard Seelig University of Minnesota & Harvard Medical School.
Methods Six putative ncRNA datasets are analyzed: (1) FANTOM3 ncRNAs, (2) FANTOM3 stringent ncRNAs, RNAz predictions (3) set1.P0.5 (P>0.5), (4) set1.P0.9.
CISC667, F05, Lec19, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) RNA secondary structure.
The Wonderful World of RNA DNARNA protein. Complexity of RNA Folding 1 strand 1 strand 4 building blocks 4 building blocks Basic structural element: double.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
Finding Regulatory Motifs in DNA Sequences. Motifs and Transcriptional Start Sites gene ATCCCG gene TTCCGG gene ATCCCG gene ATGCCG gene ATGCCC.
GENE EXPRESSION.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Experiments and theory about an elementary coding system based on RNA Brookhaven Laboratory 01/13/2008 Jean Lehmann Center for Studies in Physics and Biology.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
RNA-metabolite interactions (riboswitches). RNA aptamers RNA aptamers are structures that bind specifically to target ligands Many aptamers have been.
Phylogenetic Reconstruction based on RNA Secondary Structural Alignment Benny Chor, Tel-Aviv Univ. Joint work with Moran Cabili, Assaf Meirovich, and Metsada.
RNA processing Proks vs. Euks In proks, transcription & translation coupled In euks, processes are temporally& spatially separated so more control.
A new way of seeing genomes Combining sequence- and signal-based genome analyses Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI Introduction: So far,
WANG HANLU Advance in research of aptamers and related drugs.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Function-Information Relationship in Nucleic Acids Andrej Luptak U NIVERSITY oU NIVERSITY of C ALIFORNIA ‧ I RVINE Andrej Luptak
Ben-Gurion University Research focus on riboswitches Bioinformatics Research Focus: Searching for Novel Riboswitches in Newly Sequenced Genomes Danny Barash†
Outline More exhaustive search algorithms Today: Motif finding
Efficient Bacterial Transcription of DNA Nanocircle Vectors with Optimized Single-Stranded Promoters Tarsuo Ohmichi; Angéle Maki; Eric T. Kool A modular.
Markov Chain Monte Carlo and Gibbs Sampling Vasileios Hatzivassiloglou University of Texas at Dallas.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
Protein Synthesis Unit 5. Protein Synthesis DNA  RNA  Proteins 4 Steps: 1)Transcription  information is DNA is copied to RNA (nucleic acid  nucleic.
Maik Friedel, Swetlana Nikolajewa, Thomas Wilhelm Theoretical Systems Biology, FLI-Jena, Germany Codons and the reverse codons.
Progress toward Predicting Viral RNA Structure from Sequence: How Parallel Computing can Help Solve the RNA Folding Problem Susan J. Schroeder University.
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
MS Sequence Clustering
Chapter 13 RNA and Protein Synthesis. DNA: The Blueprints The structure of DNA “CUT the Pyramids ”
WHAT FACTORS PROMOTE THE EMERGENCE OF BIOCOMPLEXITY? WHAT FACTORS PROMOTE THE EMERGENCE OF BIOCOMPLEXITY? Robert M. Hazen, Carnegie Institution Kavli Futures.
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:
Synthetic Sequence Design for Signal Location Yaw-Ling Lin ( 林 耀 鈴 ) Dept Computer Sci and Info Engineering College of Computing and Informatics Providence.
Markov Chain Models BMI/CS 576 Colin Dewey Fall 2015.
Motif Search and RNA Structure Prediction Lesson 9.
Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI-Jena, Germany Introduction: During the last 10 years, a large number of complete.
Computational Biology, Part 3 Representing and Finding Sequence Features using Frequency Matrices Robert F. Murphy Copyright  All rights reserved.
Development of the Theophylline Riboswitch 1 on 2 Meeting 09/03/13.
But how to count? An RNA could be very long; there may be many possible ways that base pairs can be formed: e.g., ……ACGGUACGUC….. conflicting pairs A-U,
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
1 Repeats!. 2 Introduction  A repeat family is a collection of repeats which appear multiple times in a genome.  Our objective is to identify all families.
Replication, Transcription, Translation PRACTICE.
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
Evolutionary Computation Evolving Neural Network Topologies.
Multiple sequence alignment (msa)
NOTES; DNA.
InVITRO selection (SELEX)
Predicting RNA Structure and Function
Integrating Meta-Path Selection With User-Guided Object Clustering in Heterogeneous Information Networks Yizhou Sun†, Brandon Norick†, Jiawei Han†, Xifeng.
THECATSAWTHEBATANDATETHEBIGFATRAT
Laura Lancaster, Harry F. Noller  Molecular Cell 
Panos Oikonomou, Hani Goodarzi, Saeed Tavazoie  Cell Reports 
Department of Computer Science Ben-Gurion University
Department of Computer Science Ben-Gurion University
Deep Learning in Bioinformatics
Volume 11, Issue 1, Pages 1-12 (April 2015)
Presentation transcript:

R AG P OOLS : RNA-As-Graph-Pools A Web Server to Assist the Design of Structured RNA Pools for In-Vitro Selection R AG P OOLS : RNA-As-Graph-Pools A Web Server to Assist the Design of Structured RNA Pools for In-Vitro Selection The 3rd Annual ROC Meeting – Madison, WI May 28-29, RNA Pool Design for In Vitro Selection 2. Modeling of Pool Synthesis 3. Features of R AG P OOLS 4. Conclusions Namhee Kim Laboratory of Prof. Tamar Schlick New York University

NYU/BIOMATH In Vitro Selection An experimental approach to screen large (~10 15 ) random- sequence libraries of RNAs for a specific function (e.g., binding property) Numerous aptamers and ribozymes were discovered from in vitro selection D. Wilson and J.W. Szostak, Annu.Rev.Biochem 68:611 (1999)

NYU/BIOMATH Targeted RNA Pool Design – Already an experimental goal J.H. Davis and J.W. Szostak, Proc. Natl. Acad. Sci. 99:11616 (2002) M.W. Lau, K.E. Cadieux, and P.J. Unrau, J. Am. Chem. Soc. 126:15686 (2004) – Random pools are biased to simple topologies – Complex structures are more active N. Kim, H.H. Gan, and T. Schlick, RNA 13:478 (2007) Proposal Design better pools by mixing base composition to target novel structures J. Gevertz et al., RNA 11:853 (2005) J. Carothers et al., J. Am. Chem. Soc. 126:5130 (2004)

NYU/BIOMATH 4 2. Modeling of Pool Synthesis – By optimizing compositions of A, U, C and G in four containers (Mixing Matrix) and starting sequence, we seek to design pools with target topologies e.g., 40% 20% 10% 30% A 20% 30% 40% 10% U 30% 10% 20% 20% G 10% 40% 30% 40% C instead of 25% 25% 25% 25% A 25% 25% 25% 25% U 25% 25% 25% 25% G 25% 25% 25% 25% C

NYU/BIOMATH 5 3. Algorithm for Structured Pool Design Step 1. Specify a target distribution of topologies/shapes Step 2. Define candidates for starting sequences and mixing matrices that aim to cover the sequence space Step 3. Compute motif distributions corresponding to all starting sequence/mixing matrix pairs Step 4. Choose the number of mixing matrices to approximate the designed pool Step 5. Find an optimal combination of starting sequences and mixing matrices and associated weights to approximate the target RNA motif distribution R AG P OOLS : RNA-As-Graph-Pools Web Server N. Kim, J. S. Shin, S. Elmetwaly, H.H. Gan, and T. Schlick, submitted (2007)

NYU/BIOMATH 6

7

8

9 Examples of Structured Pools Examples of Structured Pools InputOutput Target structure distributions Number of mixing matrices Starting sequences Optimized associated weights, mixing matrices and starting sequences 4 1, 4 2 : 30%, 30% 2 (conservati on of C and G) All78%, MM13, modified GTP aptamer 22%, MM12, Hammerhead ribozyme 5 1, 5 2, 5 3 : 20%, 20%, 20% 3All12%, MM1, 70S 83%, MMT12, tRNA 5%, MMT4, DsrA ncRNA 5 1, 6 1 : 30%, 30% nt38.5%, MM3, tRNA 61.5%, MMT8, let-7 ncRNA 5 2, 6 2 : 20%, 20% 2Riboswitch77%, MM19, TPP riboswitch 23%, MMT4, TPP riboswitch

NYU/BIOMATH 10

NYU/BIOMATH 11

NYU/BIOMATH 12

NYU/BIOMATH Conclusions wThe R AG P OOLS offers a general tool for designing and analyzing structured RNA pools with specified target motif distributions wIn the near future, we expect to expand the set of starting sequences and mixing matrices and provide more detailed analyses of local structural properties wContact us at:

NYU/BIOMATH 14 Acknowledgments wProf. Tamar Schlick wDr. Hin Hark Gan wJin Sup Shin wShereef Elmetwaly wAll members of the Schlick Lab ●NYU McCracken fellowship and IGERT NSF fellowship ●NSF, NIH and HFSP

NYU/BIOMATH 15 Mixing Matrix Motivated by Biological Mutations M AA =M CC =M GG =M UU (A:1-6) M CC =M GG (B:7-10) M AA =M UU (C:11-14) M AC =M UG (D:15-18) M CA =M GU (E:19-22) M AA M AC M AG M AU M CA M CC M CG M CU M GA M GC M GG M GU M UA M UC M UG M UU Mixing Matrix M motivated by biological mutations A C G U ACGUACGU

NYU/BIOMATH 16 Starting Sequences and Coverage of Sequence Space Starting sequences (a) 5 1 motif (e) 4 2 motif

NYU/BIOMATH 17 Motif Distributions (e) 4 2 motif and Matrices1-22

NYU/BIOMATH 18 GTP Aptamer Pool GTP Aptamer Pool –Complex 5 2 and 4 2 motifs are targeted –Sequence/structure contour plot of the designed pools is different from random pool –Targeted structured pool depends on targeted function J. Carothers et al., J. Am. Chem. Soc. 126:5130 (2004)