17.02.2005 Fast and effective prediction of miRNA targets Marc Rehmsmeier CeBiTec, Bielefeld University, Germany Junior Research Group Bioinformatics of.

Slides:



Advertisements
Similar presentations
MicroARNs, comparaison, prediction
Advertisements

Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Table 2 shows that the set TFsf-TGblbs of predicted regulatory links has better results than the other two sets, based on having a significantly higher.
Improving miRNA Target Genes Prediction Rikky Wenang Purbojati.
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
McPromoter – an ancient tool to predict transcription start sites
RNA Structure Prediction
CISC667, F05, Lec18, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Gene Prediction and Regulation.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
RNA Structure Prediction Rfam – RNA structures database RNAfold – RNA secondary structure prediction tRNAscan – tRNA prediction.
Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs LEE P. LIM, NELSON C. LAU, PHILIP GARRETT-ENGELE ANDREW GRIMSON,
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
MicroRNA Target Prediction Using Muscle Atrophy Genes As Models Caltech Wold Lab Mentors: Dr. Barbara Wold Diane Trout Brandon King Gilberto Hernandez,
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
Whole Genome Polymorphism Analysis of Regulatory Elements in Breast Cancer AAGTCGGTGATGATTGGGACTGCTCT[C/T]AACACAAGCGAGATGAAGAAACTGA Jacob Biesinger Dr.
Predicting RNA Structure and Function
In Silico Sciences The RNA secondary structure dependence of RNA protein interactions and its implications for the post transcriptional regulation of gene.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
microRNA computational prediction and analysis
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
BLAST: Basic Local Alignment Search Tool Urmila Kulkarni-Kale Bioinformatics Centre University of Pune.
CLOSING DISCUSSION Advances and challenges in Protein-RNA: recognition, regulation, and prediction Discussion facilitator: Manny Ares Banff International.
Multiple testing correction
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Bantam Encodes a Developmentally Regulated microRNA that Controls Cell Proliferation and Regulates the Proapoptotic Gene hid in Drosophila Julius Brennecke,
Quantitative PCR Analysis of DNA, RNAs, and Proteins in the Same Single Cell A. Ståhlberg, C. Thomsen, D. Ruff, and P. Åman December 2012
Where Will They Strike Next? microRNA targeting tactics in the war on gene expression Jeff Reid Miller “Lab” Baylor College of Medicine.
Performance measurement. Must be careful what performance metric we use For example, say we have a NN classifier with 1 output unit, and we code ‘1 =
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
RNA Structure Prediction
CSLS Retreat 2007 Matan Hofree & Assaf Weiner 1. Outline  A brief introduction to microRNA  Project motivation and goal  Selecting the data sets 
Exogean: an expert gene annotation framework based on directed acyclic coloured multigraphs ENCODE Gene Prediction Workshop - EGASP/2005 Sarah Djebali,
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
Localising regulatory elements using statistical analysis and shortest unique substrings of DNA Nora Pierstorff 1, Rodrigo Nunes de Fonseca 2, Thomas Wiehe.
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
Improving Intergenic miRNA Target Genes Prediction Rikky Wenang Purbojati.
Geuvadis achievements and contributions Robert Häsler, functional genomics.
RBP1 Splicing Regulation in Drosophila Melanogaster Fall 2005 Jacob Joseph, Ahmet Bakan, Amina Abdulla This presentation available at
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
How can we find genes? Search for them Look them up.
Motif Search and RNA Structure Prediction Lesson 9.
Pattern Discovery and Recognition for Understanding Genetic Regulation Timothy L. Bailey Institute for Molecular Bioscience University of Queensland.
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
RNA Structure Prediction
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Office hours: M 10/ :30am 2-5pm in Bio 6.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Figure S1 cmSP165/166 severe b MIM159 (mir159a) a Figure S1: Detailed phenotype of severe cmSP165/166 and MIM159 plants. Severe cmSP165/166 (15x) plants.
ANIMAL TARGET PREDICTION - TIPS
Vienna RNA web servers
Predicting RNA Structure and Function
Volume 107, Issue 7, Pages (December 2001)
Volume 123, Issue 6, Pages (December 2005)
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Andrew D. Bosson, Jesse R. Zamudio, Phillip A. Sharp  Molecular Cell 
MiR-125b, a MicroRNA Downregulated in Psoriasis, Modulates Keratinocyte Proliferation by Targeting FGFR2  Ning Xu, Petter Brodin, Tianling Wei, Florian.
Pairing beyond the Seed Supports MicroRNA Targeting Specificity
Prediction of Mammalian MicroRNA Targets
Identification of FD targets by pSUC2::GFP:FD ChIP-seq in Col-0 and ft-10 tsf-1 and pFD::GFP:FD ChIP-seq in fd-2. Identification of FD targets by pSUC2::GFP:FD.
Molecular Therapy - Nucleic Acids
Deep Learning in Bioinformatics
Mechanisms contributing to posttranscriptional regulation (PTR) under DR. Mechanisms contributing to posttranscriptional regulation (PTR) under DR. Of.
Presentation transcript:

Fast and effective prediction of miRNA targets Marc Rehmsmeier CeBiTec, Bielefeld University, Germany Junior Research Group Bioinformatics of Regulation

Small interfering RNAs versus small temporal RNAs Hannon. Nature. 418: , 2002.

miRNA/target duplexes Grosshans and Slack. The Journal of Cell Biology, 156(1):17-21, 2002.

A direct approach Given a miRNA and a potential target: What are the energetically most favourable binding sites? Calculation of multiple mfe secondary structure duplexes

The language of RNA duplexes hybrid =nil ><< tt (region,region)||| unpaired_left_top ||| closed... h unpaired_left_top =ult <<< tt (base,empty) ~~~ unpaired_left_top ||| unpaired_left_bot... h unpaired_left_bot = ulb <<< tt (empty,base) ~~~ unpaired_left_bot||| edangle... h edangle = eds <<< tt (base, base) ~~~ closed ||| edt <<< tt (base,emptybase) ~~~ closed ||| edb <<< tt (emptybase,base) ~~~ closed... h

closed =stacking_region||| bulge_top||| bulge_bottom||| internal_loop||| end_loop... h stacking_region = sr <<< basepair ~~~ closed bulge_top= (bt <<< basepair ~~~ tt (uregion, empty)) `topbound` closed bulge_bottom = (bb <<< basepair ~~~ tt (empty, uregion)) `botbound` closed internal_loop= (il <<< basepair ~~~ tt (uregion,uregion)) `symbound` closed end_loop = el <<< basepair ~~~ tt (region,region) The language of RNA duplexes

Dynamic Programming recurrences Time/memory complexity: linear in target length

let-7/lin-41 binding sites position: 688, mfe: kcal/mol position: 737, mfe: kcal/mol

Requirements For prediction of miRNA targets in large databases we need: A fast program  Good statistics

Length normalisation of minimum free energies

p-values of individual binding sites

Poisson statistics of multiple binding sites Probability of k binding sites: with For small p-values: The probability of at least k binding sites:

Comparative analysis of orthologous targets

Multi-species p-values Poisson p-values: multi-species p-value: General case: k species

A dependence problem We should see a p-value as often as it says (blue curve), but we don‘t (red curve).

let-7b/NME4 (human/mouse) binding sites -GGCTCAAGCTGCCCTTACCACCCCATCCCCCACGCAGGACCAACTACCTCCGTCAGCAAGAACCCAAGCCCACATCCAAACCTGCCTGTCCCAAACCAC GGGCTTGCACTGCCTTCTGCACTTCAGGTCT-ACCCATGACCTACTACCTCTGTCAACAAGAAGTCAAGCCCCCATGC---TTCCCATGTCCCCAAAC-- **** ***** * *** ** * ** ** **** ******** **** ****** ******* *** * * ****** ** * TTACTTCCCTGTTCACCTCTGCCCCACCCCAGCCCAGAGGAGTTTGAGCCACCAACTTCAGTGCCTTTCTGTACCCCAAGCCAGCACAAGATTGGACCAA -CACTCCCTACTCCCGCTCTACCCAACTCCAGCCCAGGGGAGTCTAAGCCTCAACTCTATGTGCCTTTTTGTATCCTAAGTCAATACAATATTGGACCAT *** ** * * **** *** ** ********* ***** * **** * * * ******** **** ** *** ** **** ********* TCCTTTTTGCACCAAAGTGCCGGACAACCTTTGTGGTGGGGGGGGGTCTTCACATTATCATAACCTCTCCTCTAAAGGGGAGGCATTAAAATTCACTGTG GTCCTTGTGTACAAAAGTGCCAGACAACCTTTG GGGCATTGTCA-AAGGTGACTTCACCTGCCTCAAAGGAGAGACATTAAAATTT--TATG * ** ** ** ******** *********** *** * *** * * * * ** * ***** *** ********** * ** CCCAGCACATGGGTGGTACACTAATTATGACTTCCCCCAGCTCTGAGGTAGAAATGACGCCTTTATGCAAGTTGTAAGGAGTTGAACAGTAAAGAGGAAG CTTAAAAT * * * Multi-species p-value with k = 2:1.5e-08 Multi-species p-value with k = 1.1:5.0e-05 k = 1.1 is the effective k

Effective number of orthologous targets

Requirements For prediction of miRNA targets in large databases we need: A fast program  Good statistics 

True and false positives and negatives Classify as Positives Classify as Negatives TP FP TN FN Positives Negatives

TP FP TN FN Sensitivity and specificity p-values control specificity Spec TP FP TN FN Spec

RNAhybrid Target prediction workflow target db miRNA registry individual p-values multi-species p- values Poisson p- values bantam #sites target gene E-value Dm Dp Ag CG CG CG CG CG CG CG CG

Prediction of Drosophila miRNA targets 78 miRNAs 28,645 3‘UTRs (1/3 from D. mel, 1/3 from D. pseu, 1/3 from A. gamb)

Bantam hits targetE-value#sites Dm # sites Dp #site s Ag nervous fingers Distal-less Wrinkled (Hid)

miR-7 hits targetE-value#sites Dm # sites Dp #sites Ag CG Twin of m E(spl) region transcript m E(spl) region transcript m  CG CG Him CG Arginine methyltransferase

miR-2 hits targetE-value#sitesE- value #sitesE- value #sites grim reaper sickle miR-2amiR-2bmiR-2c plus a number of others

RNAhybrid functionality length normalisation Poisson statistics web server seed/loop constraints miRNA specific statistics effective k comparative analysis multiple binding sites RNAhybrid

miRNA target selection surprise miRNA target selection rank based p-values E-values user guidance p-values indicate not only biochemical possibility, but also biological function.

Acknowledgements Peter Steffen, Robert Giegerich, Jan Krüger Matthias Höchsmann Alexander Stark, Julius Brennecke, Stephen M. Cohen Sven Rahmann Gregor Obernosterer Robert Heinen Leonie Ringrose

References Rehmsmeier M, Steffen P, Höchsmann M and Giegerich R. Fast and effective prediction of microRNA/target duplexes. RNA, 10: , bibiserv.techfak.uni-bielefeld.de/rnahybrid