Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,

Slides:



Advertisements
Similar presentations
Transcriptional regulation and promoter analysis
Advertisements

Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Two short pieces MicroRNA Alternative splicing.
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
MiRNA-drug resistance mechanisms Summary Hypothesis: The interplay between miRNAs, signaling pathways and epigenetic and genetic alterations are responsible.
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
MiRNA Platform Overview The Agilent miRNA Microarray System A New Microarray-based Tool for Profiling Human miRNAs.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
A turbo intro to (the bioinformatics of) microRNAs 11/ Peter Hagedorn.
Catalyzing ways of thinking. Ferocious Beauty: Genome Liz Lerman Dance Exchange World Premiere February 3, 2006 Wesleyan University.
Comparative Motif Finding
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Computational biology seminar
What are microRNAs? Morten Lindow. Fire et al, Nature 1998 Worm embryo under phase contrast In situ staining for mex3 mRNA Mex3 inhibited with anti-sense.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
The Model To model the complex distribution of the data we used the Gaussian Mixture Model (GMM) with a countable infinite number of Gaussian components.
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
MicroRNA Target Prediction Using Muscle Atrophy Genes As Models Caltech Wold Lab Mentors: Dr. Barbara Wold Diane Trout Brandon King Gilberto Hernandez,
Accelerating biology with bioinformatics: collaboration with lab scientists Lewitter, RECOMB BE, July 2011.
Genetics: From Genes to Genomes
 MicroRNAs (miRNAs) are a class of small RNA molecules, about ~21 nucleotide (nt) long.  MicroRNA are small non coding RNAs (ncRNAs) that regulate.
Chapter 15 Noncoding RNAs. You Must Know The role of noncoding RNAs in control of cellular functions.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
Development of Bioinformatics and its application on Biotechnology
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Finish up array applications Move on to proteomics Protein microarrays.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
Stefan Aigner Christian Carson Rusty Gage Gene Yeo Crick-Jacobs Center Salk Institute Analysis of Small RNAs in Stem Cell Differentiation.
Inferring transcriptional and microRNA-mediated regulatory programs in glioblastma Setty, M., et al.
Changes to Syllabus: Quizzes put back: Change Oct. 3 to Oct. 17
Jessica L. Feig PhD1, Keith M. Giles PhD2, Iman Osman MD2, Andrew G
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
SmallRNAs Small molecules, big functions. Brief history The first described microRNA, lin-4 was cloned and characterised as a translational repressor.
© Wiley Publishing All Rights Reserved. RNA Analysis.
Youngae Lee Identification of microRNA function by target prediction and expression profiling.
Molecular Genetics - From DNA to Trait. How Are Different Types of Cells Created and Maintained? Different types of cells are created by differential.
Michael Cummings David Reisman University of South Carolina Gene Regulation Part 2 Chapter 9.
Copyright OpenHelix. No use or reproduction without express written consent1.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Causes and consequences of microRNA dysregulation in cancer
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Bioinformatics and Computational Biology
A comparative study of survival models for breast cancer prognostication based on microarray data: a single gene beat them all? B. Haibe-Kains, C. Desmedt,
Computational prediction of miRNA and miRNA-disease relationship
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
Motif Search and RNA Structure Prediction Lesson 9.
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
RNA Structure Prediction
Non-Coding RNA Helen Nordquist November 13, 2015.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Office hours: M 10/ :30am 2-5pm in Bio 6.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
BIO409/509 Cell and Molecular Biology. SECOND Methods paper assignment due Wed., 4/20 (you don’t do this assignment if you are in the 4H STEM Ambassador.
University of Pavia Dep. of Electrical, Computer and Biomedical Engineering Laboratory of Bioinformatics, Mathematical Modelling and Synthetic Biology.
Mestrado Integrado em Medicina Biologia Celular e Molecular II
The Transcriptional Landscape of the Mammalian Genome
Classification with Gene Expression Data
Identifying Conserved microRNAs in a Large Dataset of Wheat Small RNAs
Nobel price 2006 Andrew Z. Fire Craig C. Mello
Deep sequencing of the small RNA transcriptome of normal and malignant human B cells identifies hundreds of novel microRNAs by Dereje D. Jima, Jenny Zhang,
Combinatorial effects of microRNAs to suppress the Myc oncogenic pathway by María J. Bueno, Marta Gómez de Cedrón, Gonzalo Gómez-López, Ignacio Pérez de.
MicroRNAs: regulators of gene expression and cell differentiation
MicroRNAs and Parallel Stem Cell Lives
Noncoding RNA roles in Gene Expression
Presentation transcript:

Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research, The Galilee Society, Israel Institute of Applied Research, The Galilee Society, Israel Louise Showe LabLouise Showe Lab Wistar Institute, UPENN, USA Wistar Institute, UPENN, USA Neurocomputation Laboratory of CRI-Haifa

microRNA Precursor >hsa-mir-1-1 MI UGGGAAACAUACUUCUUUAUAUGCCCAUAUGGACCUGCUAAGCUA UGGAAUGUAAAGAAGUAUGUA UCUCA Mature > hsa-mir-1-1 uggaauguaaagaaguaugua Cancer genomics: Small RNAs with BIG impacts Paul S. Meltzer, Nature During the past few years, molecular biologists have been stunned by the discovery of hundreds of genes that encode small RNA molecules. MicroRNA expression profiles classify human cancers Jun Lu. Et al,Nature 2005 MicroRNA in cancer analysis of microRNA expression in over 300 individuals shows that microRNA profiles could be of value in cancer diagnosis

2 MicroRNAs Promote Spread of Tumor Cells : By blocking the translation of tumor suppressor genes, miRNAs have been shown to facilitate the development of many types of cancer. Small RNAs Can Prevent Spread of Breast Cancer :The tiny RNAs prevent the spread of cancer by interfering with the expression of genes that give cancer cells the ability to proliferate and migrate MicroRNAs May Be Key To HIV's Ability To Hide, Evade Drugs

miRNA processing

PART I MICRORNA PREDICTION

BayesMiRNAfind: Naïve Bayes For miRNA Gene Prediction 873 outside jobs were processed in 2 weeks on the Wistar Bioinformatics Core cluster.

One-ClassMirnaFind : One-Class microRNA gene prediction Web Server BMC-Algorithms for Molecular Biology

Advantage of our tools  Allowing predicting miRNAs for multi-species [ Vir- mir db,Li et al 2007], most of the other tools are species-specific  The input is not limited! (full genome)  Predict also non-conserved miRNA  The features seems to be more accurate describing the miRNA class.

Two-Class : The Computation procedure components

BayesMiRNA: Mouse Genome (one strand)  Out of 212 mature miRNAs from the mouse genome 135 are at the DNA + strand  Running on a parallel compute cluster with 100 nodes  (  The whole computation procedure took about 6.5 days to complete

PART II MICRORNA TARGET SITE PREDICTION Bioinformatics Journal

miRNA target site prediction Morten Lindow miRNA-group, Bioinformatics Centre University of Copenhagen

Performance of NBmiRTar

3’UTR microRNAs MiRanda Naïve Bayes classifier Predicted microRNA targets Orthologs score Summary of microRNA target prediction Classifier Mouse 3’UTR (mavid) Sequence alignment Database Human 3’ UTR Miranda score naïve Bayes score Folding energy Filters Filter

NBmiRTar

Results with Human Known Targets miRNANumber of confirmed targets Miranda Predictions Recovery by Miranda BayesMirnaTarget Predictions Recovery by BayesMirna Target NB-filter / / / / / / / / / / / / / / / Sum NBmiRTar reduces Miranda prediction by about 75% with recovery rate of 77%. NBmiRTar + NB-filter (threshold 0.9) reduces Miranda prediction by about 81% with recovery rate of 77%.

427 known human mature miRNA MiRanda Predictions 50 genes (59 TFs). human 3’UTR Showe Lab Experiment NBmiRTar Predictions Filters: MiRanda 110 NB 0.9 Orthologs Mouse 3’UTR Genes MiRanda 110 NB genes that have been shown to be down regulated at the message level after treatment

Malik Yousef, Segun Jung, Louise C Showe and Michael K Showe, Learning from Positive Examples when the Negative Class is Undetermined- microRNA gene identification. Algorithms for Molecular Biology, (Accepted)(2008). Malik Yousef, Segun Jung, Andrew V. Kossenkov, Louise C. Showe and Michael K. Showe, Na ï ve Bayes classifier for microRNA target gene identification, Bioinformatics, 15 November 2007; 23: Malik Yousef, Hagit Shatkay, Michael Nebozhyn, Louise C. Showe and Michael K. Showe, Combining Multi-Species Genomic Data for MicroRNA Identification Using Na ï ve Bayes Classifier. Bioinformatics, Vol. 22, No. 11, p (2006) Related publications