Computational prediction of miRNA and miRNA-disease relationship

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

Author: Jim C. Huang etc. Lecturer: Dong Yue Director: Dr. Yufei Huang.
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
Transcriptomics Jim Noonan GENE 760.
By Russell Armstrong Supervisor Mrs Wei Ji Diagnosis Analysis of Lung Cancer by Genome Expression Profiles.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
MicroRNA genes Ka-Lok Ng Department of Bioinformatics Asia University.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
MiRNA targets Using undergraduate molecular biology labs to discover targets of miRNAs in humans Adam Idica, Jordan Thompson, Irene Munk Pedersen, Pavan.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Radiogenomics in glioblastoma multiforme
Identifying and classifying functional small RNAs from pine Ryan Morin BC Genome Sciences Centre (presenting research conducted in the lab of Dr. Peter.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Finish up array applications Move on to proteomics Protein microarrays.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
TMpro: Transmembrane Helix Prediction using Amino Acid Properties and Latent Semantic Analysis Madhavi Ganapathiraju, N. Balakrishnan, Raj Reddy and Judith.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
MicroRNA regulation in Arabidopsis thaliana
Youngae Lee Identification of microRNA function by target prediction and expression profiling.
Introduction Sample Projects Resources Summary Future Plans Bioinformatics Support Information Session Karsten Hokamp TCD 3rd October, 2007.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Last class Plasmid isolation from bacteria Paper 2: miRNAs in iPSCs.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
EB3233 Bioinformatics Introduction to Bioinformatics.
Copyright OpenHelix. No use or reproduction without express written consent1.
Lecture 11. Topics in Omic Studies (Cancer Genomics, Transcriptomics and Epignomics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational.
Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
Hidden Markov Model and Its Application in Bioinformatics Liqing Department of Computer Science.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
MicroRNA Prediction with SCFG and MFE Structure Annotation Tim Shaw, Ying Zheng, and Bram Sebastian.
Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin Hoadley, KA et al. Cell 158(4):
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
RNA Structure Prediction
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Center for Bioinformatics and Genomic Systems Engineering Bioinformatics, Computational and Systems Biology Research in Life Science and Agriculture.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
David Amar, Tom Hait, and Ron Shamir
邹权 (PH.D.&Professor) 天津大学 计算机科学与技术学院
The Transcriptional Landscape of the Mammalian Genome
Identifying Conserved microRNAs in a Large Dataset of Wheat Small RNAs
邹权 (PH.D.&Professor) 天津大学 计算机科学与技术学院
Optimizing Biological Data Integration
Gene expression.
Global Transcriptional Dysregulation in Breast Cancer
Microarray Technology and Applications
Step 1: amplification and cloning procedures
Predicting Active Site Residue Annotations in the Pfam Database
Bioinformatics Biological Data Computer Calculations +
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Characterization of microRNA transcriptome in tumor, adjacent, and normal tissues of lung squamous cell carcinoma  Jun Wang, MD, PhD, Zhi Li, MD, PhD,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SIFGD: Setaria italica Functional Genomics Database
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Computational prediction of miRNA and miRNA-disease relationship Quan Zou (邹权) PH.D.&Professor School of Computer Sci&Tech Tianjin University, China

Contents background microRNA identification isomiR microRNA and disease outlook

Crucial regulatory molecule: Background-miRNA Crucial regulatory molecule: 1/3 human genes cell development cell proliferation cell apoptosis tumorigenesis …

2. predicting the targets 1. mining the pre-miRNA, miRNA Precursor, Pre-miRNA cell nucleus mature miRNA cytoplasm 2. predicting the targets target

Identification of microRNA AUCGUGCAGAGACUAGACUGACAUCGUGCAGAGACUAGACUGACAUCGUGCAGAGACUAGACUGACAUCGUGCAGAGA CUAGACUGACAUCGUGCAGAGACUAG ACUGAC >1 tgcgcgaauucacccauggauccauucaucuuccaagggcaccagc >2 agcgcgaauuccaagucacccauggauccauucaucuggcagcgu >3 agucgcgaauucaucaucuuccaagggcacccauggauccaucca

Ref: Xue C, et al. Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine. BMC Bioinformatics, 2005, 6(1): 310.

Ref: Xue C, et al. Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine. BMC Bioinformatics, 2005, 6(1): 310.

microRNA prediction based on machine learning obvious differences weak generalization

Importance of negative samples Negative Testing Set Positive Training Set Decision Boundary Negative Training Set

Importance of negative samples Negative Testing Set Positive Training Set New Negative Training Set New Decision Boundary

Flow innovation point Human CDs Extend Blast 100nt 100nt Human Mature microRNAs Mature-like Reads Compute Secondary Structures Extract Prediction Model Parameter Filter Rebuilt Original Negative Set Mined Sequences innovation point Replace

Leyi Wei, Minghong Liao, Yue Gao, Rongrong Ji, Zengyou He Leyi Wei, Minghong Liao, Yue Gao, Rongrong Ji, Zengyou He*, Quan Zou(邹权)*. Improved and Promising Identification of Human MicroRNAs by Incorporating a High-quality Negative Set. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2014, 11(1):192-201 (SCI, IF2011=1.543) 2017/4/26

Novel miRNA found by our method 1

Dinoflagellates genome (甲藻) Lin, et al. The Symbiodinium kawagutii genome illuminates dinoflagellate gene expression and coral symbiosis. Science. 2015, 350(6261): 691-694.

miRNA family classification PFAM(~2000) VS miRNA family(~2000) Troubles Multiple classes Few samples imbalaned 1

Quan Zou. , Yaozong Mao, Lingling Hu, Yunfeng Wu, Zhiliang Ji Quan Zou*, Yaozong Mao, Lingling Hu, Yunfeng Wu, Zhiliang Ji*. miRClassify: An advanced web server for miRNA family classification and annotation. Computers in Biology and Medicine. 2014, 45:157-160.(SCI, IF2011=1.089) ESI high cited paper 2017/4/26 1

Question UACACUGUGGAUCCGGUGAGGUAGUAGGUUGUAUAGUUUGGAAUAUUACCACCGGUGAACUAUGCAAUUUUCUACCUUACCGGAGACAGAACUCUUCGA UGAGGUAGUAGGUUGUAUAGUU ------uaca gga U --- aaua cugu uccggUGAGGUAG AGGUUGUAUAGUUu gg u |||| ||||||||||||| |||||||||||||| || gaca aggccauuccauc uuuaacguaucaag cc u agcuucucaa --g u ugg acca 1

1

Contents background microRNA identification isomiR microRNA and disease outlook

Why called isomiR? isoform vs isomiR

Background-isomiR miRNA variants, isomiRs, physiological isoforms Various length distributions, 5’/3’ ends The annotated miRNA sequence is only one specific isomiR in the miRNA locus Imprecise and alternative cleavage Modification/addition events SNP RNA editing

Materials and methods Public databases, in-house sequencing datasets, published data Bioinformatics & biostatistics Software/script Molecular biology method Data analysis: Biology/interaction Data analysis: Method/prediction Data analysis: Evolution/miRNA*

Where does isomiR happen? across different species normal vs cancer isomiR data - TCGA

isomiR difference in cancer 3’ addition: not dominant IsomiR expression: Stable across different samples Abnormal isomiR pattern in cancer cells and tissues

Contents background microRNA identification isomiR microRNA and disease outlook

Ref:Quan Zou, et al. Prediction of microRNA-disease associations based on social network analysis methods. BioMed Research International. 2015, 2015: 810514

Similarity between two microRNAs targets of miR1 targets of miR2 targets of miR1 targets of miR2 targets of miR1 targets of miR2

Function similarity of targets targets network 0.7 0.8 0.9 0.6 g1 g2 g4 g3 Strength miR1 miR2 g1 g2 g4 g3 0.4 0.5 0.8 0 miR2 miR1 FSmiR Strength(wij) Function similarity of targets 0 0.5 0.8 0.7 Ref: Yungang Xu, et al. Inferring the Soybean (Glycine max) microRNA functional network based on target gene network . Bioinformatics, 2014, 30 (1):94-103.

Outlook How many novel microRNAs are still left? All the microRNA research methods can be extended to ncRNA and lncRNA isomiR would be the next hot topic in microRNA research Diseases would be the hot spots for ever!

Thanks! Quan Zou, PhD&Professor School of Computer Science and Technology Tianjin University Email: zouquan@nclab.net http://lab.malab.cn/~zq/