Computer Science and Engineering PhD in Computer Science Monday, November 07, 2011 9:00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
N.U.S. - January 13, 2006 Gert Lanckriet U.C. San Diego Classification problems with heterogeneous information sources.
Consistent probabilistic outputs for protein function prediction William Stafford Noble Department of Genome Sciences Department of Computer Science and.
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
Computational biology and computational biologists Tandy Warnow, UT-Austin Department of Computer Sciences Institute for Cellular and Molecular Biology.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Herpes Jeff Brown Dante Kappotis Robert Vanderley Anthony Biasella.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
KDD Cup Task 2 Mark Craven Department of Biostatistics & Medical Informatics Department of Computer Sciences University of Wisconsin
Spectral coordinate of node u is its location in the k -dimensional spectral space: Spectral coordinates: The i ’th component of the spectral coordinate.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
PutidaNET :Interactome database service and network analysis of Pseudomonas putida KT2440 (P. putida KT2440) Korean BioInformation Center (KOBIC) Seong-Jin,
Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Yang Dai Prime Grant Support: NSF High-throughput.
Big Data Network Genomics Network Inference and Perturbation to Study Chemical-Mediated Cancer Induction Stefano Monti Section of Computational.
Semantic Similarity over Gene Ontology for Multi-label Protein Subcellular Localization Shibiao WAN and Man-Wai MAK The Hong Kong Polytechnic University.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
Networks and Interactions Boo Virk v1.0.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Exploring Alternative Splicing Features using Support Vector Machines Feature for Alternative Splicing Alternative splicing is a mechanism for generating.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Top X interactions of PIN Network A interactions Coverage of Network A Figure S1 - Network A interactions are distributed evenly across the top 60,000.
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
EB3233 Bioinformatics Introduction to Bioinformatics.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
Introduction to biological molecular networks
Applications of Supervised Learning in Bioinformatics Yen-Jen Oyang Dept. of Computer Science and Information Engineering.
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
GO based data analysis Iowa State Workshop 11 June 2009.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
A Tutorial of the PrePPI Database Presenters: Gabriel Leis and Katrina Sherbina Loyola Marymount University Departments of Biology and Computer Science.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches Cantone, I., Marucci, L., Iorio, F., Ricci, M., Belcastro,
Bioinformatics: Cool stuff you can do with Computers and Biology Oded Magger Tel Aviv University / Autodesk inc. GIP course 2010.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Fig. 1 Computing the four node TOMs for nodes A,B,C,D in two simple networks 1) tA,B,C,D=0+40+6=0.667 and 2) tA,B,C,D=1+41+6= From: Network neighborhood.
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Extra Tree Classifier-WS3 Bagging Classifier-WS3
A User’s Guide to GO: Structural and Functional Annotation
SEG5010 Presentation Zhou Lanjun.
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Characteristics of tissue‐specific co‐expression networks (CNs)‏
Deep Learning in Bioinformatics
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
MEET-IP Memory and Energy Efficient TCAM-based IP Lookup
Presentation transcript:

Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based Prediction of Protein Localization Using Diffusion Kernel Abstract With the availability of an overwhelming amount of high-throughput biological data, biologists and medical researchers increasingly depend on computational algorithms for hypothesis generation and prediction. One area of bioinformatics research is the development of algorithms for predicting subcellular localization of both monoplex and multiplex proteins. Most of current localization prediction algorithms employ features derived from protein sequence data and external functional annotations such as gene ontology or physicochemical properties. However, there is no method that can exploit rich localization information in a protein-protein correlation network since correlated proteins tend to be co-localized within the cell. Here we propose a novel diffusion kernel and logistic regression based algorithm, NetLoc, for protein localization prediction by exploiting protein correlation networks. NetLoc is applied to yeast protein localization prediction using four types of protein networks including physical protein-protein interaction (PPI) networks, genetic PPI networks, mixed PPI networks, and co-expressed PPI networks. Experiments showed that protein networks can provide rich information for localization prediction, achieving an AUC score up to We also showed that networks with high connectivity and high percentage of co-localized PPI lead to better prediction performance. Compared to a previous network feature based prediction algorithm with an AUC score of 0.52 on the yeast PPI network, NetLoc achieved significantly better overall performance with an AUC of 0.74 on the same dataset. We also investigated how the prediction performance of NetLoc was affected by the network characteristics such as ratio of the number of co-localized PPI (coPPI) to the number of non- co-localized PPI (ncPPI) and the density of annotated coPPI in the network. For a given network with a specific number of proteins, NetLoc performance increases with increasing coPPI/ncPPI ratio and increasing density of annotated coPPI. Another limitation of current protein localization algorithms is that they are not capable of predicting multi-location proteins. NetLoc algorithm addressed this limitation by calculating probabilistic scores for all locations for each query protein. Evaluation on the Yeast multi- localization protein dataset showed that the overall success rate of NetLoc is 88%, which is much higher than the existing method (73%) tested on the same dataset. Finally, we proposed and evaluated two methods for network based localization prediction based on multiple protein correlation networks. One is by constructing a unified protein correlation network. The other is to use multiple network kernels. Experiment showed that both methods can improve the NetLoc performance compared to original individual network.