Prediction of Protein Binding Sites in Protein Structures Using Hidden Markov Support Vector Machine.

Slides:



Advertisements
Similar presentations
1 Gesture recognition Using HMMs and size functions.
Advertisements

Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
G53MLE | Machine Learning | Dr Guoping Qiu
Profiles for Sequences
Pattern Recognition and Machine Learning
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Principle of Locality for Statistical Shape Analysis Paul Yushkevich.
Bio 465 Summary. Overview Conserved DNA Conserved DNA Drug Targets, TreeSAAP Drug Targets, TreeSAAP Next Generation Sequencing Next Generation Sequencing.
Conditional Random Fields
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Computational Analysis of USA Swimming Data Junfu Xu School of Computer Engineering and Science, Shanghai University.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Protein Tertiary Structure Prediction
1 correlating graph-theoretical centrality indices with interface residue propensity or: where do things stick together? Stefan Maetschke Teasdale Group.
Chapter 8 Prediction Algorithms for Smart Environments
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
LSM3241: Bioinformatics and Biocomputing Lecture 3: Machine learning method for protein function prediction Prof. Chen Yu Zong Tel:
Predicting the Cellular Localization Sites of Proteins Using Decision Tree and Neural Networks Yetian Chen
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Eric C. Rouchka, University of Louisville SATCHMO: sequence alignment and tree construction using hidden Markov models Edgar, R.C. and Sjolander, K. Bioinformatics.
Computational prediction of protein-protein interactions Rong Liu
2 o structure, TM regions, and solvent accessibility Topic 13 Chapter 29, Du and Bourne “Structural Bioinformatics”
CS774. Markov Random Field : Theory and Application Lecture 19 Kyomin Jung KAIST Nov
Externally Enhanced Classifiers and Application in Web Page Classification Join work with Chi-Feng Chang and Hsuan-Yu Chen Jyh-Jong Tsay National Chung.
Exploring Alternative Splicing Features using Support Vector Machines Feature for Alternative Splicing Alternative splicing is a mechanism for generating.
Protein-Protein Interaction Hotspots Carved into Sequences Yanay Ofran 1,2, Burkhard Rost 1,2,3 1.Department of Biochemistry and Molecular Biophysics,
Protein Secondary Structure Prediction G P S Raghava.
Linear Discrimination Reading: Chapter 2 of textbook.
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
Identification of amino acid residues in protein-protein interaction interfaces using machine learning and a comparative analysis of the generalized sequence-
Training Conditional Random Fields using Virtual Evidence Boosting Lin Liao, Tanzeem Choudhury †, Dieter Fox, and Henry Kautz University of Washington.
An Artificial Neural Network Approach to Surface Waviness Prediction in Surface Finishing Process by Chi Ngo ECE/ME 539 Class Project.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
A New Supervised Over-Sampling Algorithm with Application to Protein-Nucleotide Binding Residue Prediction Li Lihong (Anna Lee) Cumputer science 22th,Apr.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
HMMs and SVMs for Secondary Structure Prediction
Information Extraction Entity Extraction: Statistical Methods Sunita Sarawagi.
Combining Evolutionary Information Extracted From Frequency Profiles With Sequence-based Kernels For Protein Remote Homology Detection Name: ZhuFangzhi.
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
Protein Prediction with Neural Networks! Chris Alvino CS152 Fall ’06 Prof. Keller.
Machine Learning Methods of Protein Secondary Structure Prediction Presented by Chao Wang.
Ubiquitination Sites Prediction Dah Mee Ko Advisor: Dr.Predrag Radivojac School of Informatics Indiana University May 22, 2009.
Gist 2.3 John H. Phan MIBLab Summer Workshop June 28th, 2006.
A Hybrid Model of HMM and RBFN Model of Speech Recognition 길이만, 김수연, 김성호, 원윤정, 윤아림 한국과학기술원 응용수학전공.
Neural Networks Lecture 11: Learning in recurrent networks Geoffrey Hinton.
Protein motif /domain Structural unit Functional unit Signature of protein family How are they defined?
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Feasibility of Using Machine Learning Algorithms to Determine Future Price Points of Stocks By: Alexander Dumont.
Graphical Models for Segmenting and Labeling Sequence Data Manoj Kumar Chinnakotla NLP-AI Seminar.
Announcements 1. Textbook will be on reserve at library 2. Topic schedule change; modified reading assignment: This week: Linear discrimination, evaluating.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
BNFO 615 Fall 2016 Usman Roshan NJIT. Outline Machine learning for bioinformatics – Basic machine learning algorithms – Applications to bioinformatics.
Structured prediction
Madhavi Ganapathiraju Graduate student Carnegie Mellon University
Hidden Markov Models (HMM)
Unsupervised Learning and Autoencoders
Machine Learning Week 1.
Extra Tree Classifier-WS3 Bagging Classifier-WS3
חיזוי ואפיון אתרי קישור של חלבון לדנ"א מתוך הרצף
Combining HMMs with SVMs
Support Vector Machine (SVM)
Hidden Markov Models Part 2: Algorithms
Prediction of IgE-binding epitopes by means of allergen surface comparison and correlation to cross-reactivity  Fabio Dall'Antonia, PhD, Anna Gieras,
Finding Functionally Significant Structural Motifs in Proteins
Sequence Based Analysis Tutorial
CISC 841 Bioinformatics (Fall 2007) Hybrid models
CSC321: Neural Networks Lecture 11: Learning in recurrent networks
Neural Networks for Protein Structure Prediction Dr. B Bhunia.
Presentation transcript:

Prediction of Protein Binding Sites in Protein Structures Using Hidden Markov Support Vector Machine

Slate:the target protein. Blue:the binding partner. Magenta:interface residues. SSSEIKIVRDEYGMPHIYANDTWHLFYGYG IIINIINNIINNNIIIIIIINIINIIINNN Input Output

Machine Learning Methods Applied Classification methods Sequential labelling methods ANNSVM CRF

FEATURES Neighboring residue profile feature Neighboring residue profile feature Hydrophobicity Hydrophobicity Sequence conservation Sequence conservation Secondary structure Secondary structure Solvent accessible surface area Solvent accessible surface area

Hidden Markov Support Vector Machine Emission feature function Transition feature function Corresponding weight

Hidden Markov Support Vector Machine  Spatially neighboring residue profile feature  Spatially neighboring residue accessible surface (ASA) feature Emission feature function

Hidden Markov Support Vector Machine Transition feature function

Hidden Markov Support Vector Machine Transition feature function

Hidden Markov Support Vector Machine Corresponding weight

Hidden Markov Support Vector Machine Source Code: ☆ The cutting-plane algorithm makes it linear

DATA SET

Influence of the number of training samples on the prediction performance and running time

The inter-relation information between neighboring residues is relevant for discrimination

The window size has not significant influence on the performance

Actual interface residues ANN SVMCRFHM-SVM Comparison with related methods

Actual interface residues ANN SVMCRFHM-SVM Comparison with related methods

SUMMARY Prediction of protein binding sites Prediction of protein binding sites Hidden Markov Support Vector Machine Hidden Markov Support Vector Machine Result Analysis Result Analysis Comparison with other methods Comparison with other methods Influence of the number of training samples Influence of the number of training samples The information between neighboring residues The information between neighboring residues Window size Window size Discussion Discussion