What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define.

Slides:



Advertisements
Similar presentations
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Sequence information, logos and Hidden Markov Models Morten Nielsen, CBS, BioCentrum,
Advertisements

Gibbs sampling Morten Nielsen, CBS, BioSys, DTU. Class II MHC binding MHC class II binds peptides in the class II antigen presentation pathway Binds peptides.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU T cell Epitope predictions using bioinformatics (Neural Networks and hidden.
Bioinformatics Motif Detection Revised 27/10/06. Overview Introduction Multiple Alignments Multiple alignment based on HMM Motif Finding –Motif representation.
Training and applying hidden Markov models and support vector machines for prediction of T-cell epitopes Van Hai Van, Cao Thi Ngoc Phuong, Tran Linh Thuoc.
MHC Polymorphism Ole Lund. Objectives What is HLA polymorphism? What is it good for? How does it make life difficult for vaccine design? Definition of.
Profile Hidden Markov Models Bioinformatics Fall-2004 Dr Webb Miller and Dr Claude Depamphilis Dhiraj Joshi Department of Computer Science and Engineering.
Hidden Markov models for detecting remote protein homologies Kevin Karplus, Christian Barrett, Richard Hughey Georgia Hadjicharalambous.
Algorithms in Bioinformatics Morten Nielsen BioSys, DTU.
Biological sequence analysis and information processing by artificial neural networks.
Structural bioinformatics
Artificial Neural Networks 2 Morten Nielsen Depertment of Systems Biology, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity Nicholas M. Luscombe and Janet M. Thornton JMB (2002)
Tools to analyze protein characteristics Protein sequence -Family member -Multiple alignments Identification of conserved regions Evolutionary relationship.
Biological sequence analysis and information processing by artificial neural networks Morten Nielsen CBS.
Profile-profile alignment using hidden Markov models Wing Wong.
MHC Polymorphism. MHC Class I pathway Figure by Eric A.J. Reits.
Protein Fold recognition Morten Nielsen, Thomas Nordahl CBS, BioCentrum, DTU.
1 Computational Analysis of Protein-DNA Interactions Changhui (Charles) Yan Department of Computer Science Utah State University.
Introduction to BioInformatics GCB/CIS535
Course Summary June 2, 2005 Programming Workshop Overview of course (presentation) Protein modeling, part 2 Instructor evaluations.
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen.
Biological sequence analysis and information processing by artificial neural networks.
Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm)
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Protein Structures.
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
Artificial Neural Networks for Secondary Structure Prediction CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (slides by J. Burg)
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Algorithms in Bioinformatics Morten Nielsen Department of Systems Biology, DTU.
Rising accuracy of protein secondary structure prediction Burkhard Rost
Scoring Matrices Scoring matrices, PSSMs, and HMMs BIO520 BioinformaticsJim Lund Reading: Ch 6.1.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices Yan Liu Sep 29, 2003.
Eric C. Rouchka, University of Louisville SATCHMO: sequence alignment and tree construction using hidden Markov models Edgar, R.C. and Sjolander, K. Bioinformatics.
Sequence encoding, Cross Validation Morten Nielsen BioSys, DTU
Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm)
Introduction to Bioinformatics Biostatistics & Medical Informatics 576 Computer Sciences 576 Fall 2008 Colin Dewey Dept. of Biostatistics & Medical Informatics.
Bioinformatics Multiple Alignment. Overview Introduction Multiple Alignments Global multiple alignment –Introduction –Scoring –Algorithms.
Web Servers for Predicting Protein Secondary Structure (Regular and Irregular) Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
LOGO iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance- Pairs and Reduced Alphabet Profile into the General Pseudo Amino.
Basic Overview of Bioinformatics Tools and Biocomputing Applications II Dr Tan Tin Wee Director Bioinformatics Centre.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Prediction of Protein Binding Sites in Protein Structures Using Hidden Markov Support Vector Machine.
Hidden Markov Model and Its Application in Bioinformatics Liqing Department of Computer Science.
Convolutional LSTM Networks for Subcellular Localization of Proteins
Combining Evolutionary Information Extracted From Frequency Profiles With Sequence-based Kernels For Protein Remote Homology Detection Name: ZhuFangzhi.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
Prediction of T cell epitopes using artificial neural networks Morten Nielsen, CBS, BioCentrum, DTU.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
1 Computational Approaches(1/7)  Computational methods can be divided into four categories: prediction methods based on  (i) The overall protein amino.
“ Using Sequence Motifs for Enhanced Neural Network Prediction of Protein Distance Constraints ” J.Gorodkin, O.Lund, C.A.Anderson, S.Brunak On ISMB 99.
Using the Fisher kernel method to detect remote protein homologies Tommi Jaakkola, Mark Diekhams, David Haussler ISMB’ 99 Talk by O, Jangmin (2001/01/16)
Free for Academic Use. Jianlin Cheng.
Computational Structure Prediction
Sequence Based Analysis Tutorial
Protein Structures.
Morten Nielsen, CBS, BioSys, DTU
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen
Protein structure prediction.
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen
Introduction to Bioinformatics Tuesday, 19 March
Neural Networks for Protein Structure Prediction Dr. B Bhunia.
Presentation transcript:

What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define which method to use –(Develop method) –Train and evaluate method –(Compare performance to other methods) Documentation –Write report in form of a research article (10-15 pages) Abstract Introduction Materials and method Results Discussion References

Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm) techniques 2.Peptide MHC binding predictions using artificial neural networks with different sequence encoding schemes 3.Comparative study of PSSM, ANN for peptide MHC binding 1.Analysis on how data size defines predictive performance for PSSM and ANN 4.NN-align, a neural network-based method for motif recognition and peptide-binding prediction 5.Improved protein template identification using hidden Markov models (HMMER) 6.Implementation of HMM Baum-Welsh algorithm 7.Gibbs sampler for MHC class II binding

PSSM Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting techniques –Compare methods for sequence weighting Clustering vs heuristics –Benchmark (Peters et al 2006) covering some 20 MHC molecules, compare to best other methods

NN sequence encoding Peptide MHC binding predictions using artificial neural networks with different sequence encoding schemes –Benchmark (Peters et al 2006) covering some 20 MHC molecules, compare to best other methods –Compare sequence encoding schemes Sparse, Blosum, composition, charge, amino acids size,..

Comparative study Compare methods for MHC peptide binding –PSSM –ANN Does size matter? Data: Benchmark by Peters et al 2006 covering some 20 MHC molecules

Hidden Markov models Improved protein template identification using hidden Markov models (HMMER) –Train profile HMM to remote protein fold recognition Use the Hmmer program to construct profile HMM for selected set of proteins from the CASP8 competition Use Hmmer model to identify PDB templates for homology modeling for CASP8 targets

HMM Implement Baum-Welsh HMM training –Based on code from Tapas Kanungo HMM toolkit A tar file can be found at: – A zip file: – README file – Tutorial talks – – Test code on un-fair casino example

Gibbs sampler Gibbs sampler approach to the prediction of MHC class II binding motifs –Develop Gibbs sampler to prediction of MHC class II binding motifs –Benchmark Nielsen et al 2007 covering 14 HLA-DR alleles