© University of Reading 2007 Dr Liam J. McGuffin RCUK Academic Fellow 20 April 2014 McGuffin Group.

Slides:



Advertisements
Similar presentations
© University of Reading Dr Liam J. McGuffin RCUK Academic Fellow 20 April 2014 McGuffin Group.
Advertisements

© University of Reading Finance and Corporate Services 20 April 2014 Purchase To Pay Phil Southwell – Head of Financial and HR Systems.
© University of Reading Go to View > Master > Slide Master to put your unit name here 20 April 2014 IT Services Identity Management.
Predicting Kinase Binding Affinity Using Homology Models in CCORPS
Antibody Structure Prediction and the Use of Mutagenesis in Docking Arvind Sivasubramanian, Aroop Sircar, Eric Kim & Jeff Gray Johns Hopkins University,
Functional Site Prediction Selects Correct Protein Models Vijayalakshmi Chelliah Division of Mathematical Biology National Institute.
Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.
Lecture 22: Evaluation April 24, 2010.
Bin Ma, CTO Bioinformatics Solutions Inc. June 5, 2011.
Protein Functional Site Prediction The identification of protein regions responsible for stability and function is an especially important post-genomic.
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Performance measures Morten Nielsen, CBS, BioCentrum, DTU.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
True/False. False True Subject May Go Here True / False ? Type correct answer here. Type incorrect answer here.
ROC Curves.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Contraband Detection and Retesting. The Inspection Problem A sensor is a device used to attempt to determine some truth about an object; we will assume.
Welcome to class today! Chapter 12 summary sheet Jimmy Fallon video
Determine whether each curve below is the graph of a function of x. Select all answers that are graphs of functions of x:
How do we know whether a marker or model is any good? A discussion of some simple decision analytic methods Carrie Bennette on behalf of Andrew Vickers.
Identification System Errors Guide to Biometrics – Chapter 6 Handbook of Fingerprint Recognition Presented By: Chris Miles.
Protein Structures.
Gmat 2700 Geometry of Coordinate Reference Systems Alexandra Lyle Student No Session 1, 2006 The Globe Presentation by Alexandra Lyle SCHOOL OF.
Hypothesis Testing.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Modelling binding site with 3DLigandSite Mark Wass
ZORRO : A masking program for incorporating Alignment Accuracy in Phylogenetic Inference Sourav Chatterji Martin Wu.
Data Analysis 1 Mark Stamp. Topics  Experimental design o Training set, test set, n-fold cross validation, thresholding, imbalance, etc.  Accuracy o.
How do we know whether a marker or model is any good? A discussion of some simple decision analytic methods Carrie Bennette (on behalf of Andrew Vickers)
Prediction of protein disorder Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry Eotvos Lorand University, Budapest,
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Multi-part Messages in KMIP John Leiseboer, QuintessenceLabs.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Classification Performance Evaluation. How do you know that you have a good classifier? Is a feature contributing to overall performance? Is classifier.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study George Chikenji*, Yoshimi Fujitsuka, and Shoji Takada*
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Protein Secondary Structure, Bioinformatics Tools, and Multiple Sequence Alignments Finding Similar Sequences Predicting Secondary Structures Predicting.
Structure prediction: Homology modeling
Computational Intelligence: Methods and Applications Lecture 16 Model evaluation and ROC Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Biometric for Network Security. Finger Biometrics.
Quiz 1 review. Evaluating Classifiers Reading: T. Fawcett paper, link on class website, Sections 1-4 Optional reading: Davis and Goadrich paper, link.
Finding, Aligning and Analyzing Non Coding RNAs Cédric Notredame Comparative Bioinformatics Group Bioinformatics and Genomics Program.
Data Analytics CMIS Short Course part II Day 1 Part 4: ROC Curves Sam Buttrey December 2015.
Performance measures Morten Nielsen, CBS, Department of Systems Biology, DTU.
Machine Learning in Practice Lecture 9 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Title Category #1 Category #2 Category #3Category #
Machine Learning in Practice Lecture 9 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Improved Protein Secondary Structure Prediction. Secondary Structure Prediction Given a protein sequence a 1 a 2 …a N, secondary structure prediction.
Intrinsically disordered proteins Zsuzsanna Dosztányi EMBO course Budapest, 3 June 2016.
Using the Fisher kernel method to detect remote protein homologies Tommi Jaakkola, Mark Diekhams, David Haussler ISMB’ 99 Talk by O, Jangmin (2001/01/16)
An Empirical Comparison of Supervised Learning Algorithms
Methods: The IntFOLD Server
Image Classification via Attribute Detection
Protein Structures.
Homology Modeling.
Protein structure prediction.
Protein Disorder Prediction
Yang Liu, Perry Palmedo, Qing Ye, Bonnie Berger, Jian Peng 
Grace W. Tang, Russ B. Altman  Structure 
Volume 17, Issue 7, Pages (July 2009)
Bioinformatics 김유환, 문현구, 정태진, 정승우.
Atomistic Ensemble Modeling and Small-Angle Neutron Scattering of Intrinsically Disordered Protein Complexes: Applied to Minichromosome Maintenance Protein 
Structural Flexibility of CaV1. 2 and CaV2
Discussion of Protein Disorder Prediction
Presentation transcript:

© University of Reading Dr Liam J. McGuffin RCUK Academic Fellow 20 April 2014 McGuffin Group Methods for Prediction of Protein Disorder Two methods for different categories: DISOclust – Server version DISOclust – Manual version

To put your footer here go to View > Header and Footer2 DISOclust (Server) Simple clustering method – unsupervised Compares multiple models from nFOLD3 server Calculates per-residue accuracy for each model using ModFOLDclust Outputs probability of disorder (1 minus the mean per-residue accuracy) Combines score with the scaled DISOPRED score Manual method – same protocol but using all server models P d = posterior probability of disorder M = the set of models S rm = S r score for a model (m). Disorder score 1-(mean residue accuracy) S-score (distance between residues) S i = S-score for residue i d i = distance between aligned residues d 0 = distance threshold (3.9) Residue accuracy (mean S-score) S r = predicted residue accuracy for model N = number of models A = set of alignments S ia = Si score for a residue in a structural alignment (a)

To put your footer here go to View > Header and Footer3 False positive rate 0-1 True positive rate False positive rate True positive rate AUC, Area Under Curve (see ROC plots below); SE, Standard Error in AUC score; AUC(0-0.1), partial area under curve between false positives. Method AUC SEAUC (0-0.1)AUC-SEAUC+SE DISOclust_server DISOclust_manual DISOPRED

To put your footer here go to View > Header and Footer4 Answers to specific questions… In your analysis of disorder do you treat short disordered regions, e.g. a missing loop in a crystal structure, differently than a disordered domain or an entirely disordered protein? No, all regions are treated the same. No specific methods for long or short regions. Can you briefly describe your disorder analysis, i.e. is it based on physical principals, machine learning or a combination of both. Results from s tructure based method (DISOclust) are combined with results from a sequenced based machine learning method (DISOPRED). DISOclust significantly improved all CASP7 methods (see paper). Does your analysis of disorder prediction affect your template free modeling, i.e. does the disorder prediction aid your free model prediction? If so, in what way, in practice, did you use your disorder prediction for free modeling? Did not carry out FM, although the method does work for FM targets Can your disorder prediction distinguish between regions predicted to be fully disordered, i.e. 'cooked spaghetti', or alternatively an ensemble of a few alternative conformations? Correctly identified T0484 and T0500 as fully disordered. Works equally well on long/short regions of disorder. The DISOclust server provides visualisation of multiple alternative conformations.

To put your footer here go to View > Header and Footer5 The DISOclust server McGuffin, L. J. (2008) Intrinsic disorder prediction from the analysis of multiple protein fold recognition models. Bioinformatics, 24,