Picormatics Today’s goal: Give you an overview of some recent technological bioinformatics developments that can be applied to picornaviruses. Where possible.

Slides:



Advertisements
Similar presentations
Bd Building Admin Site Layout Charlotte Jenkins Advanced Web Technologies Assignment 2.
Advertisements

Automatic Timeline Generation from News Articles Josh Taylor and Jessica Jenkins.
Aladdin Overview *************************** NOTICE ************************* PROPRIETARY AND CONFIDENTIAL MATERIAL. DISTRIBUTION, USE, AND DISCLOSURE.
Case Study: Dopamine D 3 Receptor Anthagonists Chapter 3 – Molecular Modeling 1.
R4 Dynamically loading processes. Overview R4 is closely related to R3, much of what you have written for R3 applies to R4 In R3, we executed procedures.
Blast to Psi-Blast Blast makes use of Scoring Matrix derived from large number of proteins. What if you want to find homologs based upon a specific gene.
Analysis of the Structure-Function-Dynamics Relationship of G-Protein Coupled Receptors Augustus J. Olthafer & Dr Sanchita Hati Department of Chemistry.
1 ADVANCED MICROSOFT POWERPOINT Lesson 5 – Using Advanced Text Features Microsoft Office 2003: Advanced.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Data mining in bioinformatics: problems and challenges Sorin Draghici WWW:
Nuclear receptor function
©CMBI 2003 MUTANT DESIGN BIO- INFORMATICS QUESTION ‘MOLECULAR BIOLOGY’ BIOPHYSICS.
Power and weakness of data Power: data + software + bioinformatician = answer. Weakness: Data errors. Data poorly understood. Poor software. Never enough.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
©CMBI 2006 Molecular motors At the organism – population level, motors are needed to transfer materials. At the organelle – organ level, motors are needed.
Protein Functional Site Prediction The identification of protein regions responsible for stability and function is an especially important post-genomic.
©CMBI 2003 MUTANT DESIGN BIO- INFORMATICS QUESTION ‘MOLECULAR BIOLOGY’ BIOPHYSICS G Vriend CMBI KUN Nijmegen Netherlands
Progressive MSA Do pair-wise alignment Develop an evolutionary tree Most closely related sequences are then aligned, then more distant are added. Genetic.
Seminar series 2 Protein structure validation. In 't verleden ligt het heden; in 't nu, wat worden zal. The past: Linus Pauling ‘Inventor’ of helix and.
What can (many) sequences tell us?
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Applications of Homology Modeling Hanka Venselaar.
Signals in Sequences The number of sequences available for analysis rapidly approaches infinite. We need new ways to look at all this information.
Evaluating alignments using motif detection Let’s evaluate alignments by searching for motifs If alignment X reveals more functional motifs than Y using.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Making a Virtual Book With PowerPoint 2007 How to make a virtual book Using PowerPoint 2007 This is not a presentation template. This is not the venue.
Proceso kintamybių modeliavimas Modelling process variabilities Donatas Čiukšys.
What can (many) sequences tell us?. Nuclear receptor function.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Rising accuracy of protein secondary structure prediction Burkhard Rost
Bioinformatics 2 -- Lecture 8 More TOPS diagrams Comparative modeling tutorial and strategies.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
©CMBI 2003 MUTANT DESIGN BIO- INFORMATICS QUESTION ‘MOLECULAR BIOLOGY’ BIOPHYSICS.
Bacterial Genetics - Assignment and Genomics Exercise: Aims –To provide an overview of the development and.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Integrating the Bioinformatic Technology Group into your research programme Introduction People and Skills Examples Integrating the BTG Contacts BHRC Away.
Tracking CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Localising regulatory elements using statistical analysis and shortest unique substrings of DNA Nora Pierstorff 1, Rodrigo Nunes de Fonseca 2, Thomas Wiehe.
ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Sequence Alignments with Indels Evolution produces insertions and deletions (indels) – In addition to substitutions Good example: MHHNALQRRTVWVNAY MHHALQRRTVWVNAY-
3DM: Protein Super-family Platforms 3DM Protein super-family data integration Tom van den Bergh Bio-Prodict.
3DM: Protein engineering Super-family platforms Bio-Prodict DM super-family systems Henk-Jan Joosten Remko Kuipers Tom v/d Bergh Bas Vroling.
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
What is phage display? An in vitro selection technique using a peptide or protein genetically fused to the coat protein of a bacteriophage.
Prezi Tutorial Nadia Kudla, PharmD; Teresa Breslin, PharmD; Sydney Hendry, MD; Amanda Wojtusik, PharmD UPMC St. Margaret Faculty Development Fellowship.
Recap Resizing the Vector Push_back function Parameters passing Mechanism Primitive Arrays of Constants Multidimensional Arrays The Standard Library string.
Behrouz A. Forouzan TCP/IP Protocol Suite, 3rd Ed.
Intro to Real World Robotics Upcoming course project Martin Jagersand
Understanding Search Engines
Shelling Protein Interfaces
Lab 8.3: RNA Secondary Structure
Pfam: multiple sequence alignments and HMM-profiles of protein domains
Bioinformatics Biological Data Computer Calculations +
Nuclear Receptor structures
Molecular motors At the organism – population level, motors are needed to transfer materials. At the organelle – organ level, motors are needed to transfer.
Hands-on: Reviewing BLAST
A Mutation in the Fibroblast Growth Factor 14 Gene Is Associated with Autosomal Dominant Cerebral Ataxia  John C. van Swieten, Esther Brusse, Bianca M.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Homology modeling in short…
Volume 7, Issue 6, Pages (June 2001)
Presentation transcript:

Picormatics Today’s goal: Give you an overview of some recent technological bioinformatics developments that can be applied to picornaviruses. Where possible in less than a day's work, I have applied those techniques, as an example, to 'my' virus: R14. This seminar is available (without © ) from:

Some notes up front Your community is not very WWW oriented. This is concluded from a low number of cross pointers, high numbers of dead links and incomplete sites, and from a lack of update dates,contact addresses, references, etc. Your community is not very bioinformatics oriented either. Example, holds a beautifully complete list of VP1 sequences, one-by-one....

Your 'simple' bioinformatics options Many protein structures Many protein sequences This allows for structure based sequence alignments that are very precise, and therefore allow for novel sequence analysis techniques 1) Correlated mutation analysis 2) Sequence variability analysis

Structure-based alignment This is top left corner of alignment of ~1000 sequences of ~300 residues

Correlated mutations APGADSFGDFHKM Gray is conserved ALGADSFRDFRRL Black is variable ARGLDPFGMNHSI Red/green are AGGLDPFRMNRRV correlated mutations Correlated mutations guarantee a function. Function is determined by the position in the structure; not by the residue type.

Correlated mutations Pilot indicates this works for VP1,2,3 too.

Correlated mutations and drug design

Correlations between residues and ligands. Correlated mutations and drug design

Automatic structure comparison Rhino 9 Polio 2 FMDV 1 Mengo 1

Automatic structure comparison

R14 drug placed in R16

antagonist agonist Automatic structure comparison Example from nuclear hormone receptor drug design study

Back to sequences First rule of sequence analysis: If a residue is conserved, it is important.

Sequence analysis (continued) Second rule of sequence analysis: If a residue is very conserved, it is very important.

But what about the variable residues 20 E i =  p i ln(p i ) i=1

Sequence variability is the number of residues that is present in more than 0.5% of all sequences. But what about the variable residues

Entropy - Variability Entropy = Information Variability = Chaos

11 main function 12 first shell around main function 22 core residues (signal transduction) 23 modulator 33 mainly surface Entropy - Variability

Most information about mutations is carefully hidden in the literature. Automatic extraction of this information is no longer science-fiction. More than 90% of the 2226 mutations used for the previous few slides were extracted automatically from the literature. We extracted160 more mutations 'by hand'. Problems are mainly related to protein/gene nomenclature, residue numbering, and unclear description of the effects. Mutation information

Mutation data

A PubMed search gives: picornavirus mutation1176(2) rhinovirus mutation101(62) poliovirus mutation600(144) mengovirus mutation30(29) About 1 in 5 (in a small manually checked subset) contained identifiable mutation information in the abstract. But unfortunately often with nomenclature that 'our' software doesn't understand yet. Picorna mutation information

Now something totally different Motion is the main ingredient for protein function. Even if that function is as 'dumb' as being a container for the RNA. For example, all early Rhino directed drugs were aimed at reducing the mobility of its VP1…

The simulation of protein motion is normally called molecular dynamics, or MD. MD is commonly known as a very difficult technique for which you need the help of an army of mathematicians. That is no longer true. Dynamite (based on Bert de Groot's CONCOORD software) predicts protein motions via the WWW. Protein dynamics calculation

A short break for a word from our sponsors Laerte Oliveira Our industrial sponsor: FLORENCEFLORENCE HORNHORN Wilma KuipersWeesp Bob Bywater Copenhagen Nora vd WendenThe Hague Mike SingerNew Haven Ad IJzermanLeiden Margot BeukersLeiden Fabien CampagneNew York Øyvind EdvardsenTroms Ø Simon FolkertsmaFrisia Henk-Jan JoostenWageningen Joost van DurmaBrussels David Lutje HulsikUtrecht Tim HulsenGoffert Manu BettlerLyon Elmar Krieger Simon Folkertsma David Tim AdjeMargot Fabien Manu