Abstract Visualization of protein structural data is an important aspect of protein research. Incorporation of genomic annotations into a protein structural.

Slides:



Advertisements
Similar presentations
Managing References : Mendeley
Advertisements

WEB AND WIRELESS AUTOMATION connecting people and processes InduSoft Web Solution Welcome.
Digital Certificate Installation & User Guide For Class-2 Certificates.
SG KB 2009 NIGMS Workshop: Enabling Technologies for Structural Biology Section on Structural Analysis Margaret J. Gabanyi March 4, 2009 How to Use the.
® Microsoft Office 2010 Browser and Basics.
Python-based Tools and Web Services for Structural Bioinformatics Randy Heiland, Charles Moad IU Pervasive Technology Labs Sean Mooney, IU School of Medicine.
© 2010 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.
© Wiley Publishing All Rights Reserved. How Most People Use Bioinformatics.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Single Search By Rakphao Theppan, librarian Searching Online Resources.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
USF – Bioinformatics Master Project Outline Buck Institute The Mooney Lab Project Description 1.
XP Browser and Basics1. XP Browser and Basics2 Learn about Web browser software and Web pages The Web is a collection of files that reside.
Archives and Information Retrieval
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
Windows Server WHAT IS ACTIVE DIRECTORY? FUNDAMENTALS OF THE ACTIVE DIRECTORY – Benefits of Using the Active Directory in an Enterprise Environment.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Installing Windows XP Professional Using Attended Installation Slide 1 of 41Session 2 Ver. 1.0 CompTIA A+ Certification: A Comprehensive Approach for all.
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
Use Watch folders to automatically add PDFs to Mendeley Desktop. When you place a document in a watched folder, it will be automatically added to Mendeley.
Linux Operations and Administration
WebGBrowse A Web Server for GBrowse Configuration Ram Podicheti B.V.Sc. & A.H. (D.V.M.), M.S. Staff Scientist – Bioinformatics Center for Genomics and.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Basics of Web Databases With the advent of Web database technology, Web pages are no longer static, but dynamic with connection to a back-end database.
Connecting to Network. ♦ Overview ► A network connection is required to communicate with other computers when they are in a network. Network interface.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
CIS 375—Web App Dev II Microsoft’s.NET. 2 Introduction to.NET Steve Ballmer (January 2000): Steve Ballmer "Delivering an Internet-based platform of Next.
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Friday 17 rd December 2004Stuart Young Capstone Project Presentation Predicting Deleterious Mutations Young SP, Radivojac P, Mooney SD.
An and Collaboration Suite LI 815 XR Kristen Gripp.
1 PyMOL Evolutionary Trace Viewer 1.1 Lichtarge Lab Sept. 13, 2010.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Copyright OpenHelix. No use or reproduction without express written consent1.
Jessica Dantzer Mooney Lab Center for Computational Biology and Bioinformatics Indiana University School of Medicine
BLAST Basic Local Alignment Search Tool (Altschul et al. 1990)
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
CS4710 Why Progam?. Why learn to program? Utility of programming skills: understand tools modify tools create your own automate repetitive tasks automate.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
NCBI Genome Workbench Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 15, 2004 Slides from Michael Dicuccio’s Genome Workbench.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
3D-EM DAS Extending DAS to 3D-EM and Fitting /02/26.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
A collaborative tool for sequence annotation. Contact:
1 PRINCIPAL INVESTIGATOR USE OF THE ST ScI ELECTRONIC GRANTS MANAGEMENT SYSTEM January, 2001.
IS493 INFORMATION SECURITY TUTORIAL # 1 (S ) ASHRAF YOUSSEF.
PubMed/How to Search, Display, Download & (module 4.1)
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
(ITI310) By Eng. BASSEM ALSAID SESSIONS 10: Internet Information Services (IIS)
HANDS-ON ConSurf! Web-Server: The ConSurf webserver.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
MIRC Overview Medical Imaging Resource Center John Perry RSNA 2009.
XP Creating Web Pages with Microsoft Office
WikID installation/training
Integrated technology
Take a REST from manual searching: PDBe, programmatically
Mendeley Download Instructions
Integrated technology
Integrated technology
Explore Evolution: Instrument for Analysis
Mendeley Download Instructions
Welcome to the GrameneMart Tutorial
Presentation transcript:

Abstract Visualization of protein structural data is an important aspect of protein research. Incorporation of genomic annotations into a protein structural context is a challenging problem, because genomic data is too large and dynamic to store on the client and mapping to protein structures is often nontrivial. To overcome these difficulties we have developed a suite of SOAP-based Web services and extended the commonly used structural visualization tools UCSF Chimera and Delano Scientific PyMOL via plugins. The initial services focus on (1) displaying both polymorphism and disease associated mutation data mapped to protein structures from arbitrary genes and (2) structural and functional analysis of protein structures using residue environment vectors. With these tools, users can perform sequence and structure based alignments, visualize conserved residues in protein structures using BLAST, predict catalytic residues using an SVM, predict protein function from structure, and visualize mutation data in SWISS-PROT and dbSNP. The plugins are distributed to academics, government and nonprofit organizations under a restricted open source license. The Web services are easily accessible from most programming languages using a standard SOAP API. Our services feature secure communication over SSL and high performance multi-threaded execution. They are built upon a mature networking library, Twisted, that allow for new services to easily be integrated. Services are self-described and documented automatically enabling rapid application development. The plugin extensions are developed completely in the Python programming language and are distributed at The LSW Website contains developer tools and mailing lists, and we encourage other developers to extend their applications using our services. LifeScienceWeb Services: Integrated Analysis of Protein Structural Data Charles Moad*, Randy Heiland*, Sean D. Mooney  *Pervasive Technology Labs  Center for Computational Biology and Bioinformatics, Department of Medical and Molecular Genetics Indiana University, Indianapolis, Indiana Updates The annotations are currently updated every 2-3 months. Internally, we provide services for annotating genes or coordinates not in the PDB usually through a collaboration. For information on how to do this please contact Sean Mooney, Acknowledgements CM and RH are funded through the IPCRES Initiative grant from the Lilly Endowment. SDM is funded from a grant from the Showalter Trust, an Indiana University Biomedical Research Grant and startup funds provided through INGEN. The Indiana Genomics Initiative (INGEN) is funded in part by the Lilly Endowment. The authors would like to thank the authors of UCSF Chimera and PyMOL for their help in extending their applications. You can download these tools from the following: UCSF Chimera: Delano Scientific PyMOL: Project Goals Web services are an efficient way to provide genomic data in the context of protein structural visualization tools. Our goal is to define a series of bioinformatic web services that can be used to extend protein structural visualization tools, and other extensible computational biology desktop applications. Our current focus is on extending UCSF Chimera ( and Delano Scientific PyMOL( Figure 1: Screen grab of the current services list from Services currently offered include: ClustalW alignments Mutation PDB mapping SVM based catalytic residue prediction Sequence conservation based on PSI-BLAST PSSM Services Model Web services are an efficient way to provide genomic data in the context of protein structural visualization tools. Our goal is to define a set of bioinformatic web services that can be used to extend protein structural visualization tools, and other extensible computational biology desktop applications. We are currently focused on extending UCSF Chimera ( and Delano Scientific PyMOL ( Our services use the SOAP protocol and are currently developed using open source Python-based projects. Software Plugin Extensions We have extended UCSF Chimera and Delano Scientific PyMOL to access our services. The three primary services we provide now are: 1.Disease associated mutation and SNP to protein structure mapping and visualization 2.Protein sequence and structure residue analysis with PSI-BLAST and S-BLEST 3.Catalytic residue prediction using a support vector machine (Youn, E., et al. submitted) Installation Plugin installation is easy and can be performed for a user without root privileges. Currently, all platforms supported by UCSF Chimera and PyMOL are supported and include UNIX platforms, LINUX, Mac OS X and Windows XP. For either of the two clients supported (PyMOL or UCSF Chimera), simply follow the directions linked on the download page at They will thereafter be available from the menu, as shown below. Figure 2: Running our tools from the client application, shown using PyMOL. Automated Sequence and Structural Analysis of Protein Structures Using PSI-BLAST and S-BLEST, we provide analysis of residue environments that match between protein structures in a queried database. Additionally, if the found environments represent similar structure or function classes, the environments that are most structurally associated to those environments are returned. This service is authenticated and SSL encrypted, and all coordinate data and analysis data are stored on our servers. Currently, users can query the ASTRAL 40 v1.69 and ASTRAL 95 v1.69 nonredundant domain datasets, as well as other commonly used nonredundant protein structure databases. Figure 3: MutDB controller window, shown using PyMOL. Controller features include (from the top): Tabbed selection of query type and controller options. Query entry text box and resulting hits from PDB shown below, with PDB ID, chain, residues, and TITLE of PDB. Once a PDB ID above is selected, the coordinates are downloaded and the mutations from Swiss-Prot (SP) and dbSNP (SNP) are retrieved. The database source, type, position, mutation and wildtype flag are displayed. Upon selection, the mutation is highlighted in the coordinate visualization window. Status window that displays the number of mutations or PDB coordinates found. Mutation information window displays a link to the source (which opens in the browser), the position and annotations in that may be available, including PubMed ID (as link), phenotype and a link to MutDB.org. Figure 4: MutDB structure visualization window showing a highlighted mutation using PyMOL. Citations Dantzer J, Moad C, Heiland R, Mooney S. (2005) "MutDB services: interactive structural analysis of mutation data". Nucleic Acids Res., 33, W Peters B, Moad C, Youn E, Buffington K, Heiland R, Mooney S, “Identification of Similar Regions of Protein Structures Using Integrated Sequence and Structure Analysis Tools”. Submitted. Mooney, S.D., Liang, H.P., DeConde, R., Altman, R.B., Structural characterization of proteins using residue environments. Proteins, (4): p Figure 5: S-BLEST controller window shown using UCSF Chimera. On the right, the control box has (from top): Tabs for selecting hits in database with matching environments (or significant sequence similarity using PSI-BLAST) or common functional annotations in the hits. A pull down selection box showing the PDB ID’s with matching environments and the Z-score between the best environments. Upon selection the hit is downloaded and displayed in the visualization window (left). A button to retrieve a ClustalW alignment between the the selected hit structure and the query. The most significantly matched residue environments between the query and the hit. Displays Z-score, the matched residues, the ranking of that match (overall for that query residue environment) and the Manhattan distance. When residues are selected from this list, the coordinates in the visualization window are aligned using a the Chimera match command. Below the windows a ClustalW alignment is shown Visualization of Mutations on Protein Structures We provide mapping between mutations and SNPs and protein structures. The mutations are mapped using Smith-Waterman based alignments. Swiss-Prot mutations and nonsynonymous SNPs in dbSNP are currently supported. See for a current list of the versions of each dataset we provide. Figure 6: S-BLEST controller window showing the function analysis tab using UCSF Chimera. LSW server client WSDLs Twisted (twistedmatrix.com) pywebsvcs.sf.net SOAP (We will address service discovery in the future)