Helen M. Berman, Rutgers University EMBO Practical Course Section: Searching Structure Databases September 26, 2008 PSI Structural Genomics Knowledgebase.

Slides:



Advertisements
Similar presentations
PubMed/How to Search, Display, Download & (module 4.1)
Advertisements

In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
Annual Reviews: A Nonprofit Scientific Publisher Bringing the Best Review Literature to the Worldwide Scientific Community for over 75 Years.
Welcome to informaworld TM. The following demo will show you just a few of the features on informaworld TM. Please select where you would like start. ePublication.
SG KB 2009 NIGMS Workshop: Enabling Technologies for Structural Biology Section on Structural Analysis Margaret J. Gabanyi March 4, 2009 How to Use the.
PSI Structural Genomics Knowledgebase Helen M. Berman Bottlenecks Workshop April 14, 2008.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Annual Reviews: A Nonprofit Scientific Publisher Bringing the Best Review Literature to the Worldwide Scientific Community for over 75 Years 1.
1.
On line (DNA and amino acid) Sequence Information Lecture 7.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Gene Ontology John Pinney
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Archives and Information Retrieval
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Lecture 2.21 Retrieving Information: Using Entrez.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
The Protein Data Bank (PDB)
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
A Product of Enterprise Content Management System (CMS) Web & Portal Content Management Systems for faster web publishing Copyright.
Definitions Collaboration – working together on team projects and sharing information, often through ad-hoc processes, to accomplish project goals. Document.
PubMed/How to Search, Display, Download & (module 4.1)
Getting started on informaworld™ How do I register with informaworld™? What do I do if I forget my password? My institution does not subscribe to any journals,
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Copyright OpenHelix. No use or reproduction without express written consent1.
Information Resources for Bioinformatics 1 MARC: Developing Bioinformatics Programs July, 2008 Alex Ropelewski Hugh Nicholas
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
Springerlink.com Introduction to SpringerLink springerlink.com.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Copyright OpenHelix. No use or reproduction without express written consent1.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
Data and Dissemination Core 1. Overview and EFI Website – Heidi Imker, UIUC 2. EFI LabDB LIMS – Wladek Minor, UVA 3. SFLD – Patsy Babbitt, UCSF (post lunch)
Copyright OpenHelix. No use or reproduction without express written consent1.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
From the Advanced Search page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
A collaborative tool for sequence annotation. Contact:
Real World Experiences in Operating a Collaboratory: The Protein Data Bank Helen M. Berman Board of Governors Professor of Chemistry.
Protein Structure Database for Structural Genomics Group Jessica Lau December 13, 2004 M.S. Thesis Defense.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
EBI is an Outstation of the European Molecular Biology Laboratory. Literature Resources at the EBI Information Workshop on European Bioinformatics Resources.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
User Guide Enhanced Knowledge Hub. 2 Note Accessing Knowledge Hub 1 2 Access K-Hub by selecting: 1.Knowledge Hub tab, OR 2.Knowledge Hub under My Communities.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
From the initial page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
SG KB 2009 NIGMS Workshop: Enabling Technologies for Structural Biology Section on Structural Analysis Helen M. Berman March 4, 2009 How to use the PSI.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
OncoTrack Bioinformatics Workshop Max Planck Institute for Molecular Genetics, Berlin Wednesday 6 th November 2013 TimeSubject 13:30-15:00 Introduction.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Getting the Most out of the PDBe
Archives and Information Retrieval
From: Structural database resources for biological macromolecules
Department of Genetics • Stanford University School of Medicine
TargetDB and PEPCDB •
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Helen M. Berman, Rutgers University EMBO Practical Course Section: Searching Structure Databases September 26, 2008 PSI Structural Genomics Knowledgebase

Knowledgebase

SG KB Knowledgebase Vision The PSI Structural Genomics Knowledgebase (PSI SGKB) will turn the products of the PSI effort into major advances in knowledge that can be used to understand living systems and human disease. It will be a key resource for the advancement of biology, biochemistry, functional genomics, pharmacology, bioinformatics, chemistry, education and clinical medicine.

SG KB Knowledgebase Goals To provide a “marketplace of ideas” that  connects protein sequence information to 3D structures and homology models  enhances functional annotations  provides access to new experimental protocols and materials To kick start and enable advancements in structural genomics  by communicating and providing visibility and accessibility of information and technology advances of the PSI  through presentation and discussion of the most provocative challenges with the general community  by fostering community collaborations

SG KB PSI SGKB features  Database searchable by sequence, text, and PDB ID  Search results include aggregate reports and inventories  Links to PSI projects, external resources, and publications  SG Gateway with Nature delivers featured articles, PSI news and events, featured molecules and technologies, molecules of unknown function and broader SG content  Notification to public about recently solved PSI structures or new editorial content

SG KB  To capture, make accessible, and highlight elements of the high- throughput pipelines for use by various scientific communities  To leverage such information through the generation of molecular models and functional annotation Scope Genomic Based Target Selection Data Collection Structure Determination Isolation, Expression, Purification, Crystallization PDB Deposition & Release Models Annotations Publications Metrics Technology Experimental Tracking Target Selection Materials

SG KB Knowledgebase Users  Biologists  Biochemists  Functional Genomicists  Pharmacologists  Bioinformatics  Chemists  Clinical Researchers and Physicians  Teachers and Students

A Tour of the PSI SGKB

SG KB 1 PSI SGKB Homepage Receive alerts Explore structures of unknown function View latest structures & statistics Teasers for this month’s editorial content

SG KB 1 Structural Genomics Update Editorial content:  Research Advances  Featured Molecule  Research Library  News  Events Calendar Search Box available

SG KB 1 About this site  Additional help content (getting started), site map, contact information, and terms of use About PSI  Information about the Protein Structure Initiative and the PSI SGKB PSI centers  Links to the PSI Large-Scale and Specialized Centers PSI Resources  Links to a list of our Biomedical Protein Target themes, Target Selection documentation, and the Modeling, Technology, Experimental Data Tracking, Materials, and Publications Resources NPG Resources  Links to the other Nature gateways, journals and other resources provided by the Nature Publishing Group

SG KB 1 E-alerts: Receive news of PSI SGKB updates by or RSS feed  Updates to editorial content (monthly)  Newly released structures (weekly) Functional Sleuth: explore protein structures solved by the PSI whose functions are unknown Latest PSI statistics Provides current tallies of structures solved  View detailed reports of which structures have solved by the PSI (“Metrics”)  View the latest structures solved by the PSI

SG KB Functional Sleuth

SG KB 1 Metrics I.1.ANumber of novel experimental PSI-2 structures 1219 I.1.BNumber of distinct experimental PSI-2 structures non-redundant sequences 1714 I.1.DTotal number of experimental PSI-2 structures 1933 I.1.ENumbers of experimentally determined distinct residues Numbers of experimentally determined novel residues I.2.JNumber of experimental structures of human proteins 71 I.2.KNumber of experimental structures of eukaryotic proteins 206 I.2.MNumber of experimental structures of membrane proteins 10 I.2.NNumber of experimental structures determined at the atomic level using x-ray crystallography 1753 Number of experimental structures determined at the atomic level using NMR methods 180 PSI-2 Summary Statistics Updated Sept 5, 2008  novel structures - structures with less than 30% sequence identity to an existing structure at the time of PDB deposition  distinct proteins - structures with non-redundant sequences less than 98% sequence identity

SG KB See latest structures…

Searching the PSI SGKB

SG KB Searching the PSI SGKB

SG KB Searching the PSI SGKB 1 Begin your search here:  By protein sequence  By keyword (plain text)  By structure (PDB ID) All PSI SGKB data and resources are accessible from one central Search Box

SG KB Sequence/PDBid search  Available structures of proteins with similar/identical amino acid sequences  Any structural and functional properties (annotation) determined from these protein structures  Available theoretical/homology models created with amino acid sequences similar to your query  Any information about similar protein sequences (targets) studied by the PSI structural genomics efforts  The protocols used during those PSI research efforts  Ordering information to obtain DNA clone materials, if available.

SG KB Sequence/PDBid search

SG KB Structures In the Structures tab, experiment and reference information about the structure is displayed:  View matching sequence alignment and sequence identity  Link to RCSB PDB’s Structure Explorer to learn more about the structure  View information about chemical substrates in the experiment (bound ligands and substrates)  Download the 3D atomic coordinates for the molecule  If published, connect to its citation and abstract at PubMed.

SG KB Structures

SG KB Annotations Genomic features: gene identifier, name and synonyms, operon/regulon mappings from databases Protein sequence features: amino acid sequence, taxonomy & phylogeny, isoforms, single nucleotide polymorphisms, post-translational modifications, and sequence families. Structure features: secondary structure, oligomeric state, structure and functional domains, DNA binding motifs, sites of interaction Ligands: information about bound ligands Functional/Biochemical classifications: enzyme class, substrate specificity and catalysis, epitope mapping, cellular location, organ location Protein Networks and Biological Systems: enzymatic pathways and networks information Literature: synonyms for protein names, links to PubMed by database identifier and related text and authors Information from more than 50 external annotation resources

SG KB Annotations  every annotation provided is a link to more content

SG KB Future Annotations Layout Quick Annotations Summary will indicate available information  annotations will be organized by scientific category

SG KB Models In the Models tab, a list of the homology models available from the integrated Protein Models Portal are displayed  view the structural model, and interact with it in a Java window (AstexViewer)  download the model’s atomic coordinates  view predicted domain annotations from databases such as InterPro  view sequence/domain annotations related to the template structure, such as SCOP and CATH

SG KB Models AstexViewer lets you view the model

SG KB Experimental Data Tracking TargetDB contains worldwide structural genomics protein target information.  Search by sequence, Target ID, project site, status, update date, protein name, and source organism  Links to other sequence databases, domain databases, other structural genomics centers, and the RCSB PDB  Download target data  Target statistics summary PepcDB contains all the functionality of TargetDB plus  Experimental protocols  Detailed status history of experimental trials  Information on failed experiments

SG KB TargetDB Search

SG KB Experimental Tracking PepcDB search form

SG KB Protocols from PepcDB

SG KB Materials Repository Directly order targets of interest

SG KB Text Search With a plain text search, find information from:  PSI Center web pages  Publications resource  Technology resource  Annotation database

SG KB Text Search Site Search access web sites and files from 10 PSI centers and the Technology Portal

SG KB Text Search Structure Publications  records displays the PDB ID and the link to the RCSB PDB Structure Explorer page  their doi and Pubmed identifier  a link to the abstract

SG KB Text Search Annotations Text search may find annotations from the database if the text query is biological term

SG KB Text Search Methodology Publications  their doi and Pubmed identifier  a link to the abstract

SG KB Technology Module PSI Centers are actively developing technologies and methodologies for all aspects of the structure determination pipeline Functional Annotation Publications Genomic Based Target Selection Data Collection Structure Determination Isolation, Expression, Purification, Crystallization PDB Deposition & Release

SG KB Technologies

SG KB Publications to Date

SG KB Acknowledgements KB GroupPSI Resources Wendy TaoAndrei Kouranov (Exp. Data Tracking) Raship ShahTorsten Schwede (Models) James ChunPaul Adams (Technology) Margaret Gabanyi Josh La Baer (Materials) Tom OldfieldWladek Minor (Publications) John Westbrook Access Information Nature Matthew Day Boyana Konforti KB Steering Committee Chair, Eaton Lattman