I am not a PDBid I am a Biological Macromolecule Philip E. Bourne University of California San Diego

Slides:



Advertisements
Similar presentations
PubMed/How to Search, Display, Download & (module 4.1)
Advertisements

EndNote Web Reference Management Software (module 5)
NIH Public Access Compliance Cleveland Health Sciences Library Case Western Reserve University Kathleen C. Blazar.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
1.
PMID and PMCID Primer How to find PMCID for use in NIH reports and other documents Andrea Twiss-Brooks Co-Director, Science Libraries.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2005.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
Evidence-Based Information Retrieval in Bioinformatics
Using MeSH: Medical Subject Headings Before searching, use the MeSH database to identify search terms.
SCIENTIFIC SOLUTIONS Thomson ResearchSoft Paul Torpey April 8, 2005.
New Modes of Scholarly Communication and Learning Philip E. Bourne University of California San Diego 1WSU December 2, 2008.
Management and Distribution of Chemical Data in the Protein Data Bank John Westbrook, Dimitris Dimitropoulos, Jasmine Young, Peter Rose, Philip E. Bourne.
1 NIH Public Access Policy Policy on Enhancing Public Access to Archived Publications Resulting From NIH-Funded Research (Public Access Policy)
Jean Phillips Schwerdtfeger Library Space Science and Engineering Center University of Wisconsin-Madison November 2005.
PubMed/How to Search, Display, Download & (module 4.1)
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
Machine Learning in the New World of Scholarly Communication Philip E. Bourne University of California San Diego
The Role of Ontologies in Improved Scholarly Communication Philip E. Bourne University of California San Diego
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
The Alive Tree of Knowledge A new concept in Collective Interactive Knowledge Integration.
What is SciVee? SciVee Partners University of California, San Diego.
“If I could do it over again, I’d publish that paper in PLoS Biology.” James Watson 50 th anniversary of the publication of DNA structure Eagle Pub, Cambridge,
Some Thoughts on Scholarly Communication and the Role of Bio-ontologies Philip E. Bourne University of California San Diego
RLIMS-P: A Rule-Based Literature Mining System for Protein Phosphorylation Hu ZZ 1, Yuan X 1, Torii M 2, Vijay-Shanker K 3, and Wu CH 1 1 Protein Information.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Proteopedia: A Breakthrough in Biomolecular Structure Communication Eric Martz UMass MCB Colloquium October 27, 2008.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Architecture for a Database System
The Promise of Open Access Philip E. Bourne PhD University of California San Diego Open Access Day October 14, 2008
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Committed to making the world’s scientific and medical literature a public resource.
My Bibliography/eRA Commons Integration More utility, less work Bart Trawick Neil Thakur Commons Working Group, 9/22/09.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Open Science One Person’s View and What We Are Doing About It Philip E. Bourne University of California San Diego 1PSB Open Science Workshop.
The Evolving Digital Mathematics Library: A Mathematics Librarian’s Perspective Timothy W. Cole University of Illinois at Urbana-Champaign 8 Dec
Towards Data Attribution & Citation in the Life Sciences Philip E. Bourne UCSD 8/22/11Data Attribution and Citation.
Revised 7/19/10.  This policy states that, as of April 7, 2008, all articles resulting from U.S. National Institutes of Health (NIH) funds must be submitted.
Philip E. Bourne Professional Development Lecture 7 Understanding and Working the Publishing Process.
Data Integration and Management A PDB Perspective.
How to start to write a scientific paper Ashgan Mohamed, Ph.D Assistant Professor Cairo University.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Anita Cellucci.
Blogs and Wikis Tim Bornholtz. Purpose Many new technologies are available on the internet that enable people to publish and edit content without expensive.
Copyright OpenHelix. No use or reproduction without express written consent1.
Taming the Big Data in Computational Chemistry #euroCRIS2015 Barcelona 9-11-XI-2015 Carles Bo ICIQ (BIST) -
Telling Research Stories Through SciVee Philip E. Bourne University of California San Diego AAAS February 21, 2010.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Organize. Collaborate. Discover. 1 Introduction to Mendeley.
PubMed Basics Barbara A. Wood, MLIS Calder Library University of Miami Miller School of Medicine.
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Copyright 2007, Paradigm Publishing Inc. BACKNEXTEND 8-1 LINKS TO OBJECTIVES Import data from another Access table Import data from another Access table.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Computer Aided Software Engineering (CASE)
Next Generation Preprint Service
The NIH Public Access Policy
Reference management soft wares Endnote & Mendeley
Philip Bourne University of California San Diego
University of California San Diego
New Features Update Web of Knowledge : Discovery Starts Here
Presentation transcript:

I am not a PDBid I am a Biological Macromolecule Philip E. Bourne University of California San Diego

Striving to be Recognized The “identity” of a macromolecular structure – functional and structural features and its broad role in a living system – is not established very easily by the majority of biologists. Given the technology available to us today surely it is time that this situation changed?

This is Not to Say that the Identity has not Improved Improved chemical description of polymers and monomers Remove sequence and taxonomic inconsistencies Improved representation of viruses Primary citation assignments REMARKS, SF files, NMR restraints…. Henrick et al. NAR : D426-D433

For Example… Chemical Components Dictionary: –Model and idealized coordinates –Chemical descriptors (e.g. SMILES) and systematic names –Stereochemical assignments and aromatic bond assignments –IUPAC nomenclature for standard amino acids and nucleotides with the exception of the well-established convention for C- terminal atoms OXT and HXT –More conventional atom labeling –Removal of redundant ligands –Additional description of protonation states

This now sets the stage for the next stage of identity development

The Problem Can be Defined as A Need to Change the Workflow

Workflow Entry Point Sequence Literature Structure Function Pathway…

The best way to change the workflow is to remove the barrier between the literature (knowledge) and the PDB (data) How Can This Happen?

Possibility 1 – Proteopedia A Completely New Beginning Advantages –Anyone can contribute simply –Community consensus seems to support quality (e.g. Wikipedia) Disadvantages –Where is the reward? –Wiki format limited for providing a structural identity Eran Hodis, Eric Martz, Jaime Prilusky, Joel L. Sussman

Possibility 2 - iSee Advantages –High quality annotation Disadvantages –Time consuming –Does not scale

Possibility 3 – Database and Literature Integration Advantages –Reward through publication –Potentially comprehensive –Retains full power of the database and literature Disadvantages –Literature accessibility –Harder to do

The Disadvantage of Literature Accessibility is Disappearing Slowly The NIH Public Access Policy is a Term and Condition of Award for all grants and cooperative agreements active in Fiscal Year 2008 (October 1, September 30, 2008) or beyond, and for all contracts awarded after April 7, 2008.

So What is the Policy for NIH Sponsored Research? You can only agree to a journal copyright policy if that policy allows you to deposit the paper in PubMed Central (PMC) The paper must be deposited in PMC How this happens depends on the journal

BioLit Our Effort at Database-Literature Integration J.L.Fink, S. Kushch, P. Williams & P.E.Bourne 2008 BioLit: Integrating Biological Literature with Databases NAR 36(S2) W P.E.Bourne, J.L.Fink, M.Gerstein 2008 Open Access: Taking Full Advantage of the Content PLoS Comp. Biol. (Editorial) 4(3) e

1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results BioLit: Tools for New Modes of Scientific Dissemination Biolit integrates biological literature and biological databases and includes: –A database of journal text –Authoring tools to facilitate database storage of journal text –Tools to make static tables and figures interactive 4. The composite view has links to pertinent blocks of literature text and back to the PDB The Knowledge and Data Cycle

How Much of the Structure Literature is Currently Found in the Accessible PMC? articles were not parasable 7% PDBids out of referenced in ?? PMC articles 338 Figures have legends that include PDBids

ICTP Trieste, December 10, 2007

Where Can we Go From Here with BioLit? The Ideal Situation is to Capture Relationships as the Paper is Written

BioLit Plugin Project Rather than Post-processing the Document the Author Controls the Semantic Tagging

Author Paper Word File in Docx format Publisher BioLit Plugin Project

Plugin Architecture

Context-Sensitive Data Access Display of information of database entries when the user clicks on the ID in the document Display of ontology terms related to terms in the document text, using local database search

Ontologies are Stored in a Local Database

User Configurable Selection Fully user configuration ontology and database identifier selection All searches occur within the user’s desktop computer Desired ontologies are downloaded and installed automatically, and update periodically BioLit installer XML file provides the application with the information needed to download and install ontologies.

Possibility 4. SciVee - A Different Kind of Learning Experience Why not listen to the enthusiastic author talk about the structure while you see the structure respond to their dialog?

YouTube for Scientists

Motivation

Pubcast – Video Integrated with the Full Text of the Paper

Pubcast - Making PSP Washington DC Feb. 2008

Channels – Just Like TV ICTP Trieste, December 2007

Professional Profile ICTP Trieste, December 2007

Create & Join Communities and Discussion Groups ICTP Trieste, December 2007

Finding What you Want Tag clouds generated automatically from MESH headings Full text of the papers indexed Browsing by audience type, subject, language etc.

SciVee – Viral Projects Sweetwater School District “Postercasts” Science video competitions “Pubumentaries”

Summary New modes of learning about structure are possible Number 6 never did get identified Time will tell whether a PDBid will become more than a number

Acknowledgements SciVee Team –Apryl Bailey –Tim Beck –Leo Chalupa –Marc Friedman –Alex Ramos –Willy Suwanto BioLit Team J. Lynn Fink Sergey Kushch Parker Williams Greg Quinn CT Watch 2007, 3(3) 26-31

Questions?

Questions?