Applying Royal Society of Chemistry Cheminformatics Skills to Support the PharmaSea Project Antony Williams, Alexey Pshenichnov, Valery Tkachenko, Ken.

Slides:



Advertisements
Similar presentations
EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
Advertisements

Supporting Engagement in Open Access: a Publishers Perspective
Guideline for discussion/presentation/critique #1: Really understand the paper …
THE GLOBAL CHEMISTRY NETWORK David James Executive Director, Strategic Innovation Jim Iley Executive Director, Science and Education 3 rd September 2013.
UK National Chemical Database Service: An integration of commercial and public chemistry services to support chemists in the United Kingdom Antony Williams,
Rapid, Small-scale Dereplication of Bioactive Extracts John Blunt University of Canterbury New Zealand 1.
Monica Omodei CAUL/ANDS Webinar – “Joining the Dots” July 17, 2014 Identification of Funders and Grants.
ChemSpider: Searching by Chemical Name. ChemSpider  What is ChemSpider?  How to conduct a search  What do you get?
Personalia: Pre-Sheffield Batchelor’s degree in Chemistry at Oxford Pre-university job in my local public library system Chemistry or information science?
The Royal Society of Chemistry: Advancing Excellence in the Chemical Sciences Dan Dyer Head of Sales.
Royal Society of Chemistry developments to support open drug discovery Antony Williams, Ken Karapetyan, Valery Tkachenko, Colin Batchelor Alexey Pshenichnov.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
SciFinder ® : Part of the process™ 2006 Edition. SciFinder ® : Part of the process™ 2006 Edition SciFinder ® 2006 provides new, powerful capabilities.
How community crowdsourcing and social networking is helping to build a quality online resource for chemists.
Crowdsourced Curation of Chemistry Data. How Bad is Online Chemistry Data? Antony Williams Wolfram Summit, September 2010.
Crowdsourcing Chemistry for the Community – 5 Years of Experiences Antony Williams NFAIS, February 28 th 2012.
The Value of a Unique Researcher Identifier to ChemSpider Projects Antony Williams ORCID Meeting, Boston, May 18 th 2011.
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry Resources (and lessons from President Bush) Antony Williams 5th Meeting on.
Royal Society of Chemistry activities to develop a data repository for chemistry-specific data Aileen Day, Alexey Pshenichnov, Ken Karapetyan, Colin Batchelor,
Chemical Database Projects Delivered by RSC eScience at the FDA Meeting “Development of a Freely Distributable Data System for the Registration of Substances”
ChemSpider – A Combination Platform of Free Chemistry Database, Free Prediction Engines and Crowdsourcing Environment Antony Williams University of Oregon,
Big Data Supporting Drug Discovery Cautionary Tales from the World of Chemistry for Translational Informatics Valery Tkachenko RSC-CSIR/OSDD meeting Pune,
Searching the Chemical Literature: Reference Books and Online Resources Dr. Sheppard Chemistry 4401L.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
Chemical health and safety data online – data consistency Antony Williams iRAMP Meeting, Ithaca, Feb 2014.
Marrying ACD/Labs technologies to eScience Projects at the Royal Society of Chemistry Antony Williams ACD/Labs User Meeting June 2013.
The Benefits of Participation in the Social Web of Science Antony Williams Research Square October 30 th 2014.
BMJ and Data Sharing Claire Bower, Digital Communications
CAS — Bringing You the World’s Chemistry Knowledge.
MAST Users Group – July 9, 2009 other planned and ongoing projects within MAST.
Vendor Session: ChemSpider, from Royal Society of Chemistry.
Data enhancing the Royal Society of Chemistry publication archive Antony Williams, Colin Batchelor, Peter Corbett, Ken Karapetyan and Valery Tkachenko.
Royal Society of Chemistry SCELC Vendor Day Thomas Romano Regional Sales Manager – The Americas.
MDL Information Systems, Inc. Powering the Process of Invention Donna del Rey Director, Business Planning
Clustering the Royal Society of Chemistry chemical repository to enable enhanced navigation across millions of chemicals Valery Tkachenko, Ken Karapetyan,
A Chemistry Data Repository to Serve Them All Antony Williams.
Structure verification and elucidation using the ChemSpider database Antony J Williams, Valery Tkachenko and Alexey Pshenichnov SERMACS, November 16 th.
General & Background InformationPractical & Useful DataDetailed, Original Research Encyclopedias Dictionaries Reference Texts Books Safety Information.
Enhancements to Galaxy for delivering on NIH Commons
Who is NCCT? National Center for Computational Toxicology – part of EPA’s Office of Research and Development Research driven by EPA’s Chemical Safety for.
The CompTox Chemistry Dashboard: an informational data hub at the
US EPA’s CompTox Chemistry Dashboard
Presenters: Charles Romain and Clare Bakewell
Open Research Data and Open Access publications: How do they sit in the Web of Science? Guillaume Rivalle, Manager, Europe solution specialists
Preliminaries Have you sign up for SciFinder account? Login to PC
Preliminaries Have you sign up for SciFinder account? Login to PC
Experiences in Hosting Big Chemistry Data Collections for the Community Antony Williams July 30th 2014, NIST.
Dealing with the complex challenge of managing diverse chemistry data online Antony Williams, Valery Tkachenko, Alexey Pshenichnov and Ken Karapetyan.
Using Chemistry Databases for Literature, Substance and Reaction Searching for Chemistry Year 3 Students (CM3291) Wee Kin.
SCEC Drupal Website Development Overview and Status
An Overview of Data-PASS Shared Catalog
Royal Society of Chemistry
Using Chemistry Databases for Literature, Substance and Reaction Searching for Chemistry Year 3 Students (CM3291) Magdeline.
ORCID ID: Driving needs for analytical data exchange standards and the potential impacts on the chemical sciences Antony Williams.
Nuclear magnetic resonance NMR spectroscopy is a key analytical technique for structure elucidation of a wide range of materials from small molecules to.
VI-SEEM Data Repository
Preliminaries Have you sign up for SciFinder account? Login to PC
Who knew I would get here from there: How I became the ChemConnector
Beyond the paper resume and how to develop an online profile as a scientist Antony Williams.
VI-SEEM Data Repository
Jay Bhatt Drexel University Libraries
Web Advisory Group (WAG) Implementation Plan
Overview of open resources to support automated structure verification
OMPOL – Visualisation of large chemical spaces
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
‘The eCrystals Federation’ Management and Publication of Small Molecule Structure Data for the Whole Crystallographic Community S.J. Colesa*, J.G. Freya,
Using Chemistry Databases for Literature, Substance and Reaction Searching for Chemistry Year 3 Students (CM3291) Pattarin.
TGDATA 13/2, WGDIKE 14/2 Good afternoon everyone and thanks for the opportunity of presenting at this meeting the progress.
Preprints and literature provenance in Europe PMC
Preprints and literature provenance in Europe PMC
Presentation transcript:

Applying Royal Society of Chemistry Cheminformatics Skills to Support the PharmaSea Project Antony Williams, Alexey Pshenichnov, Valery Tkachenko, Ken Karapetyan, David Sharpe ACS San Francisco August 2014

Cancer Deaths Worldwide

Top Treatments for Cancer

Importance of Natural Products Over half of all drugs introduced between 1940 and 2006 were of natural origin or inspired by natural compounds

Natural Products for all of us!

We Are Doomed I Tell You!!!

We Are Doomed I Tell You!!!

The Dangers of Algal Blooms!

Nature’s Little Pharmacy

We Are Doomed I Tell You!!!

Antibiotic resistance

Discovery Curve Decay

RSC and Natural Products

Focus on Marine Natural Products RSC cheminformatics support to include: Deliver “PharmaSea website” Provide access to natural products subset Develop “dereplication techniques” Searching NMR features against database Develop advanced searches for MS data Host Open Data from the PharmaSea project and make available to the community

http://www.pharma-sea.eu/

The PharmaSea Website RSC is open-sourcing a chemical registry system as a result of Open PHACTS Chemical Registry system used to underpin the PharmaSea website – behind login Will be enhanced with data deposition capabilities and “dereplication”

The PharmaSea Website

The PharmaSea Website

The PharmaSea Website

New Repository Architecture doi: 10.1007/s10822-014-9784-5

New Repository Architecture

Compounds

Reactions

Analytical data

Crystallography data

Deposition of Data

Extending PharmaSea Site PharmaSea website will be extended Spectral data handling: Support Dereplication

Identifying novel compounds Compounds are collected from the ocean Extraction via chromatography Analytical sciences including: UV-Vis data (Lambda-max) Mass spectrometry (formula/mass) NMR spectroscopy (HNMR/2D) Utilized for dereplication,,,

Is this already known or not??

Identifying novel compounds 4 Me singlets 4 Me doublets 1 OMe singlet Aromatic protons

Identifying novel compounds 2D NMR data will give details regarding substitutions and this information can be used in the dereplication process

What we need is… If we could have: A DB containing known marine natural products This would give formula and mass for searching The DB has all spectral data available for each compound If experimental data are not available then use the compound to COMPUTE spectral features

RSC Acquires Marinlit All Marinlit chemical compounds in ChemSpider Marinlit developers are dereplication experts

Structure searchable database Index literature related to marine natural products: 26K articles and growing Structure searchable database Data includes taxonomy, location and literature “Spectral features” generated algorithmically Utilize the spectral features for dereplication MarinLit is ‘article-centric’ and not compound centric. Compounds are only indexed when they are newly discovered, revised, or new to marine. All compound records link to the paper they were first mentioned. They are not linked to subsequent articles that describe them.

PharmaSea Dereplication Work in progress: Produce “dereplication widget” to embed in the PharmaSea website Generate “structure features” file for every new compound deposited to PharmaSea Ideal would be to utilize spectral data directly to elucidate structures – “Computer Assisted Structure Elucidation”. ACD/Labs….

CASE-based Elucidation Computers can elucidate structures today with greater efficiency and success than many scientists – see Patrick Wheeler’s talk Natural products specifically can be very challenging and CASE is well-proven ACD/Labs have delivered their CASE-system (ACD/Structure Eludicator) to the project

1D & 2D NMR Synchronized Processing The Software displays correlations for assigned spectra and structures, and highlights correlations that are likely to be erroneous.

ChemSpider supporting CASE RSC delivered entire ChemSpider structure dataset for inclusion into the Structure Elucidator software.

CASE vs Microscopy? DOI: 10.1002/anie.201203960

Single Molecule AFM

CASE vs Microscopy? DOI: 10.1002/anie.201203960

Next:Tagging Natural Products

Next:Tagging Natural Products

Next:Tagging Natural Products

Next:Tagging Natural Products

Future Plans Roll out tagging on ChemSpider to crowdsource marine natural products subset Implement tagging for further details onto PharmaSea website Collaborate with other natural product sources Mass spectrometry fragmentation prediction

Future Plans – MS Fragmenter

Future Plans – MS Fragmenter

Future Plans

To be published: 2015 (RSC) Modern NMR Approaches To The Structure Elucidation of Natural Products Volume 1: Instrumentation and Software Volume 2: Data Acquisition and Applications to Compound Classes Edited by Antony Williams, RSC, Gary Martin, Merck and David Rovnyak, Bucknell University

To be published: 2015 (Springer) Computer-based Structure Elucidation from Spectral Data Will include a functional demo version of the ACD/Structure Elucidator software to teach the basic approaches to computer-assisted structure elucidation Authored by Mikhail Elyashberg, Kirill Blinov and Antony Williams

Acknowledgments Alexey Pshenichnov, Ken Karapapetyan and Valery Tkachenko (RSC – US Cheminformatics) Marcel Jaspars (University of Aberdeen) John Blunt and Murray Munro (Marinlit) Serin Dabb (RSC, Marinlit) Patrick Wheeler and David Hardy (ACD/Labs)

Thank you Email: williamsa@rsc.org ORCID: 0000-0002-2668-4821 Twitter: @ChemConnector Personal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams 57