Marrying ACD/Labs technologies to eScience Projects at the Royal Society of Chemistry Antony Williams ACD/Labs User Meeting June 2013.

Slides:



Advertisements
Similar presentations
UBIQUITY V3 An extensible platform for creating dynamic, customized, and geocentric native mobile applications.
Advertisements

Supporting Engagement in Open Access: a Publishers Perspective
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Social Media.
THE GLOBAL CHEMISTRY NETWORK David James Executive Director, Strategic Innovation Jim Iley Executive Director, Science and Education 3 rd September 2013.
UK National Chemical Database Service: An integration of commercial and public chemistry services to support chemists in the United Kingdom Antony Williams,
ChemSpider: Searching by Chemical Name. ChemSpider  What is ChemSpider?  How to conduct a search  What do you get?
The Royal Society of Chemistry: Advancing Excellence in the Chemical Sciences Dan Dyer Head of Sales.
Royal Society of Chemistry developments to support open drug discovery Antony Williams, Ken Karapetyan, Valery Tkachenko, Colin Batchelor Alexey Pshenichnov.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Your online classroom. Powerhouse Campus o Custom Class dashboards o Links with Moodle, Studywiz, Bb, ClickView & all web apps o Links your school library.
How community crowdsourcing and social networking is helping to build a quality online resource for chemists.
Your Professional Network Powered by NCURA By: Stephanie Moore NCURA Community Curator.
How to Expand Your School’s Online Reach using Facebook, Blogs and Twitter.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Web 2.0: Concepts and Applications 2 Publishing Online.
Crowdsourced Curation of Chemistry Data. How Bad is Online Chemistry Data? Antony Williams Wolfram Summit, September 2010.
Crowdsourcing Chemistry for the Community – 5 Years of Experiences Antony Williams NFAIS, February 28 th 2012.
Confidential 2008 Roadmap. confidential 2008 Solution Roadmap Main Themes The ChallengeOur Approach Actionable Analytics Non effective data analysis with.
Proprietary & Confidential The Thread That Ties it All Together Voicethread and Discovery Education Jennifer Dorman denblogs.com/jendorman.
The Value of a Unique Researcher Identifier to ChemSpider Projects Antony Williams ORCID Meeting, Boston, May 18 th 2011.
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry Resources (and lessons from President Bush) Antony Williams 5th Meeting on.
 The ability to develop step by step procedures for solving problems  She uses algorithmic thinking by setting up her charts.
Modern app development Continuous value delivery and rapid response to change.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Web 2.0: Concepts and Applications 2 Publishing Online.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Copyright OpenHelix. No use or reproduction without express written consent1.
Royal Society of Chemistry activities to develop a data repository for chemistry-specific data Aileen Day, Alexey Pshenichnov, Ken Karapetyan, Colin Batchelor,
Christine Laham, Fahed Abdu, David Dezano,Shelly Kim.
CSED Computational Science & Engineering Department CHEMICAL DATABASE SERVICE The Current Service is Well Regarded The CDS has a long and distinguished.
Chemical Database Projects Delivered by RSC eScience at the FDA Meeting “Development of a Freely Distributable Data System for the Registration of Substances”
ChemSpider – A Combination Platform of Free Chemistry Database, Free Prediction Engines and Crowdsourcing Environment Antony Williams University of Oregon,
Big Data Supporting Drug Discovery Cautionary Tales from the World of Chemistry for Translational Informatics Valery Tkachenko RSC-CSIR/OSDD meeting Pune,
Retrieving Chemistry Information Kenneth Lim & Mak Jie Ying Graduate Tutorial - 12 Aug 2015.
Searching the Chemical Literature: Reference Books and Online Resources Dr. Sheppard Chemistry 4401L.
NREL is a national laboratory of the U.S. Department of Energy, Office of Energy Efficiency and Renewable Energy, operated by the Alliance for Sustainable.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
Bridging Communities and Data with ArcGIS Open Data Courtney Claessens, Product Engineer Daniel Fenton, Product Engineer.
Chemical health and safety data online – data consistency Antony Williams iRAMP Meeting, Ithaca, Feb 2014.
The Benefits of Participation in the Social Web of Science Antony Williams Research Square October 30 th 2014.
Data Integration and Management A PDB Perspective.
Using Social Media for Fundraising and Communication with Supporters Lindsay Boyle – Communications & Research Coordinator Claire Chapman – Information.
Delivering an online service for validating and standardizing chemical structure files using the ChemSpider platform.
Vendor Session: ChemSpider, from Royal Society of Chemistry.
Data enhancing the Royal Society of Chemistry publication archive Antony Williams, Colin Batchelor, Peter Corbett, Ken Karapetyan and Valery Tkachenko.
Taming the Big Data in Computational Chemistry #euroCRIS2015 Barcelona 9-11-XI-2015 Carles Bo ICIQ (BIST) -
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Basics Components of Web Design & Development Basics, Components, Design and Development.
Clustering the Royal Society of Chemistry chemical repository to enable enhanced navigation across millions of chemicals Valery Tkachenko, Ken Karapetyan,
A Chemistry Data Repository to Serve Them All Antony Williams.
Structure verification and elucidation using the ChemSpider database Antony J Williams, Valery Tkachenko and Alexey Pshenichnov SERMACS, November 16 th.
General & Background InformationPractical & Useful DataDetailed, Original Research Encyclopedias Dictionaries Reference Texts Books Safety Information.
The CompTox Chemistry Dashboard: an informational data hub at the
Preliminaries Have you sign up for SciFinder account? Login to PC
Applying Royal Society of Chemistry Cheminformatics Skills to Support the PharmaSea Project Antony Williams, Alexey Pshenichnov, Valery Tkachenko, Ken.
Preliminaries Have you sign up for SciFinder account? Login to PC
Experiences in Hosting Big Chemistry Data Collections for the Community Antony Williams July 30th 2014, NIST.
Dealing with the complex challenge of managing diverse chemistry data online Antony Williams, Valery Tkachenko, Alexey Pshenichnov and Ken Karapetyan.
New features and customization options
ORCID ID: Driving needs for analytical data exchange standards and the potential impacts on the chemical sciences Antony Williams.
RSC电子平台使用介绍 联系人:孙燕 Tel:
Preliminaries Have you sign up for SciFinder account? Login to PC
Who knew I would get here from there: How I became the ChemConnector
Beyond the paper resume and how to develop an online profile as a scientist Antony Williams.
Overview of open resources to support automated structure verification
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
denblogs.com/jendorman
Social media for global scientific community – Mendeley project
Presentation transcript:

Marrying ACD/Labs technologies to eScience Projects at the Royal Society of Chemistry Antony Williams ACD/Labs User Meeting June 2013

RSC eScience Royal Society of Chemistry is a member society (>47,000), Publisher and Innovator in eScience Host of many online databases and services – ChemSpider, SyntheticPages, SpectraSchool,… Participant in multiple grant-based projects – National Chemical Database Service – Open PHACTS – PharmaSea

Multiple ACD/Labs Tools in use… Structure “checking” routines for data Nomenclature generation and conversion Physicochemical prediction algorithms Web-based spectral display widget “Interactive Lab” web-based prediction tools But first an intro to ChemSpider…

ChemSpider 28 million chemicals with associated data…

I want to know about “Vincristine”

Vincristine: Identifiers and Properties

Predicted Properties

Vincristine: Vendors and Sources Linked by Structure

Vincristine: Patents

Google Patents

Vincristine: Articles Linked by Name

RSC Databases

RSC Database Linkthrough

Spectra

Where do data come from? ChemSpider users deposit data Some contributions from NIST Chemical vendors are starting to provide data. Synthonix are one of our major contributors (

Crowdsourced “Annotations” Users can add – Compounds – Descriptions/Syntheses/Commentaries – Links to articles via DOIs – Add spectral data – Add Crystallographic Information Files – Add photos – Add MP3 files – Add Videos

Crowdsourced Curation Crowd-sourced curation: identify/tag errors, edit names, synonyms, identify records to deprecate

Spectral Uploading Locate the structure of interest and deposit spectrum

Spectral Uploading Various types of NMR spectra supported

Regular Updates

Multiple Spectra for One Structure

ChemSpider ID H1 NMR

ChemSpider ID C13 NMR

ChemSpider ID HHCOSY

ChemSpider ID HSQC

ChemSpider ID HMBC

Available Spectra

Number of Spectra IR 5389 HNMR 1679 CNMR 1207 UV-Vis 183 EI 90 2D1H13CD 68 Raman 51 NIR 32 2D1H1HCOSY 21 2D1H13CLR 10 CI+ve 8 PNMR spectra against 6890 compounds

Some usage statistics ca. 200 visitors at any one time, ~30,000 visits per day Mar 4-Apr 3, 2013 – Visits = 731,656 – Unique Visitors = 527,008 Independent servers to support other projects Does not include web service calls

ChemSpider as a Foundation ChemSpider is a foundation for projects: – >500 data sources aggregated and mapped – Continually curated and updated with new data – Normalized data around a structure centric data model – Providing an API allows integration to support other internal projects – Providing API access outside RSC extends the reach

Micropublishing Syntheses

ChemSpider SyntheticPages

Olympicene

Web Services Example: Spectral Data

Spectral Game

Increasing Complexity

SpectralGame in the hand

SpectraSchool

SpectraSchool

Recently Added– THANKS ACD/Labs! Storage and display of ASSIGNED spectra

Access ChemSpider APIs – Programmatic access used by Mobile Apps, Funded Consortia projects, many Academic groups Widgets – UI components for embedding in other websites Data – Data access, downloads, reuse, licensing

Flexible ChemSpider API

Linking Names to Structures

It is so difficult to navigate… What’s the structure? Are they in our file? What’s similar? What’s the target? Pharmacology data? Known Pathways? Working On Now? Connections to disease? Expressed in right cell type? Competitors? IP?

3-year Innovative Medicines Initiative project Integrating chemistry and biology data using semantic web technologies Open source code, open data and open standards Academics, Pharma companies, Publishers

ChemSpider Contributions The host of the chemistry services – Supplier of “standardized” chemical data files – Chemistry searching (structure, substructure etc) – Curator and data quality checking Presently rolling out the Open PHACTS chemical registration system

FP7 Initiative. PharmaSea: increasing value and flow in the marine biodiscovery pipeline ( ) Improve the quality, volume and value of active agents discovered in the marine environment and increase the speed at which they can be delivered

PharmaSea Dereplication via ChemSpider Hosting of natural products datasets Integrated storage of analytical data (ACD/Labs) Analytical data algorithms & integration – Mass spec searching – predicted fragmentation – NMR feature searching – NMR prediction – Computer-assisted structure elucidation Integration to ACD/Structure Elucidator

UK Chemical Database Service

Ilab Integration – NMR DB Searching

Ilab Integration – NMR Prediction

National Chemistry Data Repository Imagine all chemistry related data from all academic projects in the UK in ONE system Security model for the data to be embargoed, private or public (available to the entire world!) Provide tools for easy data upload, review, automated validation – chemicals, reactions, spectral data, alphanumeric data Use the data for algorithm training…

In Discussions At Present Develop the worlds largest online spectroscopy database of integrated data Does ACD/Labs have tools to help? – Automated depositions – Silent Automation – Processing and validation – Spectrus – Databasing – Spectrus DB – Web-based integration into ChemSpider

Where else can we get RICH data?

DERA : Data Enable the RSC Archive How much data is in the archive, in the publications and in the supplementary info? – How many compounds for ChemSpider? – How many syntheses for ChemSpider reactions? – How many characterization measurements? Property Data Spectral Data Graphs and charts to be used for modeling?

What if we could capture it all?

The Future of Data In Publications – Interactive plots, spectra, buy that compound, predict that property – Validation of data going INTO publications – NMR prediction, CASE validation, PhysProp comparisons From the lab – How much data NEVER gets published and is still useful? Failed Reactions? More Open Data…

Acknowledgements RSC eScience Team ACD/Labs – Pranas Japartas and Karim Kassam GGA – Indigo Toolkit and Bingo Cartridge The community of depositors The Open Source Community

Thank you Personal Blog: SLIDES: