The CompTox Chemistry Dashboard: an informational data hub at the

Slides:



Advertisements
Similar presentations
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Advertisements

Managing References : Mendeley
EndNote Web Reference Management Software (module 5.1)
EndNote Web Reference Management Software (module 5)
Perspectives from EPA’s Endocrine Disruptor Screening Program
UNEP Advisory Group Meeting Geneva, Switzerland December 12, 2014
Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.
1 High Production Volume (HPV) Challenge Program Diane Sheridan Chief, Existing Chemicals Branch, Chemical Control Division, Office of Pollution Prevention.
Key Considerations for Report Generation & Customization Richard Wzorek Director, Production IT Confidential © Almac Group 2012.
Single Search By Rakphao Theppan, librarian Searching Online Resources.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
Office of Research and Development National Center for Computational Toxicology April 6, 2010 Exposure-Based Chemical Prioritization Workshop: Exploring.
1 Do More Searching in Less Time Fall Term 2010 Helen B. Josephine
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Managing references : Mendeley
Searching the Scientific Literature Douglas A. Loy.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
AIRNow Web Services Data to Go! Prepared by Steven A. Ludewig, Timothy S. Dye Sonoma Technology, Inc. Petaluma, CA John E. White U.S. Environmental Protection.
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
Marrying ACD/Labs technologies to eScience Projects at the Royal Society of Chemistry Antony Williams ACD/Labs User Meeting June 2013.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
1 Do More Searching in Less Time Winter Term 2013 Helen B. Josephine
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Office of Research and Development Photo image area measures 1.5” H x 7” and can be masked by a collage strip of one, two or three images. The photo image.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Chemicals Policy and Health (CP&H) Introduction to Ecetoc TRA GPS Risk Assessment and REACH/GHS implementation in practice Leo Heezen Cefic 30 – 31 May.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
Glencoe Introduction to Multimedia Chapter 2 Multimedia Online 1 Internet A huge network that connects computers all over the world. Show Definition.
THE NCSU LIBRARIES the gateway to knowledge for the North Carolina State University community and partners.
Searching the Scientific Literature Douglas A. Loy.
Introduction to PubChem BioAssay
Bibliography and reference manager programs (EndNote, Mendeley, Zotero) 2015 Attila Skulteti
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Who is NCCT? National Center for Computational Toxicology – part of EPA’s Office of Research and Development Research driven by EPA’s Chemical Safety for.
The KNIME workflow for automated processing of PHYSPROP data
Kamel Mansouri Chris Grulke Richard Judson Antony Williams
US EPA’s CompTox Chemistry Dashboard
Contents Module 6: E-journal, E-books and Internet Resources
Evaluation of NCI Research Resources
Research Organisation Subgroup June 1, 2017
Bibliographic data management with RefWorks for beginners
Five years of helping chemists to create an online presence using freely available resources Antony Williams National.
Comments on ASFA Input Helen Wibley, FAO 2016 ASFA Advisory Board Meeting – Hanoi, Viet Nam.
Bibliography and reference manager programs, Endnote 2018 Attila Skulteti
S-121 Maritime Limits and Boundaries
Development of TracMyAir Smartphone App for Predicting Exposures to Ambient PM2.5 and Ozone Michael Breen,1 Yadong Xu,1 Catherine Seppanen,2 Sarav Arunachalam,2.
Overview of open resources to support automated structure verification
EndNote by: fatimah alotaibi.
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
CICC Combines Grid Computing with Chemical Informatics
Bibliography and reference manager programs, Endnote 2018 Attila Skulteti
Connected Vehicle Reference Implementation Architecture (CVRIA)
ISI Web of Knowledge update: April 2009
Overview of Oracle Site Hub
Reference Management Software Tools Mendeley (Part A)
Beyond Science and Decisions: Problem Formulation to Dose Response
OCLC, WorldCat and Connexion
Searching tools for each program
Data compilation and pre-validation
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
Search for Article Citation
Presentation transcript:

The CompTox Chemistry Dashboard: an informational data hub at the National Center for Computational Toxicology 1Antony Williams*, 1Chris Grulke, 1Jennifer Smith, 2Kamel Mansouri, 2Andrew McEachran, 1Grace Patlewicz, 2Jeremy Fitzpatrick, 1Ann Richard, and 1Jeff Edwards 1U.S. EPA, National Center for Computational Toxicology (NCCT), Research Triangle Park, NC, 2Oak Ridge Institute for Science and Education (ORISE) Participant, Research Triangle Park, NC, ACS Meeting, San Francisco April 2-6, 2017 ORCID: 0000-0002-2668-4821 Antony Williams l williams.antony@epa.gov l 919-541-1033 Problem Definition and Goals Accessing ~10 Million Predicted Properties Online The CompTox Chemistry Dashboard Problem: There is limited access online to freely available data to support computational toxicology in environmental science. Goals: To deliver online access via a simple to use web-based interface supporting diverse types of data associated with environmental chemistry, and specifically computational toxicology. To develop predictive models from the data and use these models to predict properties for the ca. 750,000 chemicals within the database and make the predicted values available. To provide details regarding the performance of the models. To make the data available as downloadable Open data. The dashboard provides access to ~750,000 chemicals from EPA’s DSSTox database [1]. It integrates curated experimental data [2] used to produce our “OPERA” models. All chemicals were passed through the prediction models and detailed model reports showing global and local applicability domains and nearest neighbor results are displayed in the application. The QSAR Modeling Report Formats (QMRF) for each model are available for each predicted endpoint. The landing page of the dashboard is a simple text entry box allowing a type-ahead search for systematic, trade and trivial names, CAS Registry Numbers and InChIs. Dashboard Entry Page Where possible, links are provided to related Wikipedia articles. An associated mol file is available for download to the desktop, and a summary report containing record data can be provided as a PDF file. Abstract The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program integrates biology, chemistry, and computer science to help prioritize chemicals for further research based on potential human health risks. This work involves computational and data driven approaches that integrate chemistry, exposure and biological data. Much has been learned from the development of a disparate suite of software applications and recent work has focused on the integration of the various data sources into a new software architecture. This architecture is intended to reduce the learning curve for multiple applications, uses curated data sources to improve data integration and recall, and ultimately delivers better data in a more consumable form for both the user visiting a website and to computers visiting web services. The resulting application is the CompTox Chemistry dashboard. This application provides access to ~750,000 chemicals and associated experimental and predicted properties, high-throughput screening data from the ToxCast project, and product and functional use data. Flexible searching supports simple chemical identifier look-up based on chemical name and CAS registry number (CASRN) and structure identification is feasible using mass and formula based searching to support mass spectrometrists performing non-targeted analysis. Batch-based searching provides the user with the ability to look up large collections of chemical data using inputs based on name, CASRN, InChI keys and other identifiers and to export associated information in a series of standard file formats. The CompTox Chemistry Dashboard architecture and development approach has delivered a foundation on which to build new applications for use within the Agency and for use by the research community. This poster reviews the available types of data and the present capabilities and functionality of the CompTox Chemistry Dashboard. Chemical Record Page: Atrazine A summary of available chemical properties for Bisphenol A The model report for the melting point prediction for Bisphenol A – including nearest neighbors. For records with chemical structure representations, various inherent properties (e.g. formula and mass) and predicted physicochemical properties (logP, water solubility etc.) are provided. Future Work Continue to expand the data in terms of chemicals, toxicity data, additional experimental data Release NCCT models as interactive online prediction tools in the near future via the dashboard. Integrate the suite of EPA T.E.S.T3 physicochemical and toxicity prediction models to expand the collection of available models. Add additional functionality supporting the display of bioassay data. Chemical Properties Panel The Toxicity Values tab provides access to data assembled from a series of public resources including EPA data (i.e. IRIS and PPRTV reports, ToxRef DB). Data can be downloaded as TSV and Excel files. References EPA Distributed Structure-Searchable Toxicity (DSSTox) Database, http://www.epa.gov/chemical-research/distributed-structure-searchable-toxicity-dsstox-database Mansouri et al. An automated curation procedure for addressing chemical errors and inconsistencies in public datasets used in QSAR modelling, SAR QSAR Environ Res. 2016 Nov;27(11):939-965. EPA Toxicity Estimation Software Tool (T.E.S.T.) software http://www.epa.gov/chemical-research/toxicity-estimation-software-tool-test Toxicity Values Panel Literature searching using integration to a series of online resources can be performed using the CASRN and chemical name. This includes Google Scholar, PubMed and PubChem patents. Future Work The authors would like to acknowledge specific colleagues within our center for their contributions to the development of the dashboard: Nancy Baker, Richard Judson, Sean Watford and John Wambaugh. Literature: Pubmed Abstract Sifter This presentation does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.