Tox21/ToxCast AR Pathway Model

Slides:



Advertisements
Similar presentations
Perspectives from EPA’s Endocrine Disruptor Screening Program
Advertisements

Office of Research and Development Computational Toxicology and High-Throughput Risk Assessment Richard Judson U.S. EPA, National Center for Computational.
Evaluating Existing in vitro Endocrine Data Jeff Pregenzer, Director of Endocrine Studies, CeeTox.
UNEP Advisory Group Meeting Geneva, Switzerland December 12, 2014
Richard A. Becker, Ph.D., D.A.B.T American Chemistry Council Arlington, Virginia Comments on “Dose Setting” EDMVS Meeting July 23-24, 2002.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Priority-setting for the Endocrine Disruptor Screening Program: Pesticide Active Ingredients Penelope A. Fenner-Crisp Office of Pesticide Programs U.S.
National Pesticide Program A New Toxicology Testing Paradigm: Meeting Common Needs Steven Bradbury, Director Environmental Fate and Effects Division Office.
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
EDSP Validation Gary E.Timm Senior Technical Advisor Office of Science Coordination and Policy U.S. Environmental Protection Agency.
Office of Research and Development National Center for Computational Toxicology April 6, 2010 Exposure-Based Chemical Prioritization Workshop: Exploring.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
What Do Toxicologists Do?
QSAR Modelling of Carcinogenicity for Regulatory Use in Europe Natalja Fjodorova, Marjana Novič, Marjan Vračko, Marjan Tušar, National institute of Chemistry,
Criteria for Screens— Review of the EDSTAC Recommendations Presentation to the EDMVS July 23, 2002.
Introductory Overview
Status of the U.S. Endocrine Disruptor Screening Program (EDSP) Status of the U.S. Endocrine Disruptor Screening Program (EDSP) September
Office of Pesticide Programs 21st Century Screening Assessment of Pesticides – A Regulatory View Vicki Dellarco, Ph.D. Senior Science Advisor Office of.
Office of Research and Development National Center for Computational Toxicology Keith Houck, PhD ISRPT 2009 Endocrine Workshop 9-10 Sept 2009 What Can.
EDSP’s Approach to Test Protocol Validation Office of Science Coordination and Policy U.S. Environmental Protection Agency.
High Throughput Genotoxicity Profiling of the US EPA ToxCast TM Chemical Library S Little 1, AW Knight 2, L Birrell 2, G Akerman 3, N McCarroll 3, D Dix.
1 Innovative Science To Improve Public Health EDKB: Endocrine Disruptors Knowledge Base at the FDA Huixiao Hong, Ph.D. Center for Bioinformatics Division.
Slide 1 of 24 EPA’s Endocrine Disruptor Screening Program (EDSP) Use of Exposure Data in Priority Setting Bill Wooge Office of Science Coordination and.
The US Tox21 Collaboration: Advances Made and Lessons Learned
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Fine scale air quality modeling using dispersion and CMAQ modeling approaches: An example application in Wilmington, DE Jason Ching NOAA/ARL/ASMD RTP,
A Comparison of Pesticide Environmental Risk Indicators for Agriculture Thomas Greitens Esther Day.
Effort 1 – Voluntary Genomics Data Submission (VGDS) FDA Guidance to Industry: Pharmacogenomics data submission (Draft 2003, final publication 2005) –Invite.
EDSP: T IER 1 T ESTING I NFORMATION C OLLECTION ISRTP 2010 Endocrine Workshop EDSP Compliance December 13, 2010 Susan Ferenc, DVM, Ph.D.
Toxicological Knowledge Base (a Definition) Response Dose “ An in computero aggregated set of the most germane literature citations and biological activity.
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
OpenMolGRID Open Computing GRID for Molecular Science and Engineering OpenMolGRID European Union 5th Framework Programme project
1 State of Play Prioritisation of Substances By modelling Hazard & Exposure Klaus Daginnus Institute for Health & Consumer Protection Joint Research Centre,
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Using Secondary Data to Evaluate Diverse Groups of Chemical and Nonchemical Stressors in Cumulative Risk Assessment Amanda M. Evans 1 Glenn E. Rice 2,
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
PRACTICES AND INTERVENTIONS FOR PREVENTION OF ALCOHOL USE AMONG YOUNG PEOPLE IN EUROPE Presented by: Anna Markina University of Tartu.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Contact: Patrick Phillips,
What is PIAAC?.
Who is NCCT? National Center for Computational Toxicology – part of EPA’s Office of Research and Development Research driven by EPA’s Chemical Safety for.
Lipinski’s rule of five
The CompTox Chemistry Dashboard: an informational data hub at the
The KNIME workflow for automated processing of PHYSPROP data
Endocrine Disruptor Screening and Testing: An Industry Update
Kamel Mansouri Chris Grulke Richard Judson Antony Williams
US EPA’s CompTox Chemistry Dashboard
from scientific literature Principal Scientist (Chemoinformatics)
Health Life Horizon Project & HL7 Business Case
ATOM Accelerating Therapeutics for Opportunities in Medicine
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Susan Makris U.S. EPA, Office of Research and Development
Five years of helping chemists to create an online presence using freely available resources Antony Williams National.
Important Points in Drug Design based on Bioinformatics Tools
Overview of the RHIS Rapid Assessment Tool
Daniela B. Friedman, University of South Carolina
Overview of open resources to support automated structure verification
HTS and Mixtures: Lessons Learned
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
CICC Combines Grid Computing with Chemical Informatics
EU: First- & Second-Generation Immigrants
Benjamin Wooden, Nicolas Goossens, Yujin Hoshida, Scott L. Friedman 
Development of a protocol for identification of reference conditions, and boundaries between high, good and moderate status in lakes and watercourses (REFCOND)
Comments on Using Existing Data for the Endocrine Screening Testing Lorenz Rhomberg, PhD Principal Gradient ISRTP 2009 Endocrine Workshop.
Ovanes Mekenyan, Milen Todorov, Ksenia Gerova
Mario Carere, Ann-Sofie Wernersson, Teresa Lettieri, Robert Kase
Consortium: National networks in 16 European countries.
Consortium: National networks in 16 European countries.
THE FRENCH DIVERSITY CHARTER: A SUCCESSFUL STORY
Presentation transcript:

Tox21/ToxCast AR Pathway Model CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity Kamel Mansouri1, Nicole Kleinstreuer2, Eric Watt1, Jason Harris1 and Richard Judson3 1ORISE Fellow, NCCT, ORD/U.S. EPA, RTP, NC, USA 2NIH/NIEHS/DNTP/NICEATM, RTP, NC, USA 3NCCT, ORD/U.S. EPA, RTP, NC, USA Kamel Mansouri l mansouri.kamel@epa.gov l ORCID: 0000-0002-6426-8036 Abstract Participants Prediction set DTU/food: Technical University of Denmark/ National Food Institute. USA EPA/NCCT: U.S. EPA/ National Center for Computational Toxicology. USA TUM: Technical University Munich. Germany ILS&NIH/NIEHS: ILS Inc & NIH/NIEHS. USA IRCSS: Istituto di Ricerche Farmacologiche “Mario Negri”. Italy MTI. Molecules Theurapetiques in silico. France LockheedMartin&EPA:   Lockheed Martin IS&GS/ High Performance Computing. USA NIH/NCATS: National Institutes of Health/ National Center for Advancing Translational Sciences. USA IdeaConsult. Bulgaria NIH/NCI: National Institutes of Health/ National Cancer Institute. USA UFG. Federal University of Golas. Brazil UMEA/Chemistry: University of UMEA/ Chemistry department. Sweden UNC/MML: University of North Carolina/ Laboratory for Molecular Modeling. USA UniBA/Pharma: University of Bari/ Department of Pharmacy. Italy UNIMIB/Michem: University of Milano-Bicocca/ Milano Chemometrics and QSAR Research Group. Italy VCCLab. Virtual Computational Chemistry Laboratory. Germany NCSU. NC State University, Bioinformatics Research Center. USA EPA/NRMRL. National Risk Management Research Laboratory. USA FDA/NCTR/DBB: U.S. FDA/ National Center for Toxicological Research/Division of Bioinformatics and Biostatistics. USA INSUBRIA. University of Insubria. Environmental Chemistry. Italy Tartu. University of Tartu. Institute of Chemistry. Estonia NIH/NTP/NICEATM. USA Chemistry Institute. Lab of Chemometrics. Slovenia SWETOX. Swedish toxicology research center. Sweden LZU: Lanzhou University. China BDS. Biodetection Systems. Netherlands IBMC. Institute of Biomedical Chemistry. Russia UNIMORE. University of Modena Reggio-Emilia. Italy MSU. Moscow State University. Russia ZJU. Zhejiang University. China JKU. Johannes Kepler University. Austria CTIS. Centre de Traitement de l'Information Scientifique. France ECUST. East China University of Science and Technology. China UNISTRA/Infochim: University of Strasbourg/ ChemoInformatique. France The list of chemicals to be predicted and prioritized for AR activity: Background: In order to protect human health from chemicals that can mimic natural hormones, the U. S. Congress mandated the U.S. EPA to screen chemicals for their potential to be endocrine disruptors through the Endocrine Disruptor Screening Program (EDSP). However, the number of chemicals to which humans are exposed is too large (tens of thousands) to be accommodated by the EDSP Tier 1 battery Objectives: To solve this, combinations of in vitro high-throughput screening (HTS) assays and computational models are being developed to help prioritize chemicals for more detailed testing. Methods: Previously, CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) demonstrated the effectiveness of combining many QSAR models trained on HTS data to prioritize a large chemical list for estrogen receptor activity. The limitations of single models were overcome by combining all models built by the consortium into consensus predictions. CoMPARA is a larger scale collaboration between 35 international groups, following the steps of CERAPP to model androgen receptor activity using a common training set of 1746 compounds provided by U.S. EPA. Eleven HTS ToxCast/Tox21 in vitro assays were integrated into a computational network model to detect true AR activity. Bootstrap uncertainty quantification was used to remove potential false positives/negatives. Reference chemicals (158) from the literature were used to validate the model. Results: The model combining ToxCat/Tox21 assays showed 95.2% and 97.5% balanced accuracies for AR agonists and antagonists respectively. The resulting data was used to build qualitative and quantitative models and a consensus combining the different structure-based and QSAR modeling approaches. Then, a library of ~80k chemical structure, including ~11k chemicals curated from PubChem literature data using ScrubChem tools was integrated with CoMPARA’s consensus predictions. Conclusion: The results of this project will be used to prioritize a large set of more than 50k chemicals for further testing over the next phases of ToxCast/Tox21, among other projects. CERAPP list: 32,464 unique QSAR-ready structures (standardized, organic, no mixtures…) EDSP Universe (10K) Chemicals with known use (40K) (CPCat & ACToR) Canadian Domestic Substances List (DSL) (23K) EPA DSSTox (version 1)– structures of EPA/FDA interest (15K) ToxCast and Tox21 (In vitro ER data) (8K) EINECS: European INventory of Existing Commercial chemical Substances ~60k structures ~55k QSAR-ready structures ~38k non overlapping with the CERAPP list ~18k overlap with DSSTox 29,904 + 17,984 = 47,888 unique standardized QSAR-ready structures Evaluation set The tool ScrubChem, a curated version of PubChem Bioassay, was used to build datasets from public data. Starting from ~80K bioactivities & 11K chemicals, results were grouped by target, chemical, modality, and outcome in order to derive hit calls summarizing results from different experiments. Hit calls were further filtered for quality by the number of pieces of evidence (n) and ratio of agreement between those values. The effect of a quality filter on the size of a dataset is shown in the example to the right. This list will then be filtered and used as an evaluation set for the built models. Project planning Evidence (n) Steps Tasks 1: Training and prioritization sets NCCT/ EPA - ToxCast assays for training set data - AUC values and discrete classes for continuous/classification modeling - QSAR-ready training set and prioritization set 2: Experimental validation set - Collect and clean experimental data from the literature - Prepare validation sets for qualitative and quantitative models 3: Modeling & predictions All participants - Train/refine the models based on the training set - Deliver predictions and applicability domains for evaluation 4: Model evaluation - Evaluate the predictions of each model separately - Assign a score for each model based on the evaluation step 5: Consensus modeling - Use the weighting scheme based on the scores to generate the consensus - Use the same validation set to evaluate consensus predictions 6: Manuscript writing - Descriptions of modeling approaches for each individual model - Input of the participants on the draft of the manuscript Training data Tox21/ToxCast AR Pathway Model This model was used to combine the following ToxCast assays and generate AUC scores that were used to train CoMPARA models after removing potential false positives/negatives. e.g., There are 932 chemicals in the selection grid (28 active and 924 inactive) with a hit call derived from 5 or more pieces of evidence (n). Each bin contains its own average for the ratio of agreement on hit calls in that bin. For example, the first bin (n=5) has 499 chemicals which on average have 98.8% agreement in the 5 data points used for their hit calls. Evidence (n) Assay Name Biological Process NVS_NR_hAR receptor binding NVS_NR_cAR NVS_NR_rAR OT_AR_ARSRC1_0480 cofactor recruitment OT_AR_ARSRC1_0960 ATG_AR_TRANS mRNA induction OT_AR_ARELUC_AG_1440 gene expression Tox21_AR_BLA_Agonist_ratio Tox21_AR_LUC_MDAKB2_Agonist Tox21_AR_BLA_Antagonist_ratio Tox21_AR_LUC_MDAKB2_Antagonist Tox21_AR_LUC_MDAKB2_Antagonist* Conclusions This project is prioritizing ~50K chemicals in a fast, accurate, and economic way. Generated high quality data and models that can be reused. Free & open-source code and workflows shared with the community. Data and predictions will be available for visualization on the EDSP dashboard: http://actor.epa.gov/edsp21/ and on the CompTox dashboard: https://comptox.epa.gov/dashboard/ The prioritized lists will help in the selection of chemicals that will be tested in the next phases of ToxCast/Tox21. A joint paper with all participants and multiple satellite papers will be published in peer reviewed journals. 1720 unique structures Agonist: ~50 actives Antagonist: ~160 actives Binding: ~170 actives 2 types of data: Continuous AUC scores Discrete hit calls Disclaimer: The views expressed in this poster are those of the authors and do not necessarily reflect the views or policies of the U.S. Environmental Protection Agency.