Download presentation
Presentation is loading. Please wait.
Published byGeoffrey Cobb Modified over 6 years ago
1
QSAR Application Toolbox: Second steps: Predicting aquatic toxicity by trend analysis
2
Background This is a step-by-step presentation designed to take the first time user of the Toolbox through the workflow of a data filling exercise.
3
Objectives This presentation demonstrates a number of functionalities of the Toolbox : Identify analogues for a target chemical, Retrieve experimental results available for those analogues, and Fill data gaps by trend-analysis.
4
Specific Aims To introduce the first time user to the work flow of the Toolbox, To familiarize the first time user with the six modules of the Toolbox, To familiarize the first time user with the basic functionalities within each module, To explain to the first time user the rationale behind each step of the exercise.
5
The Exercise In this exercise we will predict the acute toxicity to daphnids for an untested compound, (3-ethyl-5-methyl-4-methoxyphenol), which will be the “target” chemical. This prediction will be accomplished by collecting a small set of test data for chemicals considered to be in the same category as the target molecule. The category will be defined using the following categorization schemes: - EcoSAR – for structural grouping - OASIS acute aquatic MOA – mechanistic grouping
6
Trend analysis For a given (eco)toxicological endpoint, the members of a category are often related by a trend (e.g. increasing, decreasing or constant). The trend could be related to molecular mass, carbon chain length, or to some other physicochemical property. A demonstration of consistent trends in the behaviour of a group of chemicals is one of the desirable attributes of a chemical category and one of the indicators that a common mechanism for all chemicals is involved. When some chemicals in a category have measured values and a consistent trend is observed, missing values can be estimated by simple scaling from the measured values to fill in the data gaps.
7
Tracks After opening the Toolbox, the user has to choose between three use tracks (or workflows): (Q)SAR Track Category Track Flexible Track For a first time users of the Toolbox, it is recommended to select the Category Track. As the user becomes more familiar with the functionalities of the Toolbox, it is recommended to use the Flexible Track.
8
Workflow Each track follows the same general six-step workflow:
Chemical Input Profiling Endpoints Category Definition Filling Data Gaps Reporting
9
Tracks and Workflow
10
Chemical Input Click on the “Category Track”.
This takes you to the first module, which is “Chemical input”. This module provides the user with several means of entering the chemical of interest or the target chemical. Since all subsequent functions are based on chemical structure, the goal here is to make sure the molecular structure assigned is the correct one.
11
Chemical Input Screen
12
Ways of Entering a Chemical
There are several ways to enter a target chemical. Most often used are: Chemical Abstract Services (CAS) number (#), SMILES (simplified molecular information line entry system) notation, and Drawing the structure using the structural editor included in the Toolbox. Click on Drawing.
13
Drawing target chemical 3-ethyl-5-methyl-4-methoxyphenol
14
Target Chemical The Toolbox now searches the Toolbox databases and inventories for the presence of the drawn chemical structure. It is displayed as a 2D image. Click on the box next to “Substance Information”; this displays the chemical identification information. Note that no CAS number or name is displayed for this chemical. It means that it is not listed in the chemical inventories implemented in the Toolbox. The workflow on the first module is now complete, click “Next”.
15
Chemical Identification Information
16
Profiling “Profiling” refers to the electronic process of retrieving relevant information on the target compound, other than environmental fate, ecotoxicity, and toxicity data, which are stored in the Toolbox database. Available information includes likely mechanism(s) of action.
17
Side-Bar to Profiling Detailed explanations of the different profilers are provided in the Manual accompanying the Toolbox. In addition, for most of the profilers, background information can be retrieved by highlighting one of the profilers (for example OASIS acute aquatic MOA, Protein Binding ) and clicking on the button “Show Category Boundaries’’.
18
Highlight “Protein binding“ and Click on ‘’Show Category Boundaries’’
19
Insert Window Appears with the Explanation of Protein binding category
20
Profiling Target Chemical
Select the “Profiling methods” you wish to use by clicking on the box before the name of the profiler. This selects (a red check mark appears) or deselects(red check disappears) profilers. For this example check the following mechanistic categorization methods which are relevant for aquatic toxicity: - EcoSAR – for structural grouping - OASIS acute aquatic MOA – mechanistic grouping - Protein binding – mechanistic grouping - Verhaar categorization – grouping by reactivity Click on “Apply” (see next slide).
21
Profilers for 3-ethyl-5-methyl-4-methoxyphenol
22
Profiling The actual profiling will take several seconds depending on the number and type of profilers selected. The results of profiling automatically appear as a dropdown box under the target chemical (see next slide). Please note the specific profiling results by ECOSAR Classification and OASIS Acute Toxicity MOA. These results will be used to search for suitable analogues in the next steps of the exercise. Click on “Next” to move to the module “Endpoints”.
23
Profiles of 3-ethyl-5-methyl-4-methoxyphenol
24
Endpoints “Endpoints” refer to the electronic process of retrieving the environmental fate, ecotoxicity and toxicity data that are stored in the Toolbox database. Data gathering can be executed in a global fashion (i.e., collecting all data of all endpoints) or on a more narrowly defined basis (e.g., collecting data for a single or limited number of endpoints).
25
This Example In this example, we limit our data gathering to common aquatic toxicity endpoints from databases containing aquatic toxicity data (ECETOX, ECOTOX and OASIS Aquatic). Click on the box with a list of database names and select them. Click on “Gather data” (see next slide).
26
Data Gathering
27
Next Step in Data Gathering
Toxicity information on the target chemical is electronically collected from the selected datasets. In this example, an insert window appears stating there was “no data found” for the target chemical (see next slide). Close the insert window.
28
No data for Target Chemical
29
Recap You have entered the target chemical being sure of the correct structure. You have profiled the target chemical and found no experimental data is currently available for this structure. In other words, you have identified a data gap, which you would like to fill. Click on “Next” to move to the category definition module.
30
Category Definition This module provides the user with several means of grouping chemicals into a toxicologically meaningful category that includes the target molecule. This is the critical step in the workflow. Several options are available in the Toolbox to assist the user in defining the category definition.
31
Grouping Methods The different grouping methods allow the user to group chemicals into chemical categories according to different measures of “similarity” so that within a category data gaps can be filled by trend-analysis. For example, starting from a target chemical for which a specific EcoSAR classification is identified, analogues can be found with the same classification and for which experimental results are available.
32
Side Bar to EcoSAR Categorization
EcoSAR has been used by the U.S. Environmental Protection Agency since 1981 to predict the aquatic toxicity of new industrial chemicals in the absence of test data. SARs are developed for chemical classes (e.g. phenols) based on measured test data that have been submitted by industry or they are developed from other data sources. Toxicity values for new chemicals may then be calculated from the resulting regression equation.
33
Side Bar to EcoSAR Categorization
To date, over 150 SARs have been developed for more than 50 chemical classes. These chemical classes range from the very large, e.g., neutral organics, to the very small, e.g., aromatic diazoniums. Some chemical classes have only one SAR, such as acid chlorides, for which only an equation for 96-hour LC50 for fish has been developed. The EcoSAR classification in the Toolbox is used for grouping of chemicals by structural (mechanistic) similarity
34
EcoSAR category definition
To define a category based on an EcoSAR classification: Highlight “EcoSAR classification”. Click on “Defining Category” (see next slide).
35
Defining the Category
36
Confirmation An insert window confirming the EcoSAR phenols classification of the target chemical appears. Click on “OK”.
37
Naming Category Another insert window listing the default category name appears. Click on “OK”.
38
Analogues The Toolbox now identifies all chemicals corresponding to the ECOSAR classification of “phenols” which are listed in the databases selected under “Endpoints”. 559 analogues are identified. These 559 compounds along with the target chemical form a category (Phenols), which can be used for data filling (see next slide).
39
Mechanistic Analogues
40
Recap You have identified a mechanistic category (EcoSAR “phenols” classification) for the target chemical (3-ethyl-5-methyl-4-methoxyphenol). The available experimental results for these 559 chemicals can now be collected from the previously selected databases (ECETOC, ECOTOX and OASIS Aquatic).
41
Next Step in Gathering Data
Highlight the category “[559] Phenols (EcoSAR classification)”. The inserted window entitled “Read Data?” appears (see next slide). Click OK.
42
Reading the Selected Data
43
Reading the Selected Data
Select mode of reading data. Click OK. Due to the overlap between the Toolbox databases same data for intersecting chemicals could be found simultaneously in more than one database. The data redundancy is identified and the user has the opportunity to select either a single data value or all data values.
44
Summary of Aquatic toxicity Information for Analogues
45
Side-Bar on Data Note the structure of the 599 compounds with experimental results are shown. Double clicking on a structure enlarges the view of the structure.
46
Double clicking on a structure enlarges the view of the structure
Side-Bar on Data Double clicking on a structure enlarges the view of the structure
47
Navigating Through the Data Tree
The user can navigate through the data tree by closing or opening the nodes of the tree. In this example, results from aquatic toxicity are available. By double clicking on a cell in the data matrix, additional information on the test result (for example Daphnia magna for LC50 48h) is made available (see next slide).
48
Navigating Through the Data Tree
By double clicking on a cell in the data matrix, additional information on the test result (for example Daphnia magna for LC50 48h) is made available
49
Recap You have now retrieved the available experimental data on aquatic toxicity for 599 chemicals with classified as “phenols” by EcoSAR, which were found in the databases ECETOC,ECOTOX and OASIS Aquatic. You are ready to fill the data gap. Click on “Next” to access the module “Filling data gaps”.
50
The Filling Data Gap Window
Take a moment to examine the filling data matrix on the next slide. Note it contains information on the chemicals, which form the category, the 3 options for data filling, and a means of selecting data points used to fill the data gap (see next slide).
51
The Filling Data Gap Window
52
Filling Data Gaps This step in the work flow provides the user with three options for making an endpoint-specific prediction for the target molecule. As noted earlier, these options, in increasing order of complexity, are by read-across, by trend analysis, and through the use of external QSAR models. In this example we only use trend analysis.
53
Filling Data Gaps
54
Selecting the Data Points
Before applying trend analysis, the Toolbox allows the user to decide which type of results should be used in case more than one result is available for any analogue, (i.e., all values, average values, minimum or maximum results) (see next slide). Select “Average value” It should be noted that averaging results is only useful for quantitative endpoints.
55
Data Point Selection
56
Applying Trend-analysis
Highlight the data endpoint box corresponding to Daphnia magna / LC50 / 48 h under the target chemical. It will be empty as it is the data gap. Next, with the “trend analysis” box highlighted, click “Apply” (see next slide).
57
Apply Trend-analysis
58
Results of Trend-analysis
59
Interpreting the Trend-analysis
The resulting plot outlines the experimental LC50 results of all analogues (Y axis) according to a descriptor (X axis) with Log Kow being the default descriptor (see next slide). The RED dot represents the estimated result for the target chemical. The BLUE dots represent the experimental results available for the analogues. The GREEN dots (see following slides) represent analogues belonging to different subcategories. Before accepting the estimated result for the target chemical, the trend analysis should be further refined by subcategorisation (see following slides)
60
Side-Bar to Subcategorisation
In the Toolbox, a category refers to a group of chemicals which have the same profiling result according to one of the profilers listed in the module “Profiling” Subcategorisation refers to the process of applying additional profilers to the previously defined category, identifying chemicals which have differing profiling results and eventually eliminating these chemicals from the category.
61
Subcategorization The analogues which are dissimilar to the target chemical with respect to: - Substance type (mixtures and hydrolizing chemicals) The categorization based on substance type allows keeping among the analogues only those that are of the same chemical type: discrete chemicals, mixtures, polymers, inorganics, organometalics. The current target is a discrete chemical hence the analogues should also be discrete chemicals. - OASIS Mode of action (all except phenols and anilines) The categorization based on mode of action identifies analogues having the same mode of action as the target which is in the group of phenols and anilines. can be removed from the initial list of analogues previously defined by EcoSAR classification.
62
Subcategorization (1) Click on “Subcategor”
Select “Substance type” from the Grouping methods list. Click on “Remove” to eliminate chemicals with a different substance type.
63
Subcategorization (1) One salt (hydrolyzing chemical) has to be eliminated among the analogues being of a different substance type compared to the target (discrete chemical)
64
Subcategorization (2) Click on “Subcategor.”
Select “OASIS acute aquatic MOA” from the Grouping methods list. Click on “Remove” to eliminate chemicals with a different mode of action.
65
Subcategorization(2)
66
Results
67
Filled Data Gap The remaining chemicals in the graph now all have a consistent profile relevant for aquatic toxicity (i.e. substance type, ECOSAR classification and OASIS Acute Toxicity MOA) By accepting the prediction (click on “Accept”) the data gap is filled (see next slide). Click on “Next” to access the report module.
68
Filled Data Gap
69
Report The final step in the workflow, report, provides the user with a downloadable written audit trail of what the Toolbox did to arrive at the prediction. Click on “Show history”. This study history can be printed or copied to be inserted in a more detailed report (see next slide). Click on “Finish”.
70
Finished
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.