TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS Council Directive 76/464/EEC of the European.

Slides:



Advertisements
Similar presentations
C A INTRODUCTION An Environmental Quality Objective (EQO), intended as a real “No Effect Concentration” (NEC), is not accessible experimentally. The usual.
Advertisements

MODELLING OF PHYSICO-CHEMICAL PROPERTIES FOR ORGANIC POLLUTANTS F. Consolaro, P. Gramatica and S. Pozzi QSAR Research Unit, Dept. of Structural and Functional.
Chapter 10 Curve Fitting and Regression Analysis
P M V Subbarao Professor Mechanical Engineering Department
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
The Mole – A measurement of matter
ABSTRACT The BEAM EU research project focuses on the risk assessment of mixture toxicity. A data set of 124 heterogeneous chemicals of high concern as.
Lectures Molecular Bonding Theories 1) Lewis structures and octet rule
4 Th Iranian chemometrics Workshop (ICW) Zanjan-2004.
Quantitative Structure-Activity Relationships (QSAR) Comparative Molecular Field Analysis (CoMFA) Gijs Schaftenaar.
Bioinformatics IV Quantitative Structure-Activity Relationships (QSAR) and Comparative Molecular Field Analysis (CoMFA) Martin Ott.
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Section 10.1 Measuring Matter
Quantitative Genetics
Objectives To understand the definition of molar mass
Percent Composition, Empirical Formulas, Molecular Formulas
Chapter 6 Chemical Quantities.
Calibration & Curve Fitting
Objectives of Multiple Regression
QSAR Qualitative Structure-Activity Relationships Can one predict activity (or properties in QSPR) simply on the basis of knowledge of the structure of.
Combining Statistical and Physical Considerations in Deriving Targeted QSPRs Using Very Large Molecular Descriptor Databases Inga Paster and Mordechai.
Non ionic organic pesticide environmental behaviour: ranking and classification F. Consolaro and P. Gramatica QSAR Research Unit, Dept. of Structural and.
Molecular Descriptors
Chapter 10 & 11 Chemical quantities and Chemical Reactions.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
RESULT and DISCUSSION In order to find a relation between the three rate reaction constant (k OH, k NO3 and k O3 ) and the structural features of chemicals,
Surveillance monitoring Operational and investigative monitoring Chemical fate fugacity model QSAR Select substance Are physical data and toxicity information.
Chapter Calculations with Chemical Formulas and Equations Chemistry 1061: Principles of Chemistry I Andy Aspaas, Instructor.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Descriptive Statistics
ABSTRACT Bioconcentration by aquatic biota is an important factor in assessing the environmental behaviour and potential hazard evaluation of a chemical,
CONCLUSIONS CONCLUSIONS - Missing values of the principal physico-chemical properties are predicted by validated regression models by using different kinds.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
The aquatic toxicity values of 57 esters, with experimental and predicted LC50 in fish, EC50 in Daphnia and seaweed and IGC in Entosiphon sulcatum, were.
Paola GRAMATICA a, Paola LORENZINI a, Angela SANTAGOSTINO b and Ezio BOLZACCHINI b a University of Insubria, Dep. of Structural and Functional Biology,
Using the MOLE. Percentage composition Percentage composition is the mass of individual elements in a compound expressed as a percentage of the mass of.
AN APPROACH TO DETERMINE THE APPLICATION DOMAIN OF GROUP CONTRIBUTION MODELS Nina Jeliazkova 1 Joanna Jaworska 2, (2) Central Product Safety, Procter &
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
DESIRABILITY OF POPs ACCORDING TO THEIR ATMOSPHERIC MOBILITY The main goal pursued in this work is the formulation of a POP ranking by atmospheric mobility.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Martin Waldseemüller's World Map of 1507 Zanjan. Roberto Todeschini Viviana Consonni Davide Ballabio Andrea Mauri Alberto Manganaro chemometrics molecular.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
A B S T R A C T The study presents the application of selected chemometric techniques to the pollution monitoring dataset, namely, cluster analysis,
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
ABSTRACT The behavior and fate of chemicals in the environment is strongly influenced by the inherent properties of the compounds themselves, particularly.
P. Gramatica and F. Consolaro QSAR Research Unit, Dept. of Structural and Functional Biology, University of Insubria, Varese, Italy.
QSAR AND CHEMOMETRIC APPROACHES TO THE SCREENING OF POPs FOR ENVIRONMENTAL PERSISTENCE AND LONG RANGE TRANSPORT FOR ENVIRONMENTAL PERSISTENCE AND LONG.
Organic pollutants environmental fate: modeling and prediction of global persistence by molecular descriptors P.Gramatica, F.Consolaro and M.Pavan QSAR.
Log Koc = MW nNO – 0.19 nHA CIC MAXDP Ts s = 0.35 F 6, 134 = MW: molecular weight nNO: number of NO bonds.
Section 6.1 Atoms and Moles 1.To understand the concept of average mass 2.To learn how counting can be done by weighing 3.To understand atomic mass and.
F.Consolaro 1, P.Gramatica 1, H.Walter 2 and R.Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental.
MUTAGENICITY OF AROMATIC AMINES: MODELLING, PREDICTION AND CLASSIFICATION BY MOLECULAR DESCRIPTORS M.Pavan and P.Gramatica QSAR Research Unit, Dept. of.
P. Gramatica 1, H. Walter 2 and R. Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental Research.
Use of Machine Learning in Chemoinformatics
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
The Mole  Just to clear up any misconceptions when we use the term “mole” we are not referring to this small blind fellow.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Molecular Formula Calculations Combustion & Weight Percent C x H y + (x + y/4) O 2  x CO 2 + y/2 H 2 O C 2 H 5 OH + 3 O 2  2 CO H 2 O.
TOPIC 1 STOICHIOMETRIC RELATIONSHIPS 1.2 THE MOLE CONCEPT.
So that k k E 5 = - E 2 = = x J = x J Therefore = E 5 - E 2 = x J Now so 631.
MATH-138 Elementary Statistics
Ch. 7.4 Determining Chemical Formulas
PHYSICO-CHEMICAL PROPERTIES MODELLING FOR ENVIRONMENTAL POLLUTANTS
Hierarchical Classification of Calculated Molecular Descriptors
Prediction of Coal Free-Swelling Index using Mathematical Modelling
Principal Component Analysis (PCA)
P. Gramatica1, F. Consolaro1, M. Vighi2, A. Finizio2 and M. Faust3
Multivariate Statistics
Activity #16- Classification of Matter Chart Make the chart below in your notebook: ATOM ELEMENT COMPOUND MIXTURE MOLECULE.
M.Pavan, P.Gramatica, F.Consolaro, V.Consonni, R.Todeschini
Presentation transcript:

TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS Council Directive 76/464/EEC of the European Communities (EEC 1976a) includes the so-called “List 1 compounds” that are dangerous compounds for aquatic environments, selected mainly on the basis of their toxicity, persistence and bioaccumulation.Thus it is very important to obtain all the information and data relevant to the particular substances in living aquatic organisms. If no data are available to make an appropriate judgement for a specific substance, the substance is considered a candidate for List 1 until such data become available. For many chemicals there is little reliable information detailing their relative toxicity, so the application of molecular descriptors and chemometric methods in Quantitative Structure-Activity Relationships (QSAR)studies is used to predict toxicological data for different aquatic organisms. INTRODUCTION All the toxicity data are expressed in mmol/l and in logarithmic scale as log (1/response). The values used for calculations were selected by prof. Marco Vighi (Dept. of Environmental Sciences, Milano) from among the more reliable data of all the sets available. Selected data were produced with comparable, officially accepted testing methods (e.g. standard OECD or EEC Guidelines). 30 min EC 50 of the light emitted by a photoluminescent bacterium (Photobacterium phosphoreum) obtained by a standard automated method (Microtox). Available for 33 homogeneous molecules. 96 h EC 50 of unicellular chlorophiceans (Selenastrum, Chlorella or comparable species) obtained with standard methods were used. Available for 45 molecules. 48 h EC 50 obtained with standard methods were selected. Available for 94 molecules. 96 h LC 50 obtained with standard methods and produced with Onchorinchus mikiss, Poecilia reticulata or Pimephales promelas were selected. Available for 88 molecules. Bacteria Algae Daphnia Fish The minimum energy conformations of all the compounds were obtained by the molecular mechanics method of Allinger (MM+), using the package HyperChem. All descriptors were calculated from the obtained coordinates using the package WHIM- 3D/QSAR. Principal Component Analysis (PCA) was performed by STATISTICA. The Selection of the best Subset Variables (VSS method) for modelling the toxicity was done by taking a Genetic Algorithm (GA-VSS) approach, where the response is obtained by Ordinary Least Squares regression (OLS), using the package Moby Digs for variable selections. All the calculations were performed using the leave-one-out procedure of cross-validation, maximising the cross-validated R squared (Q 2 ), (Quick rule). To avoid an overestimation of the predictive capability of the models, the leave-more-out procedure (with N cross-validation groups, I.e. a 30% of objects left out at each step) was also performed (Q 2 LMO ). Standard Deviation Error in Prediction (SDEP) and Standard Deviation Error in Calculation (SDEC) are also reported, together with the multiple correlation coefficient (R 2 ). For the obtained models, the leverages approach was performed, with the aim of estimating the reliability of the predicted data and allowing only reliable predicted data to be considered. MOLECULAR DESCRIPTORS The molecule structure has been represented by different set of descriptors: mono-dimensional (count), two- dimensional (graph-invariants) and three-dimensional (3D-WHIM, 3D-Weighted Holistic Invariant Molecular) by the software produced by the Milano Chemometric Research Group of prof. Roberto Todeschini (1). Count descriptors (38) directly encode particular features of molecular structure and are simply obtained from the chemical structural formula of molecules, counting defined elements such as atoms (nAT), bonds (nBT), rings (nCIC), H-bond acceptors (nHA) and H-bond donors (nHD); atom type counts are obtained such as number of hydrogens, carbons, halogens (nH, nC, nX respectively). The second set is constituted by the more frequently used 34 graph-invariants descriptors (topological and information indices). The molecular weight (MW) is always used. For the 3D representation of the molecules, the WHIM descriptors, recently proposed and widely applied by Todeschini and Gramatica (2), have been used: a set constituted by the 33 non-directional WHIM and the 66 directional WHIM. WHIM descriptors are molecular indices that represent different sources of chemical information about the whole 3D-molecular structure in terms of size, shape, symmetry and atom distribution. These indices are calculated from (x,y,z)-coordinates of a 3D-structure of the molecule, usually from a spatial conformation of minimum energy, within different weighting schemes in a straightforward manner and represent a very general approach to describe molecules in a unitary conceptual framework. (1) R. Todeschini, WHIM-3D/QSAR- Software for the calculation of the WHIM descriptors, rel. 4.1 for Windows, Talete srl, Milano (Italy) Download: (2) R. Todeschini and P. Gramatica, 3D-modelling and prediction by WHIM descriptors. Part 5. Theory development and chemical meaning of the WHIM descriptors, Quant. Struct.-Act. Relat., 16 (1997) ; Part 6. Applications in QSAR Studies, same, F. Consolaro and P. Gramatica QSAR Research Unit, Dept. of Structural and Functional Biology, University of Insubria, Varese, Italy. Web-site: EXPERIMENTAL DATA Uniform dimension Principal Component Analyses were performed on all the experimental toxicity data with the aim of highlighting the distribution of the studied compounds. It can be noted that along the first component the compounds are well separated by the global toxicity, while along the second principal component they are separated by their specific toxicity: Toxicity in Bacteria Toxicity in Bacteria (33 objects) Tox = nBO nO WIA P2s Ke R 2 = 89.8% Q 2 LOO = 86.1% Q 2 LMO = 82.0% SDEP = 0.26 SDEC = 0.22 F 5,27 = S = 0.23 nBO: n. of skeleton bonds nO: n. of oxigen atoms WIA: average Wiener index P2s: shape dir-WHIM descriptor Ke: shape glob-WHIM descriptor Toxicity in Algae Toxicity in Algae (45 objects) Tox = nS nOH nNH P1s Tm As R 2 = 70.6% Q 2 LOO = 61.5% Q 2 LMO = 58.1% SDEP = 0.56 SDEC = 0.49 F 6,38 = S = 0.52 nS: n. of sulphur atoms nOH: n. of OH groups nNH2: n. of NH2 groups P1s: shape dir-WHIM descriptors Tm and As: dimensional glob.-WHIM descriptors Toxicity in Daphnia Toxicity in Daphnia (94 objects) Tox = nP nHA IDM E1m R 2 = 84.2% Q 2 LOO = 82.1% Q 2 LMO = 81.7% SDEP = 0.68 SDEC = 0.64 F 4,89 = S = 0.65 nP: n. of phosphorous atoms nHA: n. of h bonds acceptors IDM: mean inf. cont. on the dist. magn. E1m: atom distribution dir-WHIM descriptor Toxicity in Fish Toxicity in Fish (88 objects) Tox = nNO nHD IDM Du De Ds R 2 = 81.5% Q 2 LOO = 78.1% Q 2 LMO = 77.8% SDEP = 0.58 SDEC = 0.53 F 6,81 = S = 0.55 nNO: n. of NO groups nHD: n. of H bonds donors IDM: mean inf. cont. on the dist. magn. Du, De and Ds: atom distribution glob.-WHIM descriptors Bacteria Daphnia Algae Fish PCA on toxicity of bacteria, algae, Daphnia and fish Training set: 15 mol. Cum. E.V. = 86.6% (PC1 = 72.8%) PCA on toxicity of algae, Daphnia and fish Training set: 37 mol. Cum. E.V. = 90.0% (PC1 = 69.0%) PCA on toxicity of Daphnia and fish Training set: 79 mol. Cum. E.V. = 100% (PC1 = 86.5%) CONCLUSIONS The used procedures have confirmed the quite satisfactory predictive capability of the obtained models. The role of the descriptors in predicting the toxic effects can be explained, though there are a few uncertainties. Count descriptors play an important role in all models because of their capability in explaining particular features of some groups of chemicals; also the shape (P, k) and the density factors (E, D) are determinant in predicting the toxicity of the studied compounds. Using the reliable predicted data it was possible to add many toxicological data to the available experimental values. The graphics below and the annexed table report all the available experimental data and in addition the values predicted by our models (pink data). PCA on toxicity of bacteria, algae, Daphnia and fish n. tot. mol.: 54 Cum. E.V. = 81.5% (PC1 = 58.2%) PCA on toxicity of algae, Daphnia and fish n. tot. mol.: 97 Cum. E.V. = 93.7% (PC1 = 77.0%) PCA on toxicity of Daphnia and fish n. tot. mol.: 125 Cum. E.V. = 100% (PC1 = 88.3%) METHODS 2s/P003