MODELLING OF PHYSICO-CHEMICAL PROPERTIES FOR ORGANIC POLLUTANTS F. Consolaro, P. Gramatica and S. Pozzi QSAR Research Unit, Dept. of Structural and Functional.

Slides:



Advertisements
Similar presentations
C A INTRODUCTION An Environmental Quality Objective (EQO), intended as a real “No Effect Concentration” (NEC), is not accessible experimentally. The usual.
Advertisements

CE Introduction to Environmental Engineering and Science Readings for This Class: O hio N orthern U niversity Introduction Chemistry, Microbiology.
UNIT 2 Inorganic Nomenclature*, Intermolecular Forces, and Properties of Solutions *Students are responsible for reviewing nomenclature on their own.
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
Lipinski’s rule of five
ABSTRACT The BEAM EU research project focuses on the risk assessment of mixture toxicity. A data set of 124 heterogeneous chemicals of high concern as.
Basic Steps of QSAR/QSPR Investigations
Introduction to GW contamination and
1 Persistent, Bioaccumulative and Toxic Pollutants Persistent, Bioaccumulative and Toxic Pollutants Programme (PBT) launched by the EPA in 1998: - Reduce.
Quantitative Structure-Activity Relationships (QSAR) Comparative Molecular Field Analysis (CoMFA) Gijs Schaftenaar.
Bioinformatics IV Quantitative Structure-Activity Relationships (QSAR) and Comparative Molecular Field Analysis (CoMFA) Martin Ott.
Speed of Gases. Speed of Gases Root Mean Square Speed of Gases R = J *K / mol J = 1kg * m2 / s2 MM = molar mass.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Quantitative Structure-Activity Relationships (QSAR)  Attempts to identify and quantitate physicochemical properties of a drug in relation to its biological.
Environmental Processes Partitioning of pollutants 3.i Sorption involving organic matter (between air/soil and water/soil)
Combining Statistical and Physical Considerations in Deriving Targeted QSPRs Using Very Large Molecular Descriptor Databases Inga Paster and Mordechai.
Non ionic organic pesticide environmental behaviour: ranking and classification F. Consolaro and P. Gramatica QSAR Research Unit, Dept. of Structural and.
Molecular Descriptors
Molecular Polarity & Intermolecular Forces
Fundamental processes in soil, atmospheric and aquatic systems 2(ii) Partitioning.
RESULT and DISCUSSION In order to find a relation between the three rate reaction constant (k OH, k NO3 and k O3 ) and the structural features of chemicals,
Sung Kyu (Andrew) Maeng. Contents  QSAR Introduction  QSBR Introduction  Results and discussion  Current QSAR project in UNESCO-IHE.
Surveillance monitoring Operational and investigative monitoring Chemical fate fugacity model QSAR Select substance Are physical data and toxicity information.
A unifying model of cation binding by humic substances Class: Advanced Environmental Chemistry (II) Presented by: Chun-Pao Su (Robert) Date: 2/9/1999.
ABSTRACT Bioconcentration by aquatic biota is an important factor in assessing the environmental behaviour and potential hazard evaluation of a chemical,
Predicting a Variety of Constant Pure Compound Properties by the Targeted QSPR Method Abstract The possibility of obtaining a reliable prediction a wide.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
CONCLUSIONS CONCLUSIONS - Missing values of the principal physico-chemical properties are predicted by validated regression models by using different kinds.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
The aquatic toxicity values of 57 esters, with experimental and predicted LC50 in fish, EC50 in Daphnia and seaweed and IGC in Entosiphon sulcatum, were.
Paola GRAMATICA a, Paola LORENZINI a, Angela SANTAGOSTINO b and Ezio BOLZACCHINI b a University of Insubria, Dep. of Structural and Functional Biology,
Identifying Applicability Domains for Quantitative Structure Property Relationships Mordechai Shacham a, Neima Brauner b Georgi St. Cholakov c and Roumiana.
University of Texas at AustinMichigan Technological University 1 Module 2: Evaluating Environmental Partitioning and Fate: Approaches based on chemical.
DESIRABILITY OF POPs ACCORDING TO THEIR ATMOSPHERIC MOBILITY The main goal pursued in this work is the formulation of a POP ranking by atmospheric mobility.
TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS Council Directive 76/464/EEC of the European.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
The Gas State  Gases are everywhere – atmosphere, environmental processes, industrial processes, bodily functions  Gases have unique properties from.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
Polychlorinated Biphenyls (PCBs)
Fugacity-based environmental modelsmodels Level 1--the equilibrium distribution of a fixed quantity of conserved chemical, in a closed environment at equilibrium,
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
ABSTRACT The behavior and fate of chemicals in the environment is strongly influenced by the inherent properties of the compounds themselves, particularly.
P. Gramatica and F. Consolaro QSAR Research Unit, Dept. of Structural and Functional Biology, University of Insubria, Varese, Italy.
QSAR AND CHEMOMETRIC APPROACHES TO THE SCREENING OF POPs FOR ENVIRONMENTAL PERSISTENCE AND LONG RANGE TRANSPORT FOR ENVIRONMENTAL PERSISTENCE AND LONG.
Organic pollutants environmental fate: modeling and prediction of global persistence by molecular descriptors P.Gramatica, F.Consolaro and M.Pavan QSAR.
McKim Conference on Predictive Toxicology
Log Koc = MW nNO – 0.19 nHA CIC MAXDP Ts s = 0.35 F 6, 134 = MW: molecular weight nNO: number of NO bonds.
حرارة وديناميكا حرارية
F.Consolaro 1, P.Gramatica 1, H.Walter 2 and R.Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental.
MUTAGENICITY OF AROMATIC AMINES: MODELLING, PREDICTION AND CLASSIFICATION BY MOLECULAR DESCRIPTORS M.Pavan and P.Gramatica QSAR Research Unit, Dept. of.
Molar Mass of an Unknown Gas
P. Gramatica 1, H. Walter 2 and R. Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental Research.
Use of Machine Learning in Chemoinformatics
Intermolecular Forces Topic 4.3. Intermolecular Forces Intramolecular forces – refer to the forces that hold atoms together within molecules or formula.
© 2009, Prentice-Hall, Inc. Effusion Effusion is the escape of gas molecules through a tiny hole into an evacuated space.
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Mixture or Compound Mixture or Compound.
Pure Substances and Mixtures. Pure Substances Elements cannot be broken down chemically into anything else – they are pure substances. Compounds can be.
Pure Substances and Mixtures
Lecture 44 Electronegativity and Polarity Ozgur Unal
Statistical Data Analysis - Lecture /04/03
Intermolecular Forces
PHYSICO-CHEMICAL PROPERTIES MODELLING FOR ENVIRONMENTAL POLLUTANTS
Hierarchical Classification of Calculated Molecular Descriptors
SIMPLE LINEAR REGRESSION MODEL
Virtual Screening.
P. Gramatica1, F. Consolaro1, M. Vighi2, A. Finizio2 and M. Faust3
Information Sources Day 1.
Chapter 11 Liquids, solids, and intermolecular forces
M.Pavan, P.Gramatica, F.Consolaro, V.Consonni, R.Todeschini
Chemistry/Physical Setting
Presentation transcript:

MODELLING OF PHYSICO-CHEMICAL PROPERTIES FOR ORGANIC POLLUTANTS F. Consolaro, P. Gramatica and S. Pozzi QSAR Research Unit, Dept. of Structural and Functional Biology, University of Insubria, Varese, Italy web-site: INTRODUCTION Many organic pollutants are causes of contamination of air, water, soil and food, with possible adverse effects on human and animal health. The proper use of chemicals must be based on an understanding of their behaviour in the environment. This behaviour is strongly influenced by the chemical’s physico-chemical properties. Unfortunately, for a large number of these compounds, the experimental data of several physico-chemical properties are not known or, when known, not all data are homogeneous; this hinder an accurate and comparable evaluation of the future environmental fate of the considered compounds. EEC Priority List 1 compounds and Persistent Organic Pollutants (POPs) are the classes of compounds considered here. APPLICATION OF QSARs FOR THE PREDICTION OF PHYSICO-CHEMICAL PROPERTIES Due to the difficulties in obtaining reliable experimental data, QSAR studies provide a complementary tool that can profitably be used to provide data from accurately validated models. Quantitative structure-activity relationship approaches are based on the assumption that the structure of a molecule must contain features responsible for its physical and chemical properties, and on the possibility of representing a molecule by numerical descriptors. Thus, molecular descriptors represent the way by which chemical information is transformed and coded, to deal with chemical, pharmacological and toxicological problems. Many chemical descriptors have been used in this work: 1D-descriptors, that are structural descriptors obtained from a simple knowledge of the molecular formula, 2D or topological descriptors, that are obtained from the knowledge of the molecular topology, and 3D-descriptors, recently developed (1), that contain information about the whole 3D-molecular structure in terms of size, shape, symmetry and atom distribution. Before the calculation of the descriptors, the minimum energy conformations of all the compounds were obtained by the molecular mechanics method of Allinger (MM+). Then, the selected properties were modelled by the Selection of the best Subset Variables (VSS) method, through the Genetic Algorithm (GA-VSS) approach, where the response is obtained by Ordinary Least Squares regression (OLS). All calculations were performed using the leave-one- out and leave-more-out procedures of cross-validation, maximising the cross-validated R squared (Q 2 ). Standard Deviation Error in Prediction (SDEP) and Standard Deviation Error in Calculation (SDEC) are also evaluated. In spite of the great variability of the molecular structures of the studied compounds, models with good predictive power have been obtained. The reliability of predicted data was subsequently checked by the leverage approach; only reliable predicted data were then used. (1) R. Todeschini, WHIM-3D/QSAR- Software for the calculation of the WHIM descriptors. Rel. 4.1 for Windows, Talete s.r.l., Milano (Italy) Download: EEC Priority List 1 compounds are chemicals of high diffusion that are dangerous for the environment. The hazard relates to both their toxicity for different organisms and their physico-chemical properties that determine their environmental fate, persistence and bioaccumulation. This List includes a large number of heterogeneous compounds, for which the following physico-chemical properties were considered: melting point, boiling point, density, refraction index, vapour pressure, water solubility and log Kow. Log vapour pressure = nBO E10 -4 IDMT HY Log water sol. = Ss L3v Ku Log K ow = MW UI L3m Log vapour pressure = P1s Tv Log water sol. = nC nCl nOH nNH2 MW: molecular weight Se: total electronegativity HY: hydrophilicity factor E3m: emptiness along the third direction with atomic masses weights nBO: number of bonds IDMT: total information content on the distance magnitude HY: hydrophilicity factor nC: number of C atoms nCl: number of Cl atoms nOH: number of OH groups nNH2: number of NH2 groups The dimension of the molecules (MW, Se) appears the most important factor. The selection of E3m descriptor confirms the importance of the emptiness factor and the atomic masses in the determination of this property. The hydrophilicity factor selected by this model seems to be the most relevant variable in predicting vapour pressure (by standardised regression coefficients). In this model all the selected variables are count descriptors. CONCLUSIONS The good predictive power of models, here presented, indicates that they will accurately predict environmental fate and effects of organic chemicals for which experimental data are not available. Reliable predictive models will enable fast and accurate assessments to be made of the environmental profiles of proven or potentially hazardous chemicals. The ability to accurately estimate the environmental fate and effect of commercial chemicals, and to obtain a better insight into the structural features important for such behaviour, will be very helpful in formulating improved environmental policy. For this reason we have available, for anyone interested in it, a table with all the data predicted by the above reported regression models. Density = E10 -3 MW E10 -2 Se HY E3m Persistent Organic Pollutants (POPs) EEC Priority List 1 compounds Persistent Organic Pollutants are long-lived and fat-soluble compounds that have toxic effects on animal reproduction, development and immunological function. Some POPs are also probably carcinogenic. These compounds are of 12 chemical classes, including polychlorinated biphenyls (PCBs), polycyclic aromatic hydrocarbons (PAHs), polychlorinated dibenzo-p-dioxins, pesticides and chlorinated benzene. The POPs environmental fate depends upon a variety of physico-chemical properties such as vapour pressure, Henry’s law constant, boiling point, melting point, log K ow, log K oc, water solubility and also total surface area and molar volume. Ss: total electrotopological charge L3v:dimension along the third direction with van der Waals volume weight Ku: shape with unitary weights This model points out the importance of size variables (Ss) and the presence of out-of-plane atoms (L3v) in predicting water solubility. The descriptor Ku underlines the relevance of molecular shape (more linear molecules (K=1) are less soluble). MW: molecular weight UI: insaturation index L3m:dimension along the third direction with atomic mass weight The descriptors MW and UI confirm the importance of molecular size and insaturation in order to predict POPs K ow. L3m points out the relevance along the third dimension of chlorine atoms. P1s: shape along principal axis with electrotopological charge weight Tv: dimension linear term with van der Waals weights The selected descriptors point out the importance of the molecular size (Tv) and of the linearity (P1s) in determining POP vapour pressure. 2r/P002