A) I. I. Mechnikov National University, Chemistry Department, Dvorianskaya 2, Odessa 65026, Ukraine, b) Department of Molecular.

Slides:



Advertisements
Similar presentations
Multidimensional Parallel Column Gas Chromatography P. M. Owens and D. W. Loehle Center for Molecular Sciences United States Military Academy West Point,
Advertisements

Simple Linear Regression and Correlation
Lab #20 Write-Up (Due ) Rubric Your Report Header (Establishing Equilibrium) Objective Procedure & Materials: Refer to Handout Data: Graph attached.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
Objectives (BPS chapter 24)
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 17 Chapter 13: Acid-Base Titrations.
Regression and Correlation
Basic Statistical Concepts
Simple Linear Regression Statistics 700 Week of November 27.
IE-331: Industrial Engineering Statistics II Spring 2000 WEEK 1 Dr. Srinivas R. Chakravarthy Professor of Operations Research and Statistics Kettering.
Application and Efficacy of Random Forest Method for QSAR Analysis
Linear Regression Analysis
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH Lesson 14 Correlation & Regression.
Results Conclusion C Results CFD study on heat transfer and pressure drop characteristics of an offset strip-fin heat exchanger in helium systems Objectives.
AN ITERATIVE METHOD FOR MODEL PARAMETER IDENTIFICATION 4. DIFFERENTIAL EQUATION MODELS E.Dimitrova, Chr. Boyadjiev E.Dimitrova, Chr. Boyadjiev BULGARIAN.
Biostatistics Unit 9 – Regression and Correlation.
REVISION C4 1 st Half! (H) An acid can be neutralised by adding a ______ or an ______ to it. An _______ is a soluble _______. An alkali can be neutralised.
STRATEGY TO HELP UNDERGRADUATE STUDENTS TO CALCULATE ACIDITY CONSTANTS BASED ON SPECTROSCOPIC MEASUREMENTS Diego Airado Rodríguez 1, Florentina Cañada.
A unifying model of cation binding by humic substances Class: Advanced Environmental Chemistry (II) Presented by: Chun-Pao Su (Robert) Date: 2/9/1999.
S TUDY O F T HE S ECOND V IRIAL C OEFFICIENTS : N EW C HALLENGE F OR QSPR Elena Mokshyna, Victor E. Kuz’min, Vadim I. Nedostup.
Lab #24 Write-Up (Due ) Rubric Your Report Lab Worksheet
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Identifying Applicability Domains for Quantitative Structure Property Relationships Mordechai Shacham a, Neima Brauner b Georgi St. Cholakov c and Roumiana.
Regression Regression relationship = trend + scatter
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Bashkir State Univerity The Chair of Mathematical Modeling , Ufa, Zaki Validi str. 32 Phone: ,
University of Auckland New Zealand Geothermal Group Department of Engineering Science Computer Modelling of Gas and Liquid Tracers in Geothermal Reservoirs.
Physical and chemical equilibrium of CO2-Water-Mineral system using Aspen Plus process simulator Technical University of Delft Ali Akbar Eftekhari Hans.
L e c t u r e 2L e c t u r e 2L e c t u r e 2L e c t u r e 2 Precipitation equilibrium Associate prof. L.V. Vronska Associate prof. M.M. Mykhalkiv.
Acid/Base Chemistry Part II CHEM 2124 – General Chemistry II Alfred State College Professor Bensley.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
A WEIGHTED CALIBRATION METHOD OF INTERFEROMETRIC SAR DATA Yongfei Mao Maosheng Xiang Lideng Wei Daojing Li Bingchen Zhang Institute of Electronics, Chinese.
WARM-UP Do the work on the slip of paper (handout)
Physical Property Modeling from Equations of State David Schaich Hope College REU 2003 Evaluation of Series Coefficients for the Peng-Robinson Equation.
Scatter Plots, Correlation and Linear Regression.
Selection of Molecular Descriptor Subsets for Property Prediction Inga Paster a, Neima Brauner b and Mordechai Shacham a, a Department of Chemical Engineering,
A "Reference Series" Method for Prediction of Properties of Long-Chain Substances Inga Paster and Mordechai Shacham Dept. Chem. Eng. Ben-Gurion University.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Atmospheric Chemistry Chemical effects on cloud activation with special emphasis on carbonaceous aerosol from biomass burning M. C. Facchini, S. Decesari,
Differentiate between physical and chemical changes and properties.[CHE.4A] October 2014Secondary Science - Chemistry.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Theory of dilute electrolyte solutions and ionized gases
Differential Equations Linear Equations with Variable Coefficients.
A molecular descriptor database for homologous series of hydrocarbons ( n - alkanes, 1-alkenes and n-alkylbenzenes) and oxygen containing organic compounds.
Physical Science and You Chapter One: Studying Physics and Chemistry Chapter Two: Experiments and Variables Chapter Three: Key Concepts in Physical Science.
Unit 3 Section : Regression  Regression – statistical method used to describe the nature of the relationship between variables.  Positive.
1 Prediction of Phase Equilibrium Related Properties by Correlations Based on Similarity of Molecular Structures N. Brauner a, M. Shacham b, R.P. Stateva.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Acids, Bases, and the pH scale. Acids and Bases  A chemical compound that releases H + to solution is an acid.  A compound that accepts H + and removes.
Physiochemical properties of drugs Using the Sirius T3 to make measurements.
Surface Tension Measurements of Organic, Inorganic and Mixed Aqueous Solutions Acknowledgments Thanks to the UNH Chemistry Department and Dr. Greenslade’s.
CHEE 323J.S. Parent1 Reaction Kinetics and Thermodynamics We define a catalyst as a substance that increases the rate of approach to equilibrium of a reaction.
CHAPTER 3 Describing Relationships
A.Liudchik, V.Pakatashkin, S.Umreika, S.Barodka
Regression and Correlation
Chemistry I Unit IV Objectives Chapter 10
Hierarchical Classification of Calculated Molecular Descriptors
Taras Shevchenko University, Kiev, Ukraine
Regression Analysis PhD Course.
2-7 Curve Fitting with Linear Models Holt Algebra 2.
VO Biophysik | G. Schauberger | Institut für Med. Physik & Biostatistik Reconstruction of airborne emissions by inverse dispersion modelling and.
Correlation and Regression
Pei-Yu Lin1#, Ming-Huang Wang1, Wen-Ta Chiu2,3 and Yuh-Shan Ho4*
Adequacy of Linear Regression Models
Introduction to Analytical Chemistry
3.2. SIMPLE LINEAR REGRESSION
Physical Properties of Matter
Presentation transcript:

a) I. I. Mechnikov National University, Chemistry Department, Dvorianskaya 2, Odessa 65026, Ukraine, b) Department of Molecular Structure and Chemoinformatics, A.V. Bogatsky Physical- Chemical Institute National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine c) Badger Technical Services, LLC, Vicksburg, Mississippi, USA d) Interdisciplinary Center for Nanotoxicity, Department of Chemistry, Jackson State University, Jackson, Mississippi, 39217, USA TWO-LAYER QSPR MODEL FOR PREDICTION OF ORGANIC COMPOUNDS A QUEOUS SOLUBILITY AT VARIOUS TEMPERATURES 2013 Presented by: Klimenko K.

Odessa national university Chemistry department

Challenges of aqueous solubility determination Other factors which can effect solubility 1.Pressure 2.Solution equilibrium 3.pH 4.State of substance 5.Methods for excessive solute removal These factors are frequently not taken to the account when solubility determination is carried out. Moreover, there is no universally recognized method for the experiment, therefore, solubility data can be variegated. 3

Temperature-solubility relationship Example solubility temperature coefficient(k j ) 4

Assessment of regression equation fit 5

Two-layer QSPR approach for aqueous solubility model development Molecular descriptors QSPR of aqueous solubility at 25 o C (lg(x j ) 25 ) Aqueous solubility prediction in range 0<t<100 lg(x j ) t = f (lg(x j ) 25, k j, t) QSPR of solubility temperature coefficient (k j ) 6

Feature net procedure for QSPR solubility model development Solubility temperature coefficient (k j ) calculation from experimental data QSPR model for coefficient prediction (k j ) Generating Simplex descriptors QSPR solubility model 0<t<100 0 C 7 Prediction of (k j ) value for all compounds in the set Calculation of descriptor k j (t-25), for temperature factor impact implementation

Statistical characteristics of QSPR models for solubility temperature coefficients 8 T1T2T3T4T5Average n65 Variable number Tree number R2R R 2 test R 2 (oob) S (ws) S (oob) S (ts) n – number of data points T(1-5) – test sets

Obs. vs Pred. solubility coefficient plot 9

Statistical characteristics of feature net QSPR models for solubility at temperature range 0>t>100 0 C T1T2T3T4T5Average m548 n1484 Variable number200 Tree number150 R2R R 2 test R 2 (oob) S (ws) S (oob) S (ts) m – number of compounds 10

Obs. vs Pred. solubility model plot 11

Distribution of prediction error for compounds with various molecular mass 12

Physicochemical parameters' relative influence on solubility in general model 13

Prediction of aqueous solubility for compounds from external test set(t=25,m=28) 14 Compounds nameobs.pred.Compounds nameobs.pred. acebutolol pyrimethamine Amoxicillin salicylic acid trazodone sulfamerazine folic acid sulfamethizole furosemide terfenadine hydrochlorothiazide thiabendazole imipramine tolbutamide indometacin Benzocaine ketoprofen benzthiazide lidocaine clozapin meclofenamic acid dibucaine naphthoic acid diethylstilbestrol Bendroflumethiazide diflunisal probenecid dipyridamole model1/Ttwo-layerfeature net S3,571,391,18 % accurate predictions17,942,946,4

Prediction of aqueous solubility at different temperatures t= o C t= o C t= o C t= o C t= o C m=5,k=35 %acc.pred.comp=75 %acc.pred.data points=71,4 15

Conclusion -SiRMS allows developing QSPR models for successful aqueous solubility in temperature range о С. -Linear regression equation is the best to describe solubility logarithm dependence on temperature. It is also useful for defining solubility temperature coefficient. -Electrostatics (25%) and lipophilicity (18%) have max impact on solubility. Temperature factor’s influence is also substantial and equals 3%. -Information derived from 2D-structure is sufficient for aqueous solubility prediction. 16

Thank you for your attention!