Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.

Slides:



Advertisements
Similar presentations
Spatial autoregressive methods
Advertisements

Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.
Latent Semantic Analysis
Chapter 6 Eigenvalues and Eigenvectors
Surface normals and principal component analysis (PCA)
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Artificial Neural Networks
ABSTRACT The BEAM EU research project focuses on the risk assessment of mixture toxicity. A data set of 124 heterogeneous chemicals of high concern as.
Spectral Clustering Scatter plot of a 2D data set K-means ClusteringSpectral Clustering U. von Luxburg. A tutorial on spectral clustering. Technical report,
The Potts and Ising Models of Statistical Mechanics.
Chapter 9 Graph algorithms Lec 21 Dec 1, Sample Graph Problems Path problems. Connectedness problems. Spanning tree problems.
Chapter 3 Determinants and Matrices
The Irreducible reps are orthogonal. Hence for the reducible rep and a particular irreducible rep j  (character of Reducible Rep)(character of Irreducible.
Visual Recognition Tutorial1 Random variables, distributions, and probability density functions Discrete Random Variables Continuous Random Variables.
6 1 Linear Transformations. 6 2 Hopfield Network Questions.
Analytic Geometry in Three Dimensions
Matrices CS485/685 Computer Vision Dr. George Bebis.
Arithmetic Operations on Matrices. 1. Definition of Matrix 2. Column, Row and Square Matrix 3. Addition and Subtraction of Matrices 4. Multiplying Row.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 5QF Introduction to Vector and Matrix Operations Needed for the.
Molecular Descriptors
Lecture7 Topic1: Graph spectral analysis/Graph spectral clustering and its application to metabolic networks Topic 2: Concept of Line Graphs Topic 3: Introduction.
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
1 February 24 Matrices 3.2 Matrices; Row reduction Standard form of a set of linear equations: Chapter 3 Linear Algebra Matrix of coefficients: Augmented.
8.1 Vector spaces A set of vector is said to form a linear vector space V Chapter 8 Matrices and vector spaces.
Presentation on Matrices and some special matrices In partial fulfillment of the subject Vector calculus and linear algebra ( ) Submitted by: Agarwal.
Computer Science 112 Fundamentals of Programming II Introduction to Graphs.
ECE 8443 – Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Normal Distributions Whitening Transformations Linear Discriminants Resources.
Network Systems Lab. Korea Advanced Institute of Science and Technology No.1 Appendix A. Mathematical Background EE692 Parallel and Distribution Computation.
Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
“Topological Index Calculator” A JavaScript application to introduce quantitative structure-property relationships (QSPR) in undergraduate organic chemistry.
Linear algebra: matrix Eigen-value Problems
Martin Waldseemüller's World Map of 1507 Zanjan. Roberto Todeschini Viviana Consonni Davide Ballabio Andrea Mauri Alberto Manganaro chemometrics molecular.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
A Review of Some Fundamental Mathematical and Statistical Concepts UnB Mestrado em Ciências Contábeis Prof. Otávio Medeiros, MSc, PhD.
ABSTRACT The behavior and fate of chemicals in the environment is strongly influenced by the inherent properties of the compounds themselves, particularly.
1 Some Inequalities on Weighted Vertex Degrees, Eigenvalues, and Laplacian Eigenvalues of Weighted Graphs Behzad Torkian Dept. of Mathematical Sciences.
What is the determinant of What is the determinant of
1 Matrix Algebra and Random Vectors Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Eigenvalues The eigenvalue problem is to determine the nontrivial solutions of the equation Ax= x where A is an n-by-n matrix, x is a length n column.
F.Consolaro 1, P.Gramatica 1, H.Walter 2 and R.Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental.
Signal & Weight Vector Spaces
MATRICES Operations with Matrices Properties of Matrix Operations
By Josh Zimmer Department of Mathematics and Computer Science The set ℤ p = {0,1,...,p-1} forms a finite field. There are p ⁴ possible 2×2 matrices in.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Mesh Segmentation via Spectral Embedding and Contour Analysis Speaker: Min Meng
Objectives: Normal Random Variables Support Regions Whitening Transformations Resources: DHS – Chap. 2 (Part 2) K.F. – Intro to PR X. Z. – PR Course S.B.
Lecture 5 Graph Theory prepped by Lecturer ahmed AL tememe 1.
2.1 Matrix Operations 2. Matrix Algebra. j -th column i -th row Diagonal entries Diagonal matrix : a square matrix whose nondiagonal entries are zero.
MTH108 Business Math I Lecture 20.
CS479/679 Pattern Recognition Dr. George Bebis
Matrices and vector spaces
Network analysis.
Degree and Eigenvector Centrality
Section 7.12: Similarity By: Ralucca Gera, NPS.
Statistical Analysis Professor Lynne Stokes
1. Use the quadratic formula to find all real zeros of the second-degree polynomial
CS485/685 Computer Vision Dr. George Bebis
6-4 Symmetric Matrices By毛.
Derivative of scalar forms
Graph Operations And Representation
1.3 Vector Equations.
Chapter 3 Linear Algebra
Matrix Algebra and Random Vectors
Topological Index Calculator III
Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 – 14, Tuesday 8th November 2016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR)
...molecule is said to be a “polar”
Principal Component Analysis
M.Pavan, P.Gramatica, F.Consolaro, V.Consonni, R.Todeschini
Detecting Important Nodes to Community Structure
Presentation transcript:

Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria decision making environmetrics experimental design artificial neural networks statistical process control Milano Chemometrics and QSAR Research Group Department of Environmental Sciences University of Milano - Bicocca P.za della Scienza, Milano (Italy) Website: michem.unimib.it/chm/

Roberto Todeschini Milano Chemometrics and QSAR Research Group Molecular descriptors Autocorrelations, eigenvalue-based and information indices Iran - February 2009

Contents  Autocorrelation descriptors  Autocorrelation descriptors  Molecule representation by matrices  Molecule representation by matrices  Eigenvalue-based descriptors  Eigenvalue-based descriptors  Information content  Information content  Information indices  Information indices

- quadratic molecular property - quadratic molecular property with interaction terms w is the vector collecting the weights of each atom Autocorrelation on a molecular graph 1 = (1,A) (A,A) (A,1)

Moreau - Broto autocorrelation of a topological structure 1984 LAG Autocorrelation on a molecular graph

Example : 4-hydroxy-2-butanone Autocorrelation on a molecular graph

Eigenvalue descriptors are derived from the diagonalization of symmetric matrices derived from a molecular graph, such as:  Adjacency matrix  Vertex distance matrix  Edge adjacency matrix  Edge distance matrix  Detour matrix  Geometrical distance matrix  Covariance matrix... and any weighted symmetric matrix Eigenvalue-based descriptors

Lovasz - Pelikan index (or leading eigenvalue) The largest eigenvalue derived from the adjacency matrix 1973

Eigenvalue-based descriptors General functions of eigenvalues

Eigenvalue-based descriptors The trace of the adjacency matrix (and of the distance matrix) is equal to zero.

Eigenvalue-based descriptors VAA indices (from adjacency matrix) Balaban et al., 1991

VEA indices (from adjacency matrix) Balaban et al., 1991 where A is largest negative eigenvalue derived from the adjacency matrix Eigenvector-based descriptors

VAD, VED and VRD indices (from distance matrix) Balaban et al., 1991 The same indices defined above are calculated on the topological distance matrix Eigenvalue-based descriptors

The geometry matrix G (or geometric distance matrix) is a square symmetric matrix whose entry r st is the geometric distance calculated as the Euclidean distance between the atoms s and t: Molecular geometry

Distance / distance matrix Distance / distance matrix (DD) Randic et al., 1994

Folding degree index Randic et al., 1994 This quantity tends to 1 for linear molecules (of infinite length) and decreases in correspondence with the folding of the molecule. The largest eigenvalue derived from the distance/distance matrix Eigenvalue-based descriptors

Conventional bond order  single bond:  * = 1  double bond:  * = 2  triple bond:  * = 3  conjugated bond:  * = 1.5

Eigenvalue-based descriptors BCUT descriptors Burden - CAS - University of Texas eigenvalues BCUT descriptors Burden - CAS - University of Texas eigenvalues The largest absolute eigenvalues 1, 2, 3,..., L, derived from the following B matrix: 1997  * conventional bond order w atomic properties

Topological information indices Indices based on the information content and entropy measures derived from the molecular graphs.

Information content The information content of a system having n elements is a measure of the degree of diversity of the elements in the set. where G is the number of different equivalence classes and n g is the number of elements in the g-th class and

Information content Maximum information content Total information content

The Shannon entropy of a system having n elements is the mean information content of a set of elements where G is the number of different equivalence classes and p g is the probability of the g-th class and Information content

Maximum entropy Standardized entropy

Information content... on atoms n = 9 C = 7 F = 2 n = 9 C = 7 F = 1 Br = 1 I C = 7 log log 2 2 = = I T = – = I C = 7 log (1 log 2 1) = = I T = – = H = -(7/9) log 2 (7/9) + -(2/9) log 2 (2/9) = = H * = / = I MAX = 9 log 2 9 = H MAX = log 2 9 = H = -(7/9) log 2 (7/9) - 2 (1/9) log 2 (1/9) = x = H * = / = 0.311

Information content... on vertex degrees n = 9 V1 = 3 V2 = 3 V3 = on vertex degree magnitudes SV1 = 3 SV2 = 6 SV3 = 9 n = 18 V1 = 3 V2 = 6 V3 = 9 H = -(3/18) log 2 (3/18) - (6/18) log 2 (6/18) -(9/18) log 2 (9/18) = xxxx H = 3*[-(3/9) log 2 (3/9)] = xxx

Department of Environmental Sciences University of Milano - Bicocca P.za della Scienza, Milano (Italy) Website: michem.disat.unimib.it/chm/ THANK YOU Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria decision making environmetrics experimental design artificial neural networks statistical process control Milano Chemometrics and QSAR Research Group

X

X

X

X

X

Roberto Todeschini Milano Chemometrics and QSAR Research Group Molecular descriptors Autocorrelations, eigenvalue-based and information indices Prof. Roberto Todeschini Dr. Davide Ballabio Dr. Viviana Consonni Dr. Alberto Manganaro Dr. Andrea Mauri

X

X

X

Autocorrelation ona molecular graph