Presentation is loading. Please wait.

Presentation is loading. Please wait.

Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.

Similar presentations


Presentation on theme: "Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria."— Presentation transcript:

1 Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria decision making environmetrics experimental design artificial neural networks statistical process control Milano Chemometrics and QSAR Research Group Department of Environmental Sciences University of Milano - Bicocca P.za della Scienza, 1 - 20126 Milano (Italy) Website: michem.unimib.it/chm/

2 Roberto Todeschini Milano Chemometrics and QSAR Research Group Molecular descriptors Autocorrelations, eigenvalue-based and information indices Iran - February 2009

3 Contents  Autocorrelation descriptors  Autocorrelation descriptors  Molecule representation by matrices  Molecule representation by matrices  Eigenvalue-based descriptors  Eigenvalue-based descriptors  Information content  Information content  Information indices  Information indices

4 - quadratic molecular property - quadratic molecular property with interaction terms w is the vector collecting the weights of each atom Autocorrelation on a molecular graph 1 = (1,A) (A,A) (A,1)

5 Moreau - Broto autocorrelation of a topological structure 1984 LAG Autocorrelation on a molecular graph

6 Example : 4-hydroxy-2-butanone Autocorrelation on a molecular graph

7 Eigenvalue descriptors are derived from the diagonalization of symmetric matrices derived from a molecular graph, such as:  Adjacency matrix  Vertex distance matrix  Edge adjacency matrix  Edge distance matrix  Detour matrix  Geometrical distance matrix  Covariance matrix... and any weighted symmetric matrix Eigenvalue-based descriptors

8 Lovasz - Pelikan index (or leading eigenvalue) The largest eigenvalue derived from the adjacency matrix 1973

9 Eigenvalue-based descriptors General functions of eigenvalues

10 Eigenvalue-based descriptors The trace of the adjacency matrix (and of the distance matrix) is equal to zero.

11 Eigenvalue-based descriptors VAA indices (from adjacency matrix) Balaban et al., 1991

12 VEA indices (from adjacency matrix) Balaban et al., 1991 where A is largest negative eigenvalue derived from the adjacency matrix Eigenvector-based descriptors

13 VAD, VED and VRD indices (from distance matrix) Balaban et al., 1991 The same indices defined above are calculated on the topological distance matrix Eigenvalue-based descriptors

14 The geometry matrix G (or geometric distance matrix) is a square symmetric matrix whose entry r st is the geometric distance calculated as the Euclidean distance between the atoms s and t: Molecular geometry

15 Distance / distance matrix Distance / distance matrix (DD) Randic et al., 1994

16 Folding degree index Randic et al., 1994 This quantity tends to 1 for linear molecules (of infinite length) and decreases in correspondence with the folding of the molecule. The largest eigenvalue derived from the distance/distance matrix Eigenvalue-based descriptors

17 Conventional bond order  single bond:  * = 1  double bond:  * = 2  triple bond:  * = 3  conjugated bond:  * = 1.5

18 Eigenvalue-based descriptors BCUT descriptors Burden - CAS - University of Texas eigenvalues BCUT descriptors Burden - CAS - University of Texas eigenvalues The largest absolute eigenvalues 1, 2, 3,..., L, derived from the following B matrix: 1997  * conventional bond order w atomic properties

19 Topological information indices Indices based on the information content and entropy measures derived from the molecular graphs.

20 Information content The information content of a system having n elements is a measure of the degree of diversity of the elements in the set. where G is the number of different equivalence classes and n g is the number of elements in the g-th class and

21 Information content Maximum information content Total information content

22 The Shannon entropy of a system having n elements is the mean information content of a set of elements where G is the number of different equivalence classes and p g is the probability of the g-th class and Information content

23 Maximum entropy Standardized entropy

24 Information content... on atoms n = 9 C = 7 F = 2 n = 9 C = 7 F = 1 Br = 1 I C = 7 log 2 7 + 2 log 2 2 = 19.651 + 2.000 = 21.651 I T = 28.529 – 21.651 = 6.878 I C = 7 log 2 7 + 2 (1 log 2 1) = 19.651 + 0 = 19.651 I T = 28.529 – 19.651 = 8.878 H = -(7/9) log 2 (7/9) + -(2/9) log 2 (2/9) = 0.282 + 0.482 = 0.764 H * = 0.764 / 3.170 = 0.241 I MAX = 9 log 2 9 = 28.529 H MAX = log 2 9 = 3.170 H = -(7/9) log 2 (7/9) - 2 (1/9) log 2 (1/9) = 0.282 + 2 x 0.352 = 0.986 H * = 0.986 / 3.170 = 0.311

25 Information content... on vertex degrees n = 9 V1 = 3 V2 = 3 V3 = 3 1 1 1 2 2 2 3 3 3... on vertex degree magnitudes SV1 = 3 SV2 = 6 SV3 = 9 n = 18 V1 = 3 V2 = 6 V3 = 9 H = -(3/18) log 2 (3/18) - (6/18) log 2 (6/18) -(9/18) log 2 (9/18) = xxxx H = 3*[-(3/9) log 2 (3/9)] = xxx

26 Department of Environmental Sciences University of Milano - Bicocca P.za della Scienza, 1 - 20126 Milano (Italy) Website: michem.disat.unimib.it/chm/ THANK YOU Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria decision making environmetrics experimental design artificial neural networks statistical process control Milano Chemometrics and QSAR Research Group

27

28 X

29 X

30 X

31 X

32 X

33 Roberto Todeschini Milano Chemometrics and QSAR Research Group Molecular descriptors Autocorrelations, eigenvalue-based and information indices Prof. Roberto Todeschini Dr. Davide Ballabio Dr. Viviana Consonni Dr. Alberto Manganaro Dr. Andrea Mauri

34 X

35 X

36 X

37 Autocorrelation ona molecular graph


Download ppt "Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria."

Similar presentations


Ads by Google