Download presentation
Presentation is loading. Please wait.
Published bySilvester Carpenter Modified over 9 years ago
1
1 A bridge between PLS path modeling and ULS-SEM Michel Tenenhaus
2
2 A SEM tree Svante Wold Harald Martens PLS regression (1983) SIMCA-P The Unscrambler C. Ringle SMART-PLS Chatelin-Esposito Vinzi Fahmy-Jäger-Tenenhaus XLSTAT-PLSPM (2008) W. Chin PLS-Graph Herman Wold NIPALS (1966) PLS approach (1975) J.-B. Lohmöller LVPLS 1.8 (1984) SEM Component-based SEM (Score computation) Covariance-based SEM (CSA) (Model validation) K. Joreskog (LISREL, 1970) R. McDonald(1996) M. Tenenhaus (2001) H. Hwang Y. Takane GSCA (2004) H. Hwang VisualGSCA 1.0 (2007) Generalized Structured Component Analysis (ALS)
3
3 Covariance-based Structural Equation Modeling Latent Variables : Structural model (Inner model):
4
4 Structural Equation Modeling Measurement model (outer model) : MV LV MV Endogenous Exogenous
5
5 Structural Equation Modeling Mixing Structural and measurement models : Residual variances are diagonal matrices
6
6 Structural Equation Modeling Covariance matrix for manifest variables : Outer model Inner model Exogen. LV Cov. Structural Residual variance Measurement Residual variance
7
7 Covariance-based SEM ULS algorithm : S = Observed covariance matrix for MV Goodness-of-fit Index (Jöreskog & Sorbum): PCA Generalization
8
8 Path model describing causes and consequences of Customer Satisfaction: The inner model. value satisfaction Image Perceived Customer Expectation Perceived quality Loyalty Customer Complaints
9
9 Use of AMOS 6.0 Méthod = ULS This is a computational trick: Residual variances are passed to errors and can always be computed afterwards. First Roderick McDonald’s idea (1996) Measurement residual variances are canceled:
10
10 Covariance-based SEM ULS algorithm with the McDonald’s constraints: S = Observed covariance matrix for MV Goodness-of-fit Index (Jöreskog & Sorbum):
11
11 Use of AMOS 6.0 - Méthod = ULS - Measurement residual variances = 0
12
12 Results Outer LV Estimates: 2 nd McDonald’s idea + Fornell’s idea PLS estimate of LV: - Mode A - LV inner estimate = theoretical LV - LV inner estimate computation is useless. GFI =.869
13
13 Cross-validation by bootstrap
14
14 Cross-validation by bootstrap
15
15 Comparison between the Fornell-ULS and Fornell-PLS standardized weights
16
16 Comparison between the Fornell-ULS and Fornell-PLS standardized weights and all Cor(LV Fornell-ULS, LV Fornell-PLS ) >.99
17
17 First particular case : Factor Analysis and Principal Component Analysis
18
18 First particular case : FA and PCA Factor Analysis Reflective mode Principal component Analysis Formative mode AVE =.69 AVE =.74.15.21.42.36.08 a posteriori computation
19
19 FA vs PCA : Variance reconstruction FA does not care for variance reconstruction PCA cares for variance reconstruction FA (reflective mode) yields to better covariances reconstruction than PCA (formative mode).
20
20 MIMIC mode (with this new approach) (Multiple effect indicators for multiple causes) Formative mode (multiple cause) Reflective mode (multiple effect)
21
21 MIMIC mode (usual in CSA) Formative mode (multiple cause) Reflective mode (multiple effect) ? ?
22
22 MIMIC mode (usual in CSA) Formative mode (multiple cause) Reflective mode (multiple effect) ?
23
23 MIMIC mode (better) Formative mode (multiple cause) Reflective mode (multiple effect) (Same than before) Proposal: Compute a global score as PCA oriented vs the dependent block
24
24 Second particular case : Multi-block data analysis
25
Sensory analysis of 21 Loire Red Wines (J. Pagès) X 1 = Smell at rest, X 2 = View, X 3 = Smell after shaking, X 4 = Tasting X1X1 X2X2 X3X3 X4X4 3 Appellations4 Soils Illustrative variable 4 blocks of variables
26
PCA of each block: Correlation loadings
27
PCA of each block: Correlation loadings GFI =.301
28
28 Multi-block data analysis = Confirmatory Factor Analysis VIEW SMELL AFTER SHAKING SMELL AT REST SMELL AT REST TASTING GFI =.849
29
29 First dimension Using MV with significant loadings
30
30 First global score GFI =.973 2nd order CFA
31
31 Validation of the first dimension Correlations 1.6211.865.7621.682.813.8951.813.920.942.944 Rest1 View Shaking1 Tasting1 Score1 Rest1ViewShaking1Tasting1
32
32 Second dimension
33
33 2 nd global score GFI =.905
34
34 Validation of the second dimension Correlations 1.7891.782.8031.944.904.928 Rest2 Shaking2 Tasting2 Score2 Rest2Shaking2Tasting2
35
35 Mapping of the correlations with the global scores Score 1 related with quality Score 2 unrelated with quality
36
36 Correlation with global quality New result. Not obtained with other multi-block data analysis methods, nor with factor analysis of the whole data.
37
37 Wine visualization in the global score space Wines marked by Appellation
38
38 Wine visualization in the global score space Wines marked by Soil
39
DAM = Dampierre-sur-Loire
40
A soft, warm, blackberry nose. A good core of fruit on the palate with quite well worked tannin and acidity on the finish; Good length and a lot of potential. DECANTER (mai 1997) (DECANTER AWARD ***** : Outstanding quality, a virtually perfect example) Cuvée Lisagathe 1995
42
42 = Sun exposure = Sun protection A = Gender (A 1 = Men, A 2 = Women) Data Model = 0 + 1 A 1 + 2 *A 1 + 3 *A 2 + (1) 1 < 0? 2 < 3 ? Third particular case : Analysis of covariance between two blocks of binary variables (with C. Guinot et E. Mauger (CERIES))
43
43 No gender effect Gender main effect W M 1 = 0, 2 = 3 1 0, 2 = 3 Interaction *gender W M 2 32 3 Theory: background = 0 + 1 A 1 + 2 *A 1 + 3 *A 2 + (1)
44
44 X 1 X 1 Sun exposure during lifetime (4) X 4 X 4 Sun exposure during hobbies (2) X 5 X 5 Practice of naturism (1) A Gender Y Y Sun protection behavior over the past year (6) X 3 X 3 Sun exposure during nautical sports (2) X 2 X 2 Sun exposure during mountain sports (2) Score for sun protection Theory: background Score for sun exposure
45
45 Equation (1) is replaced by: Yc = 0 + 1 A 1 + 2 Xw*A 1 + 3 Xw*A 2 + (2) = 0 + 1 A 1 + 2 (X*A 1 )w + 3 (X*A 2 )w + (3) Question: How to estimate and test the parameters w, c, 0, 1, 2, 3 ? Theory: background = 0 + 1 A 1 + 2 *A 1 + 3 *A 2 + (1)
46
46 Covariance based SEM with constraints No group effect on the measurement model Y3Y3Y3Y3 Y2Y2Y2Y2 Y1Y1Y1Y1 Y A1A1A1A1 X 1 *A 1 X 2 *A 1 X 3 *A 1 X 1 *A 2 X 2 *A 2 X 3 *A 2 X*A 1 X*A 2 2 2 33 11 w1w1 w1w1 w2w2 w3w3 w2w2 w3w3 c1c1 c2c2 c3c3 Theory: methods
47
47 We have applied the ULS-SEM to a study on sun- exposure behavior in 8,084 French adults. Development of skin cancers Premature skin ageing. Data came from the SU.VI.MAX study* *Hercberg S. et al. Arch Intern Med. 2004;164:2325-42 Application: material
48
48 Use of AMOS method = ULS
49
Product used besides voluntarily sun exposure periods Nb of days of practice of nautical activites > 400 days Sun exposure during practice of nautical sports Nb of days of lifetime hobbies > 900 days Sun exposure during practice of hobbies Practice of naturism during lifetime Product used for face with SPF > 15 Product used for body with SPF > 15 Several times while sun exposure While sun exposure While sun tanning Nb of days of lifetime hobbies > 900 days Sun exposure during practice of hobbies Nb of days of mountain sport actvities > 200 days Sun exposure during practice of mountain sports Intensity of sun exposure moderate or severe Basking in the sun is important or very important Sun exposure between 11am and 4 pm Sun exposure of body and face. Basking in the sun is important or very important Intensity of sun exposure moderate or severe Nb of days of mountain sport actvities > 200 days Practice of naturism during lifetime... Sun exposure during practice of mountain sports Nb of days of practice of nautical activites > 400 days Sun exposure during practice of nautical sports Sun exposure between 11am and 4 pm Sun exposure of body and face Sun expo. (M) Sun expo. (W) 1.46 2.30 1.35 2.67.64.62 1.23 1.00 2.13 3.91 2.82 2.30 1.46 3.91 2.13 1.00 1.23.64 2.67 1.35.62 Sun Protection 1.00.82.90.45.40.62 Men d -.17.75 1.24 AMOS results [-.20, -.14] [.64,.89] Confidence Interval (Bootstrap) [1.12, 1.38] GFI =.870
50
50 = 0 + 1 M + 2 *M + 3 *W + Conclusion 1.Women tend to protect themselves from the sun more than men ( 1 < 0). 2.This difference between men and women increases as lifetime sun exposure increases ( 3 - 2 > 0) CoefficientsEstimateLowerUpper 22 Sun exposure (Men) Sun Protection.754.647891 33 Sun exposure (Women) Sun Protection1.2371.1231.385 11 Men Sun Protection -.166 -.198 -.139 Results: AMOS ULS
51
51 LV estimation using PLS (Mode A, Fornell’s normalisation) Example for Sun Protection:
52
52 Sun protection over the past year score *SPF: Sun protection factor + 0.24If sun protection products used while sun tanning + 0.20If sun protection products used throughout voluntarily sun exposure periods + 0.22If sun protection products applied several times during sun exposure periods + 0.14If the sun protection product used for the face has a SPF* over 15 + 0.09If the sun protection product used for the body has a SPF* over 15 + 0.11If sun protection products used besides voluntarily sun exposure periods Results
53
53 Lifetime sun exposure score * Median value of the duration was used as a threshold for dichotomisation 0.14If sun exposure of the body and the face 0.11If sun exposure between 11 a.m. and 4 p.m. 0.07If basking in the sun is declared important or extremely important 0.20If self-assessed intensity of sun exposure is declared moderate or severe 0.10If sun exposure during practice of mountain sports 0.05If the number of days of lifetime mountain sports activities > 200 days* 0.06If sun exposure during practice of nautical sports 0.03If the number of days of lifetime nautical sports activities > 400 days* 0.13If sun exposure during practice of hobbies 0.07If the number of days of lifetime hobby activities > 900 days* 0.03If practice of naturism during lifetime Results
54
54 Coefficients EstimateLowerUpper 22 Sun exposure (Men) Sun Protection.754.647.891 33 Sun exposure (Women) Sun Protection1.2371.1231.385 11 Men Sun Protection -.166 -.198 -.139 Standard Parameter Estimate Error t Value Pr > |t| Intercept 0.0729460737 B 0.01213456 6.01 <.0001 score_x1_protect 0.2473795070 B 0.02574722 9.61 <.0001 GENRE Femmes 0.1269948620 B 0.01557730 8.15 <.0001 GENRE Hommes 0.0000000000 B... score_x1_protec*GENRE Femmes 0.1613712617 B 0.03316612 4.87 <.0001 score_x1_protec*GENRE Hommes 0.0000000000 B... Main effect Gender + Interaction Sun exposure*Gender are highly significant. Results: analysis of covariance
55
55 When mode A is chosen, outer LV estimates using Covariance-based SEM (ULS or ML) or Component based SEM (PLS) are always very close. It is possible to mimic PLS with a covariance-based SEM software (McDonald,1996, Tenenhaus, 2001). Covariance-based SEM authorizes to implement constraints on the model parameters. This is impossible with PLS. Conclusion 1: SEM-ULS > PLS
56
56 When SEM-ULS does not converge or does not give an admissible solution, PLS is an attractive alternative. PLS offers many optimization criterions for the LV search (but rigorous proofs are still to be found). PLS still works when the number of MV is very high and the number of cases very small. The new software XLSTAT-PLSPM will be available at the beginning of 2008. Conclusion 2: PLS > SEM-ULS
57
57 References - M. Tenenhaus, E. Mauger, C. Guinot : « Use of ULS-SEM and PLS-SEM to measure interaction effect in a regression model relating two blocks of binary variables » in Handbook of Partial Least Squares (PLS): Concepts, Methods and Applications (V. Esposito Vinzi, J. Henseler, W. Chin, H. Wang, Eds), Springer, 2007. - M. Tenenhaus : « A bridge between PLS path modeling and ULS-SEM » PLS’07, Oslo. - Tenenhaus M., Esposito Vinzi V., Chatelin Y.-M., Lauro C. (2005) : « PLS path modeling » Computational Statistics & Data Analysis, 48, 159-205.
58
Final conclusion « All the proofs of a pudding are in the eating, not in the cooking ». William Camden (1623)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.