1 740f02classsize18 The Confounding Effect of Class Size on the Validity of Object- Oriented Metrics Khaled El Eman, etal IEEE TOSE July 01.

Slides:



Advertisements
Similar presentations
Sociology 680 Multivariate Analysis Logistic Regression.
Advertisements

Brief introduction on Logistic Regression
Fig. 4-1, p Fig. 4-2, p. 109 Fig. 4-3, p. 110.
CLUSTERING SUPPORT FOR FAULT PREDICTION IN SOFTWARE Maria La Becca Dipartimento di Matematica e Informatica, University of Basilicata, Potenza, Italy
HSRP 734: Advanced Statistical Methods July 24, 2008.
Slide 1Fig. 22.1, p.669. Slide 2Fig. 22.3, p.670.
Slide 1Fig. 17.1, p.513. Slide 2Table 17.1, p.514.
Towards Logistic Regression Models for Predicting Fault-prone Code across Software Projects Erika Camargo and Ochimizu Koichiro Japan Institute of Science.
1 ECE 453 – CS 447 – SE 465 Software Testing & Quality Assurance Case Studies Instructor Paulo Alencar.
1 Empirical Validation of Three Software Metrics Suites to Predict Fault-Proneness of Object-Oriented Classes Developed Using Highly Iterative or Agile.
Metrics. Basili’s work Basili et al., A Validation of Object- Oriented Design Metrics As Quality Indicators, IEEE TSE Vol. 22, No. 10, Oct. 96 Predictors.
Prediction of fault-proneness at early phase in object-oriented development Toshihiro Kamiya †, Shinji Kusumoto † and Katsuro Inoue †‡ † Osaka University.
P.464. Table 13-1, p.465 Fig. 13-1, p.466 Fig. 13-2, p.467.
P449. p450 Figure 15-1 p451 Figure 15-2 p453 Figure 15-2a p453.
Fig. 11-1, p p. 360 Fig. 11-2, p. 361 Fig. 11-3, p. 361.
Analysis of CK Metrics “Empirical Analysis of Object-Oriented Design Metrics for Predicting High and Low Severity Faults” Yuming Zhou and Hareton Leung,
Table 3 Predicting Time 3 Religiosity from Time 1 Religiosity and College Major at Time 1 LISREL Models (Z – ratios in parentheses)
Confidence Estimation for Machine Translation J. Blatz et.al, Coling 04 SSLI MTRG 11/17/2004 Takahiro Shinozaki.
Table 6-1, p Fig. 6-1, p. 162 p. 163 Fig. 6-2, p. 164.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems.
Voting Behavior of Naturalized Citizens: Sarah R. Crissey Thom File U.S. Census Bureau Housing and Household Economic Statistics Division Presented.
Business Research Method Measurement, Scaling, Reliability, Validity
Chidamber & Kemerer Suite of Metrics
Statistics for clinical research An introductory course.
Defect Management Defect Injection and Removal
Japan Advanced Institute of Science and Technology
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
1 OO Metrics-Sept2001 Principal Components of Orthogonal Object-Oriented Metrics Victor Laing SRS Information Services Software Assurance Technology Center.
Business Intelligence and Decision Modeling Week 11 Predictive Modeling (2) Logistic Regression.
A Validation of Object-Oriented Design Metrics As Quality Indicators Basili et al. IEEE TSE Vol. 22, No. 10, Oct. 96.
The influence of developer quality on software fault- proneness prediction Yansong Wu, Yibiao Yang, Yangyang Zhao,Hongmin Lu, Yuming Zhou,Baowen Xu 資訊所.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Logistic Regression July 28, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Research Heaven, West Virginia FY2003 Initiative: Hany Ammar, Mark Shereshevsky, Walid AbdelMoez, Rajesh Gunnalan, and Ahmad Hassan LANE Department of.
Analysis of Case Control Studies E – exposure to asbestos D – disease: bladder cancer S – strata: smoking status.
Logistic Regression. Linear Regression Purchases vs. Income.
AMMBR II Gerrit Rooks. Checking assumptions in logistic regression Hosmer & Lemeshow Residuals Multi-collinearity Cooks distance.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
1 Chapter 15 Data Analysis: Basic Questions © 2005 Thomson/South-Western.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Logistic Regression Analysis Gerrit Rooks
Object-Oriented (OO) estimation Martin Vigo Gabriel H. Lozano M.
Exponential Functions. When do we use them? Exponential functions are best used for population, interest, growth/decay and other changes that involve.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
CSE 5331/7331 F'07© Prentice Hall1 CSE 5331/7331 Fall 2007 Regression Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Statistical Comments on Retrospective Analysis Girish Aras, Ph.D. Jonathan Ma, Ph.D. Center for Drug Evaluation and Research, FDA.
Instructor Resource Chapter 12 Copyright © Scott B. Patten, 2015.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
A Parametrized Strategy of Gatekeeping, Keeping Untouched the Probability of Having at Least One Significant Result Analysis of Primary and Secondary Endpoints.
Logistic Regression: Regression with a Binary Dependent Variable.
Fig. 6-CO, p p. 185a p. 185b p. 185c p. 185d.
Logistic Regression When and why do we use logistic regression?
LOGISTIC REGRESSION 1.
Mingyu Feng Neil Heffernan Joseph Beck
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Logistic Regression.
Figure 11-1.
Fig. 6-CO, p. 211.
07CO, p. 190.
Figure Overview.
Figure Overview.
Flowchart of Manuscript Selection
Clinical prediction models
Case-control studies: statistics
Presentation transcript:

1 740f02classsize18 The Confounding Effect of Class Size on the Validity of Object- Oriented Metrics Khaled El Eman, etal IEEE TOSE July 01

2 740f02classsize18 Terminology u Confounding u Empirical u Binary classification u univariate u Type I error u Type II error u Odds ratio

3 740f02classsize18 Odds Ratio

4 740f02classsize18 Unmatched case-control study

5 740f02classsize18 Size Confounding – fig 2

6 740f02classsize18 Table 2 and 3

7 740f02classsize18 Testing for confounding u Fig 3 u If association is not significant, assume no confounding – wrong

8 740f02classsize18 Figure 3

9 740f02classsize18 Testing for confounding u Do logistic regression u If the regression coefficient change dramatically with and without, then strong indication of confounding

10 740f02classsize18 Figure 4

11 740f02classsize18 Table 4

12 740f02classsize18 Table 5

13 740f02classsize18 Discussion “The results that we obtained are remarkably consistent.They indicate that, for some of the product metrics that we studied, the univariate analyses demonstrate that they are indeed associated with fault- proneness (namely, WMC, CBO, RFC, and NMA). After controlling for the size confounder, all of these associations disappear, and the differences in the product metrics' odds ratios indicates a strong and classic confounding effect. The remaining product metrics were not associated with fault-proneness from a univariate analysis, but this was predictable from previous work.” p642

14 740f02classsize18 Conclusion u Table 5 u What is the conclusion of the article?

15 740f02classsize18 Class Discussion u What kind of experiment should be used for these studies?