Detection and Prediction of Errors in EPC Business Process Models

Slides:



Advertisements
Similar presentations
A core course on Modeling kees van Overveld Week-by-week summary.
Advertisements

Evaluation of Abstraction Techniques. Uses for the complexity metrics in our framework Comparing the complexity of the reference model with the abstracted.
Towards Workflow Pattern Support of Event-Driven Process Chains (EPC) Jan Mendling, Gustaf Neumann Dept. of IS and New Media, WU Wien, Austria Markus Nüttgens.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
System integrity The term system integrity has the following meanings: That condition of a system where in its specified operational and technical parameters.
Outliers Split-sample Validation
Psychology 202b Advanced Psychological Statistics, II February 10, 2011.
Chapter 13 Additional Topics in Regression Analysis
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Additional Topics in Regression Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Stat 112: Lecture 8 Notes Homework 2: Due on Thursday Assessing Quality of Prediction (Chapter 3.5.3) Comparing Two Regression Models (Chapter 4.4) Prediction.
T-test.
Predictive Analysis in Marketing Research
Multiple Regression – Basic Relationships
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
A university for the world real R © 2009, Chapter 14 EPCs Jan Mendling.
Measurement and Data Quality
CSCI 347 / CS 4206: Data Mining Module 06: Evaluation Topic 01: Training, Testing, and Tuning Datasets.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
Model of Prediction Error in Chaotic and Web Driven Business Environment Franjo Jović*, Alan Jović ** * Faculty of Electrical Engineering, University of.
Software Engineering Modern Approaches
Math 260 Final Project //name //major //picture (optional)
1 6.9 Exponential, Logarithmic & Logistic Models In this section, we will study the following topics: Classifying scatter plots Using the graphing calculator.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
Mathcad Enabling of Engineering e-Content on Knovel ASEE 2009 Annual Conference Austin, TX Sasha Gurke SVP & Co-Founder, Knovel Corp.
Advanced Precalculus Notes 4.9 Building Exponential, Logarithmic, and Logistic Models.
Generating Correct EPCs from Configured C-EPCs* Jan MendlingVienna University of Economics Jan ReckerQueensland University of Technology Michael RosemannQueensland.
Today Ensemble Methods. Recap of the course. Classifier Fusion
8.1.4 Can it still be factored? Factoring Completely I can factor out a common factor.
Forecasting in CPT Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Business Process Design by View Integration J. MendlingVienna University of Economics C. SimonUniversity of Koblenz.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Linear Discriminant Analysis and Logistic Regression.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
5 September 2002AIAA STC Meeting, Santa Fe, NM1 Verification and Validation for Computational Solid Mechanics Presentation to AIAA Structures Technical.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 5: Credibility: Evaluating What’s Been Learned.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
A core Course on Modeling Introduction to Modeling 0LAB0 0LBB0 0LCB0 0LDB0 S.29.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Learning: Neural Networks Artificial Intelligence CMSC February 3, 2005.
Logistic Regression: Regression with a Binary Dependent Variable.
A Discourse on Complexity of Process Models J. CardosoUniversidade da Madeira J. MendlingVienna University of Economics G. NeumannVienna University of.
Learning with Neural Networks Artificial Intelligence CMSC February 19, 2002.
1 Alternative Process Modeling langugues UML activity diagrams Event-driven process chains System-specific languages like Staffware will follow later...
Outlier Detection Identifying anomalous values in the real- world database is important both for improving the quality of original data and for reducing.
Regression Analysis Part D Model Building
Parts of a Lab Write-up.
Detection and Prediction of Errors in EPC Business Process Models
Broadband Number of connections at 6 monthly intervals from Jan 1st y (000s) x
Correlation and Regression
BA 275 Quantitative Business Methods
11/29/2018 Department of Software Engineering and IT Engineering
Types of Control I. Measurement Control II. Statistical Control
A graphing calculator is required for some problems or parts of problems 2000.
Analytics: Its More than Just Modeling
Standardized versus Unstandardized Regression Coefficients
IQA 2018 QA Basics September 11, 2018 Day 1 Workshop 1.
Section 5 Multiple Regression.
Product moment correlation
Time Series Forecasting Accelerator
Choosing a Model pages 563–566 Exercises 1. quadratic 2. linear 3.
Ch 4.1 & 4.2 Two dimensions concept
Faulty EPCs in the SAP Reference Model
Chapter 8, part C..
Presentation transcript:

Detection and Prediction of Errors in EPC Business Process Models Jan Mendling WU Wien Austria

The Business Process Management Lifecycle

Business Process Model Quality and Errors The need for appropriately qualified process modelers increases with the size of the initiative as it becomes important that adequate quality assurance procedures are inplace. It is not possible to control the different quality aspects after the models are designed, if on a single day 100+ hours are spent on designing new models. (Rosemann 2006) The cost of errors increases exponentially over the development lifecycle: it is more than 100 times more costly to correct a defect post-implementation than it is to correct it during requirements analysis. (Boehm 1981, Moody 2005)

Agenda EPC Business Process Models Verification of EPC Soundness Metrics for Business Process Models Prediction of Errors based on Metrics

Example EPC process model

EPC Functions and Events

EPC Connectors AND-Join AND-Split XOR-Split OR-Split OR-Join XOR-Join

EPC Behavior d w

Problems with Connector Mismatch

EPC Soundness Start and End Events

Reachability Graph Calculation State Explosion Problem: 348 = 79,766,443,076,872,509,863,361 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 41 43 47 34 27 28 19 35 42 44 48 20 37 21 45 46 38 29 36 24 22 39 26 25 30 40 31 32 33

Reduction Rules

Tool Support for Soundness Verification Reduction Rules with xoEPC Reachability Graph Calculation with ProM

Formal Errors

Verification Performance for SAP Ref.-Mod.

Verification Performance for SAP Ref.-Mod. II

Verification Performance for SAP Ref.-Mod. III

Model Metrics: Coefficient of Network Connectivity (CNC) = 1.043

Model Metrics: Connector Mismatch (MM) = 14

Model Metrics: Cyclicity (CYC) = 0

Model Metrics: Separability (Pi) = 0.455

Model Metrics: Structuredness (Phi) = 0.652

Model Metrics: Connector Heterogeneity (CH) = 0.873

Model Metrics: Diameter 1 2 3 = 14 4 5 6 7 8 9 10 11 12 13 14

Metrics and Errors for a Sample of 2003 EPCs rPhi,hasError= -0,36 rCH,hasError= 0,46

Logistic Regression Results

Results

Error Prediction = 0.811 > 0.5

Holdout Sample Results 113 EPCs from books by Scheer, Becker & Schütte, Staud, and Seidlmeier Accuracy interval for prediction function 81.15% to 96.77% with 95% confidence

Contributions Formalization of the OR-Join Verification of Process Models with OR-Joins and multiple Start- and End-Events Metrics for Business Process Models Validation of Metrics as Error Predictors

Discussion Importance of Verification Business Process Modeling Process Business Process Modeling Tools Teaching of Process Modeling