CONFORMANCE CHECKING IN THE LARGE: PARTITIONING AND TOPOLOGY Jorge Munoz-Gama, Josep Carmona and Wil M.P. van der Aalst.

Slides:



Advertisements
Similar presentations
Cuts, Trees, and Electrical Flows Aleksander Mądry.
Advertisements

From Local Patterns to Global Models: Towards Domain Driven Educational Process Mining Nikola Trčka Mykola Pechenizkiy.
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Introduction to Algorithms Quicksort
Benchmarking traversal operations over graph databases Marek Ciglan 1, Alex Averbuch 2 and Ladialav Hluchý 1 1 Institute of Informatics, Slovak Academy.
Advanced Iso-Surfacing Algorithms Jian Huang, CS594, Spring 2002 This set of slides are developed and used by Prof. Han-Wei Shen at Ohio State University.
Jorge Muñoz-Gama Josep Carmona
Nanxi Kang Princeton University
/faculteit technologie management 1 Process Mining: Organizational and Conformance Mining Algorithms Ana Karla Alves de Medeiros Ana Karla Alves de Medeiros.
Software Design Process A Process is a set of related and (sequenced) tasks that transforms a set of input to a set of output. Inputs Outputs Design Process.
Process discovery: Inductive Miner
Aligning Event Logs and Process Models for Multi- perspective Conformance Checking: An Approach Based on ILP Massimiliano de Leoni Wil M. P. van der Aalst.
Models vs. Reality dr.ir. B.F. van Dongen Assistant Professor Eindhoven University of Technology
1 Chapter 7 Network Flow Slides by Kevin Wayne. Copyright © 2005 Pearson-Addison Wesley. All rights reserved.
Mining Declarative Models using Intervals Jan Martijn van der Werf Ronny Mans Wil van der Aalst.
Block-Structured Process Discovery: Filtering Infrequent Behaviour Sander Leemans Dirk Fahland Wil van der Aalst Eindhoven University of Technology.
Leting Wu Xiaowei Ying, Xintao Wu Dept. Software and Information Systems Univ. of N.C. – Charlotte Reconstruction from Randomized Graph via Low Rank Approximation.
Rectangle Visibility Graphs: Characterization, Construction, Compaction Ileana Streinu (Smith) Sue Whitesides (McGill U.)
1 Analysis of workflows a-priori and a-posteriori analysis Wil van der Aalst Eindhoven University of Technology Faculty of Technology Management Department.
A Schedulability-Preserving Transformation of BDF to Petri Nets Cong Liu EECS 290n Class Project December 10, 2004.
Process Mining: An iterative algorithm using the Theory of Regions Kristian Bisgaard Lassen Boudewijn van Dongen Wil van.
More Graph Algorithms Weiss ch Exercise: MST idea from yesterday Alternative minimum spanning tree algorithm idea Idea: Look at smallest edge not.
Winter Retreat Connecting the Dots: Using Runtime Paths for Macro Analysis Mike Chen, Emre Kıcıman, Anthony Accardi, Armando Fox, Eric Brewer
Tools for Planar Networks Grigorios Prasinos and Christos Zaroliagis CTI/University of Patras 3 rd Amore Research Seminar – Oegstgeest, The Netherlands,
1 Refined Search Tree Technique for Dominating Set on Planar Graphs Jochen Alber, Hongbing Fan, Michael R. Fellows, Henning Fernau, Rolf Niedermeier, Fran.
Unraveling Unstructured Process Models Marlon Dumas University of Tartu, Estonia Joint work with Artem Polyvyanyy and Luciano García-Bañuelos Invited Talk,
Software Design Processes and Management
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 FLUTE: Fast Lookup Table Based RSMT Algorithm.
Insuring Sensitive Processes through Process Mining Jorge Munoz-Gama Isao Echizen Jorge Munoz-Gama and Isao Echizen.
Jorge Munoz-Gama Advisor: Josep Carmona December 2014 CONFORMANCE CHECKING AND DIAGNOSIS IN PROCESS MINING.
HIERARCHICAL CONFORMANCE CHECKING OF PROCESS MODELS BASED ON EVENT LOGS Jorge Munoz-Gama, Josep Carmona and Wil M.P. van der Aalst.
Jorge Muñoz-Gama Universitat Politècnica de Catalunya (Barcelona, Spain) Algorithms for Process Conformance and Process Refinement.
An Introduction to Petri Nets Marjan Sirjani Formal Methods Laboratory University of Tehran.
EVENT-BASED REAL-TIME DECOMPOSED CONFORMANCE ANALYSIS Seppe vanden Broucke, Jorge Munoz-Gama, Josep Carmona, Bart Baesens, and Jan Vanthienen CoopIS 2014.
DECOMPOSED CONFORMANCE Jorge Munoz-Gama, Josep Carmona and W.M.P van der Aalst.
Chi-Cheng Lin, Winona State University CS 313 Introduction to Computer Networking & Telecommunication Chapter 5 Network Layer.
Pontificia Universidad Católica de Chile School of Engineering Department of Computer Science A feedback-based framework for process enhancement of causal.
Notes of Rational Related cyt. 2 Outline 3 Capturing business requirements using use cases Practical principles  Find the right boundaries for your.
Efficiency of Alignment-based algorithms B.F. van Dongen Laziness! (Gu)estimation! Implementation effort?
A deterministic near-linear time algorithm for finding minimum cuts in planar graphs Thank you, Steve, for presenting it for us!!! Parinya Chalermsook.
Decision Mining in Prom A. Rozinat and W.M.P. van der Aalst Joosung, Ko.
Data Structures and Algorithms in Parallel Computing Lecture 2.
Alignment-based Precision Checking A. Adriansyah 1, J. Munoz Gamma 2, J. Carmona 2, B.F. van Dongen 1, W.M.P. van der Aalst 1 Tallinn, 3 September 2012.
Decomposing Data-aware Conformance Checking Massimiliano de Leoni, Jorge Munoz-Gama, Josep Carmona, Wil van der Aalst PAGE 0.
Speeding Up Enumeration Algorithms with Amortized Analysis Takeaki Uno (National Institute of Informatics, JAPAN)
"Decomposing Alignment- based Conformance Checking of Data-aware Process Models" Massimiliano de Leoni, Jorge Muñoz-Gama, Josep Carmona, Wil van der Aalst.
BOĞAZİÇİ UNIVERSITY – COMPUTER ENGINEERING Mehmet Balman Computer Engineering, Boğaziçi University Parallel Tetrahedral Mesh Refinement.
Decomposing Replay Problems: A Case Study Eric Verbeek and Wil van der Aalst.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Towards comprehensive support for organizational mining Presenter : Yu-hui Huang Authors : Minseok Song,
Smith’s Aerospace © P. Bailey & K. Vander Linden, 2005 Procedural Activity Patrick Bailey Keith Vander Linden Calvin College.
R-customizers Goal: define relation between graph and its customizers, study domains of adaptive programs, merging of interface class graphs.
Parallelizing Functional Tests for Computer Systems Using Distributed Graph Exploration Alexey Demakov, Alexander Kamkin, and Alexander Sortov
Diagnostic Information for Control-Flow Analysis of Workflow Graphs (aka Free-Choice Workflow Nets) Cédric Favre(1,2), Hagen Völzer(1), Peter Müller(2)
Process Mining – Concepts and Algorithms Review of literature on process mining techniques for event log data.
Discovering Models for State-based Processes M.L. van Eck, N. Sidorova, W.M.P. van der Aalst.
Process discovery Sander Leemans.
Profiling based unstructured process logs
1 Queensland University of Technology, Brisbane, Australia
Exploring processes and deviations
Decomposed Process Mining: The ILP Case
Bart M. P. Jansen June 3rd 2016, Algorithms for Optimization Problems
Dept. of Computer Sciences University of Wisconsin-Madison
Petri nets refresher Prof.dr.ir. Wil van der Aalst
Sorting and recurrence analysis techniques
SPQR Tree.
An Introduction to Petri Nets
Sorting and Divide-and-Conquer
JORGE MUNOZ-GAMA and JOSEP CARMONA
FLUTE: Fast Lookup Table Based RSMT Algorithm for VLSI Design
Presentation transcript:

CONFORMANCE CHECKING IN THE LARGE: PARTITIONING AND TOPOLOGY Jorge Munoz-Gama, Josep Carmona and Wil M.P. van der Aalst

Motivation 2 THEORYREALITY REALITY REFLECTION LOGS

BPM 2013 (Tuesday) BPI Workshop 2013 Community Motivates Itself 3 Process Mining Conformance Checking Conformance Dimensions Alignment based Conformance Big Data Conformance Checking Applications Decomposition RPST BPM 2013 (Wednesday)

Conformance General Idea 4 model Log trace mismatches

Conformance in a Nutshell 5 Log Model A B C D E A B B C Alignment E FitnessPrecision How much behavior of the log is captured by the model? How accurate is the model describing the log? Conformance mismatch on the Log Conformance mismatch on the Model

Conformance in the Large How easy is to diagnose a conformance problem here? How much time it takes? 6

General Idea: Decomposition 7

The 4 Challenges Comprehensible Guaranties Fast Diagnosis 8 ?

SESE and RPST for decomposing 9 SESERPST Single Entry Single Exit components Refined Process Structure Tree * Artem Polyvyanyy: Structuring Process Models. PhD Thesis. University of Potsdam (Germany), January 2012 * Hopcroft, J., Tarjan, R.E.: Dividing a graph into triconnected components. SIAM J. Com- put. 2(3), 1973 Based on graph decomposition

Interior, Boundary, Entry, and Exit nodes Given a subgraph and a node of it: Interior node: connected only to nodes of the subgraph. Boundary node: not interior Entry node: boundary where no incoming edge in subgraph or all outgoing edges in Exit node: boundary where no outgoing edge in subgraph or all incoming edges in 10

Nodes examples 11

Structural Decomposition 12 transition place token A B D E F C

Example of SESE and RPST 13 SESE: set of edges which graph has a Single Entry node and a Single Exit node Refined Process Structure Tree (RPST) containing non overlapping SESEs Unique Modular Linear Time

Why SESE and RPST? Why SESE? Only one entry; only one exit Represent subprocesses within the process Intuitive for conformance diagnosis Why RPST? Partitioning over the RPST Any cut is a partitioning Algorithm to partitioning by size (k) 14

Why SESE and RPST? Why SESE? Only one entry; only one exit Represent subprocesses within the process Intuitive for conformance diagnosis Why RPST? Partitioning over the RPST Any cut is a partitioning Algorithm to partitioning by size (k) 15 K<

The 4 Challenges Comprehensible Guaranties Fast Diagnosis 16 ?

Properties of the Partitioning What about the guaranties in conformance checking? Decomposed Perfectly Fitting Checking: A model/log is perfectly fitting if and only if all the components are perfectly fitting 17 * W.M.P. van der Aalst : Decomposing Petri nets for process mining: A generic approach. Distributed and Parallel Databases, 2013

SESE and Decomposed Perfectly Fitting SESEs (per se) do not satisfy the Decomposed Perfectly Fitting Checking property 18 1 token in p => abcdef fits S but not S2 2 tokens in p => abdecf fits S1 and S2 but not S

Valid Decomposition The problem is in the boundary places No reflection on the log A partition with only transitions shared among components (no places neither arcs) Transitions have reflect on the log Use that reflection to sync the components This is known as a valid decomposition Details in: 19 * W.M.P. van der Aalst : Decomposing Petri nets for process mining: A generic approach. Distributed and Parallel Databases, 2013 * J. Munoz-Gama, J. Carmona, and W.M.P. van der Aalst : Conformance checking in the large: partitioning and topology. BPM 2013

SESE to Valid Decomposition Create a ‘bridge’ for each shared place 20

Results 21 SESEs and Bridges Few with non negligible size

SESE + Bridging Theorem Theorem: SESE decomposition with Bridging post- processing satisfies the Decomposed Perfectly Fitting Checking 22

The 4 Challenges Comprehensible Guaranties Fast Diagnosis 23 ?

Results 1 Net – 1h 15min 7 Subnets – 2min 24

Results 25 Better performance with large, concurrency, long traces and concentrated conformance problems Not always faster: short traces, fitting. Overhead of the decomposition Results even in cases that the original approach can handle

The 4 Challenges Comprehensible Guaranties Fast Diagnosis 26 ?

Results 27 Conformance problems spread Conformance problems concentrated

Topology 28 S1 S4 S2 S3 B1 B2 S7S8 S5 S6

Topology Enhancement 29 S1 S4 S2 S3 B1 B2 S7S8 S5 S6 t1,t3,t4,t5,t7,t7,t9

Topological Diagnosis Algorithms Non-Fitting (Weakly) Connected Components 30 S1 S4 S2 S3 B1 B2 S7S8 S5 S6 Non-Fitting Subnet

Topological Diagnosis in Large 31

Topological Results 32 From almost 700 nodes … … to 70

The 4 Challenges Comprehensible Guaranties Fast Diagnosis 33

Future Work Estimate fitness Not decisional but metric Divide-and-Conquer Alignment Algorithms Reconstruct the alignment New decompositions Less trivial components New conformance dimensions Precision, generalization, … 34

Conclusions Partitioning Technique for Conformance Checking Based on SESE and RPST May be faster, distributed, and help on the diagnosis Topology for diagnosis Implemented in ProM framework. 35

Thank You

Process Mining in a Nutshell 37 THEORYREALITY

Process Mining in a Nutshell 38 THEORYREALITY REALITY REFLECTION LOGS