Beyond Tasks and Gateways: Discovering BPMN Models with subprocesses, boundary events and activity markers Raffaele Conforti, Marcello La Rosa Queensland University of Technology Marlon Dumas, Luciano García-Bañuelos University of Tartu 1 BPM’2014 Conference, Eindhoven 11 September 2014
2 CIDTaskTime Stamp… 13219Enter Loan Application T 11:20: Retrieve Applicant Data T 11:22: Enter Loan Application T 11:22: Compute Installments T 11:22: Notify Eligibility T 11:23: Approve Simple Application T 11:24: Compute Installements T 11:24:35- ………… Automated Process Discovery
What’s the catch?
There you are!
Automated Process Discovery: Handling Complexity Filter Filter out “irrelevant” events (tasks) Filter out “irrelevant” traces Abstract Zoom into most frequent tasks or paths Extract subprocesses Divide Divide log by variants based on similarity (trace clustering) Discover multiple process models rather than one 5
Bose, Veerbeck & van det Aalst: Discovering Hierarchical Process Models using ProM Related Work: ProM two-phase miner
ProM Two-Phase Miner 7 Instead of… Produces this…
Before 8
After 9
What’s the catch?
Data! 11
Extracting the Process Hierarchy Extract event tables Find primary keys Find foreign keys Cluster event types Split log per cluster 12 flat log log hierarchy TimePOIDAtt2Att :121…… :532…… ………… ReceivePO TimeMOIDPOIDAtt :1311… :3122… …2…… CreateMO ShipPO TimePOIDAtt :11 1… …2… ………
… the rest Discover one model per (sub- )process Identify interrupting boundary events Identify interrupting timer events Identify event subprocesses Identify loop/multi- instance markers 13 -Heuristics miner -ILP -Inductive Miner -Fodina Heuristics
Evaluation Setup Four flat process discovery algos Heuristics, ILP, Inductive (H), Fodina Each algo with and without BPMN-Miner Quality measures Accuracy: Fitness, precision, F-score Understandability: Size, Control-Flow Complexity, … LogsTracesEventsEvent typesEvents/type IWT (FRIS) Insurance Order-to-cash
Evaluation - Results 15
Side-Effect: Correct Models 16
What’s Next Standalone tool implementation Currently in ProM nightly build Further evaluation Logs with larger number of event types Noise resilience Missing events can trick foreign key discovery Further enrichment Event-based gateways, more BPMN events… Adding data conditions, completion conditions, … 17