Neo4j for Process Mining

Slides:



Advertisements
Similar presentations
Practical Business Modeling in the Unified Process Tom Morgan Software Architect, Fidelity National Information Services
Advertisements

Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
Department of Computer Science 1 CSS 496 Business Process Re-engineering for BS(CS)
A university for the world real R © 2009, Chapter 17 Process Mining and Simulation Moe Wynn Anne Rozinat Wil van der Aalst Arthur.
A university for the world real R © 2009, Chapter 23 Epilogue Wil van der Aalst Michael Adams Arthur ter Hofstede Nick Russell.
Computer System Analysis Chapter 10 Structuring System Requirements: Conceptual Data Modeling Dr. Sana’a Wafa Al-Sayegh 1 st quadmaster University of Palestine.
Understanding Data Analytics and Data Mining Introduction.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
BPMN By Hosein Bitaraf Software Engineering. Business Process Model and Notation (BPMN) is a graphical representation for specifying business processes.
Use Cases College of Alameda Copyright © 2007 Patrick McDermott.
Week 5: Business Processes and Process Modeling MIS 2101: Management Information Systems.
Information Systems in Organizations 2.1 Analyzing organizations as systems and processes.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
1 Chapter 5 Modeling System Requirements Finding the Use Cases Page
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
CSPC 464 Fall 2014 Son Nguyen.  Attendance/Roster  Introduction ◦ Instructor ◦ Students  Syllabus  Q & A.
ING XBRL Proof of Concept July 19, ©2005 page 2. Utilizing XBRL  ING Objectives  Benefits  Goals  Proof of Concept Plan  Stat  USGAAP  Pain.
Maikel Leemans Wil M.P. van der Aalst. Process Mining in Software Systems 2 System under Study (SUS) Functional perspective Focus: User requests Functional.
Data Mining Introduction to data mining concepts.
Information and Information Technology 1. Information and employment 2.
Information ITIL Technology Infrastructure Library ITIL.
Logic Models How to Integrate Data Collection into your Everyday Work.
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Change Request Management
Auditing Concepts.
Information Systems in Organizations 2
Welcome to M301 P2 Software Systems & their Development
Software Quality Control and Quality Assurance: Introduction
Just What Are Processes, Anyway?
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
Profiling based unstructured process logs
Chapter 6: Database Project Management
Fundamentals of Information Systems, Sixth Edition
Introduction Characteristics Advantages Limitations
Object-Oriented Software Engineering Using UML, Patterns, and Java,
Introduction to Control Flow Patterns and BizAgi
David Shepherd, Zachary P. Fry, Emily Hill, Lori Pollock, and K
Coordinating & Scheduling
Week 10: Object Modeling (1)Use Case Model
Information Technology Project Management – Fifth Edition
ServiceNow Implementation Knowledge Management
OnContact CRM Customer Relationship Management
Data Warehouse.
MANAGEMENT INFORMATION SYSTEM MEHTAP PARLAK Industrial Engineering Department, Dokuz Eylul University, Turkey 1.
De-mystifying Big Data Testing using new generation tools / technology
Chapter 16 Nursing Informatics: Improving Workflow and Meaningful Use
Database Design Using the REA Data Model
Customer Relationship Management
Communicate the Impact of Poor Cost Information on a Decision
Communicate the Impact of Poor Cost Information on a Decision
Information Systems in Organizations 2
AUDIT AND VALIDATION TESTING FOR BIG DATA APPLICATIONS
Communicate the Impact of Poor Cost Information on a Decision
Data Quality By Suparna Kansakar.
Measuring Data Quality and Compilation of Metadata
Communicate the Impact of Poor Cost Information on a Decision
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
An Introduction to Data Warehousing
BPMN - Business Process Modeling Notations
Chapter 13 Quality Management
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Web Mining Department of Computer Science and Engg.
Tools for Implementation
Tools for Implementation
Object-Oriented Software Engineering
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Information Systems in Organizations 2. 1
UML Design for an Automated Registration System
Lecture 10 Structuring System Requirements: Conceptual Data Modeling
Information Systems in Organizations 2
Presentation transcript:

Neo4j for Process Mining Big Data and Hadoop Users Group of West Michigan Gordon Food Service 3/9/2016

Agenda Introduction What is a Process What is a Process Model What is Process Mining Data Prospecting with Neo4j Tool Demo

Introduction In the domain of information systems it is very difficult to see a product let alone determine how efficiently it was developed. In most cases we are left with textual information to understand and make assessments of a process.

*Better ways to capture, visualize, and analyze processes* Introduction The purpose of this proof of concept is to explore the application of graph database techniques in conjunction with process mining to improve the assessment of software development processes. *Better ways to capture, visualize, and analyze processes*

What is a Process?

*There are as many definitions of process as there are people* What is a Process? A series of actions or steps taken in order to achieve a particular end. A collection of linked tasks which find their end in the delivery of a service or product to a client. A set of activities and tasks that, once completed, will accomplish an organizational goal. *There are as many definitions of process as there are people*

Definition for Presentation A process comprises an identified body of work that can be characterized as a set of related activities or a defined sequence of steps and decisions The process name is in action verb - noun form (usually) The name is singular The result must be discrete and countable The result is what the customer of the process wants The process is initiated by a triggering event that could be an action, time, or a condition Workflow Modeling: Tools for Process Improvement and Application Development, Sharp & McDermott

Process Examples Suggested Process What We Call It Why Not a Process Customer Relationship Management Process Area Doesn’t deliver a single, specific result; a set of related business processes meeting an overall objective Acquire New Customer Business Process Delivers a single, specific result, and meets all other criteria. An “end-to-end business process Assess Prospect Financial Status or Setup Customer Sub process Too small - both deliver specific results, but are intermediate results in an E2E process Calculate Customer Credit Limit or Create Customer Account Activity, Step, Task Much too small - a part of a sub process. Possibly described in a procedure or use case. Workflow Modeling: Tools for Process Improvement and Application Development, Sharp & McDermott

What is a Process Model?

BPMN Basic Elements Activities Gateways Events Data

BPMN Pools White Box Pools Black Box Pools Pool with Swimlanes

BPMN Pool Communication

What is Process Mining?

Process Mining Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Three types of Process Models

Event Logs A process consists of cases A case consists of events such that each event relates to one case Events within a case are ordered Events can have attributes Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS

Play-Out Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Play-In Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Play-In Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Replay Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Replay Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS Process Mining Discovery, Conformance, and Enhancement of Business Processes, Wil M. P. Van Der Aalst

Data Prospecting with Neo4j

General Approach Given that the artifacts from the SLDC are recorded and linked in JIRA (i.e. issue tracker) they can be viewed as a graph Find a promising project in this case Wholesale Order Capture (WOC) and load the issues with their associate links into a labeled graph (i.e. Neo4J) Use Cypher queries to assess the “Prospect” for developing event logs Remediate good prospects (i.e. if possible) Extract the event log from the graph database and load it into Prom

Traceability Network Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS

Graph Model Set the context / expectation for the presentation by discussing current level of analytics maturity at GFS

Challenges Compiling Event Logs Picking events and activities for the model Finding good the timestamps Assessing the relevancy and accuracy of the timestamps Relating and ordering events and activities across systems

Neo4j Demo

ProM Demo

Conclusions While the WOC project was very successful from a project context, it was not as successful in a process improvement context Controls for workflow and traceability between issues needs to be improved Neo4j is a very effective tool for prospecting and processing event logs Process mining of the SDLC is feasible but will take significant effort on the data side (no surprise here !)