Querying Business Processes Under Models of Uncertainty Daniel Deutch, Tova Milo Tel-Aviv University ERP HR System eComm CRM Logistics Customer Bank Supplier.

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

Programming Types of Testing.
NP-complete and NP-hard problems Transitivity of polynomial-time many-one reductions Concept of Completeness and hardness for a complexity class Definition.
An Introduction to Variational Methods for Graphical Models.
Efficient Query Evaluation on Probabilistic Databases
Introduction of Probabilistic Reasoning and Bayesian Networks
A Visual Query Language for Business Processes Catriel Beeri Hebrew University Anat Eyal, Simon Kamenkovich, Tel Aviv University Tova Milo.
Paper by: A. Balmin, T. Eliaz, J. Hornibrook, L. Lim, G. M. Lohman, D. Simmen, M. Wang, C. Zhang Slides and Presentation By: Justin Weaver.
Querying Disjunctive Databases in Polynomial Time Lars Olson Masters Thesis Brigham Young University Supported by NSF Grant #
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Academic Advisor: Prof. Ronen Brafman Team Members: Ran Isenberg Mirit Markovich Noa Aharon Alon Furman.
Application architectures
Reducing Human Interactions in Web Directory Searches ORI GERSTEL - Cisco SHAY KUTTEN - Technion EDUARDO SANY LABER - PUC-Rio RACHEL MATICHIN and DAVID.
Computational Complexity, Physical Mapping III + Perl CIS 667 March 4, 2004.
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
Expressing Giotto in xGiotto and related schedulability problems Class Project Presentation Concurrent Models of Computation for Embedded Software University.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Introduction & Overview CS4533 from Cooper & Torczon.
2  Problem Definition  Project Purpose – Building Obfuscator  Obfuscation Quality  Obfuscation Using Opaque Predicates  Future Planning.
Application architectures
Query Processing Presented by Aung S. Win.
Efficient Query Evaluation over Temporally Correlated Probabilistic Streams Bhargav Kanagal, Amol Deshpande ΗΥ-562 Advanced Topics on Databases Αλέκα Σεληνιωτάκη.
Interpreting the data: Parallel analysis with Sawzall LIN Wenbin 25 Mar 2014.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
Estimating Entropy for Data Streams Khanh Do Ba, Dartmouth College Advisor: S. Muthu Muthukrishnan.
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
Finding Optimal Probabilistic Generators for XML Collections Serge Abiteboul, Yael Amsterdamer, Daniel Deutch, Tova Milo, Pierre Senellart BDA 2011.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
©Silberschatz, Korth and Sudarshan13.1Database System Concepts Chapter 13: Query Processing Overview Measures of Query Cost Selection Operation Sorting.
Join Synopses for Approximate Query Answering Swarup Achrya Philip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented by Bhushan Pachpande.
Diversified Top-k Graph Pattern Matching 1 Yinghui Wu UC Santa Barbara Wenfei Fan University of Edinburgh Southwest Jiaotong University Xin Wang.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Query Optimization Chap. 19. Evaluation of SQL Conceptual order of evaluation – Cartesian product of all tables in from clause – Rows not satisfying where.
Swarup Acharya Phillip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented By Vinay Hoskere.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
Simultaneously Learning and Filtering Juan F. Mancilla-Caceres CS498EA - Fall 2011 Some slides from Connecting Learning and Logic, Eyal Amir 2006.
Monitoring Business Processes with Queries VLDB2007 CatrielBeeri, AnatEyal, Tova Milo, AlonPilberg Summarized by Gong GI Hyun, IDS Lab., Seoul.
Technology Mapping. 2 Technology mapping is the phase of logic synthesis when gates are selected from a technology library to implement the circuit. Technology.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
Strings Basic data type in computational biology A string is an ordered succession of characters or symbols from a finite set called an alphabet Sequence.
A Brief Maximum Entropy Tutorial Presenter: Davidson Date: 2009/02/04 Original Author: Adam Berger, 1996/07/05
1 An infrastructure for context-awareness based on first order logic 송지수 ISI LAB.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
TU/e Algorithms (2IL15) – Lecture 9 1 NP-Completeness NOT AND OR AND NOT AND.
Chapter 13: Query Processing
Written By: Presented By: Swarup Acharya,Amr Elkhatib Phillip B. Gibbons, Viswanath Poosala, Sridhar Ramaswamy Join Synopses for Approximate Query Answering.
General Architecture of Retrieval Systems 1Adrienn Skrop.
An Algorithm to Learn the Structure of a Bayesian Network Çiğdem Gündüz Olcay Taner Yıldız Ethem Alpaydın Computer Engineering Taner Bilgiç Industrial.
Efficient Evaluation of XQuery over Streaming Data
A paper on Join Synopses for Approximate Query Answering
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
Patterns extraction from process executions
Probabilistic Data Management
Database System Concepts and Architecture
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Objective of This Course
Sungho Kang Yonsei University
Efficient Subgraph Similarity All-Matching
Resource Allocation for Distributed Streaming Applications
Computational Advertising and
Presentation transcript:

Querying Business Processes Under Models of Uncertainty Daniel Deutch, Tova Milo Tel-Aviv University ERP HR System eComm CRM Logistics Customer Bank Supplier

Querying BPs Under Models of Uncertainty2 Outline  Introduction & Motivation  External Events  Partial Tracing  Related and Future work

Querying BPs Under Models of Uncertainty3  Introduction & Motivation  External Events  Partial Tracing  Related and Future work Outline

Querying BPs Under Models of Uncertainty4  Web-based Business Processes are very popular  Querying possible \ likely execution flows of such applications  Allows for optimization, personalized ads, improved business logic,…  Queries over specification structure  Database approach to specification analysis Querying Business Process Introduction and Motivation

Querying BPs Under Models of Uncertainty5 Example BP Introduction and Motivation

Querying BPs Under Models of Uncertainty6  BP specifications are compiled to running code  Modeled as nested DAGs Each DAG corresponds to a function\web-page  Node pairs model activities activation and completion Edges mark flow relation Atomic/compound activities Nesting models implementation relations \ links Guarding formulas (on external events) model choices  Recursion is allowed BP Specifications Introduction and Motivation

Querying BPs Under Models of Uncertainty7 Example Specification Introduction and Motivation F1 F3 F2 chooseTravel F4 … $searchType = “flights only” Confirm $searchType = “flights + hotels” $searchType = “flights+hotels+cars” … … $airline=“BA” Login Flights Advertise Confirm $airline=“AF” Confirm Login Flights Advertise Confirm Hotels … $hotel=“Marriott ” $hotel=“Crown ePlaza” … $choice=“confirm” F1 $choice=“reset” … $airline=“AL” …

Querying BPs Under Models of Uncertainty8  A a specific run is called an EX-flow  Describes activities that occurred in practice + their flow relation  Obtained from a BP by choosing one implementation for each compound activity  Zoom-in edges connect implementations to activation and completion nodes of the corresponding compound activity  Execution trace (Ex-trace) is the log recorded for an EX-flow  Traces size for a given BP may be unbounded Execution Flow Introduction and Motivation

Querying BPs Under Models of Uncertainty9 Example Execution Flow Introduction and Motivation chooseTravel Login Flights $searchType=“flight only” $airline = “BA” Confirm 07:00 07:02 07:04 07:05 07:10 07:30 07:40 07:43 07:41 07:42 Flights Confirm Advertise... $choice = “confirm”...

Querying BPs Under Models of Uncertainty10  External Effects, such as user choices, server response time, etc.  Partial Tracing, due to lack of storage, confidentiality, etc. Thus the past is uncertain, and the future unknown Sources of Uncertainty Introduction and Motivation

Querying BPs Under Models of Uncertainty11  Introduction & Motivation  External Events  Partial Tracing  Related and Future work Outline

Querying BPs Under Models of Uncertainty12  Even when traces of past executions are known, future executions are hard to predict  The effect of external events is modeled by logical formulas, guarding implementations  At run-time, formulas’ truth values determines the chosen implementations Uncertainty External Events

Querying BPs Under Models of Uncertainty13  Example queries: What is the possible behavior (flow) for users that do not finalize their reservation? Which hotels can be reserved by British Airways fliers?  The set of all EX-flows conforming to a query may be large (possibly infinite)  But some are more interesting than others… TOP-K most likely EX-flows External Events

Querying BPs Under Models of Uncertainty14  We define likelihood of EX-flows  Then find the top-k most likely out of these conforming to a query (TOP-K-MATCHES) What is the typical behavior (flow) for users that do not finalize their reservation? Which hotels are preferred by British Airways fliers?  TOP-K answers reflect common usage patterns TOP-K likely EX-flows (cont.) External Events

Querying BPs Under Models of Uncertainty15  Execution patterns  Intuitive, similar in structure to execution traces  Seek for occurrences (homomorphism) of the pattern within (any) trace’s sub-graph  May contain transitive nodes and edges  May contain a projection part Query language External Events

Querying BPs Under Models of Uncertainty16 Example Query External Events chooseTravel Any Start $Airline = “BA” Flights Hotels Confirm chooseTravel

Querying BPs Under Models of Uncertainty17  We distinct three classes of distributions, according to their level of dependency  Memory-less (markovian): no dependencies between formulas.  Bounded-memory: dependency in(at most) B last values of each formula.  General Distribution Classes External Events

Querying BPs Under Models of Uncertainty18 Our Example Specification External events F1 F3 F2 chooseTravel F4 … $searchType = “flights only” Confirm $searchType = “flights + hotels” $searchType = “flights+hotels+cars” … … $airline=“BA” Login Flights Advertise Confirm $airline=“AF” Confirm Login Flights Advertise Confirm Hotels … $hotel=“Marriott ” $hotel=“Crown ePlaza” … $choice=“confirm” F1 $choice=“reset” … $airline=“AL” …

Querying BPs Under Models of Uncertainty19 Distribution External Events

Querying BPs Under Models of Uncertainty20  For memory-less distribution, we find the TOP-K matches in PTIME (data complexity) [compute a compact representation of output]  For bounded-memory distributions, NP-completeness in the data size, but we give powerful heuristics  In all settings, NP-completeness in the query size  For general distributions, we show undecidability Results External Events

Querying BPs Under Models of Uncertainty21  For memory-less distributions, Dynamic Programming algorithm  Gradually computes a table holding the i’th most probable trace rooted at any activity  Implemented with satisfactory performance Algorithms (intuition) External Events

Querying BPs Under Models of Uncertainty22  For bounded-memory, we build a memory-less BP where activity is a “state”, holding all relevant information for future computations.  This “explodes” exponentially the number of compound activity names.  Optimizations utilize conditional independencies between formulas.  Compute approximation of “actual memory”, and maintain only that. Algorithms (intuition, cont.) External Events

Querying BPs Under Models of Uncertainty23  Introduction & Motivation  External Events  Partial Tracing  Related and Future work Outline

Querying BPs Under Models of Uncertainty24  Partial Tracing, due to lack of storage, confidentiality,… Naïve tracing records all activities accurately Semi-Naïve tracing contains only partial information on the names of some activities Selective tracing may omit some activities occurrences  Tracing systems (called types) are represented by a renaming function and a deletion set Types of Partial Traces Partial Tracing

Querying BPs Under Models of Uncertainty25 Trip Search HotelFlight Print Credit1 HotelCredit1 Luxury Search LuxHotelLuxFlight Print Credit2 LuxHotelCredit2 Credit1 Example BP Partial Tracing

Querying BPs Under Models of Uncertainty26 Trip Search Hotel Flight Print Credit1 Trip Search LuxHotel Credit2 LuxFlight Print Credit2 Luxury Credit1 Naïve Traces Partial Tracing

Querying BPs Under Models of Uncertainty27 Trip Search Hotel Flight Print Credit Trip Search Hotel Credit Flight Print Credit Luxury Credit Semi-Naïve Traces Partial Tracing

Querying BPs Under Models of Uncertainty28 Trip Search Hotel Flight Print Credit Trip Search Hotel Credit Flight Print Credit Selective Traces Partial Tracing

Querying BPs Under Models of Uncertainty29 VLDB sneak preview…  Type inference: Given an input type and a query over its traces, infer a type representing exactly the qualifying traces  Type checking: Given also an output type, verify that the qualifying traces conform to it  Practical motivation stems from optimization of queries over execution traces repositories Type Inference & Checking Partial Tracing

Querying BPs Under Models of Uncertainty30  Type Inference Impossible with only naïve tracing Possible, but with exponential blowup for semi-naïve tracing systems In PTIME for selective tracing systems  Type Checking NP-hard and solvable in EXPTIME for naïve and semi-naïve tracing systems Undecidable for selective tracing systems Complexity Partial Tracing

Querying BPs Under Models of Uncertainty31  Selective tracing is “ideal” for type inference  Other methods should be considered for type checking Practical Implication Partial Tracing

Querying BPs Under Models of Uncertainty32  Given a partial trace, what is its most likely origin?  Or, more generally, given a pattern (query) of partial traces, what are the most likely origins of partial traces of this pattern ?  Good news: All query evaluation algorithms extend to this context (even without knowing the tracing system…) Lets talk about (top-k) queries Partial Tracing

Querying BPs Under Models of Uncertainty33  Introduction & Motivation  External Events  Partial Tracing  Related and Future work Outline

Querying BPs Under Models of Uncertainty34  (Probabilistic) Recursive State Machines with temporal logic as query language  Probabilistic Relational DBs  Probabilistic XML  Graph grammars with MSO (or FO) as query language  BP and Web applications mining Related work Related & Future Work

Querying BPs Under Models of Uncertainty35  Practical applications: Web-sites design On-line advertisements Improved business logic  Enriched Query Language Joins Data values Projection queries with further aggregation functions  Distribution  Optimization  Efficient Type checking under some restrictions  Inference of specifications/probability distributions Future work Related & Future Work

תודה ( = Thanks )