Simulation based approach Shang Zechao

Slides:



Advertisements
Similar presentations
CSE 211 Discrete Mathematics
Advertisements

Igor Balla REU DIMACS Metric Spaces
WSPD Applications.
Yinghui Wu, LFCS DB talk Database Group Meeting Talk Yinghui Wu 10/11/ Simulation Revised for Graph Pattern Matching.
Shuai Ma, Yang Cao, Wenfei Fan, Jinpeng Huai, Tianyu Wo Capturing Topology in Graph Pattern Matching University of Edinburgh.
New Models for Graph Pattern Matching Shuai Ma ( 马 帅 )
The IEEE International Conference on Big Data 2013 Arash Fard M. Usman Nisar Lakshmish Ramaswamy John A. Miller Matthew Saltz Computer Science Department.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Graph Isomorphism Algorithms and networks. Graph Isomorphism 2 Today Graph isomorphism: definition Complexity: isomorphism completeness The refinement.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 CHAPTER 4 - PART 2 GRAPHS 1.
Mining Graphs.
Based on “Cascading Behavior in Networks: Algorithmic and Economic Issues” in Algorithmic Game Theory (Jon Kleinberg, 2007) and Ch.16 and 19 of Networks,
Applied Discrete Mathematics Week 12: Trees
© 2006 Pearson Addison-Wesley. All rights reserved14 A-1 Chapter 14 Graphs.
On the Hardness of Graph Isomorphism Jacobo Tor á n SIAM J. Comput. Vol 33, p , Presenter: Qingwu Yang April, 2006.
CMPT-825 (Natural Language Processing) Presentation on Zipf’s Law & Edit distance with extensions Presented by: Kaustav Mukherjee School of Computing Science,
1 QSX: Querying Social Graphs Graph Pattern Matching Graph pattern matching via subgraph isomorphism Graph pattern matching via graph simulation Revisions.
Message Passing Systems A Formal Model. The System Topology – network (connected undirected graph) Processors (nodes) Communication channels (edges) Algorithm.
Yinghui Wu LFCS Lab Lunch Homomorphism and Simulation Revised for Graph Matching.
Message Passing Systems A Formal Model. The System Topology – network (connected undirected graph) Processors (nodes) Communication channels (edges) Algorithm.
Making Pattern Queries Bounded in Big Graphs 11 Yang Cao 1,2 Wenfei Fan 1,2 Jinpeng Huai 2 Ruizhe Huang 1 1 University of Edinburgh 2 Beihang University.
Querying Big Graphs within Bounded Resources 1 Yinghui Wu UC Santa Barbara Wenfei Fan University of Edinburgh Southwest Jiaotong University Xin Wang.
Yinghui Wu, SIGMOD 2012 Query Preserving Graph Compression Wenfei Fan 1,2 Jianzhong Li 2 Xin Wang 1 Yinghui Wu 1,3 1 University of Edinburgh 2 Harbin Institute.
GRAPH Learning Outcomes Students should be able to:
Performance Guarantees for Distributed Reachability Queries Wenfei Fan 1,2 Xin Wang 1 Yinghui Wu 1,3 1 University of Edinburgh 2 Harbin Institute of Technology.
Virtual Network Mapping: A Graph Pattern Matching Approach Yang Cao 1,2, Wenfei Fan 1,2, Shuai Ma University of Edinburgh 2 Beihang University.
MCS312: NP-completeness and Approximation Algorithms
Complexity Classes (Ch. 34) The class P: class of problems that can be solved in time that is polynomial in the size of the input, n. if input size is.
Research Directions for Big Data Graph Analytics John A. Miller, Lakshmish Ramaswamy, Krys J. Kochut and Arash Fard Department of Computer Science University.
Diversified Top-k Graph Pattern Matching 1 Yinghui Wu UC Santa Barbara Wenfei Fan University of Edinburgh Southwest Jiaotong University Xin Wang.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
Lecture 17 Trees CSCI – 1900 Mathematics for Computer Science Fall 2014 Bill Pine.
Association Rules with Graph Patterns Yinghui Wu Washington State University Wenfei Fan Jingbo Xu University of Edinburgh Southwest Jiaotong University.
Approximation Algorithms
Graph pattern matching Graph GGraph Qisomorphism f(a) = 1 f(b) = 6 f(c) = 8 f(d) = 3 f(g) = 5 f(h) = 2 f(i) = 4 f(j) = 7.
Module #19: Graph Theory: part II Rosen 5 th ed., chs. 8-9.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
Shuai Ma Graph Search & Social Networks. 2 Graphs are everywhere, and quite a few are huge graphs!
Yinghui Wu, ICDE Adding Regular Expressions to Graph Reachability and Pattern Queries Wenfei Fan Shuai Ma Nan Tang Yinghui Wu University of Edinburgh.
NP-COMPLETE PROBLEMS. Admin  Two more assignments…  No office hours on tomorrow.
Distributed Graph Simulation: Impossibility and Possibility 1 Yinghui Wu Washington State University Wenfei Fan University of Edinburgh Southwest Jiaotong.
Patterns around Gnutella Network Nodes Sui-Yu Wang.
1 Chapter 22 Elementary Graph Algorithms. 2 Introduction G=(V, E) –V = vertex set –E = edge set Graph representation –Adjacency list –Adjacency matrix.
Topics Paths and Circuits (11.2) A B C D E F G.
Locally constrained graph homomorphisms Jan Kratochvíl Jan Kratochvíl Charles University, Prague.
Computing Branchwidth via Efficient Triangulations and Blocks Authors: F.V. Fomin, F. Mazoit, I. Todinca Presented by: Elif Kolotoglu, ISE, Texas A&M University.
Research Directions for Big Data Graph Analytics John A. Miller, Lakshmish Ramaswamy, Krys J. Kochut and Arash Fard.
Graph Theory. undirected graph node: a, b, c, d, e, f edge: (a, b), (a, c), (b, c), (b, e), (c, d), (c, f), (d, e), (d, f), (e, f) subgraph.
Lecture 25 NP Class. P = ? NP = ? PSPACE They are central problems in computational complexity.
On the Hardness of Optimal Vertex Relabeling and Restricted Vertex Relabeling Amihood Amir Benny Porat.
Introduction to Graph Theory By: Arun Kumar (Asst. Professor) (Asst. Professor)
Complexity Classes Karl Lieberherr. Source From riptive_complexity.html.
CSE 421 Algorithms Richard Anderson Lecture 29 Complexity Theory NP-Complete P.
CPT-S Topics in Computer Science Big Data 1 1 Yinghui Wu EME 49.
Outline  Introduction  Subgraph Pattern Matching  Types of Subgraph Pattern Matching  Models of Computation  Distributed Algorithms  Performance.
Introduction to NP Instructor: Neelima Gupta 1.
Yinghui Wu, SIGMOD Incremental Graph Pattern Matching Wenfei Fan Xin Wang Yinghui Wu University of Edinburgh Jianzhong Li Jizhou Luo Harbin Institute.
The NP class. NP-completeness Lecture2. The NP-class The NP class is a class that contains all the problems that can be decided by a Non-Deterministic.
Introduction to Algorithms
The NP class. NP-completeness
Analysis of University Researcher Collaboration Network Using Co-authorship Jiadi Yao School of Electronic and Computer Science,
CPT-S 415 Big Data Yinghui Wu EME B45 1.
Algorithms and networks
Finding Fastest Paths on A Road Network with Speed Patterns
CS223 Advanced Data Structures and Algorithms
Algorithms and networks
Subtree Isomorphism in O(n2.5)
Efficient Subgraph Similarity All-Matching
Graph Homomorphism Revisited for Graph Matching
Approximate Graph Mining with Label Costs
Presentation transcript:

Simulation based approach Shang Zechao 1010161920 Graph Matching Simulation based approach Shang Zechao 1010161920

Introduction What is graph matching? When the one graph matches with another?

Introduction (cont.) Graph: G=(V, E). GQ = (VQ, EQ) Can be easily extended with labels. Exact matching: isomorphism Find a bijection function f between V and VQ (u, v) in E iff (f(u), f(v)) in EQ

Introduction (cont.) Graph isomorphism Sub-graph isomorphism Too hard! GI class Sub-graph isomorphism NP-Complete Too hard!

Simulation based approach [Henzinger95] Find a relation S: V x VQ (u, u’) in S if u and u’ has same labels for all children v’ of u’, there exists v V is child of u (v, v’) in S

Simulation based approach The major difference between graph simulation and graph isomorphism Isomorphism requires an bijection (one to one) function Graph simulation based on relation (many to many) Simulation is in polynomial time

An Example [Fan10] Drug dealer network B: Boss S: Secretary AM: Assistant manager FW: Field worker

An Example (cont.) In real world S and AM is same AM maps to multiple worker

Bounded Simulation [Fan10] Each edge in pattern graph has label Either a positive integer K Or * (infinite) The length of path connects these two nodes

The Example (cont.) AM should be able to reach FW within 3 hops.

Matching Algorithm Similar with the EffcientSimilarity algorithm in [Henzinger95]. Pre-compute the distance matrix between all pairs of node in G. Complexity O(|V||E| + |Ep||V|2 + |Vp||V|)

Strong Simulation [Ma12] Recall the condition that two nodes match: Have same label Children could be matched by simulation Two issues Parent information is not captured Matching size is not limited

An Example [Ma12] Bio can match to Bio1, Bio2, Bio3, Bio4 Actually only Bio4 makes sense

Strong Simulation two nodes match if: Have same label Children could be matched by simulation Parent could be matched by simulation The matched sub-graph should have same diameter as pattern graph

An Example (cont.) Bio only matches to Bio4 in strong simulation

Comparison of different approaches children topology parents topology connectivity cycle info simulation Y N with parent topology with diameter constrain isomorphism

Comparison of different approaches locality bounded matches bisimulation bounded cycle simulation N Y with parent topology with diameter constrain isomorphism

But Bounded cycle problem is intractable NP-hard Bisimilar problem is intractable coNP-hard

References [Henzinger95] M. R. Henzinger, T. A. Henzinger, and P. W. Kopke. 1995. Computing simulations on finite and infinite graphs. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science (FOCS '95). IEEE Computer Society, Washington, DC, USA, 453-. [Fan10] Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Yinghui Wu, and Yunpeng Wu. 2010. Graph pattern matching: from intractable to polynomial time. Proc. VLDB Endow. 3, 1-2 (September 2010), 264-275. [Ma12] Shuai Ma, Yang Cao, Wenfei Fan, Jinpeng Huai , Tianyu Wo. 2012. Capturing Topology in Graph Pattern Matching. PVLDB. To appear.