Graph Database Mining and Its Applications

Slides:



Advertisements
Similar presentations
Graph Mining Laks V.S. Lakshmanan
Advertisements

 Data mining has emerged as a critical tool for knowledge discovery in large data sets. It has been extensively used to analyze business, financial,
Mining for Tree-Query Associations in a Graph Jan Van den Bussche Hasselt University, Belgium joint work with Bart Goethals (U Antwerp, Belgium) and Eveline.
gSpan: Graph-based substructure pattern mining
NeMoFinder: Dissecting genome- wide protein-protein intractions with meso-scale network motifs Mike Yuan.
Introduction to Graph Mining
5/12/2015PhD seminar CS BGU Counting subgraphs Support measures for graphs Natalia Vanetik.
Connected Substructure Similarity Search Haichuan Shang The University of New South Wales & NICTA, Australia Joint Work: Xuemin Lin (The University of.
Mining Top-K Large Structural Patterns in a Massive Network Feida Zhu 1, Qiang Qu 2, David Lo 1, Xifeng Yan 3, Jiawei Han 4, and Philip S. Yu 5 1 Singapore.
GraphSig: Mining Significant Substructures in Compound Libraries 1.
Frequent Subgraph Pattern Mining on Uncertain Graph Data
Leiden University Efficient Frequent Query Discovery in F ARMER Siegfried Nijssen and Joost N. Kok ECML/PKDD-2003, Cavtat.
Databases and Data Mining Lecture 3: Descriptive Data Mining Peter van der Putten (putten_at_liacs.nl)
The UNIVERSITY of Kansas EECS 800 Research Seminar Mining Biological Data Instructor: Luke Huan Fall, 2006.
Mining Time-Series Databases Mohamed G. Elfeky. Introduction A Time-Series Database is a database that contains data for each point in time. Examples:
The UNIVERSITY of Kansas EECS 800 Research Seminar Mining Biological Data Instructor: Luke Huan Fall, 2006.
COM (Co-Occurrence Miner): Graph Classification Based on Pattern Co-occurrence Ning Jin, Calvin Young, Wei Wang University of North Carolina at Chapel.
Graph Mining - Motivation, Applications and Algorithms Graph mining seminar of Prof. Ehud Gudes Fall 2008/9.
A Multi-Level Parallel Implementation of a Program for Finding Frequent Patterns in a Large Sparse Graph Steve Reinhardt, Interactive Supercomputing
33 rd International Conference on Very Large Data Bases, Sep. 2007, Vienna Towards Graph Containment Search and Indexing Chen Chen 1, Xifeng Yan 2, Philip.
Mining Graphs with Constrains on Symmetry and Diameter Natalia Vanetik Deutsche Telecom Laboratories at Ben-Gurion University IWGD10 workshop July 14th,
Mining Scientific Data Sets Using Graphs George Karypis Department of Computer Science & Engineering University of Minnesota (Michihiro Kuramochi & Mukund.
FAST FREQUENT FREE TREE MINING IN GRAPH DATABASES Marko Lazić 3335/2011 Department of Computer Engineering and Computer Science,
Slides are modified from Jiawei Han & Micheline Kamber
Graph Indexing Techniques Seoul National University IDB Lab. Kisung Kim
Advanced Association Rule Mining and Beyond. Continuous and Categorical Attributes Example of Association Rule: {Number of Pages  [5,10)  (Browser=Mozilla)}
Storytelling and Clustering for Cellular Signaling Pathways M. Shahriar Hossain, Monika Akbar, Nicholas F. Polys Department of Computer Science, Virginia.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
On Graph Query Optimization in Large Networks Alice Leung ICS 624 4/14/2011.
An Efficient Algorithm for Discovering Frequent Subgraphs Michihiro Kuramochi and George Karypis ICDM, 2001 報告者:蔡明瑾.
SPIN: Mining Maximal Frequent Subgraphs from Graph Databases Jun Huan, Wei Wang, Jan Prins, Jiong Yang KDD 2004.
1 Frequent Subgraph Mining Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY June 12, 2010.
Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.
Frequent Subgraph Discovery Michihiro Kuramochi and George Karypis ICDM 2001.
University at BuffaloThe State University of New York Lei Shi Department of Computer Science and Engineering State University of New York at Buffalo Frequent.
Mining Turbulence Data Ivan Marusic Department of Aerospace Engineering and Mechanics University of Minnesota Collaborators: Victoria Interrante, George.
Patterns around Gnutella Network Nodes Sui-Yu Wang.
Mining Graph Patterns Efficiently via Randomized Summaries Chen Chen, Cindy X. Lin, Matt Fredrikson, Mihai Christodorescu, Xifeng Yan, Jiawei Han VLDB’09.
1 Efficient Discovery of Frequent Approximate Sequential Patterns Feida Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu ICDM 2007.
Frequent Structure Mining Robert Howe University of Vermont Spring 2014.
Graph Indexing From managing and mining graph data.
COMP53311 Association Rule Mining Prepared by Raymond Wong Presented by Raymond Wong
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Mining Complex Data COMP Seminar Spring 2011.
Data Mining: Principles and Algorithms Graph Pattern Mining Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign
Frequent Sub-Structure-Based Approaches for Classifying Chemical Compounds Mukund Deshpande, Michihiro Kuramochi, George Karypis University of Minnesota,
Subgraph Search Over Uncertain Graphs Erşan Demircioğlu.
1 Substructure Similarity Search in Graph Databases R 陳芃安.
Ning Jin, Wei Wang ICDE 2011 LTS: Discriminative Subgraph Mining by Learning from Search History.
Gspan: Graph-based Substructure Pattern Mining
Outline Introduction State-of-the-art solutions
Mining in Graphs and Complex Structures
Efficient Closed Pattern Mining in Strongly Accessible Set Systems
Mining Frequent Subgraphs
September 19, 2018.
Graph Search with Indexing
Data Mining: Concepts and Techniques — Chapter 9 — 9.1. Graph mining
Large Graph Mining: Power Tools and a Practitioner’s guide
Frequent Neighborhood Patterns: Mining Algorithms and Applications
Mining, Indexing and Searching Graphs in Biological Databases
Mining Frequent Subgraphs
Mining and Searching Graphs in Biological Databases
Discovering Larger Network Motifs
Efficient Subgraph Similarity All-Matching
Slides are modified from Jiawei Han & Micheline Kamber
FP-Growth Wenlong Zhang.
Effective Entity Recognition and Typing by Relation Phrase-Based Clustering
Mining Frequent Subgraphs
gApprox: Mining Frequent Approximate Patterns from a Massive Network
Approximate Graph Mining with Label Costs
Presentation transcript:

Graph Database Mining and Its Applications Luke Huan Feb 19, 2004

Presentation Overview Introduction Finding recurring subgraphs from graph databases. Algorithms Application Bioinformatics Social network Communication network Left: social network right protein structure 1L06 11/30/2018

An Example of Graph Database Mining Input: a database of labeled undirected graphs and a threshold 0 <   1 p2 p5 d c a x y (P) p1 p3 p4 b d c x y (Q) q1 q3 q2 d c x (S) s1 s3 s2  = 2/3 Output: All (connected) frequent subgraphs from the graph database. d c x d c d c x y d c x y 11/30/2018

FSG: an apriori like method 11/30/2018

Framework 11/30/2018

Candidate Generating 11/30/2018

How about using paths? Graphs can be decomposed into a set of paths 11/30/2018

Applications I: protein structure analysis ASP 102 ALA 55 SER 214 HIS 57 GLY 140 GLY 142 ASP 194 GLY 43 GLY 196 11/30/2018

Application II: Finding social contacts 11/30/2018

References X. Yan, J. Han, gSpan: Graph-Based Substructure Pattern Mining, ICDM 2002 J. Huan, W. Wang, J. Prins. Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism, ICDM 2003. M. Kuramochi and G. Karypis Finding Frequent Patterns in a Large Sparse Graph, SIAM 2004 N. Vanetik, E. Gudes Mining Frequent Labeled and Partially Labeled Graph Patterns, ICDE 2004 11/30/2018