Testing our ranking in different dataset Overview Centrality-based ranking for RDF nodes Alvaro Graves, Sibel Adali and Jim Hendler.

Slides:



Advertisements
Similar presentations
 Theorem 5.9: Let G be a simple graph with n vertices, where n>2. G has a Hamilton circuit if for any two vertices u and v of G that are not adjacent,
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Social network partition Presenter: Xiaofei Cao Partick Berg.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Linked data: P redicting missing properties Klemen Simonic, Jan Rupnik, Primoz Skraba {klemen.simonic, jan.rupnik,
GRAIL: Scalable Reachability Index for Large Graphs VLDB2010 Vineet Chaoji Mohammed J. Zaki.
Graphs Chapter 12. Chapter Objectives  To become familiar with graph terminology and the different types of graphs  To study a Graph ADT and different.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 CHAPTER 4 - PART 2 GRAPHS 1.
Algorithm Strategies Nelson Padua-Perez Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
Graph & BFS.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Centrality and Prestige HCC Spring 2005 Wednesday, April 13, 2005 Aliseya Wright.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2001 Lectures 3 Tuesday, 9/25/01 Graph Algorithms: Part 1 Shortest.
CS2420: Lecture 37 Vladimir Kulyukin Computer Science Department Utah State University.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2001 Wednesday, 9/26/01 Graph Basics.
Graph & BFS Lecture 22 COMP171 Fall Graph & BFS / Slide 2 Graphs * Extremely useful tool in modeling problems * Consist of: n Vertices n Edges D.
Graphs Chapter 12. Chapter 12: Graphs2 Chapter Objectives To become familiar with graph terminology and the different types of graphs To study a Graph.
Search in a Small World JIN Xiaolong Based on [1].

UNIVERSITY OF JYVÄSKYLÄ Resource Discovery in Unstructured P2P Networks Distributed Systems Research Seminar on Mikko Vapa, research student.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Using Dijkstra’s Algorithm to Find a Shortest Path from a to z 1.
Directed graphs Definition. A directed graph (or digraph) is a pair (V, E), where V is a finite non-empty set of vertices, and E is a set of ordered pairs.
Lecture 12-2: Introduction to Computer Algorithms beyond Search & Sort.
Exposure In Wireless Ad-Hoc Sensor Networks Seapahn Meguerdichian Computer Science Department University of California, Los Angeles Farinaz Koushanfar.
1 Computing with Social Networks on the Web (2008 slide deck) Jennifer Golbeck University of Maryland, College Park Jim Hendler Rensselaer Polytechnic.
Graph Theory in Computer Science
Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.
Optimal Link Bombs are Uncoordinated Sibel Adali Tina Liu Malik Magdon-Ismail Rensselaer Polytechnic Institute.
Graph Algorithms Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar Adapted for 3030 To accompany the text ``Introduction to Parallel Computing'',
A New Hybrid Wireless Sensor Network Localization System Ahmed A. Ahmed, Hongchi Shi, and Yi Shang Department of Computer Science University of Missouri-Columbia.
Shortest Path Algorithms. Definitions Variants  Single-source shortest-paths problem: Given a graph, finding a shortest path from a given source.
CS223 Advanced Data Structures and Algorithms 1 Review for Final Neil Tang 04/27/2010.
Topics Paths and Circuits (11.2) A B C D E F G.
Graphs Chapter 12. Chapter 12: Graphs2 Chapter Objectives To become familiar with graph terminology and the different types of graphs To study a Graph.
David Stotts Computer Science Department UNC Chapel Hill.
Dijkstra animation. Dijksta’s Algorithm (Shortest Path Between 2 Nodes) 2 Phases:initialization;iteration Initialization: 1. Included:(Boolean) 2. Distance:(Weight)
Po-Lung Chen (Dont block me) d091: Urban Transport System 2010/03/26 (1) d091: Urban Transport System Po-Lung Chen Team Dont Block Me, National Taiwan.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Physics Section 18.2 Apply series and parallel circuits. Note: A simple circuit may contain only one load. A more complex circuit may contain numerous.
1 GRAPHS – Definitions A graph G = (V, E) consists of –a set of vertices, V, and –a set of edges, E, where each edge is a pair (v,w) s.t. v,w  V Vertices.
Graphs 1 Neil Ghani University of Strathclyde. Where are we …. We studied lists: * Searching and sorting a list Then we studied trees: * Efficient search.
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
CSE 373: Data Structures and Algorithms
Shortest Path Problem Under Triangular Fuzzy Neutrosophic Information
EECS 203 Lecture 19 Graphs.
Dynamic Programming 1 Neil Tang 4/20/2010
CSC317 Shortest path algorithms
Single-Source Shortest Paths
Network analysis.
Graph Theory.
Finding Fastest Paths on A Road Network with Speed Patterns
Visual analytics for discovering entity relationship on text data
Elementary graph algorithms Chapter 22
Fast Nearest Neighbor Search on Road Networks
Minimum Spanning Trees
Shortest-Paths Trees Kun-Mao Chao (趙坤茂)
Department of Computer Science University of York
CSE 373: Data Structures and Algorithms
Challenges in Visualizing Knowledge and Metadata
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Sorting and Divide-and-Conquer
The Theory of NP-Completeness
Elementary graph algorithms Chapter 22
CSE 373: Data Structures and Algorithms
Review for Final Neil Tang 05/01/2008
A Variation of Minimum Latency Problem on Path, Tree and DAG
Presentation transcript:

Testing our ranking in different dataset Overview Centrality-based ranking for RDF nodes Alvaro Graves, Sibel Adali and Jim Hendler Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY 12180, USA TW funding from: DARPA, NSF, IARPA, ARL, Lockheed Martin, SRI, Fujitsu, IBM This work focuses on finding ways to sort nodes of an RDF graph in a “relevant” order. By relevant we mean some reasonable order, generally accepted by people with expertise in the information represented in the graph. We base our method on what is called the closeness centrality. Finding the centrality for every node in a graph is equivalent to solving the All-Pairs Shortest Path problem. Our current solution is based on Dijkstra's algorithm and has complexity O(nm log(n)) where n is the number of nodes and m is the number of edges. This approach allows us to solve the problem in a parallelized platform improving the speed. Definition of centralityWeights and distance functions How ranking works We define closeness centrality for a node i as where d is a distance function between nodes, A is the connected component that i belongs to, and r i is the reachability of the node i, the number of nodes in A. The distance d of a path is given by the sum of the edge weights along this path. We consider two different types of weights: Constant: Each edge has weight c (usually c = 1). This basic distance function does not take into account the semantics of the graph. Predicate-based weights: Edges weights are determined as a function of the predicate they contain. For example c = p(u, v)/|E| A simple example The simplest type of graph to rank is a taxonomy. Which are the most relevant nodes? CIA Factbook This dataset is an RDF version of the annual publication made by the CIA about the different countries in the world. It is interesting that the top ranked nodes are international organizations and some globally important issues, like “Climate Change” and “Biodiversity”. Scalability over bigger graphs Customizability using other weights. Stability with respect to the properties of the different data sets. Future work In degree top 10Centrality top 10 1rdf:Statementorg:OIC 2cia:Estimateorg:WHO 3cia:CommodityPercentorg:UN 4cia:CountryPercentorg:ITU 5cia:AirportBreakdownorg:UNCTAD 6cia:Portorg:UPU 7 cia:SexRationBreakdown org:ICAO 8cia:EthnicGroupPercentorg:UNESCO 9cia:ReligionPercentenv:Biodiversity 10cia:LanguagePercent env:ClimateChange Wine Ontology It contains 720 nodes and almost 2000 edges without literals. In this table we show the weighted and non weighted results. We can see that the weights favor the more specific instances of the ontologyover more general concepts. Weighted top 10Constant top 10 1WhitehallCabernetBlancWine 2MariettaOldVinesRedWinery 3CorbansOldDryRieslingRegion 4MountadamRieslingDry 5 StGenevieveTexasWhite WineGrape 6SelaksIceWineDessertWine 7 KathrynKennedyLateral LateHarvest 8 WhitehallLanePrimavera Moderate 9adjacentRegionWineColor 10ChiantiClassico EarlyHarvest Terrorists Ontology It was developed from Mindswap. It contains information about terrorists, events in nodes and edges.It contains noisy data ranging from mistyped instances to the inclusion of information not related with terrorirsm. It is focused in middle eastern terrorists. Weighted bottom 10Weighted top 10 1Pir FarouqAyman Al-Zawahiri 2Ricardo PalmeraMidhat Mursi 3Naim KassemOsama Bin Laden 4Rodney ColoradoMustafa Kamel 5Jamal Abu SamhadenaOmar al Rahman 6Qari Thair YuldashevMusab al Zarqawi 7 Shoko Asahar Ramzi Yousef 8Juan Jose MartinezKhalid Mohammed 9Erminso CuevasHafiz Saeed 10Tomas Molina Caracas Huthaifa Azzam