Bidirectional Query Planning Algorithm

Slides:



Advertisements
Similar presentations
Problem solving with graph search
Advertisements

University of Minnesota 1 Exploiting Page-Level Upper Bound (PLUB) for Multi-Type Nearest Neighbor (MTNN) Queries Xiaobin Ma Advisor: Shashi Shekhar Dec,
DIJKSTRA’s Algorithm. Definition fwd search Find the shortest paths from a given SOURCE node to ALL other nodes, by developing the paths in order of increasing.
Graph Searching (Graph Traversal) Algorithm Design and Analysis Week 8 Bibliography: [CLRS] – chap 22.2 –
Graphs-Part II Depth First Search (DFS). We Already Covered Breadth First Search(BFS) Traverses the graph one level at a time – Visit all outgoing edges.
Using Search in Problem Solving
Image Segmentation Today’s Readings Intelligent Scissors, Mortensen et. al, SIGGRAPH 1995Intelligent Scissors From Sandlot ScienceSandlot Science.
Web Projections Learning from Contextual Subgraphs of the Web Jure Leskovec, CMU Susan Dumais, MSR Eric Horvitz, MSR.
1 Internet Networking Spring 2002 Tutorial 6 Network Cost of Minimum Spanning Tree.
Announcements Mailing list: –you should have received messages Office hours onlineonline –start next week.
CISC220 Fall 2009 James Atlas Nov 13: Graphs, Line Intersections.
L o g o Jieyi Long, Hai Zhou, and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. An O(nlogn) Edge-Based Algorithm for Obstacle- Avoiding Rectilinear.
1 Keyword Search on External Memory Data Graphs Bhavana Dalvi* Meghana Kshirsagar # S. Sudarshan Indian Institute of Technology, Bombay *: Current affiliation:
Query Planning for Searching Inter- Dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal 1, Ruoming Jin 2 1 Department of Computer.
Bidirectional Expansion for Keyword Search on Graph Databases Varun Kacholia Shashank Pandit Soumen Chakrabarti S. Sudarshan.
Fast Failover for Control Traffic in Software-defined Networks Globecom 2012 Neda B. & Ying Z. Presented by: Szu-Ping Wang.
Authors: Bhavana Bharat Dalvi, Meghana Kshirsagar, S. Sudarshan Presented By: Aruna Keyword Search on External Memory Data Graphs.
Using Dijkstra’s Algorithm to Find a Shortest Path from a to z 1.
Keyword Search on External Memory Data Graphs Bhavana Bharat Dalvi, Meghana Kshirsagar, S. Sudarshan PVLDB 2008 Reported by: Yiqi Lu.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Wolf Siberski1 What do you mean? – Determining the Intent of Keyword Queries on Structured Data.
Representing and Using Graphs
Harikrishnan Karunakaran Sulabha Balan CSE  Introduction  Database and Query Model ◦ Informal Model ◦ Formal Model ◦ Query and Answer Model 
P ROBLEM Write an algorithm that calculates the most efficient route between two points as quickly as possible.
EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data Cuoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
SEEDEEP: A System for Exploring and Querying Deep Web Data Sources Gagan Agrawal Fan Wang, Tantan Liu Ohio State University.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Depth-First Search Lecture 21: Graph Traversals
Shahed University Dr. Shahriar Bijani May  A path is a sequence of vertices P = (v 0, v 1, …, v k ) such that, for 1 ≤ i ≤ k, edge (v i – 1, v.
Keyword Searching and Browsing in Databases using BANKS Charuta Nakhe, Arvind Hulgeri, Gaurav Bhalotia, Soumen Chakrabarti, S. Sudarshan Presented by Sushanth.
Learning to Create Data-Integration Queries Partha Pratim Talukdar, Marie Jacob, Muhammad Salman Mehmood, Koby Crammer, Zachary G. Ives, Fernando Pereira,
Dijkstra animation. Dijksta’s Algorithm (Shortest Path Between 2 Nodes) 2 Phases:initialization;iteration Initialization: 1. Included:(Boolean) 2. Distance:(Weight)
Construction of Optimal Data Aggregation Trees for Wireless Sensor Networks Deying Li, Jiannong Cao, Ming Liu, and Yuan Zheng Computer Communications and.
1 3/21/2016 MATH 224 – Discrete Mathematics First we determine if a graph is connected.
CSC 213 – Large Scale Programming Lecture 31: Graph Traversals.
Graphs – Breadth First Search
Graphs Motivation and Terminology Representations Traversals
Elementary Graph Algorithms
Graphs-Part II Depth First Search (DFS)
CSE 373 Topological Sort Graph Traversals
Announcements CS accounts Project 1 is out today
SEEDEEP: A System for Exploring and Querying Deep Web Data Sources
Problem Solving Agents
CS4670/5670: Image Scissors Noah Snavely Today’s Readings
Topological Sort Minimum Spanning Tree
Discrete Math 2 Weighted Graph Search Tree
I206: Lecture 15: Graphs Marti Hearst Spring 2012.
Unweighted Shortest Path Neil Tang 3/11/2010
Algorithms for Exploration
CS120 Graphs.
Dijkstra’s Algorithm We are given a directed weighted graph
Minimum Spanning Tree.
T. C. van Dijk1, J.-H. Haunert2, J. Oehrlein2 1University of Würzburg
Graphs Chapter 13.
Lecture 1B: Search.
Announcements Reader is in the BOOK STORE (not Comm Bldg.)
Keyword Searching and Browsing in Databases using BANKS
Graph Indexing for Shortest-Path Finding over Dynamic Sub-Graphs
CSE 421: Introduction to Algorithms
MCN: A New Semantics Towards Effective XML Keyword Search
Answering Cross-Source Keyword Queries Over Biological Data Sources
Announcements Project 1 is out today
Sorting and Divide-and-Conquer
Implementation of Dijkstra’s Algorithm
Announcements Project 1 is out today help session at the end of class.
Algorithms CSCI 235, Spring 2019 Lecture 33 Graphs II
Introduction to XML IR XML Group.
Presentation transcript:

Bidirectional Query Planning Algorithm Motivation Bidirectional Query Planning Algorithm Growth of size and popularity of biological deep web data sources Increasing need for querying these data sources Answering cross-source queries manually Identify relevant data sources Submit queries to numerous query forms Keep track of results Manually combine and summarize results Algorithm Overview Bidirectional Exploration Identify target nodes and starting nodes Backward exploration connecting target nodes with starting nodes Forward exploration connecting starting nodes with target nodes Backward and forward exploration queue Heuristic 1: Always explore the least cost node Forward explore: Predecessor u to Descendant v u is single node: edge exploration u is composite node: explore the unexplored nodes in u first Backward explore: Descendant v to Predecessor u Edge Exploration Build paths connecting target with starting nodes Heuristic 2: Always explore the shortest path Explore edge from node u to v u is single node: Dijkstra algorithm u is composite node u’s unshared ancestor: normal u’s shared ancestor: longest path from these ancestors to v Running Example Motivating Example Entity-Attribute Query Q1={ERCC6,SNPID,”ORTH BLAST”,HGNCID} Entity-Entity Relationship Query Q2={MSMB, RET} Experiment Results Conclusion Support cross-source queries in deep web Entity-attribute and entity entity relationship queries Propose a bidirectional query planning algorithm Our planning algorithm has good scalability Our plan outperforms plans from Steiner tree Our plan perform closely to the optimal plan Setup Speedup and Plan Quality 12 biological deep web data sources 20 queries related with SNP study Our algorithm, Exhaustive search (OPT), Steiner tree algorithm Problem Formulation Query Q={e1,…,em,a1,…ak} Entity keyword: initialize the query Attribute keyword: attribute of interest Scalability Data source Multi-source dependency Cost model Access cost Quality cost For 60% queries, our plans have speedup over Steiner tree’s For 80% queries, our plans have the same execution time as OPT’s For 90% queries, quality of our plans is close to that of OPT’s Formulation: Find the lowest cost subgraph SubG from the dependency graph, such that all search terms in query Q are covered by SubG OPT scales poorly, 9 hours for 75 sources Ours scales well, <1 seconds for 75 sources