Search Computing Engineering SeCo: Liquid Queries Marco Brambilla, Stefano Ceri SeCo workshop, Como, June 17th-19th, 2009.

Slides:



Advertisements
Similar presentations
Texas Digital Library Services Preservation Network.
Advertisements

University of Dublin Trinity College Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences Neil Peirce, Owen Conlan, Vincent.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
EA Demonstration Study : Dissemination Forum – 8 June EA Views and Sub-views Patrick Bardet EA Unit.
The ANSI/SPARC Architecture of a Database Environment
13-1 © Prentice Hall, 2004 Chapter 13: Designing the Human Interface (Adapted) Object-Oriented Systems Analysis and Design Joey F. George, Dinesh Batra,
School of Engineering & Technology Computer Architecture Pipeline.
Search Computing Stefano Ceri. Prof. Stefano Ceri Database Management Search Computing, an EU-funded Project  European Reseach Council (ERC) runs EU.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Interactive Systems Technical Design Seminar work: Web Services Janne Ojanaho.
Model Personalization (1) : Data Fusion Improve frame and answer (of persistent query) generation through Data Fusion (local fusion on personal and topical.
Statistical Methods in Computer Science Why? Ido Dagan.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
CS 290C: Formal Models for Web Software Lecture 6: Model Driven Development for Web Software with WebML Instructor: Tevfik Bultan.
10th TTCN-3 User Conference, 7-9 June 2011, Bled, Slovenia AUTOSAR Conformance Tests - Feedback on their development and utilization Alain Feudjio-Vouffo,
Data Mining Techniques
Persistent Identifiers Service WATER FOR A HEALTHY COUNTRY FLAGSHIP SISS Workshop v2.3 Pavel Golodoniuc | Computer scientist 7 May 2013.
Ontology-derived Activity Components for Composing Travel Web Services Matthias Flügge Diana Tourtchaninova
Conceptual Modeling Issues in Web Applications enhanced with Web services Sara Comai, Politecnico di Milano In collaboration with:
Oracle Application Express (Oracle APEX), formerly called HTML DB, is a Free rapid web application development tool for the Oracle database.
Multi-agent Research Tool (MART) A proposal for MSE project Madhukar Kumar.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
1 A Static Analysis Approach for Automatically Generating Test Cases for Web Applications Presented by: Beverly Leung Fahim Rahman.
Kuali Enterprise Workflow Presented at ITANA October 2009 Eric Westfall – Kuali Rice Project Manager.
1 10/14/2015ã 2007, Spencer Rugaber The Waterfall Process Software plans and requirements Validation System feasibility Validation Product design Verification.
Dashboard for Training Metrics By: Gopi Patel Guided by: Dr. Meiliu Lu Department of Computer Science California State University, Sacramento.
EXist Indexing Using the right index for you data Date: 9/29/2008 Dan McCreary President Dan McCreary & Associates (952) M.
Towards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Automatic Report Generation for WLCG/EGEE D. D. Sonvane (Gridview Team) B.A.R.C.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
User interface design and human computer interaction Xiangming Mu.
Human-Machine Interaction in a CASE Environment Paulo Gomes CISUC – University of Coimbra Portugal IJCAI’03 Workshop on Mixed- Initiative Intelligent Systems.
Engineering Workshops Purposes of Neighbor Solicitation.
Center for E-Business Technology Seoul National University Seoul, Korea Optimization of Multi-Domain Queries on the Web Daniele Braga, Stefano Ceri, Florian.
Database Architecture Course Orientation & Context.
LaHave House Project 1 LaHave House Project Automated Architectural Design BML + ARC.
Department of Computing, School of Electrical Engineering and Computer Sciences, NUST - Islamabad KTH Applied Information Security Lab Secure Sharding.
Semi-Automatic Generation of Device-Adapted User Interfaces Stina Nylander Swedish Institute of Computer Science.
Engine Group Namiruddin Ahmed Ali Kamil. 2 XMLApe XMLApe Research Group Involved in research on a number of projects that are related to XML and inspired.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
HABIT Understanding the mechanisms of habit-forming and habit reliance ‘Provocations assignments’ for Day 1 of the NSF/EU/NIH Behavior Change Workshop.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
By Douglas Copas and Mark Perry.  Currently available small business based commercial inventory management systems are either prohibitively expensive.
ABOUT ME ADAPTIVE SOFTWARE | Samudra Kanankearachchi Senior Software Data Science Specialist NEXT GENERATION OF ADAPTIVE ENTERPRISE.
School of Computer Science Advanced Interfaces Group Extensive expertise in R&D of VR software systems and applications MAVERIK VR software downloaded.
System Analysis and Design Copyright © Genetic Computer School 2007 SAD14-1 CHAPTER OVERVIEW  Overview Of System Maintenance  Tasks Performed During.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Flight Simulator Overview Flight Compartment Host Computer Motion Control Cabinet Motion Platform 13/6/2016 Visual Display Visual Image Generator Interface.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
OpenACS and.LRN Conference 2008 Automatic Limited-Choice and Completion Test Creation, Assessment and Feedback in modern Learning Processes Institute for.
Dynamic Query Forms for Database Queries. Abstract Modern scientific databases and web databases maintain large and heterogeneous data. These real-world.
Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis The 17th International.
Chrome Extension. Agenda  Overview  Our Work  Analysis  Architecture design  User Interface Design  Code Review  Demonstration.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
Why Intelligent Data Analysis? Joost N. Kok Leiden Institute of Advanced Computer Science Universiteit Leiden.
Dynamo: A Runtime Codesign Environment
Business Rule Based Configuration Management and Software System Implementation Using Decision Tables Olegas Vasilecas, Aidas Smaizys VGTU, Vilnius, Lithuania.
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Project Structure Overview
IT323 Project Phase#2.
Exploratory Search Framework for Web Data Sources
Chaitali Gupta, Madhusudhan Govindaraju
IBM C IBM Big Data Engineer. You want to train yourself to do better in exam or you want to test your preparation in either situation Dumpspedia’s.
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Presentation transcript:

Search Computing Engineering SeCo: Liquid Queries Marco Brambilla, Stefano Ceri SeCo workshop, Como, June 17th-19th, 2009

Brambilla, Ceri Search Computing: LIQUID QUERIES Agenda 1.Overview of the SeCo architecture –Development and experimentation roadmap 2.Application development approach: LIQUID QUERIES –Configurability of the interface, strong parameter typing, static mapping to services –Continuous query processes –Exploitation of user intelligence (interactive query process – user feedback), BPM –Automatic code generation of user interface and interaction steps –Adaptivity and customization of the query interaction 3.Support to the developer in the various design phases –Service marts specification –Query specification –User interface specification 4.SeCo extensions –High-level queries -- General almost-NL query NLP, wordnet, query splitting, and mapping to services 2

1. Overview and roadmap of the SeCo architecture SeCo workshop, Como, June 17th-19th, 2009

Brambilla, Ceri Search Computing: LIQUID QUERIES Search Computing architecture: overall view 4 Main Query flow relation High level query Where can I attend a DB scientific conference close to a beautiful beach reachable with cheap flights? Sub query 1 Where can I attend a DB scientific conference? Sub query 2 place close to a beautiful beach? Sub query 3 place reachable with cheap flight? Low level query 1 ConfSearch(DB,placeX,dateY) Low level query 2 TourSearch(Beach,PlaceX) Low level query 3 Flight(cost<200,PlaceX,DateY) Query plan Services invocations and operators execution Results Presented results MSVVEIS08 - Barcelona – Iberia LID08 – Rome - Alitalia RCIS08- Marrakech- AirFrance

Brambilla, Ceri Search Computing: LIQUID QUERIES Search Computing architecture: configurability of the implementation 5 Main Query flow relation

Brambilla, Ceri Search Computing: LIQUID QUERIES Search Computing architecture: development roadmap 6 Prototype 1: Core behaviour of the system. Engine-based execution of queries Domain repository Service repository Coarse result presentation relation Prototype 2: Planning Automatic optimized query planning Prototype 3: Mapping and presentation mapping to domains presentation of results Prototype 4: High level queries

2. Application development approach: LIQUID QUERIES SeCo workshop, Como, June 17th-19th, 2009

Brambilla, Ceri Search Computing: LIQUID QUERIES LIQUID QUERY A level above the optimization: –Forcing the query flow LIQUID QUERY: A query with flexible boundaries Control is –on the user –at query time –on the evolution Contextual/recommended direction could be proposed In line with current trends in search (and others!) 8 Forward-looking and (a little bit) far-fetched ideas Open to discussion

Brambilla, Ceri Search Computing: LIQUID QUERIES Microsoft Bing Contextual step-by-step evolution of the query 9

Brambilla, Ceri Search Computing: LIQUID QUERIES Google Squared Multi-content, resizable, reshapeable query 10

Brambilla, Ceri Search Computing: LIQUID QUERIES Not a search: Hunch Just a big decision tree Perceived as great value by today users 11

Brambilla, Ceri Search Computing: LIQUID QUERIES Yahoo! research Web of pages vs. Web of objects Understand the need behind the user query Exploiting user intelligence –Tags –Folksonomies Multi-step queries Multi-technology queries –Annotations –Content-based 12

Brambilla, Ceri Search Computing: LIQUID QUERIES LIQUID QUERY Moving from "one time query" to a process-based approach Continuation of queries based on exploitation of relations between service marts A query with flexible boundaries, that can be –Reshaped/refined: asking for different information on the results –Expanded: asking for additional information on the results adding new domains –Extended: asking for more results by the user at runtime Contextual/recommended direction could be proposed Relies on the SeCo query machine –Every user interaction could trigger recalculations 13

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query navigation Liquid- what? –Liquid data (BEA & San Diego) –Liquid publications and docs (Fabio & Trento) –(old-style) Liquid queries (Heer & Berkeley) Somehow similar to Google Squared, but: –Multi-domain – Multi-purpose –More flexible 14 Upon first query cycle, various options to the user: Refinement of the query Extension of the query results (give me more) Expansion of the query (add more domains) Choosing a different connection between services (i.e., changing the adopted access pattern) Clustering, re-ranking,... What does it imply at the query machine level?

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query navigation 15 Conference Photo Description Date Hotel Photo Description Address Services

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query: clustering/unclustering 16 At the query machine level? Probably nothing, just a presentation issue, if...

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query: ranking/reranking For unclustered data or cluster representatives 17 At the query machine level? If multi-ranking service available, recompute the query. If not, just re-sort the query result at presentation level.

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query: refinment Adding additional constraints –E.g., on this timeframe... More or less results Search again Refined search... At the query machine level? Rebuild the plan, possibly. And re-execute the query. If pieces can be reused... (caching)

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query: extend the query 19 At the query machine level? Run again the machine on further data Gimme more Asking for more results... More results...

Brambilla, Ceri Search Computing: LIQUID QUERIES Asking for more results on a specific service... More results... Liquid query: zooming in (service-wise) 20 At the query machine level? Run again the machine on that service. Or: Change the throughput of the machine Clock branches Gimme more

Brambilla, Ceri Search Computing: LIQUID QUERIES Asking for more results on a specific item... More results... Liquid query: zooming in 21 At the query machine level? Run again the machine on services joined to that item. Or: Change the throughput of the machine Clock branches Gimme more

Brambilla, Ceri Search Computing: LIQUID QUERIES Liquid query: expand (shrink) the query 22 At the query machine level? Changing the plan. If something can be reused... (caching) Additional subquery Asking for more columns (or remove existing ones)... Results... ?

Brambilla, Ceri Search Computing: LIQUID QUERIES Changing the used access paths Liquid query: change join conditions 23 At the query machine level? Changing the plan. If something can be reused... (caching)

Brambilla, Ceri Search Computing: LIQUID QUERIES Horizontal and vertical multi-domain search Structure of the interface automatically generated based on the structure of the access plan Additional feature: save the resulting inteface (for query and results) for canned vertical applications Apply a stylesheet for making the application real –Set of default stylesheets that can be painted upon the inteface –Possibility of defining custom stylesheets 24

3. Support to the designer. SETTING UP THE LIQUID QUERY ENVIRONMENT SeCo workshop, Como, June 17th-19th, 2009

Brambilla, Ceri Search Computing: LIQUID QUERIES Registration Time The role of the designer is at registration time! Low development cost Higher cost of registration –Description of services –Description of default interfaces for services inputs and results 26

Brambilla, Ceri Search Computing: LIQUID QUERIES The tools Strong parameter typing –UI fields are typed Static mapping to services –UI fields are directly mapped to search services BPM-like modeling of the user interaction and query processing steps Automatic generation of UI Adaptivity and customization of the query interaction 27

Brambilla, Ceri Search Computing: LIQUID QUERIES The hard task: Registration time Building access patterns Building binding Defining the (lightweight) semantics –Domains –Keywords Defining the (default) presentation –Forms –Results 28

4. SeCo Extensions: High level queries SeCo workshop, Como, June 17th-19th, 2009

Brambilla, Ceri Search Computing: LIQUID QUERIES High level queries Almost NL-specified queries –Conjunctive noun phrases Need to be decomposed and mapped to semantic domains need of domain repository Require NLP and semantization of phrase contents need of NL analysis

Brambilla, Ceri Search Computing: LIQUID QUERIES Domain repository Storage of –domain definitions taxonomy (e.g., Dewey classification) –mappings of NL words to domains –mappings of services to domains Shallow approach based on –Wordnet (synsets Sx) –Wordnet-Domains (domains Dx) 31 D1 D2 D3 S1 S2... S6 (.2,.8) Service Repository ss1 (.4,.6)

Brambilla, Ceri Search Computing: LIQUID QUERIES Domain Repository: API Three main interfaces: Domain query: used to extract a domain (or a list of domains), and their corresponding properties, that relate to a specific string Service extraction: used to extract the list of services associated to the domain Domain hierarchy update: used to update the domain hierarchy

Brambilla, Ceri Search Computing: LIQUID QUERIES Query Analyser Starts from almost- natural language specification of the user request tries to determine a decomposition in subqueries that can fit the problem of mapping on a domain E.g.: scientific conference reachable with a cheap flight, with a beautiful beach nearby Target splitting –q1=scientific conference ", –q2=reachable with a cheap flight ", and –q3=with a beautiful beach nearby ".

Brambilla, Ceri Search Computing: LIQUID QUERIES Query Analyser For NLP, we exploit an open source tool developed by the Stanford Natural Language Processing Group The outcome is a tree representation of the query Definition of euristics for query splitting To optimize the recognition of query-subquery relations: –iterative invocation of the NLP tool based on various arguments (feedback from user, feedfwd/back from other components,...); – exploitation of knowledge/services available in other components. E.g., knowledge –about the available services, domains, and so on; – syntax/logic analysis results on the sentence.

Brambilla, Ceri Search Computing: LIQUID QUERIES Back to the example - 2 scientific conference reachable with a cheap flight, with a beautiful beach nearby Very coarse euristics: –Subqueries = first level subtrees Obtained splitting –q1=scientific conference reachable", –q2=with a cheap flight ", and –q3=with a beautiful beach nearby ". 35

Brambilla, Ceri Search Computing: LIQUID QUERIES Back to the example - 3 Still not exact, but rather close (being the first shot:) Further information can be extracted –Association between words: e.g., cheap_flight –Meaning of phrase connectives And... –What about negation? –What about join attributes between phrases? –... 36

Brambilla, Ceri Search Computing: LIQUID QUERIES Query Analyser Tasklist –Extraction of a corpus of queries from Yahoo! Answers –Definition of concrete options for optimization of the extractor –Training? –Validation of the approach on the corpus –Mapping: currently could be trivial on keywords of domains

Q uestions? Search Computing