CIDR 2007, Asilomar California1 Predicate-Based Indexing of Enterprise Web Applications Cristian Duda, David Graf, Donald Kossmann ETH Zurich.

Slides:



Advertisements
Similar presentations
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Advertisements

Previous Lectures: Planning of a Web site: Discussing the strategic issues of Web site engineering process –Models used for Web site planning –Compare.
DT228/3 Web Development Introduction to Java Server Pages (JSP)
1 Towards Automating Complex Associative Access to Multiple Bioinformatics Data Sources Ling Liu, Calton Pu David Buttler, Wei Han Henrique Paques, Dan.
G O B E Y O N D C O N V E N T I O N WORF: Developing DB2 UDB based Web Services on a Websphere Application Server Kris Van Thillo, ABIS Training & Consulting.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Agenda: Solomon N’Jie Overview Microsoft Enterprise Search Solution
Server-Side vs. Client-Side Scripting Languages
Web Server Hardware and Software
2/11/2004 Internet Services Overview February 11, 2004.
Chapter Apache Installation in Linux- Mandrake. Acknowledgment The following information has been obtained directly from
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
Google App Engine and Java Application: Clustering Internet search results for a person Aleksandar Kartelj Faculty of Mathematics,
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Java Pet Store Application. Outline Introduction Introduction Information Layer Information Layer Application Layer Application Layer Infrastructure Layer.
Midwest Documentum User Group Harley-Davidson Documentum WCM 10/10/2006.
Customer Forum OTech’s New Web Publishing Service Web Services Section – April 29, 2015.
 Cloud computing is one of the more recent technologies that many businesses, individuals and other industry organizations believe to by one of the keys.
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
IT 210 The Internet & World Wide Web introduction.
J2EE Web Fundamentals Lesson 1 Introduction and Overview
ITIS 1210 Introduction to Web-Based Information Systems Chapter 24 How Websites Work with Databases How Websites Work with Databases.
Cloud Connect- Google Search Appliance. What is GSA? The Google Search Appliance (GSA) provides fast, relevant search for your intranet or website.
SednaSpace A software development platform for all delivers SOA and BPM.
Java Omar Rana University of South Asia. Course Overview JAVA  C/C++ and JAVA Comparison  OOP in JAVA  Exception Handling  Streams  Graphics User.
Web 2.0: Concepts and Applications 6 Linking Data.
Marcel Casado NCAR/RAP WEATHER WARNING TOOL NCAR.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
Universiti Utara Malaysia Chapter 3 Introduction to ASP.NET 3.5.
Introduction to ColdFusion Yu Fu 2003 MEC Candidate.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
Searching Business Data with MOSS 2007 Enterprise Search Presenter: Corey Roth Enterprise Consultant Stonebridge Blog:
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
Instant Information Access With Magnify Search Dr. Rado Kotorov Technical Director Strategic Product Mgt.
Search Engines.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
Copenhagen, 6 June 2006 EC CHM Multilinguality Anton Cupcea Finsiel Romania.
Application Development Framework (ADF) Introduction 5-Oct-2012.
May 6, 2015 Session 10:00am – 12 Noon. Agenda 2 TopicPresenter / FacilitatorTimeframe  Welcome Michael Chen5 min.  CA Web Publishing Richard Lehman55.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
WebFOCUS Magnify: Search Based Applications Dr. Rado Kotorov Technical Director of Strategic Product Management.
Advanced Search Solutions for SharePoint Christopher Even BA-Insight.
1. 2 Google Session 1.About MIT’s Google Search Appliance (GSA) 2.Adding Google search to your web site 3.Customizing search results 4.Tips on improving.
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
4 m 9K Copyright 2002 Forum 9000, LLC Slide 1 Forum 9000 Quality Systems for Quality Care.
Java Programming: Advanced Topics 1 Building Web Applications Chapter 13.
ColdFusion MX 7 “Blackstone” Macromedia, Inc. macromedia 2005 Living With Today’s Internet Chronic problems continue to exist for users and developers.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
December 9, 2004 EC511 Java Pet Store Demo Chandra Donipati.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Building J2EE Applications Based on Design Patterns with Business Components for Java Olivier LE DIOURIS Principal Product Manager Oracle Corporation.
The Internet Salihu Ibrahim Dasuki (PhD) CSC102 INTRODUCTION TO COMPUTER SCIENCE.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. Oracle Business Intelligence By Lera Technologies.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
Predicate-based Indexing of Annotated Data Donald Kossmann ETH Zurich
Building Enterprise Applications Using Visual Studio®
Creating Novell Portal Services Gadgets: An Architectural Overview
Building Search Systems for Digital Library Collections
Prepared by Rao Umar Anwar For Detail information Visit my blog:
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Presentation transcript:

CIDR 2007, Asilomar California1 Predicate-Based Indexing of Enterprise Web Applications Cristian Duda, David Graf, Donald Kossmann ETH Zurich

2 Enterprise Search: Possible Approaches “Do It Yourself” (e.g., SAP, Oracle) +App vendors know the semantics of their application - Everybody impements their own search engine - Cross Application Search is difficult “Google for Web Applications” (generic ESE) + generic (for all applications) + enables cross-application search - need to teach the semantics of the app to the search engine - nobody knows how to do it

3 Enterprise Search: Current Status Search up to 50,000 documents for just $1,995. Search up to 30 million documents New! Improved search results relevance, security and access to more content. The Google Mini delivers cost- effective, high-quality search for your public website, intranet, and file servers – and you can be up and running in less than an hour. Supports from 50,000 to 300,000 documents. Learn more.Google MiniLearn more The Google Search Appliance provides robust, scalable and secure search across virtually all the information in your company. Starts at $30,000 for search across 500,000 documents. Learn more.Google Search ApplianceLearn more

4 Enterprise Application Search Search up to 50,000 documents for just $1,995. Search up to 30 million documents New! Improved search results relevance, security and access to more content. The Google Mini delivers cost- effective, high-quality search for your public website, intranet, and file servers – and you can be up and running in less than an hour. Supports from 50,000 to 300,000 documents. Learn more.Google MiniLearn more The Google Search Appliance provides robust, scalable and secure search across virtually all the information in your company. Starts at $30,000 for search across 500,000 documents. Learn more.Google Search ApplianceLearn more

5 Enteprise Application Search JSP file idnametype 1parrotgreen 2 DatabaseProperty file title.english=PetStore XML Message Snake Data User View SAP,...

6 Enterprise Search Engine (ESE) Challenges: 1. Userview assembled in a non-trivial way (not WYSIWYG) 2. References to Web Pages are complex: URL function parameters context (workflow, security) This is not Google! 1. Google is WYSIWYG 2. Google references are simple URIs This is not Hidden Web! 1. The app developer collaborates and teaches the semantics of the app to the ESE 2. The ESE has full access to all data sources

7 Enterprise Search Engine: Rules and Patterns a handful of patterns are enough to describe the mapping from raw view to user view declaratively(semi-automatic) Crawl the data sources (automatic) Normalize the data (automatic) Predicate-based indexing (automatic) Predicate-based query processing (automatic)

8 Predicate-based Index Google... ESE Doc IdKeywordScorePredicate d1java7true d1pet1true d1store1true d1parrot1$catid=1 d1finch1$catid=1 d1iguana1$catid=2 d1rattlesnake1$catid=2 d2male1$itemid=1 d2female1$itemid=1

9 Demo! Indexing Query Processing Result Generation Use Case:  Sun’s Java Pet Store Application

10 The Application JSP Application developed by Sun Uses Dynamic JSP Pages + Database Sun uses it to showcase the capabilities of their J2EE platform

11 Indexing (using our GUI) JSP Files Rules from app. developer Index location Indexed files

12 Query Processing (using our GUI) The queried Index Query Results (URL+additional info)

13 Result presentation Dbl click on query result Web page (user view) is displayed in browser. 1 2 Query:java iguana

14 Result presentation java iguanaQuery: Only appears in the JSP file Only appears in the database Our ESE understood the combination between the two data sources ! The ESE combined the two data sources just as the application would have done

15 Something funny The application also has a search functionality, but…

16 Something funny No Results! The application’s search box is broken

17 Details: Contacts: Cristian Duda ETH Zurich, Switzerland cristian.duda at inf.ethz.ch Donald Kossmann ETH Zurich, Switzerland kossmann at inf.ethz.ch