Trustworthy Distributed Search and Retrieval over the Internet

Slides:



Advertisements
Similar presentations
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
Advertisements

PHP I.
Idaho National Engineering and Environmental Laboratory What is a Framework? Web Service? Why do you need them? Wayne Simpson November.
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
Trustworthy Information Distribution and Retrieval Michael Melliar-Smith University of California, Santa Barbara Research conducted in collaboration with.
Servlets and a little bit of Web Services Russell Beale.
Fast Track to ColdFusion 9. Getting Started with ColdFusion Understanding Dynamic Web Pages ColdFusion Benchmark Introducing the ColdFusion Language Introducing.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
1 CS6320 – Why Servlets? L. Grewe 2 What is a Servlet? Servlets are Java programs that can be run dynamically from a Web Server Servlets are Java programs.
1 Distributed Mathematics Assessment System (DMAS) & WME Progress and Demo Saleh Al-shomrani Department of Computer Science Kent State University 06/13/2007.
Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004.
WebQuilt and Mobile Devices: A Web Usability Testing and Analysis Tool for the Mobile Internet Tara Matthews Seattle University April 5, 2001 Faculty Mentor:
Elizabeth Fong,ITL, NIST Nenad Ivezic, MEL, NIST Yun Peng, UMBC Tom Rhodes, ITL, NIST An Agent-based Manufacturing Application Developed by NIST OMG Agent.
Server-side Scripting Powering the webs favourite services.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
1 PHP and MySQL. 2 Topics  Querying Data with PHP  User-Driven Querying  Writing Data with PHP and MySQL PHP and MySQL.
WP 9 (former Task 1b of WP 1): Data infrastructure Robert Huber UNI-HB Esonet 2nd all regions workshop, Paris
From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith
Presented by Wei Dai The iTrust Local Reputation System for Mobile Ad-Hoc Networks.
CSCI 6962: Server-side Design and Programming Web Services.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Presented By: Sibin G. Peter Instructor: Dr. R.M.Verma.
Efficient Peer to Peer Keyword Searching Nathan Gray.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Curtis Spencer Ezra Burgoyne An Internet Forum Index.
Can Change this on the Master Slide Monday, August 20, 2007Can change this on the Master Slide0 A Distributed Ranking Algorithm for the iTrust Information.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
1 Web Servers (Chapter 21 – Pages( ) Outline 21.1 Introduction 21.2 HTTP Request Types 21.3 System Architecture.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
A Standards-Based Approach for Supporting Dynamic Access Policies for a Federated Digital Library K. Bhoopalam, K. Maly, F. McCown, R. Mukkamala, M. Zubair.
CONTENTS  Definition And History  Basic services of INTERNET  The World Wide Web (W.W.W.)  WWW browsers  INTERNET search engines  Uses of INTERNET.
VO Privilege Activity. The VO Privilege Project develops and implements fine-grained authorization to grid- enabled resources and services Started Spring.
Monte-Carlo Event Database: current status Sergey Belov, JINR, Dubna.
Windows 7 WampServer 2.1 MySQL PHP 5.3 Script Apache Server User Record or Select Media Upload to Internet Return URL Forward URL Create.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Event-Based Infrastructure for Reconciling Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey C. Fox.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Heat-seeking Honeypots: Design and Experience John P. John, Fang Yu, Yinglian Xie, Arvind Krishnamurthy and Martin Abadi WWW 2011 Presented by Elias P.
Review of PARK Reflectometry Group 10/31/2007. Outline Goal Hardware target Software infrastructure PARK organization Use cases Park Components. GUI /
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
BOF-1147, JavaTM Technology and WebDAV: Standardizing Content Management Java and WebDAV Juergen Pill Team Leader Software AG Remy Maucherat Software Engineer.
Python Driven Sensor Observation Service Benjamin Welton NASA USRP.
Web Programming Language
INLS 623– Stored Procedures
Simulation Production System
JavaScript and Ajax (Ajax Tutorial)
Internet Business Associate v2.0
Improving searches through community clustering of information
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
TGFT protocal prototyping based on CCSDS
CHAPTER 3 Architectures for Distributed Systems
PHP / MySQL Introduction
#01 Client/Server Computing
Do it now – PAGE 11 You will find your do it now task in your workbook – look for the start button! Wednesday, 21 November 2018.
The Internet An Overview.
Google App Engine Ying Zou 01/24/2016.
Event-Based Infrastructure for Reconciling Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey C. Fox.
Unit# 5: Internet and Worldwide Web
Chapter 16 The World Wide Web.
Encrypted Database Final Presentation
Knowledge Sharing Mechanism in Social Networking for Learning
WJEC GCSE Computer Science
#01 Client/Server Computing
Presentation transcript:

Trustworthy Distributed Search and Retrieval over the Internet Presented by Yung-Ting Chuang Research conducted in collaboration with Isai Michel Lombera, Louise E. Moser and P. M. Melliar-Smith Supported in part by NSF Grant CNS 10-16103

Overview iTrust Introduction Implementation of iTrust Performance evaluation Conclusion and future work WORLDCOMP’2011 iTrust Yung-Ting Chuang

iTrust vs Centralized Search Engine Nowadays we use centralized search engines (Google, Yahoo, Bing, etc) Benefits Drawbacks iTrust – distributes metadata and request to random participating nodes WORLDCOMP’2011 iTrust Yung-Ting Chuang

Distribution of Metadata Source of Information WORLDCOMP’2011 iTrust Yung-Ting Chuang

Distribution of a Request Source of Information Request Encounters Metadata Requester of Information WORLDCOMP’2011 iTrust Yung-Ting Chuang

Retrieval of Information Source of Information Request Matched Requester of Information WORLDCOMP’2011 iTrust Yung-Ting Chuang

iTrust Implementation WORLDCOMP’2011 iTrust Yung-Ting Chuang

Web Server Foundation cURL - inter-node communication and resource-specific actions Follows HTTP redirects SQLite - stores node, metadata, membership, and resources Rapid deployment Minimal local write privilege Constrains enforcement Session module - tracks and distinguishes users Log module – for debugging and simulation PECL - inter-node search and requests Send POST statement WORLDCOMP’2011 iTrust Yung-Ting Chuang

Application Infrastructure Metadata functions XML Senders: scan the resources and creates an XML list Receivers: scan and insert sources into current node Helper functions: Inserts nodes into the membership Insert keywords into the database Upload or fetch resources Node querying Java jar files Apache Tika packages Apache Lucene packages WordNet dictionary WORLDCOMP’2011 iTrust Yung-Ting Chuang

Public Interface Computer interfaces Human interfaces Handles inter-node communication (ex. queries, resource distribution, and metadata list distribution) Use inbox-type fashion to distribute/read request. Human interfaces Add nodes or metadata keywords Generates requests Provide feedback WORLDCOMP’2011 iTrust Yung-Ting Chuang

Prototype Implementation of iTrust WORLDCOMP’2011 iTrust Yung-Ting Chuang

Prototype Implementation of iTrust WORLDCOMP’2011 iTrust Yung-Ting Chuang

Prototype Implementation of iTrust * Explain what’s the meaning of “hit” and scoring. WORLDCOMP’2011 iTrust Yung-Ting Chuang

Performance Evaluation Assume: All nodes have the same membership set Internet is reliable All nodes has enough memory to store sources Variables: Membership contains n participating nodes Metadata are distributed to m nodes Requests are distributed to r nodes Proportion x of participating nodes are operational WORLDCOMP’2011 iTrust Yung-Ting Chuang

Probabilistic Analysis Probabilities: 1st node has the metadata and is operational: mx No match on 1st trial: 1- mx = n-mx No match on 2nd trial: n-mx-1 No match on rth trial: n-mx-r+1 n n n n-1 n-r+1 WORLDCOMP’2011 iTrust Yung-Ting Chuang

Probabilistic Analysis Probability q of no match on r trials: q = n-mx n-mx-1 … n-mx-r+1 n n-1 n-r+1 Probability p of a match on r trials: p = 1 - n-mx n-mx-1 … n-mx-r+1 , where If mx+r > n, then p = 1 n ≥ mx+r WORLDCOMP’2011 iTrust Yung-Ting Chuang

Simulation Results libCURL - URL transfer library for transferring data Steps our program performed: Clears database Adds nodes Uploads a file and creates metadata Distributes metadata Selects nodes for requests at random If at least one node replies, program returns 1; Else program returns 0 WORLDCOMP’2011 iTrust Yung-Ting Chuang

Performance Evaluation Results WORLDCOMP’2011 iTrust Yung-Ting Chuang

Performance Evaluation Results WORLDCOMP’2011 iTrust Yung-Ting Chuang

Conclusion and Future Work Achieve high probability of match Match remains high even some nodes are subverted We plan to evaluate the effectiveness, efficiency, scalability, and reliability in PlanetLab We are investigating possible attacks on iTrust and countermeasures to such attacks We are investigating other implementations of iTrust based on SMS We plan to make the iTrust source code, tools, documentation, freely available WORLDCOMP’2011 iTrust Yung-Ting Chuang

Questions? Comments? Our iTrust website: http://itrust.ece.ucsb.edu Contact information: Yung-Ting Chuang: ytchuang@ece.ucsb.edu Isai: imichel@ece.ucsb.edu Our project is supported by NSF: CNS 10-16193 WORLDCOMP’2011 iTrust Yung-Ting Chuang