Yimam & Kobsa July 13, 2000TWIST 2000 Centralization vs. Decentralization Issues in Internet-based KMS: Experiences from Expertise Recommender Systems.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Centralize or Decentralize? A Requirements Engineering Perspective on Internet-Scale Architectures Eric Yu University of Toronto July 2000.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Information Retrieval in Practice
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Xyleme A Dynamic Warehouse for XML Data of the Web.
A Mobile World Wide Web Search Engine Wen-Chen Hu Department of Computer Science University of North Dakota Grand Forks, ND
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
Chapter 9: Moving to Design
University of Kansas Data Discovery on the Information Highway Susan Gauch University of Kansas.
Knowledge Portals and Knowledge Management Tools
Overview of Search Engines
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
An Intelligent Broker Architecture for Context-Aware Systems A PhD. Dissertation Proposal in Computer Science at the University of Maryland Baltimore County.
Chapter 9 Elements of Systems Design
The Design Discipline.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
International Workshop on Web Engineering ACM Hypertext 2004 Santa Cruz, August 9-13 An Engineering Perspective on Structural Computing: Developing Component-Based.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Master Thesis Defense Jan Fiedler 04/17/98
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Xiaoying Sharon Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
1 XML Based Networking Method for Connecting Distributed Anthropometric Databases 24 October 2006 Huaining Cheng Dr. Kathleen M. Robinette Human Effectiveness.
Data Mining By Dave Maung.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 05. Review Software design methods Design Paradigms Typical Design Trade-offs.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Data Mining for Web Intelligence Presentation by Julia Erdman.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
- How to draw a clear distinction between a client and a server(there is often no clear distinction) - A server may continuously act as a client - Distinction.
Semantic Web in Context Broker Architecture Presented by Harry Chen, Tim Finin, Anupan Joshi At PerCom ‘04 Summarized by Sungchan Park
IBM Express Runtime Quick Start Workshop © 2007 IBM Corporation Deploying a Solution.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
Xiaoying Sharon Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Information Retrieval in Practice
Business System Development
Search Engine Architecture
Web Services and Application of Multi-Agent Paradigm for DL
CHAPTER 3 Architectures for Distributed Systems
Ch > 28.4.
Data Warehousing and Data Mining
Data Mining Chapter 6 Search Engines
Web Mining Department of Computer Science and Engg.
Web Mining Research: A Survey
Information Retrieval and Web Design
Presentation transcript:

Yimam & Kobsa July 13, 2000TWIST 2000 Centralization vs. Decentralization Issues in Internet-based KMS: Experiences from Expertise Recommender Systems Dawit Yimam, GMD-FIT.MMK & Alfred Kobsa, UCI, ICS

Yimam & Kobsa July 13, 2000TWIST 2000 Outline Background First centralized approach Alternatives - to centralize or decentralize ? DEMOIR Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Expert Recommenders/Finders Systems to help users in tracing human information and/or expertise sources in organizations part of knowledge management and knowledge sharing services. Traditionally done by manual construction and search of expertise descriptions of people, e.g., +Expert Databases (“knowledge directories”) +Personal web pages on the Web Automatically mining implicit sources of expertise evidence from electronic resources of an organization and its people. Background Alternatives First appr. DEMOIR Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Characterizing Expert Finders 1. Expertise evidence/indicator source recognition and gathering 2. Expertise modeling - Expertise indicator extraction - expertise model representation 3. Expertise model deployment - query mechanisms - matching operation - output delivery/presentations - adaptation and learning operations Background Alternatives DEMOIR Summary First appr.

Yimam & Kobsa July 13, 2000TWIST 2000 Query-time expertise modeling Web Site Indexing Web Documents Index Background Alternatives DEMOIR Summary Glimpse FIT Peoples’ and other Web Pages WebGlimpse First appr.

Yimam & Kobsa July 13, 2000TWIST 2000 Query-time expertise modeling Query (Boolean) Background Alternatives DEMOIR Summary Web Site Indexing Web Documents Index Glimpse FIT Peoples’ and other Web Pages WebGlimpse Expert Query Interface First appr.

Yimam & Kobsa July 13, 2000TWIST 2000 Query-time expertise modeling Expert Database (Name, URL) Search Ranked List of Experts Background Alternatives DEMOIR Query (Boolean) Web Site Indexing Web Documents Index Glimpse FIT Peoples’ and other Web Pages WebGlimpse Expert Query Interface Search Result (passages containing Keywords) Expertise Modeler & Tracer First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Query-time expertise modeling Shortcomings: +high latency in query processing +personal sources hard to include +non-document sources (e.g. recommendation from people, social relations, etc.) hard to include +full reliance on availability of some search engine +limited exploitation of info due to lack of persistent expertise models Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Building apps on text Indexes Existing Web indexing systems use centralized indexes of distributed resources/collections. Distributed Indexing needed to cope with ever growing information on Internet. But, currently centralized global indexes (though may be distributed in a tightly coupled manner) consistently outperform decentralized indexing and query approaches. This favors centralizing the applications to be built on them. Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Pre-generation of Expertise Models Alternative 1: Personal expert finding agents +Decentralized multi-agent system. +Expertise modeling as well as searching done by self-managing personal agents residing in experts’ computers (e.g. Vivacqua, 1999; Foner, 1997). Alternative 2: Aggregated expertise modeling +Based on centralized expertise models (that are either dynamically aggregated or linked to a pre-constructed ontology) (e.g. simple versions in Kautz & Selman, 1998; Krulwich & Burkey, 1996). +Can be distributed among tightly coupled cluster of machines. Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Personal expert finding agents Agent communication Agent ModelExpertise FindExpert Personal Expertise Model Agent ModelExpertise FindExpert Personal Expertise Model Agent n ModelExpertise FindExpert Personal Expertise Model Agent ModelExpertise FindExpert Personal Expertise Model Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Aggregated Expertise Modeling Expert Finding Server Aggregated Expertise Model Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Aggregated Expertise Modeling Expert Finding Server Aggregated Expertise Model Gateway (broker) local Expertise Model Server 1 local Expertise Model Server 1 local Expertise Model Server n LANLAN Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Aggregated Expertise Modeling Expert Finding Server Aggregated Expertise Model Gateway (broker) local Expertise Model Server 1 local Expertise Model Server 1 local Expertise Model Server n Gateway (broker) Server 1 Server n Central Expertise Model LANLAN LANLAN Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Analysis Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Analysis (contd.) Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Hybrid Approach Combine distributed agents with centralized expertise model server - “local-central” approach How ? 1. Decentralized + centralized Expertise modeling Lightweight personal agents for personal sources Configurable gatherers for organizational resources 2. Centralized (but “distributable”) expertise information server 3. Decentralized Exploitation of expertise information (through clients) Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 DEMOIR - A Hybrid Architecture Organizational Information Resources Expertise- indicator Source Gatherers Source Type Identifier Source Wrapper 2 Source Wrapper 1 Source Wrapper n... EISM Ontology, Organizational structure, etc. Aggregated Expertise Model Expert Models Remote Expert Details API Clients Fusers Expertise Information Space Background Alternatives DEMOIR First appr. Summary

Yimam & Kobsa July 13, 2000TWIST 2000 DEMOIR - A Hybrid Architecture Gathering (decentralize d Centralized) Modeling (decentralized/centralized) Exploitation (decentralized ) Background Alternatives DEMOIR First appr. Organizational Information Resources Expertise- indicator Source Gatherers Source Type Identifier Source Wrapper 2 Source Wrapper 1 Source Wrapper n... EISM Ontology, Organizational structure, etc. Aggregated Expertise Model Expert Models Remote Expert Details API Clients Fusers Expertise Information Space Summary

Yimam & Kobsa July 13, 2000TWIST 2000 Summary/Observation Centralized and decentralized options have their advantages and disadvantages. Many problem domains involve both “centralizable” and “decentralizable” tasks  Challenges: +isolating such tasks and identifying the tradeoffs b/n centralizing and decentralizing their operations +If both approaches are used, how to get them work together Background Alternatives DEMOIR Summary First appr.

Yimam & Kobsa July 13, 2000TWIST 2000 Summary/Observation Centralization/decentralization is only one dimension of a system’s architecture. Relate to: +size/complexity of system (e.g. number of different parts, dynamism of their interaction, etc.) +heterogeneity of data and their sources +accessibility (e.g. permissions/privacy constraints, manner of use) +communication patterns among components  keep these in mind and analyze how they affect centralization/decentralization decision. Background Alternatives DEMOIR Summary First appr.

Yimam & Kobsa July 13, 2000TWIST 2000 Summary/Observation What we did (in retrospect): 1. Identify system requirements/tasks 2. Identify and analyze centralized and decentralized alternatives of performing identified tasks  thereby identify and evaluate general centralization and decentralization factors in the problem domain. 3.Specify optimum system components as well as architecture (i.e. trying to achieve advantages and avoid disadvantages of alternatives)  aim at flexibility to allow varying degrees of centralization and/or decentralization to suit different deployment environments. Background Alternatives DEMOIR Summary First appr.