P2Pedia A Distributed Wiki Network Management and Artificial Intelligence Laboratory Carleton University Presented by: Alexander Craig May 9 th, 2011.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

Chapter 21 Successfully Implementing The Information System
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
Chapter 13 Physical Architecture Layer Design
Architecture & Performance Community Place case study Presented by u Jin Hyung, SEO.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Tutorial 11: Connecting to External Data
Principles of Information Systems, Sixth Edition Organizing Data and Information Chapter 5.
Overview of Search Engines
Version Enterprise Architect Redefines Modeling in 2006 An Agile and Scalable modeling solution Provides Full Lifecycle.
Chapter One Overview of Database Objectives: -Introduction -DBMS architecture -Definitions -Data models -DB lifecycle.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
A Scalable Framework for the Collaborative Annotation of Live Data Streams Thesis Proposal Tao Huang
Web 2.0: Concepts and Applications 2 Publishing Online.
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Evaluating Centralized, Hierarchical, and Networked Architectures for Rule Systems Benjamin Craig University of New Brunswick Faculty of Computer Science.
The Earth System CoG Collaboration Environment Sylvia Murphy and Cecelia DeLuca (NOAA/CIRES), and Luca Cinquini (NASA/JPL) AGU Ocean Sciences February.
1 10 THE INTERNET AND THE NEW INFORMATION TECHNOLOGY INFRASTRUCTURE.
1 Benjamin Perry, Venkata Kambhampaty, Kyle Brumsted, Lars Vilhuber, William Block Crowdsourcing DDI Development: New Features from the CED 2 AR Project.
Protocol Architectures. Simple Protocol Architecture Not an actual architecture, but a model for how they work Similar to “pseudocode,” used for teaching.
Windows 2003 Overview Lecture 1. Windows Networking Evolution Windows for Workgroups – peer-to-peer networking built into the OS Windows NT – separate.
11/16/2012ISC329 Isabelle Bichindaritz1 Web Database Application Development.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Mobile Topic Maps for e-Learning John McDonald & Darina Dicheva Intelligent Information Systems Group Computer Science Department Winston-Salem State University,
ISTE Standards for Teachers ISTE Standards for Students –Check the profiles, can you integrate these opportunities for your students?Check the profiles,
Master Thesis Defense Jan Fiedler 04/17/98
Evaluation of a Publish/Subscribe System for Collaboration and Mobile Working Collaborative Advertising over Internet with Agents Independent Study: Wireless.
CitiWiki Summer Workshop Day 1: Introduction Li Xu CS, UMass Lowell.
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
PSI Peer Search Infrastructure. Introduction What are P2P Networks? The term "peer-to-peer" refers to a class of systems and applications that employ.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.
Enabling Peer-to-Peer SDP in an Agent Environment University of Maryland Baltimore County USA.
An Introduction to Wikis Bushra Faisal February 17, 2009.
For: CS590 Intelligent Systems Related Subject Areas: Artificial Intelligence, Graphs, Epistemology, Knowledge Management and Information Filtering Application.
Gurleen Ahluwalia Lecturer in Communication Skills BBSBEC, Fatehgarh Sahib Punjab.
Database Server Concepts and Possibilities Lee Lueking D0 Data Browser Workshop April 8, 2002.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
CSCI-235 Micro-Computers in Science The Internet and World Wide Web.
Windows SharePoint Services. Overview Windows SharePoint Services (WSS) Information Worker Infrastructure component delivered in Windows Server 2003 Enables.
Principles of Information Systems, Sixth Edition Organizing Data and Information Chapter 5.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Principles of Information Systems, Sixth Edition Organizing Data and Information Chapter 5.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Alex Chee Daniel LaBare Mike Oster John Spann Bryan Unbangluang Collaborative Document Sharing In Conjunction With.
What is a Wiki? A wiki is an online database that can be edited by anyone with access to it. “ Wiki ” is Hawaiian meaning ‘ fast ’ or ‘ quick ’
XML Extensible Markup Language
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
General Architecture of Retrieval Systems 1Adrienn Skrop.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Group collaborative Projects: TECHNOLOGICAL TOOLS TO ASSESS INDIVIDUAL CONTRIBUTION.
Novell iFolder Novell Academy QuickTrain. What is iFolder? Novell iFolder lets users’ files follow them anywhere A simple and secure way to access, organize.
CS 405G: Introduction to Database Systems
Project 1 Introduction to HTML.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Software Design and Architecture
CHAPTER 3 Architectures for Distributed Systems
XML in Web Technologies
TYPES OF SERVER. TYPES OF SERVER What is a server.
Microsoft Office Illustrated
Content Management Systems
NoSQL Systems Overview (as of November 2011).
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Presentation transcript:

P2Pedia A Distributed Wiki Network Management and Artificial Intelligence Laboratory Carleton University Presented by: Alexander Craig May 9 th, 2011

2 Overview What is a Wiki? Distributed vs. Centralized Wikis Article Versioning Model What is P2Pedia? Versioning Path Queries and Wiki Links Trust Indicators

3 What is a Wiki? “the simplest online database that could possibly work” - Ward Cunningham

4 What is a Wiki? Wikis are edited through a web based interface using a simple markup language

5 What is a Wiki?

6 Distributed vs. Centralized Wikis Traditional wiki systems, such as Wikipedia, use a centralized architecture where all content is stored on a central server.

7 Distributed vs. Centralized Wikis In a distributed wiki system, every user of the system acts as a host, and shares a portion of the wiki data set.

8 Advantages of Distributed Wikis Reliability - No centralized point of failure Scalability - Distributed bandwidth usage and costs No centralized authority to censor content Offline operation – Users can retain read and edit access to all locally stored data even in the case of a network failure

9 Versioning Model In a traditional, centralized wiki versioning is strictly linear.

10 Versioning Model In a distributed system, articles may be modified concurrently by separate authors on separate nodes.

11 Distributed Wikis – Existing Work Existing work largely aims to present a logically centralized service over a distributed architecture.

12 Versioning Model Distributed modifications result in a tree of article versions, rather than a linear sequence

13 Versioning Model

14 Versioning Model

15 Versioning Model: Future Work P2Pedia also aims to support user assisted merging of articles. In this case, a lattice of article versions may be formed.

16 Versioning Model This distributed versioning model reflects the underlying peer to peer infrastructure of the system, and may be better suited to some domains (ex: teaching materials).

17 What is P2Pedia? A distributed wiki system implemented using the community customization options provided by the Universal Peer to Peer framework Articles are stored in an XML format, and use the Creole wiki markup open standard CREOLE – A common wiki markup

18 P2Pedia

19 P2Pedia

20 P2Pedia

21 Versioning Queries Complex queries can be performed on the version graph using the provided graph query functionality of U-P2P.

22 Versioning Queries

23 Versioning Queries

24 Versioning Queries

25 Versioning Queries

26 Versioning Queries Ex: Find all second generation edits of article “B”

27 Versioning Queries Ex: Find all sibling articles (other edits of the same parent) of article “F”

28 Versioning Queries Ex: Find all edits (descendents) of article “B”

29 Wiki Links In a centralized wiki, the latest version of an article is always authoritative.

30 Wiki Links Because no authoritative version of an article exists, wiki links are implemented as a search for all descendents.

31 Trust Indicators If article revisions are not merged into a single version, how should a user select which version of an article to view? U-P2P provides a number of resource trust indicators alongside search results to provide users with additional information.

32 Trust Indicators

33 Trust Indicators: Document Popularity Document popularity = Number of hosts serving the document Ex: Document Popularity of “A” = 3

34 Trust Indicators: Network Distance Network Distance = Number of intermediate network hops between querying node and host Ex: Average Network Distance of “A” = 0.5

35 Trust Indicators: Peer Popularity Peer popularity = Number of incoming peer connections to a specified host Ex: Peer Popularity = 2 (Assuming queried connection is outgoing)

36 Trust Indicators: Similarity Similarity = (Number of hosted articles in common) / (Total number of hosted articles) Similarity = Size({A}) / Size({A, B, C}) = 1/3

37 Conclusion P2Pedia expands on existing work by supporting a collaboration model which supports the diversification of content. Versioning is realized using document links which are queried through the distributed graph query capabilities of U-P2P. General trust indicators are introduced to allow users to evaluate the quality of content.

38 P2Pedia Live Demo: Questions?

39 References [1] “Wikipedia,” Accessed May 4 th, [2] “Wiktionary,” Accessed May 5 th, [3] “Lyric Wiki,” Accessed May 5 th, [4] “UniWiki,” Accessed May 5 th, [5] “Wooki,” Accessed May 5 th, [6] “WikiCreole,” Accessed May 5 th,