Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.

Slides:



Advertisements
Similar presentations
Peer-to-Peer Infrastructure and Applications Andrew Herbert Microsoft Research, Cambridge
Advertisements

Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Distributed Data Processing
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
1 Planetary Network Testbed Larry Peterson Princeton University.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
Gnutella 2 GNUTELLA A Summary Of The Protocol and it’s Purpose By
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
FRIENDS: File Retrieval In a dEcentralized Network Distribution System Steven Huang, Kevin Li Computer Science and Engineering University of California,
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
A Framework for Cost-Effective Peer-to- Peer Content Distribution Mohamed Hefeeda and Bharat Bhargava Department of Computer Sciences Purdue University.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
Week #10 Objectives: Remote Access and Mobile Computing Configure Mobile Computer and Device Settings Configure Remote Desktop and Remote Assistance for.
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Cli/Serv.: JXTA/151 Client/Server Distributed Systems v Objective –explain JXTA, a support environment for P2P services and applications ,
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
1 Telematica di Base Applicazioni P2P. 2 The Peer-to-Peer System Architecture  peer-to-peer is a network architecture where computer resources and services.
On P2P Collaboration Infrastructures Manfred Hauswirth, Ivana Podnar, Stefan Decker Infrastructure for Collaborative Enterprise, th IEEE International.
THE GITB TESTING FRAMEWORK Jacques Durand, Fujitsu America | December 1, 2011 GITB |
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Peer-to-Pee Computing HP Technical Report Chin-Yi Tsai.
Real World Case Study KM Summer Institute June Rano Joshi, Vorsite.
OCLC Online Computer Library Center Annual Report: New Enterprises & Development News Marty Withrow, Director Product Development Division oclc.org.
August 2003 At A Glance VMOC-CE is an application framework that facilitates real- time, remote cooperative work among geographically dispersed mission.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
PEERSPECTIVE.MPI-SWS.ORG ALAN MISLOVE KRISHNA P. GUMMADI PETER DRUSCHEL BY RAGHURAM KRISHNAMACHARI Exploiting Social Networks for Internet Search.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
6° of Darkness or Using Webs of Trust to Solve the Problem of Global Indexes.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
May 26-28ICNEE 2003 ARCHON: BUILDING LEARNING ENVIRONMENTS THROUGH EXTENDED DIGITAL LIBRARY SERVICES Hesham Anan, Kurt Maly, Mohammad Zubair,et al. Digital.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
ZLOT The Z Texas Implementation Component of the Library of Texas Dr. William E. Moen Principal Investigator.
Development of the West Virginia University Electronic Theses & Dissertations System Presented By Haritha Garapati at ETD the 7 th International.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
CMSC 691B Multi-Agent System A Scalable Architecture for Peer to Peer Agent by Naveen Srinivasan.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
GEOSS Common Infrastructure (GCI) The GEOSS Common Infrastructure allows Earth Observations users to search, access and use the data, information, tools.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
GCI Architecture GEOSS Information System Meeting 20 September 2013, ESA/ESRIN (Frascati, Italy) M.Albani (ESA), D.Nebert (USGS/FGDC), S.Nativi (CNR)
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
About me Civil engineer (not in IT) and self-taught developer
CHAPTER 3 Architectures for Distributed Systems
Early Measurements of a Cluster-based Architecture for P2P Systems
DHT Routing Geometries and Chord
Peer-to-Peer Information Systems Week 6: Performance
Presentation transcript:

Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University

Outline Introduction/Motivation P2P Based Digital Library Network Architecture Current Status Conclusion

Motivation Efficient Infrastructure for Dissemination – Digital libraries are the cornerstones of modern learning environment. They provide an infrastructure for publishing and managing quality content so it is discovered easily and effectively. Cost Effective – Digital libraries provide cost-effective access to a wealth of information for remote and geographically dispersed users, thus making them an effective tool for online learning environment

Problems Centralized Framework – Requires an organization in a domain/community to take a lead in not only providing the hardware and software infrastructure to support a digital library, but processes to develop and maintain the content Evolving Communities – With time a community ’ s interest seems to grow out and move to a different topic, however the processes and interfaces that have been put in place are not flexible to accommodate the changing interest of the community. Distributed Membership – Problem of building a community where members are distributed, do not know of other members, do not know of available interest areas, yet still want to come together in a community of common interest.

Approach Digital Library architecture (Freelib) based on P2P model that is scalable and supports communities and domains evolving from the bottom up. A P2P network is a distributed network with no central control and consists of nodes running identical software. Example P2P networks for file sharing: Gnutella, Napster, and Kazaa

Issues Keeping the network connected in the presence of frequent joining, re-joining, and leaving of participants Maintaining a low barrier for building communities with diverse interests Providing support for new and evolving communities with changing interests. The key in our vision of supporting evolving communities is to build a network with nodes clustered by access patterns. The major challenges in building such a library are:

Network Architecture The network architecture of Freelib consists of two overlaying networks: access network and support network. The access network topology is characterized by the user access pattern The support network topology is determined by an adaptation of the Symphony protocol. The Symphony protocol preserves the connectivity and small world properties of the network for supporting efficient search.

Network Architecture Figure 1: Network architecture, showing the access and the support network

Key Features: Evolving Communities Friends: every node keeps a list of preferred peers The friend list is ranked based on access patterns with higher ranks given to peers having higher access Nodes migrate on the ring so that nodes that share common interest occupy adjacent locations Note: by ‘ Access ’ we mean access of full-text document. The reason ‘ Search ’ is not considered is simple: user may get lots of search results that she may consider not interesting. On the other hand, download of full-text after seeing the metadata about the item is a better indication that the document is of interest to user

Key Features: Friend Ranking Ranking of friends N: is the total number of outgoing accesses to all nodes during a time interval t. n i : is the total number of outgoing accesses to i th friend in the list. P: is the total number of incoming accesses from all nodes during a time interval t. p i : is the total number of incoming accesses from i th friend in the list.  : is the weighting factor in the range 0 to 1 where =1 ignores incoming accesses in the ranking calculation; and  = 0 ignores outgoing accesses in the ranking calculation.

Key Features: Migration Migration enables evolution of communities When: if my top ranked friends are far away on the ring, it is time for me to join them In the steady state, most of the short contacts are identical to access friends

Key Features: more search options Search query: – Simple keyword search – Advanced search – Metadata based: author = ‘ John Smith ’ Search type: – Friend search: search only friend nodes (this one is suitable for nodes with community evolution done and is expected to return the most interesting results) – Local search: search friends and neighbor nodes (after migration, a node should use this mode to discover potential friends after migration is done) – Global search: search friends, neighbors, and distant links also (this is suitable for newcomers until they join their communities)

Freelib Client Architecture Search/Access Publish History/Log Collection MD + Full-text + Indexes Messenger Network Agent Friends & Contacts P2P network Main UI Log Manager Registry HTTP Collection Manager Figure 2: FreeLib client Architecture

Current Status - Implementation Phased Implementation: start with basic features, then each phase adds more features and enhances existing ones Phase 0: establish a small network and show the Search and Access protocols in motion Later phases: complete Network Protocol implementation, provide advanced search tools, etc.

Current Status - Phase 0 Implement publish, search, access, access log and other supporting modules such as Http, messenger, collection manager, registry, etc. Testbed Build a small p2p network (5 nodes: n1, n2, n3, n4, n5) Publish some documents Perform some searches (simple keyword search for now) Do some accesses (downloads of full-text) Show how the friend list is changing based on the accesses.

Figure 3: Node 1 and Node 2 of Phase 0 demo (node 1 on the left) n3 n5 n2 n3 n4 n1

Figure 4:Node 3 and Node 4 of Phase 0 demo (node 3 on the left) n5 n1 n2 n5 n2

Future work Complete implementation of network protocol and enhanced implementation of other modules Address scalability issues regarding collection size and number of requests. Need to have indexing in the collection manager Testbed and/or simulation to evaluate the protocol Support for OAI protocol to allow interaction with OAI service-/data- providers in the future

Conclusion High school teachers nation-wide can build their own community to publish and exchange school project ideas, class syllabuses, classroom materials. Faculty members of universities nation-wide can use such a system to publish and make available their own publications, project reports, course material and to search for publications of interest to them. Students can create collections of material related to their courses and also to their private life. Freelib will have potentially a broad impact on the education community, as there are many target communities that can benefit from such a system.