Alessandro Yoshi Polliotti 1 / 13 TERENA Networking Conference 2005 Biblioteca d'Alessandria: A Peer-to-peer Network for Scholar Knowledge Exchange Terena.

Slides:



Advertisements
Similar presentations
EPrints Web Configuratio n Management. SQL database Web server Scripts to configure repository activities Configuration files EPrints - the Administrator's.
Advertisements

IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Business Development Suit Presented by Thomas Mathews.
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
Project 1 Introduction to HTML.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Antonella De Robbio, Dario Maguolo Mathematics Library – University Library System University of Padova – ITALY Mathematics Subject Classification and.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
7DS Seven Degrees of Separation Suman Srinivasan IRT Lab Columbia University.
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
1st Project Introduction to HTML.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Understanding and Managing WebSphere V5
Winter Consolidated Server Deployment Guide for Hosted Messaging and Collaboration version 3.5 Philippe Maurent Principal Consultant Microsoft.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Open Archives for Library and Information Science: an international experience Antonella de Robbio and Paula Sequeiros IV EBIB Conference: Open Access.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
BEN Architecture Isovera Consulting Feb Internet consulting for non-profits 2 BEN Architecture Diagram.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Navigating An Introductory Guide for Librarians Brought to you by:
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Searching Business Data with MOSS 2007 Enterprise Search Presenter: Corey Roth Enterprise Consultant Stonebridge Blog:
Navigating An Introductory Guide for Librarians Brought to you by:
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CYCLADES IST CYCLADES: A Personalised Collaborative Digital Library Environment Umberto Straccia I.S.T.I. - C.N.R. Pisa (ITALY)
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
interactive logbook Paul Kiddie, Mike Sharples et al. The Development of an Application to Enhance.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
DSpace - Digital Library Software
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Digitalcommons.unl.edu Archiving Department Records.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Chapter 1 Introduction to HTML.
An Overview of Data-PASS Shared Catalog
Project 1 Introduction to HTML.
Building Search Systems for Digital Library Collections
The New Face of Information Retrieval: The Ankara University Open Access Platform Prof. Dr. Sekine Karakaş Prof. Dr. Doğan.
RCSI institutional repository rcsi
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Alessandro Yoshi Polliotti 1 / 13 TERENA Networking Conference 2005 Biblioteca d'Alessandria: A Peer-to-peer Network for Scholar Knowledge Exchange Terena Networking Conference 2005 A.Y. Polliotti, A. Tugnoli, M. Simoncini, S. Mangiaracina

Alessandro Yoshi Polliotti 2 / 13 TERENA Networking Conference 2005 Knowledge flux Knowledge is not shared by researchers: it is a profitable business for publishers Estimates show that knowledge sharing is limited to about 10% of total scientific production The first bottleneck is a strong self censorship among researchers The second bottleneck is the chronic lack of funds The growing discomfort among researchers has brought to OAI (Open Archives Initiative) Started in 1999 and implemented in 2001 OAI is still mostly an unknown voluntary effort 100% P 2% OAI ~100% 10% 9% ??% R L ~90% of knowledge is lost Based on data from UNESCO Institute for Statistics and Ulrich’s Periodicals Directory

Alessandro Yoshi Polliotti 3 / 13 TERENA Networking Conference 2005 OAI Open Archives Initiative (OAI) develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Data Provider make their documents available to the public exposing their metadata through the OAI-PMH protocol (XML queries over HTTP) Harvester Harvest metadata from different Data Providers Service Provider use Harvesters to collect metadata from different Data Providers, then implements enhanced services over these metadata such as cross search, index, alert system, etc. BdA is both service provider and data provider OAI’ shortcomings the meaning of many metadata fields is not well defined archive fragmentation installation is above the skills of an average user

Alessandro Yoshi Polliotti 4 / 13 TERENA Networking Conference 2005 BdA Objectives Knowledge exchange To improve the accessibility To improve the speed of dissemination To increase the impact of papers and recognition of the authors To measure the excellence of a Research Institute through visibility of its scientific productivity (evaluation and branding) To stop certain bad habits! Strategy To provide Research Institutes with a tool, giving them a chance to handle and distribute their own content To promote an ethic of knowledge sharing between researchers to accelerate the transition to the new model To add a customizable tool to researchers' desktops, something useful for their daily work, not just for archiving purposes And to make it as user friendly as possible Installation, document submission, unified search archive and search engine

Alessandro Yoshi Polliotti 5 / 13 TERENA Networking Conference 2005 End user perspective Freescience (focus on single users) Allows to build simple personal digital archives Main features: Allows users to share, search and download full-text documents Direct communication via instant message Anyone can start using it with profit at once Archivemaker (focus on institutions, laboratories and power users) Allows to build structured, branded institutional archives Main features: Allows users to organize themselves in groups Allows groups to choose their sharing peers Allows groups to display their documents on web with a click Each group can be OAI-PMH compliant with a click

Alessandro Yoshi Polliotti 6 / 13 TERENA Networking Conference 2005 BdA architecture CC C C p2p network of Java based clients JBOSS application server TomcatBdA Enterprise Javabeans C C Web publishingOAI DPIndexerClient interface MySQL OAI Data Providers OAI Service Providers CC

Alessandro Yoshi Polliotti 7 / 13 TERENA Networking Conference 2005 BdA Architecture (continued) OAIPMH Search/Share … download OAI Search download

Alessandro Yoshi Polliotti 8 / 13 TERENA Networking Conference 2005 P2P File Sharing Detailed Metadata Every item in the system has a metadata Metadata contains: authors, abstract, keywords, doc. type, etc. Each metadata is timestamped with its insertion date Versioning support, collection support and more Each item is identified anywhere based on its unique SHA-1 hash P2P File Sharing based on metadata The BdA server acts like a BitTorrent tracker Current protocol allows multi source sequential download Future protocol will allow multi source random block download

Alessandro Yoshi Polliotti 9 / 13 TERENA Networking Conference 2005 P2P and firewalls Types of nodes in p2p network A) Nodes that have full access to the network B) Nodes that may only open outbound connections C) Nodes that may only open outbound connections on a limited range of ports, typically HTTP and FTP Node behaviour C nodes cannot connect to the BdA service unless the network administrator opens the necessary ports. If the number and quality of open ports is sufficient, they are equal to B nodes B nodes may connect to the BdA service however they may not download documents from each other Heuristics are being studied to allow B nodes to establish direct connections when possible A nodes may connect to the BdA service and transfer documents from and to nodes of type A and B Special A nodes may be setup by certain organizations to provide high availability and data replication

Alessandro Yoshi Polliotti 10 / 13 TERENA Networking Conference 2005 Search Engine Advanced Search based on metadata fields Permanent index is stored on BdA server database Boolean logic between fields one index for each of the most important metadata fields Handles complex search expressions boolean expressions with: parentheses boolean operator (AND,OR,XOR,NOT) wildcards (*, ?) and sentences “…” Transparently search on both BdA and OAI metadata

Alessandro Yoshi Polliotti 11 / 13 TERENA Networking Conference 2005 Groups and collaboration Groups A user may belong to one or more groups A user may publish an item in any one of his groups Groups are organized in a tree Groups may have sharing relationships among them Some users have administrative tasks for their groups Users 3 classes: User: may only submit new documents Editor: controls and validates visibility of metadata and documents Administrator: creates new groups, handles memberships, establishes sharing relationships with other groups

Alessandro Yoshi Polliotti 12 / 13 TERENA Networking Conference 2005 Web Publishing View document metadata from the web Based on groupware and sharing policies It shows all the metadata of a BdA Archive (i.e.: a group or a hierarchy of groups) Browse and Search functionality Two ways of using it Integrate it in existing web pages to display the metadata in any format (default is XML) directly on the Biblioteca d'Alessandria website (pages are in XHTML, obtained combining XML and an XSL style sheet)

Alessandro Yoshi Polliotti 13 / 13 TERENA Networking Conference 2005 Thank You! Web Site: Technical Staff: Information/Marketing: