Open Repositories 2008 The NCore Platform: An Open-Source Suite of Tools and Services for Implementing Digital Libraries Dean B. Krafft Cornell University.

Slides:



Advertisements
Similar presentations
Yammer Technical Solutions Overview
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
EXtensible Catalog David Lindahl University of Rochester.
JMS messaging service  All write-only Fedora operations are published to subscribed clients  Messaging system can be durable – if client/consumer/subscriber.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May Sandy Payette Co-Director.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Building a National Science Digital Library Dean Krafft, Cornell University
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
NDR (resource references, metadata, collection data, etc.) NCS (& DDS) Expert Voices wiki.nsdl.org Harvest Manager OAI-PMH service (proai) NDR Search NCS.
LON-CAPA 1 The LearningOnline Network with Computer-Assisted Personalized Approach (LON-CAPA) Stuart Raeburn Michigan State University.
Digital Library Architecture and Technology
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Tutorial – Semantic Digital Libraries, May 9, 2007 WWW 2007 Copyright , DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University.
Project Builder and MediaMatrix: Redefining Access in the Digital Age Dean Rehberger and Michael Fegan MERLOT August 7-10, 2006 New Orleans, LA.
Open Repositories 2008 The NCore Platform: An Open-Source Suite of Tools and Services for Implementing Digital Libraries Dean B. Krafft Cornell University.
Connecting Applications with the NSDL Technical Network Services.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Creating and Operating a Digital Library for Information and Learning– the GROW Project Muniram Budhu Department of Civil Engineering & Engineering Mechanics.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Building a National Science Digital Library on Fedora Dean Krafft, Cornell University
DLESE and NSDL: Digital Library Components of Cyberinfrastructure International Workshop of Cyberinfrastructure for Geosciences IWCG Beijing, China.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
© 2005 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice The China Digital Museum Project.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
Tooltime: Using NSDL 2.0 Dean Krafft, Cornell University
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
Core Integration Web Services Dean Krafft, Cornell University
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
1 The NSDL Program Stephen Griffin National Science Foundation.
NSDL 2.0: Creating a collaborative digital library Dean Krafft, Cornell University
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
The Technical Infrastructure of the NSDL Dean Krafft, Cornell University
NSDL 2.0: Building a Collaborative Digital Library Dean Krafft, Cornell University
The Technical Infrastructure of the NSDL Dean Krafft, Cornell University
NSDL Technical Platforms Diagrams. NSDL Collections Technical Platform NSDL.org Library search and browse UI Key Ingest services and tools Repository.
Building Tools and Services on the NDR Dean Krafft, Cornell University
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
EXPLORER project Elizabeth Lunt Project Manager De Montfort University.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Modelling a team-based astronomy task using LAMS James Dalziel Professor of Learning Technology, and Director, Macquarie E-Learning Centre Of Excellence.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Vision... “… a network of learning environments and resources for Science, Mathematics, Engineering and Technology education, will ultimately meet the.
Joseph JaJa, Mike Smorul, and Sangchul Song
Overview: Fedora Architecture and Software Features
Working with the NSDL 2.0 Data Repository
NSDL Data Repository (NDR)
The Fedora Project April 28-29, 2003 CNI, Washington DC
Presentation transcript:

Open Repositories 2008 The NCore Platform: An Open-Source Suite of Tools and Services for Implementing Digital Libraries Dean B. Krafft Cornell University April 1, 2008

Open Repositories 2008 What is NCore? Data Model and Architecture Services and Management Tools End User Tools Status and What’s Next Questions Presentation Overview

Open Repositories 2008 NCore: The Technical Vision Provide support for communities across a broad range of disciplines, education levels, and degrees of engagement Enable the library as a shared, collaborative, contributory space Support the creation and display of context around library resources to enhance discovery, use, and understanding Put the library in the path of the user, enabling them to easily and comfortably integrate the library into their normal workflow Release NCore as an open-source platform for developing digital library tools

Open Repositories 2008 Context and Collaboration

Open Repositories 2008 What does NCore provide NSDL? A digital library that contains: –References to STEM resources –Metadata that describe those resources –Ways to organize, interrelate, and annotate resources Back-end tools and services to support the creation, organization, and indexing of resource references and metadata in the library End-user tools that allow: –Discovering, creating, and organizing library resources and metadata –Creating context, relationships, and annotations for the materials in the library

Open Repositories 2008 NCore in Operation NSDL Data Repository contains 5 million digital objects: 3 million resources and 2.3 million metadata records 150,000 metadata/resource record updates are harvested from over 130 collections each month Production services run on 9 Dell RedHat Linux servers, including 3 repository servers with one leader and 2 real- time replicating followers Search indexing service regularly crawls and indexes 700,000 STEM resource pages Search service handles approximately 5 million queries/month Expert Voices blogosphere currently contains 46 blogs and 1524 posts with 749 registered users

Open Repositories 2008 The Stacks (Repository) The NSDL Data Repository (NDR), implemented as set of digital objects and relationships in a Fedora repository Resource Metadata Resource Related Metadata for Member of Metadata Provider Collection Member of Related Metadata for Agent Selector for Selector for

Open Repositories 2008 NCore specializes Fedora Multiple Object Types: –Resources (with local or remote content) –Metadata –Aggregations (collections) –Metadata Providers (branding) –Agents Relationships with arbitrary graph queries: –Structural (part of) –Annotation (relates to) Maintains Fedora’s independence of datastreams - allows other metadata and content to coexist with NCore

Open Repositories 2008 NCore: Production enhancements to Fedora Problem: Original Kowari RDF triple-store could not scale to 250 million dynamically updated triples Solution: MPTStore - easily handles hundreds of millions of dynamically updated triples for useful subset of RDF queries Problem: Recovery model of rebuilding from Fedora XML files (Foxml) did not scale to 5 million digital objects Solution: Transaction journalling - supports real-time updates to multiple replicas. Journals can accumulate for later playback. Supports static backup of “followers”

Open Repositories 2008 NCore implements an information network overlay...

Network Overlay View User View API/UI Repository View with Relations & Annotations Resources on the Web

Open Repositories 2008 Key aspects of this overlay Vision is to represent contextual knowledge around web resources Serves as a forum for independent parties to contribute, discover, use, and re-use this context at will Allows libraries to construct a cohesive and vetted view of the contents therein Allows all these independent parties to go about their business and not step on each other's toes NOTE: The subset of resources on the web can be an existing repository!

Open Repositories 2008 Back-end Services NDR API: A REST-based web services interface to the NDR Ingest: OAI-PMH metadata aggregator OAI-PMH server for library metadata NSDL Search Service Digital Discovery System

Open Repositories 2008 NCore: NDR API Uses REST calls for all interactions Specializes Fedora for NDR objects/relationships Disseminations allow combining metadata from multiple sources, or related content Authentication: Requests signed with private key associated with an agent Authorization: Agent can become a metadata provider or aggregator; can create resources Documented at

Open Repositories 2008 OAI-PMH Services Harvesting and Ingest –Automated process: Harvest trigger files created by scheduling system –Full logging with feedback to provider –Automated rescheduling Repository OAI-PMH serving –Uses Fedora proai service to index Dublin Core datastreams in metadata objects –Collections server as OAI-PMH sets –RDF relationships can be expressed and served as metadata

Open Repositories 2008 OAI-PMH Automated Harvesting Collections validate their OAI-PMH server CI registers collection (CRS) harvest schedule, baseURL, set information… Full harvest initiated Subsequent incremental harvests according to schedule automated s if problems

Open Repositories 2008 NSDL Search Service Based on Lucene/Nutch Service exposes full power of Lucene queries Indexes metadata records incrementally harvested from NDR Crawls resources on web, indexing full text of resource Scales easily to millions of resources

Open Repositories 2008 DDS Search Service Digital Discovery System Web Service (DDSWS), developed by DLESE/Digital Learning Sciences REST web service interface Supports detailed searching on specialized metadata fields (e.g. education level) Efficient for moderate sized collections (order 10,000 records) Documentation at 1/service_specification.jsp 1/service_specification.jsp

Open Repositories 2008 Management/Developer Tools NCore Collection System (NCS): full XML- driven metadata cataloging/management Strand Maps Web Service: concept/knowledge mapping and visualization Shibboleth Identity Management Clients: user authentication Planned: NCore Tool Kit (PHP, Java, and Javascript toolkit for NDR API access)

Open Repositories 2008 Strand Map Service Internationally-recognized science learning goals and progressions Enables teachers and learners to –Visualize and explore learning goals and their interconnections –Use learning goals to locate and assess resources and curriculum components –Enhance their science knowledge and pedagogical content knowledge –Adapt instructional materials while supporting recognized learning goals Institutions use SMS web service to create applications and interfaces

Open Repositories 2008 End-user tools Blogging with integrated NSDL search, resource linking, and publication, based on WordPress MultiUser Wiki with integrated NSDL search, resource linking, and publication, based on MediaWiki Content management system with workflow and NDR publication, based on Fez

Open Repositories 2008 NCore Model - Extending Existing Open Source Tools Create search service plugin –find resources to talk about –insert links to resources Create data repository interaction plugins –add new resources to the library –add referenced resources to the library –add metadata about resources Community sign-on (Shibboleth) Add administrative support Skins/themes Build on RSS

Open Repositories 2008 Create

Open Repositories 2008 Add References

Open Repositories 2008 Annotate

Open Repositories 2008 NDR API DescribeContribute

Open Repositories 2008 NDR API AnnotateRelateAggregate

Open Repositories 2008 Blog about it

Open Repositories 2008 NDR API Repository Relationships

Open Repositories 2008 OnRamp – OnFire Distribution System

Open Repositories 2008 Fez - Record

Open Repositories 2008

OnRamp - Integration with NDR

Open Repositories 2008 Repository Relationships NDR API

Open Repositories 2008 Status NSDL.org and OAI server/ingest in production since 2002 NDR/NDR API in production since January 2007; v1.1 released on SourceForge in December 2007 NDR search service in production since January 2007; SourceForge beta release March 2008 Expert Voices in production since early 2007; plugins released on SourceForge in March 2008 NSDL Wiki in production now; plugins release on SourceForge in March 2008 OnRamp in production since January 2008 NCS in production now; released on SourceForge in March 2008

Open Repositories 2008 What’s Next in 2008? SourceForge release of NSDL search code, MediaWiki extensions, Wordpress extensions (done as of April 1) Registering RSS feeds to support bookmarking, folksonomic tagging systems such as del.icio.us and Nature Publishing’s Connotea Extensions to the Moodle Course Management System to support searching for and linking to NSDL resources Tool to create NSDL “personal magazine” – user-level selection, organization, annotation and presentation of NSDL resources

… Protocol: OAI-PMH HTTP REST NDR API STEM Collections Archive Service NCS NCore: The Technical Ecosystem

Open Repositories 2008 Summary - Why NCore? It is designed to flexibly support a range of digital library and repository applications It can layer over existing repositories It can scale up to large libraries - certainly tens of millions of resources It comes with a significant existing suite of tools and services It has been demonstrated to work in a large production library It is available in open source from SourceForge under the Educational Community License (ECL 1.0) Funding for NCore (as part of NSDL Technical Network Services) now guaranteed through 2012

Open Repositories 2008 OnRamp Expert Voices NSDL Wiki For more information on NCore: Collaborative Tools:

Open Repositories 2008 Acknowledgements NSF EHR/DUE - Lee Zia, Program Officer NSDL Core Integration Team –UCAR: Kaye Howe, PI and Executive Director –Cornell: Dean Krafft, PI –Columbia: Kate Wittenberg, PI Fedora Development Team –Cornell: Sandy Payette & Carl Lagoze –Univ. of Virginia: Thornton Staples This material is based upon work is supported by the National Science Foundation under Grants No. DUE , , , and Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Open Repositories 2008 Contact Information Dean B. Krafft Cornell Information Science 301 College Ave. Ithaca, NY USA This work is licensed under the Creative Commons Attribution-ShareAlike 2.5 License. To view a copy of this license, visit or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. When separated from this work, some images may be covered by separate copyright or license terms.