The Technical Infrastructure of the NSDL Dean Krafft, Cornell University

Slides:



Advertisements
Similar presentations
EXtensible Catalog David Lindahl University of Rochester.
Advertisements

Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
1 NSDL The National Science Foundation's National Digital Library for Science, Mathematics, Engineering and Technology Education [a.k.a. Smete, NSDL, Learns,...]
Sally Rumsey ORA Service & Development Manager Why ORA? Why Fedora?
Introducing Symposia : “ The digital repository that thinks like a librarian”
Open Repositories 2008 The NCore Platform: An Open-Source Suite of Tools and Services for Implementing Digital Libraries Dean B. Krafft Cornell University.
Building a National Science Digital Library Dean Krafft, Cornell University
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
NDR (resource references, metadata, collection data, etc.) NCS (& DDS) Expert Voices wiki.nsdl.org Harvest Manager OAI-PMH service (proai) NDR Search NCS.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
NSDL – A Tool for Teaching and Learning Eileen McIlvain Pathways Liaison NSDL Core Integration BEN Scholars Workshop December 8-10, 2006.
Digital Library Architecture and Technology
Digital Libraries: New Tools for ScienceTeaching and Learning.
Making the Most of Digital Learning Resources for STEM with NSDL 2010 Robert Noyce Teacher Scholarship Program Conference Washington DC July 8-9, 2010.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Tutorial – Semantic Digital Libraries, May 9, 2007 WWW 2007 Copyright , DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Open Repositories 2008 The NCore Platform: An Open-Source Suite of Tools and Services for Implementing Digital Libraries Dean B. Krafft Cornell University.
Welcome! Carol Minton Morris Communications Director NSDL Core Integration Cornell Mike Luby NSDL Publisher Relations NSDL Core Integration Columbia.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Creating and Operating a Digital Library for Information and Learning– the GROW Project Muniram Budhu Department of Civil Engineering & Engineering Mechanics.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Building a National Science Digital Library on Fedora Dean Krafft, Cornell University
DLESE and NSDL: Digital Library Components of Cyberinfrastructure International Workshop of Cyberinfrastructure for Geosciences IWCG Beijing, China.
Developing a Concept Extraction Technique with Ensemble Pathway Prat Tanapaisankit (NJIT), Min Song (NJIT), and Edward A. Fox (Virginia Tech) Abstract.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
The National Science Digital Library & Shibboleth.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Tooltime: Using NSDL 2.0 Dean Krafft, Cornell University
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
Blogging and Publishing in the NSDL Dean Krafft, Carol Minton Morris (Cornell) Blythe Bennett (Syracuse)
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
Core Integration Web Services Dean Krafft, Cornell University
1 The NSDL Program Stephen Griffin National Science Foundation.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
NSDL 2.0: Creating a collaborative digital library Dean Krafft, Cornell University
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
NSDL 2.0: Building a Collaborative Digital Library Dean Krafft, Cornell University
The Technical Infrastructure of the NSDL Dean Krafft, Cornell University
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Building Tools and Services on the NDR Dean Krafft, Cornell University
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
A centre of expertise in digital information management 10 minute practical guide to the JISC Information Environment (for publishers!)
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
NSDL: A New Tool for Teaching and Learning.
NSDL: OAI and a large-scale digital library
An Architecture for Complex Objects and their Relationships
Working with the NSDL 2.0 Data Repository
NSDL Data Repository (NDR)
Fedora Filling the “Sweet Spot” in the Information Landscape
The National Science Digital Library (NSDL)
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
The Fedora Project April 28-29, 2003 CNI, Washington DC
Presentation transcript:

The Technical Infrastructure of the NSDL Dean Krafft, Cornell University

NSDL Technical Overview Structure of the talk:  NSDL 1.0 Overview  The Fedora-based NSDL Data Repository (NDR) and NSDL 2.0  Inspiring Contribution and Collaboration – ExpertVoices  Other NSDL 2.0 Services and Tools  Q&A

What is the NSDL?  An NSF-funded $20 million/year program in Science, Technology, Engineering and Mathematics (STEM) education  A digital library describing nearly two million carefully selected online STEM resources from well over 100 collections (at  A core integration team (Cornell, UCAR, Columbia) working with 9 “pathways” portals and over 200 NSF grantees  A large community of researchers, librarians, content providers, developers, students, and teachers

NSDL 1.0  Create a “union catalog” of Dublin Core metadata records for STEM resources  Harvest those records from collections using OAI-PMH (openarchives.org)  Store records in an Oracle DB and re-serve qualified DC through OAI-PMH  Build a search index using metadata plus full- text of available content pages  Create a web portal at nsdl.org for K-gray access to NSDL resources

Infrastructure overview: NSDL 1.0 STEM Collections on the Web Central Metadata Repository Search Service Archive Service Collection Registration System NSDL.org Portal Protocol: OAI-PMH HTTP REST SQL

NSDL 1.0 Lessons  Metadata Repository was quick to implement using known technologies, but  Limited model  Metadata-centric orientation  No content – only metadata  Limited relationships – collection/item  Limits on context, structure, and access  Severe limits on contribution and collaboration  One-way data flow: NSDL → Users  Rather than one portal for everyone, support communities with common interests: Pathways now provide discipline and area-specific portals

NSDL 2.0  Create an NSDL that guides not just resource discovery, but resource selection, use, organization, annotation and contribution  Supports creating “context” for resources  Presents resources in context: linked to related concepts; with user ratings; with codes and data  Supports creating a permanent archive of resources  Enables community tools for structuring, evaluation, annotation, contribution, collaboration  Provides two-way data flow: NSDL ↔ users  Goal: Create a dynamic, living library

Creating the NSDL Data Repository  Supports storing both content and metadata  Allows arbitrary relationships among resource and metadata objects: organization, annotation, citation  Accessible through web service architecture of remixable data sources and transformations

Fedora: the NDR middleware  A Flexible, Extensible Digital Object Repository Architecture (  Open source project with $2.2 million in Mellon funding  Collaboration of Cornell and Univ. of Virginia  Key funded users include:  eSciDoc project (collaboration of the Max Planck Society and FIZ Karlsruhe)  Public Library of Science (Topaz Foundation)  VTLS Corp., Harris Corp., Library of Congress  Australian Research Repositories Online to the World  Royal Library Denmark, National Library, and DTU

What is Fedora?  An architecture, toolkit, and implementation: middleware, not a vertical application  DSpace in contrast: a vertical application with a fixed workflow targeted at users  Stores arbitrary internal and external digital objects, disseminations (transformations and combinations), relationships among objects  Entirely SOAP/REST based, disseminations are URLs  XML data store; RDBMS cache; RDF triplestore supports relationship queries

NSDL Data Repository (NDR)  References to roughly 2 million selected STEM resources on the web  Sourced metadata statements about those resources  A REST API to allow authenticated access by Pathways and providers  Support for annotation, aggregation, and other relationships

Sample NDR Objects & Relationships Publication Resource Data Set Metadata Publication Metadata Data Set Resource Code Resource Cites Metadata for Member of Metadata Provider MatForge Collection Soft Matter Collection Member of Cites Metadata for Cornell CCMR MatDL Pathway Selector for Selector for

An Information Network Overlay  Think of the NDR as a lens for viewing science content on the net  Content can be:  Local: stored directly in the NDR  Remote: accessed through a URL  Computed: derived from a database or web service  Archived: an older version stored at SDSC  It all has a repository-based URL

Network Overlay View User View API/UI Repository View with Relations & Annotations Resources on the Web

How should we use the NDR?  The NDR provides powerful capabilities for:  Creating context around resources  Enabling the NSDL community to directly contribute resources and context  Representing a web of relationships among science resources and information about those resources  How do we use it? Here’s one specific example …

Soft Matter Wiki: Planned NDR Integration  Community of approved contributors (e.g. teachers, librarians, materials scientists) are granted edit access to Soft Matter wiki  New resources and metadata are created as wiki pages and reflected into the NDR  Relevant non-wiki-based NDR resources and metadata are displayed as read-only wiki pages, subject to comment and linking  User and project pages organize NDR resources  Will work with MatDL on integrating these capabilities into Soft Matter Wiki

NDR Entry for Soft Matter Wiki Wiki Entry New Metadata New Audience MD Referenced New Resource 1 Referenced Existing Resource 2 Annotates Metadata for Member of Metadata Provider Metadata Provider Existing Collection Soft Matter Wiki Member of Inferred relationship between resources

But an NDR-integrated wiki is just the beginning …

Expert Voices  A system using blogging technology to:  Support STEM conversations among scientists, teachers and students  Tie NSDL resources to real-world science news  Create context for resources to enhance discovery, selection and use  Enable NSDL community members to become NSDL contributors: of resources, questions, reviews, annotations, and metadata  Expert Voices ≠ LiveJournal  Contributors are carefully selected, contributions are about science, the process of science, and education

Expert Voices Implementation  Open source multi-user blogging system  Published entries become NSDL resources  Owner controls publication of entries and visibility of comments  Entries can contain linked references to NSDL resources, references to URLs that should become resources, and new resource metadata  Integrated with NSDL Shibboleth-based community sign-on

MyNSDL: NDR-integrated tagging, bookmarking, and recommendation  Based on Connotea open-source folksonomic tagging/bookmarking system  Tags and bookmarking structure are reflected back into the NDR  Authorized users can “automatically” recommend new NSDL resources simply by tagging them  Gives user a personal view of NSDL resources

Other proposed applications  iVia-based Expert-Guided crawl: Tool for Pathways and others to turn websites into resource collections (in development at UC Riverside)  Moodle Course Management System – courses integrated with NSDL resources  Electronic lab notebook – integrating lab notes with code, data sets, and reference materials within the library archival framework

… NSDL 2.0 Ecosystem Protocol: OAI-PMH HTTP REST NDR API STEM Collections Search Service Archive Service Fedora- based NDR

What does this mean for the user?  NSDL 2.0 applications situate resources in context, aiding both discovery and use  Users become contributors, adding new resources, ratings, annotations, and organizational structure – frequently as a side effect of using the library  Specialized portals, tagging, and powerful relationship queries and filtering support user- specific “views” into the library

Summary  NSDL 1.0 created a large, production digital library of STEM resources for education.  NSDL 2.0 and its tools allow scientists, mathematicians, teachers, engineers, librarians, and students to create a unique web of context, contribution, and collaboration around the high-quality STEM education resources at the core of the NSDL.

Acknowledgements  NSDL NSF Program Officers  Lee Zia  David McArthur  NSDL Core Integration Team  UCAR: Kaye Howe, PI and Executive Director  Cornell: Dean Krafft, PI  Columbia: Kate Wittenberg, PI  Fedora Development Team  Cornell: Sandy Payette & Carl Lagoze  Univ. of Virginia: Thornton Staples

Questions?

Contact Information Dean B. Krafft Cornell Information Science 301 College Ave. Ithaca, NY USA This work is licensed under the Creative Commons Attribution-NoDerivs 2.5 License. To view a copy of this license, visit or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.