New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.

Slides:



Advertisements
Similar presentations
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Advertisements

DuraSpace, Fedora and DuraCloud Triangle Research Libraries Network September, 2009.
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Introducing Progress Arcade Roy Ellis
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
© 2009 IBM Corporation ® IBM Software Group Introduction to Cloud Computing Vivek C Agarwal IBM India Software Labs.
Preservation In The Cloud Markus Wust NCSU Libraries.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
Plan Introduction What is Cloud Computing?
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Banking Clouds V International Youth Banking Forum.
“Grandpa’s up there somewhere.”. Making your IT skills virtual What it takes to move your services to the cloud Erik Mitchell | Kevin Gilbertson | Jean-Paul.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
“ Does Cloud Computing Offer a Viable Option for the Control of Statistical Data: How Safe Are Clouds” Federal Committee for Statistical Methodology (FCSM)
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
CLOUD COMPUTING. What is cloud computing ? History Virtualization Cloud Computing hardware Cloud Computing services Cloud Architecture Advantages & Disadvantages.
The DSpace Course Module – An introduction to DSpace.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
An Introduction to DuraCloud Michele Kimpton, Project Director Carissa Smith, Partner Specialist DuraSpace Webinar  Sept 2011.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
Software Architecture
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
DuraCloud pilot program Michele Kimpton, CEO DuraSpace Richard Rodger, Dept Head Software development, M.I.T. Libraries Claire Stewart Dept Head Digital.
The Global Video Grid: DigitalWell Update & Plan For SRB Integration Myke Smith, Manager Streaming Media Technologies University of Washington / ResearchChannel.
1 © 2009 Cisco Systems, Inc. All rights reserved.Cisco Confidential Cloud Computing – The Value Proposition Wayne Clark Architect, Intelligent Network.
Cloud Computing John Engates CTO, Rackspace Presented: Rackspace Customer Conference, 2008 October 29, 2008.
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
IT and IM: Promises and Pitfalls Greta Lowe August 15, 2011.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Enterprise Cloud Computing
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Archiving and Preservation Michele Kimpton CEO, DuraSpace Bryan Beecher Director, ICPSR DuraSpace Webinar November 2, 2011.
0 National Geospatial Platform Jerry Johnston Department of the Interior January 6, 2016.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Web Technologies Lecture 13 Introduction to cloud computing.
Information Systems in Organizations 5.2 Cloud Computing.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Store, Manage, and Archive Content in the Cloud Michele Kimpton, DuraSpace CEO DuraSpace Nate Klingenstein, Internet 2 Internet 2 meeting, April 2013.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
MICROSOFT AZURE APP BUILDER PROFILE: RAVERUS LTD. Raverus is a customer-driven company engaged in providing software applications designed to improve and.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Preservation support and data management services built on cloud infrastructure Features:  Ingest once, and make multiple copies to multiple storage.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Avenues International Inc.
? What is Institutional Repository for Rutgers University
Joseph JaJa, Mike Smorul, and Sangchul Song
Implementing an Institutional Repository: Part II
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Office 365 and Microsoft Project Integrations for HULAK Project Management Software Enable Teams to Remain Productive and Within Budget OFFICE 365 APP.
Fedora Filling the “Sweet Spot” in the Information Landscape
Michele Kimpton Project Director, DuraCloud NDIPP Partner meeting
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Archiving and preservation services in the cloud
Presentation transcript:

New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace

Social and Technical Forces (2000-present)  Waves of Repository-Enabled Applications Institutional Repositories Digital Collections Digital Libraries Collaborative Spaces and “Web 2.0” Scholarly and Scientific Infrastructure E-Research Data (archiving, linking, sharing)

Implications for our future work more distributed more collaborative more web - oriented more open more interoperable

Emergence of Infrastructure Source: Understanding Infrastructure: Lessons for New Scientific Infrastructure, Systems Integrate components Central control Dedicated/specialized gateways More closed More preconceived Integrate systems Distributed control Generic gateways More open More reconfigurable Networks

Source: Francine Berman, Got Data? A Guide to Data Preservation in the Information Age, pp December 2008 page 55 page 53

History: DSpace and Fedora Two open source repository systems –DSpace: End-user application and repository Turn key system providing easy out-of-box –Fedora: Web services (repository and supporting services) Flexible, modular, and scalable Enabling technology supporting… –scholarship, science, culture, education –open access –preservation and archiving

DSpace and Fedora Installations Largest share of open repositories worldwide … over 700 institutions tracked in our registries Universities Research Centers Libraries Archives Cultural Heritage Government More…

DSpace Foundation and Fedora Commons 501(c)(3) non-profit organizations Common toolsInteroperabilityNew tools and services Web APIs Storage Abstraction Architecture Strategy SWORD Deposit MS Word Plug-In DuraSpace Future Joint Offerings Business Strategy Communication/Outreach Progression of Partnership

Goals of Strategic Partnership Stewardship: – Support and align open source development communities for DSpace and Fedora –Keepers of the cause (durability + access) Innovation: –Think beyond existing platforms –New strategic directions for repositories –New products and services Sustainability: –Devise business models that fit our sector –Services that generate revenue for non-profits

What About the Cloud? An emerging architecture in which data and applications reside in cyberspace, allowing users to access via the internet (Pew Internet 9/08) A style of computing where massively scalable IT-related capabilities are provided “as a service” using Internet technologies to multiple external customers. (Gartner, 6/08).

Types of Cloud Services Software as a Service (SAAS) –e.g., Google Apps Cloud Computing –e.g., Amazon Elastic Compute Cloud (EC2) Cloud Storage –e.g., Amazon Simple Storage Service (S3)

Cloud Services

Vision: Federated Repositories and Cyberinfrastructure DuraSpace Heaven

DuraSpace Proposition Trust and durability in the cloud

What have we learned from our users? Focus Groups Site Visits Forums

Problems Tools and processes unproven Limited IT support Capital expenditures limited Task can be overwhelming ( replication, migration, emulation ect.) Preservation important but difficult to implement

Problems Systems not interoperable Heterogeneous applications/platforms Lack of commons standards Inelastic compute capability Barriers to making content more accessible and useful to researchers

Advantages – Cloud Services Flexibility Scalability Pay for use Easy to implement Cost

Public cloud providers drive cost down through scale, location and virtualization technology Large Data centers(50k+) can achieve 5 to 7 times costs savings over Medium Data Centers(1,000) *Hamilton, J Internet-Scale Service Efficiency (Sept 08) Technology*Cost Med DCCost Large DC Network$95 per Mbit/sec/mo$13 per Mbit/sec/mo Storage$2.20 per Gbyte/mo$.40 per Gbyte/mo Admin140 servers/admin>1000 servers/admin

Issues Security Transparency Data lock in SLA’s Trust

DuraSpace Trusted management of and access to durable digital assets in the cloud DuraSpace Mediating Service

DuraSpace- Notional Architecture

Architectural view

Core services-Preservation based Replicate to multiple storage providers Replicate to multiple geographic areas Be able to manage content and services through web based “Dashboard” Includes integrity checking and monitoring “Pay for use” for services and storage

Technology Services Build and run services on top of content stored in the cloud –Search –Aggregation –Streaming –Migration –Hosting Enable others to build services/apps on top of content

Use Cases: DuraSpace with Cloud Storage Online backup for text, images, datasets, video, audio Preservation-Multiple copies, geographies, administrations Temporary or permanent project storage

Use cases: DuraSpace with Cloud Compute Streaming service for video JPEG2000 image engine Indexing and other processing heavy jobs Staging area for repository ingest Repositories in cloud Data and text mining over open data Aggregation and web 2.0 tools on open content and collections

DuraSpace software Open source - apache license Open core Run Your Own: Private clouds, University consortia Extensible: Research partners

Critical success factors Ease of use- simplicity Trusted partner for end user Cost effective Scalable/Flexible Can establish key partnerships with service providers Can build community of developers and users

Timeline Identified initial cloud partners Identified initial pilot partners Defined initial requirements Initial open source release -Q Begin pilot- Fall 2009 Extensions available for repository platforms- Q Roll out to Repository community-Q Launch production service Q2 2010

Initial capabilities Replication, up to three providers (including local store) Web based “Dashboard” Data integrity checking and monitoring Can push content from DSpace/Fedora repository platform Integrated billing Compute capability A few initial compute services TBD

Listen… Sandy and Michele’s DuraSpace webinar