Presentation is loading. Please wait.

Presentation is loading. Please wait.

New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.

Similar presentations


Presentation on theme: "New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace."— Presentation transcript:

1 New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace

2 Social and Technical Forces (2000-present)  Waves of Repository-Enabled Applications Institutional Repositories Digital Collections Digital Libraries Collaborative Spaces and “Web 2.0” Scholarly and Scientific Infrastructure E-Research Data (archiving, linking, sharing)

3 Implications for our future work more distributed more collaborative more web - oriented more open more interoperable

4 Emergence of Infrastructure Source: Understanding Infrastructure: Lessons for New Scientific Infrastructure, http://deepblue.lib.umich.edu/handle/2027.42/49353 Systems Integrate components Central control Dedicated/specialized gateways More closed More preconceived Integrate systems Distributed control Generic gateways More open More reconfigurable Networks

5 Source: Francine Berman, Got Data? A Guide to Data Preservation in the Information Age, pp 50-56 December 2008 page 55 page 53

6 History: DSpace and Fedora Two open source repository systems –DSpace: End-user application and repository Turn key system providing easy out-of-box –Fedora: Web services (repository and supporting services) Flexible, modular, and scalable Enabling technology supporting… –scholarship, science, culture, education –open access –preservation and archiving

7 DSpace and Fedora Installations Largest share of open repositories worldwide … over 700 institutions tracked in our registries Universities Research Centers Libraries Archives Cultural Heritage Government More…

8 DSpace Foundation and Fedora Commons 501(c)(3) non-profit organizations Common toolsInteroperabilityNew tools and services Web APIs Storage Abstraction Architecture Strategy SWORD Deposit MS Word Plug-In DuraSpace Future Joint Offerings Business Strategy Communication/Outreach Progression of Partnership

9 http://blogs.the451group.com/opensource/

10 Goals of Strategic Partnership Stewardship: – Support and align open source development communities for DSpace and Fedora –Keepers of the cause (durability + access) Innovation: –Think beyond existing platforms –New strategic directions for repositories –New products and services Sustainability: –Devise business models that fit our sector –Services that generate revenue for non-profits

11 What About the Cloud? An emerging architecture in which data and applications reside in cyberspace, allowing users to access via the internet (Pew Internet 9/08) A style of computing where massively scalable IT-related capabilities are provided “as a service” using Internet technologies to multiple external customers. (Gartner, 6/08).

12 Types of Cloud Services Software as a Service (SAAS) –e.g., Google Apps Cloud Computing –e.g., Amazon Elastic Compute Cloud (EC2) Cloud Storage –e.g., Amazon Simple Storage Service (S3)

13 Cloud Services

14 Vision: Federated Repositories and Cyberinfrastructure DuraSpace Heaven

15 DuraSpace Proposition Trust and durability in the cloud

16 What have we learned from our users? Focus Groups Site Visits Forums

17 Problems Tools and processes unproven Limited IT support Capital expenditures limited Task can be overwhelming ( replication, migration, emulation ect.) Preservation important but difficult to implement

18 Problems Systems not interoperable Heterogeneous applications/platforms Lack of commons standards Inelastic compute capability Barriers to making content more accessible and useful to researchers

19 Advantages – Cloud Services Flexibility Scalability Pay for use Easy to implement Cost

20 Public cloud providers drive cost down through scale, location and virtualization technology Large Data centers(50k+) can achieve 5 to 7 times costs savings over Medium Data Centers(1,000) *Hamilton, J Internet-Scale Service Efficiency (Sept 08) Technology*Cost Med DCCost Large DC Network$95 per Mbit/sec/mo$13 per Mbit/sec/mo Storage$2.20 per Gbyte/mo$.40 per Gbyte/mo Admin140 servers/admin>1000 servers/admin

21 Issues Security Transparency Data lock in SLA’s Trust

22 DuraSpace Trusted management of and access to durable digital assets in the cloud DuraSpace Mediating Service

23 DuraSpace- Notional Architecture

24 Architectural view

25 Core services-Preservation based Replicate to multiple storage providers Replicate to multiple geographic areas Be able to manage content and services through web based “Dashboard” Includes integrity checking and monitoring “Pay for use” for services and storage

26 Technology Services Build and run services on top of content stored in the cloud –Search –Aggregation –Streaming –Migration –Hosting Enable others to build services/apps on top of content

27 Use Cases: DuraSpace with Cloud Storage Online backup for text, images, datasets, video, audio Preservation-Multiple copies, geographies, administrations Temporary or permanent project storage

28 Use cases: DuraSpace with Cloud Compute Streaming service for video JPEG2000 image engine Indexing and other processing heavy jobs Staging area for repository ingest Repositories in cloud Data and text mining over open data Aggregation and web 2.0 tools on open content and collections

29 DuraSpace software Open source - apache license Open core Run Your Own: Private clouds, University consortia Extensible: Research partners

30 Critical success factors Ease of use- simplicity Trusted partner for end user Cost effective Scalable/Flexible Can establish key partnerships with service providers Can build community of developers and users

31 Timeline Identified initial cloud partners Identified initial pilot partners Defined initial requirements Initial open source release -Q3 2009 Begin pilot- Fall 2009 Extensions available for repository platforms- Q1 2010 Roll out to Repository community-Q1 2010 Launch production service Q2 2010

32 Initial capabilities Replication, up to three providers (including local store) Web based “Dashboard” Data integrity checking and monitoring Can push content from DSpace/Fedora repository platform Integrated billing Compute capability A few initial compute services TBD

33 Listen… Sandy and Michele’s DuraSpace webinar http://www.education-webevents.com/


Download ppt "New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace."

Similar presentations


Ads by Google