Ontario Library Research Cloud: Building A Province-Wide Research Cloud for Ontario’s Academic Libraries Pascal V. Calarco, University of Waterloo IGeLU.

Slides:



Advertisements
Similar presentations
What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
Advertisements

The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Lorcan Dempsey OCLC Big Heads – Heads of Technical Services of Large Research Libraries ALA 2013 Chicago 28 June things about
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
Columbia University Libraries / Information Services Digital Asset Management Digital Preservation Digital Publishing Stephen Davis, October 28, 2010.
University of Sydney – Academic Forum – 13 April 2005 John Shipp University Librarian THE FUTURE OF THE UNIVERSITY LIBRARY CHANGES IN SCHOLARLY COMMUNICATION.
THE JOKOMO / YAMADA LIBRARY DIGITAL LIBRARY PROJECT.
CTS PRIVATE CLOUD Quarterly Customer Meeting October 23, 2013 Kay Metsker.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Ontario University Library Consortia Activity Ontario University Library Consortia Activity Gwendolyn Ebbett Dean of the Library University of Windsor.
Meeting of CAUL/CONZUL and CREPUQ Sub-Committee of Libraries Montréal, Québec, October 10, 2001 October 10, 2001 A Research Digital Library : a Proposal.
NHPRC ELECTRONIC RECORDS RESEARCH FELLOWSHIP SYMPOSIUM Nov. 19, 2004 Rebecca Schulte University of Kansas Project Title: Testing Boundaries—An Exploration.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Texas State University Libraries Faculty Digitization Services Overview Ray Uzwyshyn, Ph.D. MBA MLIS Director, Collections and Digital Services.
OU Digital Library development project Liz Mallett – Project Manager James Alexander – Project Developer 25 January 2012.
Preservation In The Cloud Markus Wust NCSU Libraries.
Shared October 13, 2010 Shelf Michael Roy, Dean of Library and Information Services, Middlebury College A Networked Image Platform Jeremy Stynes, Head.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
The University of Texas Research Data Repository : “Corral” A Geographically Replicated Repository for Research Data Chris Jordan.
SYNAT - the Polish National Research Content Infrastructure Wojtek Sylwestrzak, ICM Tomasz Rosiek, ICM Tomasz Krassowski, ICM Tartu, Estonia June 27, 2012.
Teaching and Learning with Technology  Allyn and Bacon 2002 Administrative Software Chapter 5 Teaching and Learning with Technology.
Digital Library Architecture and Technology
New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.
Scholars Portal Project Ontario Council of University Libraries Scholars Portal in 2007 A Progress Report Leslie Weir Université d’Ottawa - University.
#watitis2014 ONTARIO LIBRARY RESEARCH CLOUD: BUILDING A PROVINCE-WIDE RESEARCH CLOUD FOR ONTARIO’S ACADEMIC LIBRARIES.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
Ymchwil Research Ymchwil Research RESAW Ioan Isaac-Richards Ingest Processes Manager Head of Web Archiving
The Fundamentals of Preserving Knowledge Assets Pacific Neighborhood Consortium 2010 Catherine Quinlan, Dean of the USC Libraries USC's Dual Approach.
Edward M. Corrado: ELAG | |
Librarian Perceptions of the Function of the Academic Library: Summer-Fall 2006 Kevin Guthrie Roger C. Schonfeld December 4, 2006.
A brief overview… “The Obama Administration is committed to the proposition that citizens deserve easy access to the results of scientific research their.
Texas Digital Library CENTRAL TEXAS AND SAN ANTONIO-AREA REGIONAL MEETING SEPTEMBER 5, 2013.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Collaborative Markup of Library and Research Data Examples from Ontario Council of University Libraries (OCUL)
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
Collection Management Strategies in a Digital Environment Cecily Johns CMI Project Director August 2001.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Presented by Kristen J. Nyitray Head, Special Collections & University Archives University Archivist Special Collections & University Archives Stony Brook.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Developing strong data roots in fertile library soil: e-science in canadian libraries Geoff Harder Canadian eResearch Community, CLA, 2013.
IT and IM: Promises and Pitfalls Greta Lowe August 15, 2011.
VIVO and Scholarly Repositories: Synergistic Opportunities.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
ScholarSpace & Open UH Mānoa March 2013 Beth Tillinghast Web Support Librarian ScholarSpace & eVols Project Manager UHM Library.
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 7.
1 Engineering Faculty Council Library Service Trends Mark Haslett University Librarian University of Waterloo “Day 20”
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Digital Collections Forum Doug Moncur AIATSIS September 2004.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
A Technical Overview Bill Branan DuraCloud Technical Lead.
1 Pioneer Investments Legal and Compliance System Assessment Weekly Status Update June 23, 2005.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Making the Case for Curation: The Practical Experiment of DSpace Managing Digital Assets February 5-6, 2005 Charleston, SC Ann J. Wolpert, Director of.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Institutional Repository for Milligan College. Introduction.
CMU Libraries’ Digital Assets Preservation Strategy Presenter Gabrielle V. Michalek Principal Archivist and Head, Archives/Digital Library Initiatives.
Rebecca L. Mugridge LFO Research Colloquium March 19, 2008.
Dataverse at Scholars Portal Alan Darnell Director, Scholars Portal.
VI-SEEM Data Repository
Print & Digital Preservation: Current Status, Future Opportunity
DPubS: An Open Source Electronic Publishing System
Research data preservation in Canada
Presentation transcript:

Ontario Library Research Cloud: Building A Province-Wide Research Cloud for Ontario’s Academic Libraries Pascal V. Calarco, University of Waterloo IGeLU 2015 September 3, 2015

Agenda OCUL Overview Problem we’re trying to solve Funding and project plan Technology overview Some likely use cases Next steps Q&A

Ontario Council of University Libraries 21 member libraries 420,000 students Collaboration in: –Shared electronic collections –Planning & assessment –Digital library services & infrastructure

Libraries’ Growing Storage Needs Digitized physical materials: books, journals, film, audio –Reformatting to conserve original eg. Acidic paper such as newspapers –Reformatting to increase access eg. Rare materials –Format migration to preserve content eg. 16mm film

Libraries Growing Storage Needs Born digital scholarly content for long term stewardship: –E-Theses and supplemental material –Scholarship: Working papers, Pre-prints, Open Access –Research data: numeric, geospatial, image, audio –Websites and digital ephemera of academic interest –Donated electronic materials for Special Collections John English’s hard drives of personal correspondence, drafts and other materials

OCUL Storage Survey (2013) 10 of 21 institutions responded; six >10k FTE, 4 smaller than 10k Preservation & Access Needs: –80%: digitized print content –80%: faculty publications –60%: donated digital content –50%: research data –50%: GIS data –40%: purchased digital resources –20%: corporate records –20%: E-Theses

OCUL Survey: Storage Needs Current storage requirements: 100GB- 30TB; total of respondents: 58.5 TB Expected storage needs, next 2-3 years: –20% 100TB+ –40% 10TB-100TB –20% >10TB –250TB total for all 10 institutions

OCUL Survey: Storage Provisioning 80% partner with campus IT often/mostly 60% provision in-house often/mostly 40% provision with other partner libraries often/mostly 30% provision with commercial services often/mostly

OCUL Storage Survey: Top Features (2013) Large storage on demand Low cost Canadian-based hosting Transparent pricing Archival quality storage

Storage Architectures and Cost Tiers

Cloud storage options Amazon S3/Glacier: $500k/year for current 250TB SP content –$2000/TB per year, recurring DuraCloud: Amazon reseller, adding preservation & mgmt. tools –$1000-$1500/TB per year, recurring Private Cloud: OpenStack –$280-$350/TB per year, amortized over three years

MTCU Proposal and PIF funding 2013/2014: OCUL was awarded $1.2 million Productivity and Innovation Fund (PIF) funding for OLRC startup 50TB per founding partner institution Triplestore preservation: content copies at three different co-located nodes for redundancy, error correction Text mining portal for stored ScholarsPortal content

Hardware configuration Dell selected as hardware vendor. Head units: Dell Power Edge R720xd server populated with two 2.8GHz Xeon processors, 256GB of RAM, and two 200GB SSD drives which will be used to run the operating system and the OpenStack software. Each head unit also contains twelve 4TB SAS drives for an internal storage capacity of 48TB. Storage shelves: Dell PowerVault MD 1200 storage shelves, directly attached to the server, with each shelf containing twelve 4TB SAS drives, with a total capacity per shelf of 48TB. Total initial capacity 3.6 PB raw, triple-redundant, 1.2 PB net

OpenStack An open source cloud computing platform, primarily deployed as an Infrastructure-as-a-Service (IaaS) platform Swift – OpenStack object store, store and retrieve data via API Integrate OpenStack/Swift to Digital Repository architectures Develop Dropbox-like cloud storage web interface

Use Cases Digital Preservation Institutional and Personal Storage Repositories Research Data Management Text mining large volumes of digital textual content for research purposes

Digital Curation

Fedora Commons Open source digital object repository, that is the underlying architecture behind Islandora, Hydra, and other digital asset management systems.

DSpace An open source turnkey institutional repository software for building open access repositories for scholarly and published digital content.

Archivematica & ICAtoM An open source digital preservation system designed to maintain standards-based, long term access to collections of digital objects.

Dataverse An open source web application for publishing, citing, analyzing and preserving research data. Research data management focus Access not preservation

Text Mining Potential uses by researchers in Digital Humanities: –Entity recognition –Parts of speech analysis –Topic modeling –Network analysis –Visualization

Canadian Text Archive Centre Phase 2 development –Leverage OCUL ScholarsPortal text corpus of books and journals for academic research –CTAC Advisory Committee being formed –Tools and service development for students and researchers to create worksets of documents from content in the OLRC –Bring “analysis to the data” –June 2015 – May 2016

Current Status & Milestones October 2014: integration with Archivematica December 2014: integration with DataVerse Q1 2015: Storage Nodes finalized; installation of Waterloo/Guelph/Laurier node March 2015: integration with Fedora Commons May 2015: Third Hackfest, Text Mining Portal June 2015: integration with DSpace Fall 2015: Canadian Text Archive Centre Advisory Committee

Thanks! Questions? Pascal Calarco, University of Waterloo Library