Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dataverse at Scholars Portal Alan Darnell Director, Scholars Portal.

Similar presentations


Presentation on theme: "Dataverse at Scholars Portal Alan Darnell Director, Scholars Portal."— Presentation transcript:

1 Dataverse at Scholars Portal Alan Darnell Director, Scholars Portal

2 Ontario Council of University Libraries (OCUL) Scholars Portal is the technology support service of OCUL … Tackle problems that are too big for any one institution 21 Libraries 450,000 FTE

3 Numeric Data Published data, highly curated

4 Geospatial Data Published data, highly curated

5 SP Research Data Repository Thank you IQSS ! dataverse.scholarsportal.info

6 Dataverse 3 Open to any researcher – 77 published datasets – 472 studies – 6,357 files Slow but growing uptake from libraries – 12 institutional dataverses Wide range of file formats – WARC files, Twitter feeds, spreadsheets, documents, historical census data, survey files, image files, weather data, etc…

7 Dataverse 4 Stronger institutional focus DataCite DOIs Shibboleth Canadian Access Federation Internationalization (coming soon) September 2016

8 A wish list

9 Ontario Library Research Cloud Utilize existing network and data center facilities in Ontario universities to build a PB-scale distributed storage network using OpenStack object storage (Swift) and commodity storage hardware cost-effective long-term storage for digital assets 5 nodes / 370 TB Ottawa, Queens, York, Toronto, Guelph

10 Wish 1 : Big Data Support for in place ingestion of files stored in the cloud Storage Model that supports block and object services – OpenStack Swift & S3 – ownCloud and DropBox

11 Dataverse > Archivematica > Swift Image Credit: Julie Allinson, University of York Storage Service Dashboard OLRC https://wiki.archivematica.org/Dataverse

12 Wish 2 : Digital Preservation PREMIS – Standard vocabulary to record preservation actions like ingest, transformation PRONOM – Enhanced file identification – droid, Siegfried, FIDO METS – Structural representation of complex digital objects Native XML Export – Concern about JSON as a preservation format

13 Wish 3 : Plugin Architecture Allow domain specialists to extend file support through a plugin architecture – Encourage and enable community contributions Methods – Describe – Thumbnail – View – Download – Explore – Transform My New File Format

14 Wish 4 : Tools for Analysis Jupyter and Zeppelin are interactive web based tools used for analysis of a wide range of data formats Use of Apache Spark as a processing engine for big data http://jupyter.org https://zeppelin.apache.org


Download ppt "Dataverse at Scholars Portal Alan Darnell Director, Scholars Portal."

Similar presentations


Ads by Google