Download presentation
Presentation is loading. Please wait.
Published byBrianna Smith Modified over 8 years ago
1
TOWARDS AN ARCHITECTURE FOR NATIONAL DATA SERVICES Ian Foster Director, Computation Institute Argonne National Laboratory & The University of Chicago @ianfoster ianfoster.org
2
Architecture?
3
Principle 1: Reduce data friction Make simple things easy Make hard things possible For example, cloud-hosted software-as-a-service For example, publish- then-filter
4
Principle 2: Small pieces, loosely joined Storage systems Content management Analysis systems Registries Identity management Data movers … and many more … REST interfaces Open Simple Composable Extensible Versioned
6
Principle 3: Insist on stories For example: – “I need to store/backup/archive my data” – “I need to transfer/mirror my data” – “I need to share my data” – “I need to publish my data” – “I need to discover published data” – “I need to analyze my data” Good stories are detailed, urgent, popular
7
We have much to build on and/or integrate with Agave Brown Dog DataCite DATAone Dataverse Earth System Grid Globus Globus Connect InCommon iPlant ORCID SEAD XSEDE Zenodo Many more Many many more!
8
Globus cloud-hosted software-as-a-service for: Data transfer, sync, and sharing Identity and group management Data publication and discovery Globus Connect software to integrate resources and institutions Globus demonstration
9
What does it mean to publish? Data is: Identified Described Curated Verifiable Accessible Preserved
10
I can: Search Browse Access the data What does it mean to discover?
11
Data publication and discovery Metadata Access Control License Storage Curation Workflow Curation Workflow Policies Collection Metadata Data Metadata Data Metadata Data Dataset Community
12
Takeaway messages Three principles: – Reduce data friction – Small pieces, loosely joined – Insist on stories We have strong components to start with We’d like from you: – Stories to inform and prioritize – Volunteers to deploy and explore
13
The following are backup slides in case network failure prevents the live demonstration
14
Publish dashboard 14
15
Start a new submission 15
16
16 Describe submission: 1) Dublin Core
17
17 Describe submission: 2) Science metadata
18
Assemble the dataset 18
19
19 Transfer files to submission endpoint
20
20 Check dataset is assembled correctly
21
Submission now in curation workflow 21
22
Search published datasets 22
23
Search across collections
24
Discover a published dataset 24
25
Select a published dataset 25
26
View downloaded dataset 26
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.