Download presentation
Presentation is loading. Please wait.
Published byCharlotte Barber Modified over 9 years ago
1
Arizona Astronomical Data Hub AAS 227: Dark/Orphaned Data P. Bryan Heidorn ORCID: 0000-0002-4601-8180 University of Arizona @AAS 227 6 January 2016
2
Thesis Large projects have well planned data stores Large amounts of data remain uncurated Orphan Data Much of that data is currently largely invisible – Dark Data This data should be curated professionally in collaboration with scientists Need for long-lived institutions
3
f(x)=ax k +o(x k ) Power Law of Science Data f(x)=ax k +o(x k )| X<.20 Data Volume Science Projects and Initiatives
4
Does NSF’s Data Follow the Power Law? I do not know but if $1 = X bytes…..
5
Dark data is the data that we know is/was there but we can’t see it. Hubble Space Telescope composite image "ring" of dark matter in the galaxy cluster Cl 0024+17
6
Software Infrastructure for Sustained Innovation Christine Borgman, UCLA Ian Foster, University of Chicago Bryan Heidorn, University of Arizona Tom Howe, University of Washington Carl Kesselman, University of Southern California
7
Cyberinfrastructure Vision “The anticipated growth in both the production and repurposing of digital data raises complex issues not only of scale and heterogeneity, but also of stewardship, curation and long-term access. ” NSF Cyberinfrastructure Vision for 21st Century Discovery, Chapter 3
8
Recognition of need for data curation “Recommendation 6: The NSF, working in partnership with collection managers and the community at large, should act to develop and mature the career path for data scientists and to ensure that the research enterprise includes a sufficient number of high- quality data scientists.” Long-Lived Digital Data Collections: Enabling Research and Education in the 21 st Century, Recommendations
9
Recognition of the importance of Information Recognition of the need for education New work roles within traditional institutions Interagency Working Group on Digital Data
10
AADH Workshop July 2015 28 Astronomers, software developers, librarians, AAS, VPR and School of Information
11
Accelerate for Success Partnership School of Information Department of Astronomy and Steward Observatory iPlant Collaborative Library AAS
12
AADH Broad Objectives Refine mission, science and education use cases Prevalence of Orphaned Data Take advantage of iPlant/CyVerse, Library and School of Information infrastructure and longevity Obtain community buy-in and manage expectations Establish short- and long-term funding
13
Develop a science advisory board to help guide and assist the project staff Collect data from AAS publication by University of Arizona researchers between 2005 and 2015 2500 articles in AAS Journals from 2005- 2015 1086 papers with author affiliation of the National Optical Astronomy Observatory 343 journal articles from Arizona State University authors AADH Y1 Goals
14
Develop data/software catalog Adopt (meta-)data formats Write policy documents curators and authors Ingest selected data sets Develop discovery tool (eg. WWT) Create educational material Hold follow-on data/software carpentry workshops
15
The iPlant CyVerse Collaborative Discovery Environment Use hundreds of Apps and manage data in a simple web interface Bisque Image Analysis Environment Atmosphere custom cloud-based scientific analysis platform or use a ready-made one for your area of scientific interest Data Store Store, manage, access, and share all the data related to your research
16
Overcoming Barriers Reduce pain of metadata Reduce pain of data format Discourage bad behavior Reward good behavior
17
From repositories to collaborative space
18
Also… We are hiring a faculty member in Data Science also Astronomy Postdoc http://si.arizona.edu/news/wanted- assistant-professor-data-science- tenure-eligible or at http://si.arizona.edu/news/wanted- assistant-professor-data-science- tenure-eligible https://uacareers.com/postings/7832 https://uacareers.com/postings/7832
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.