Arizona Astronomical Data Hub AAS 227: Dark/Orphaned Data P. Bryan Heidorn ORCID: 0000-0002-4601-8180 University of 227 6 January 2016.

Slides:



Advertisements
Similar presentations
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Advertisements

Lorcan Dempsey OCLC Big Heads – Heads of Technical Services of Large Research Libraries ALA 2013 Chicago 28 June things about
The Changing Research Data Paradigm One agency’s response Changes to Implementation of NSF’s Data Sharing Policy NOAA’s second annual Environmental Data.
Workforce Demand and Career Opportunities in University and Research Libraries NAS Symposium on Digital Curation Anne R. Kenney July 19, 2012.
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Introducing Hellenic Conference of Academic Libraries Eduardo Ramos Account Development Manager Southern Europe & Israel
International Dimensions of Digital Science and Scholarship Address to the American Association of Research Libraries and the Canadian Association of Research.
Workshop Purpose Paul Hertz April 25, The Importance of Astronomy Science Centers NASA Astronomy Science Centers provide functions for the community:
A Roadmap to Service Excellence Information Technology Strategic Plan University of Wisconsin-Madison A report to the ITC
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Library as Partner in Creating Curriculum for Sustainability Bonnie J. Smith University of Florida Libraries Maria A. Jankowska UCLA Research Library.
NSF-funded Research Collaborations with SubSaharan Africa Presented at a Workshop on “Enhancing Research and Education Network Connectivity to and within.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Informal Learning, Cyberlearning and Innovative Education Diana G. Oblinger, Ph.D.
An Environmental Scan for Data Services Trends that are shaping today’s environment for data services.
Dr. Fran Berman, RPI Feedback from BRDI Sponsor Forum 11/11 January 29, 2012 Fran Berman.
The University Library in the Campus Strategic Goals, Initiatives and Metrics Fall 2013.
Data Sharing and Archiving: A Professional Society View Clifford S. Duke Ecological Society of America September 9, 2010.
Publish and Disseminate Your Earth Science Activities on the Web The Digital Library for Earth System Education and The Geological Society of America.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Research Information Management: Continuity, Change and Impact Michael Jubb Research Information Network UUK Workshop 5 December 2007.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
“Help, we started a journal!” Adventures in supporting open access publishing using Open Journal Systems Anna Craft Metadata Cataloger The University of.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Marilyn S. Billings Presented by Lenka Němečková.
Research Data Management Library and Campus Collaboration to Support E-Research Sandra De Groote, MLIS Abigail Goben, MLS Robert J. Sandusky, PhD on behalf.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
California Digital Library Managing and Federating e-Print Repositories: UC’s eScholarship Initiatives CNI Fall Task Force Meeting December 1999 John Ober.
1 LSST Town Hall 227 th meeting of the AAS 1/7/2016 Pat Eliason, LSSTC Executive Office Pat Osmer, LSSTC Senior Advisor.
STATE TECHNOLOGY PLAN DRAFT GOAL DEVELOPMENT. The five goals of the 2010 Plan are: 1. Teaching for Learning: Michigan students will have meaningful technology-enabled.
GT Research Data Project Team Original Charge: to investigate, evaluate, assess, and communicate Georgia Tech researchers’ data practices, processes, and.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
The R EPOSITORY AS P UBLISHER OPPORTUNITIES AND CHALLENGES IN A DUAL ROLE BEN HOCKENBERRY SYSTEMS LIBRARIAN | ST. JOHN FISHER COLLEGE.
UMass Libraries 2009 Maxine Schmidt Integrated Sciences and Engineering Library Head University of Massachusetts Amherst, MA 01003
Digital Data Collections in Biology Collaborative Expedition Workshop November 8, 2005 Arlington, Virginia Chris Greer Program Director National Science.
Publication: One Immunization Against Dark Data P. Bryan Heidorn ORCID: University of Assembly of Society Officers 624.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
A Shared Commitment to Digital Preservation and Access.
SCHOLARLY COMMUNICATION SARAH NORRIS AND LILY FLICK JUNE 16, 2016.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
ICPSR Data Fair November 8, 2010 Katherine McNeill, MIT Libraries
Research Data management university of Oklahoma university Libraries
The Astrolabe Project: Identifying and Curating Astronomical ‘Dark Data’ through Development of Cyberinfrastructure Resources Gretchen Stahlman, PhD Candidate.
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Gretchen Stahlman, PhD Candidate, University of Arizona
Data Practices and Perspectives of Atmospheric and Engineering Faculty
SSarah The Value of Scholarly Communications Programming: Perspectives from Three Settings Sarah Beaubien • Scholarly Communications.
Briefing to ARL Membership
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Successful Data Curation for Large Data Archives
Presentation transcript:

Arizona Astronomical Data Hub AAS 227: Dark/Orphaned Data P. Bryan Heidorn ORCID: University of January 2016

Thesis  Large projects have well planned data stores  Large amounts of data remain uncurated  Orphan Data  Much of that data is currently largely invisible – Dark Data  This data should be curated professionally in collaboration with scientists  Need for long-lived institutions

f(x)=ax k +o(x k ) Power Law of Science Data f(x)=ax k +o(x k )| X<.20 Data Volume Science Projects and Initiatives

Does NSF’s Data Follow the Power Law? I do not know but if $1 = X bytes…..

Dark data is the data that we know is/was there but we can’t see it. Hubble Space Telescope composite image "ring" of dark matter in the galaxy cluster Cl

Software Infrastructure for Sustained Innovation Christine Borgman, UCLA Ian Foster, University of Chicago Bryan Heidorn, University of Arizona Tom Howe, University of Washington Carl Kesselman, University of Southern California

Cyberinfrastructure Vision “The anticipated growth in both the production and repurposing of digital data raises complex issues not only of scale and heterogeneity, but also of stewardship, curation and long-term access. ” NSF Cyberinfrastructure Vision for 21st Century Discovery, Chapter 3

Recognition of need for data curation “Recommendation 6: The NSF, working in partnership with collection managers and the community at large, should act to develop and mature the career path for data scientists and to ensure that the research enterprise includes a sufficient number of high- quality data scientists.” Long-Lived Digital Data Collections: Enabling Research and Education in the 21 st Century, Recommendations

 Recognition of the importance of Information  Recognition of the need for education  New work roles within traditional institutions Interagency Working Group on Digital Data

AADH Workshop July 2015  28 Astronomers, software developers, librarians, AAS, VPR and School of Information

Accelerate for Success Partnership  School of Information  Department of Astronomy and Steward Observatory  iPlant Collaborative  Library  AAS

AADH Broad Objectives  Refine mission, science and education use cases  Prevalence of Orphaned Data  Take advantage of iPlant/CyVerse, Library and School of Information infrastructure and longevity  Obtain community buy-in and manage expectations  Establish short- and long-term funding

 Develop a science advisory board to help guide and assist the project staff  Collect data from AAS publication by University of Arizona researchers between 2005 and 2015  2500 articles in AAS Journals from  1086 papers with author affiliation of the National Optical Astronomy Observatory  343 journal articles from Arizona State University authors AADH Y1 Goals

 Develop data/software catalog  Adopt (meta-)data formats  Write policy documents curators and authors  Ingest selected data sets  Develop discovery tool (eg. WWT)  Create educational material  Hold follow-on data/software carpentry workshops

The iPlant CyVerse Collaborative  Discovery Environment  Use hundreds of Apps and manage data in a simple web interface  Bisque Image Analysis Environment  Atmosphere  custom cloud-based scientific analysis platform or use a ready-made one for your area of scientific interest  Data Store  Store, manage, access, and share all the data related to your research

Overcoming Barriers  Reduce pain of metadata  Reduce pain of data format  Discourage bad behavior  Reward good behavior

From repositories to collaborative space

Also…  We are hiring a faculty member in Data Science also Astronomy Postdoc  assistant-professor-data-science- tenure-eligible or at assistant-professor-data-science- tenure-eligible 