Enabling Access to Scientific & Technical Data-sets in e-Science: a role for Library Science James L. Mullins, PhD Dean of Libraries & Professor of Library.

Slides:



Advertisements
Similar presentations
INFLIBNET CALIBER INFLIBNET CENTRE, GANDHINAGAR – MARCH 22, 2013 Databib: Cataloging the Data Repositories of the World Michael Witt Assistant Professor.
Advertisements

Workforce Demand and Career Opportunities in University and Research Libraries NAS Symposium on Digital Curation Anne R. Kenney July 19, 2012.
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
The IGERT Program Preliminary Proposals June 2008 Carol Van Hartesveldt IGERT Program Director IGERT Program Director.
Institute of Technology University of Minnesota An Introduction Mos Kaveh Associate Dean for Research and Planning Centennial Professor, Electrical & Computer.
Strategic Planning Synergies between Science/Engineering and Liberal Arts/Social Sciences “Bridging Disciplines: Solving Complex Problems” College of Agricultural.
Learning by Doing: Cases of Librarians Working with Faculty Research Data for the First Time IASSIST 2010 Jake CarlsonMichael Witt Data Research Interdisciplinary.
Center for the Study of Digital Libraries Texas A&M University College Station, TX.
Scholar Services at the University Library: The Scholarly Commons Report.
African Librarianship and the Academic Enterprise Prepared By: Kay Raseroka Director: Library Services University of Botswana.
The Council on Undergraduate Research ProVisions September 17, 2013.
The Center for Undergraduate Research, Scholarship and Creative Activities Teaching Learning Workshop Friday, November 12 Understanding Undergraduate Research:
Learning to Live in a Technical World How TSA prepares students for a smarter, more leadership- driven workforce.
Tyler O. Walters, Associate Director, Technology & Resource Services Library & Information Center, Georgia Institute of Technology For NSF Site Visit to.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
An Eight Year Odyssey in Data Management: Purdue University James L. Mullins, PhD Dean of Libraries & Esther Ellis Norton Professor, Purdue University.
A Paradigm for Interdisciplinary Research A. H. Rebar, DVM, Ph.D. Senior Associate Vice President for Research Executive Director of Discovery Park.
Research Cyberinfrastructure Alliance Working in partnership to enable computationally intensive, innovative, interdisciplinary research for the 21 st.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Building an interdisciplinary research program in an academic library James L. Mullins Dean of Libraries D. Scott Brandt Associate Dean for Research Coalition.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen – Louisiana State University Rob.
J. WILLARD MARRIOTT LIBRARY Preserving, Promoting and Presenting Research Posters: USpace’s New Poster Archiving Service Lisa Chaufty Western CONTENTdm.
Maryland Library Association and Delaware Library Association 2012 Joint Library Conference, Ocean City, MD Nedelina Tchangalova Engineering, Physical.
References: [1] Branch, B.D., Fosmire, M., The role of interdisciplinary GIS and data curation librarians in enhancing authentic scientific research.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
The University Library in the Campus Strategic Goals, Initiatives and Metrics Fall 2013.
NSF IGERT proposals Yang Zhao Department of Electrical and Computer Engineering Wayne State University.
Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.
PSCIC Working Group: Parag Chitnis Chris Greer Susan Lolle Sam Scheiner Jane Silverthorne Bill Zamer Manfred Zorn.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
HPC Centres and Strategies for Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen –
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
1 Video Message: Welcome ETD 2015: 18 th Int’l Symposium on ETDs New Delhi, India Edward A. Fox Executive Director, Chairman of the Board NDLTD,
O C I October 31, 2006Office of CyberInfrastructure Implementing the Strategic Vision for Digital Data NSF Data Group ACCI Meeting October 31, 2006.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
Changing Nature of Academic Librarianship: Implementing a Distributed Institutional Repository Jeremy Garritano (765) Chemical.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Digital Data Collections in Biology Collaborative Expedition Workshop November 8, 2005 Arlington, Virginia Chris Greer Program Director National Science.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Digital Asset Management: E-Science Life-Cycle Anthony D. Smith Ocean Teacher Academy Training Course, 30 September - 4 October 2013, Mombasa, Kenya.
ICPSR Data Fair November 8, 2010 Katherine McNeill, MIT Libraries
Optimizing STEM Programs to Promote Enrollment and Retention
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
PV 2009 December 3, 2009 The Data Conservancy: Building Sustainable Infrastructure for Interdisciplinary Scientific Data Curation and Preservation.
Using DLESE: Finding Resources to Enhance Teaching
IATUL Creating an institutional repository for massive data sets -- a statement of the problem and an assessment of the challenge and opportunities. James.
Data Fundamentals A. D. Smith – September 26, 2011.
Digital library for Earth System Education Teaching Boxes
Beyond Vendor Fairs: Partnering with Vendors to Engage End Users
Curate, Archive, Manage, Preserve
Department of Medicine Michael Farkouh, Vice-Chair Research michael
E-Science Life-Cycle A. D. Smith – September 26, 2011.
Briefing to ARL Membership
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Purdue University The PURR campus data repository service: institutional effort looking towards international engagement Michael Witt, associate.
Bird of Feather Session
James L. Mullins, Ph.D. Dean of Libraries
Wrap-Up – NSF Site Visit 8 February 2010
Dr. Kathleen Liang Kellogg Distinguished Professor of Sustainable Agriculture Director of Center for Environmental Farming Systems North Carolina A&T State.
Presentation transcript:

Enabling Access to Scientific & Technical Data-sets in e-Science: a role for Library Science James L. Mullins, PhD Dean of Libraries & Professor of Library Science Purdue University March 24, 2008 University of Illinois Mortenson Center South African Librarian Program

e-Science What is meant by e-Science? Large scale science increasingly carried out through distributed global collaborations enabled by the Internet Such collaborative scientific enterprise requires access to massive data collections, large scale computing resources and high performance visualization back to the individual user scientists Requires large scale storage, retrieval and transfer

Innovative Research Concepts Data Authors – benefit from their own work, broadly disseminated, safely archived. Data Managers -- collaborates by insuring successful retention and dissemination through technical infrastructure Data Scientists – conduct creative inquiry and analysis, enhance the research of data authors National Science Board, Long-lived digital data collections: Enabling research and education in the 21 st century, p. 27.

Innovative Research Concepts National Science Board, Long-lived digital data collections: Enabling research and education in the 21 st century, p. 27. Data Scientists: … crucial to the successful management of a digital data collection – lie in having their contributions fully recognized

National Science Foundation Recognition of the Challenge for Data Curation Dr. Christopher Greer Former Program Director Office of Cyberinfrastructure, NSF

To Stand the Test of Time: Long Term Stewardship of Digital Data Sets in Science and Engineering A report to the National Science Foundation from the ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe Supported by NSF, September 26-27, 2006 Attendees: NSF program directors; disciplinary researchers; information technologists; computer scientists; and librarians

To Stand the Test of Time: Long Term Stewardship of Digital Data Sets in Science and Engineering – Overarching Recommendation NSF should facilitate the establishment of a sustainable framework for the long-term stewardship of data. This framework should involve multiple stakeholders by supporting: Research to understand, model, & prototype data stewardship Training and educational programs to develop new workforce Efforts to effect change in the research enterprise regarding the importance of the stewardship of digital data produced

To Stand the Test of Time: Long Term Stewardship of Digital Data Sets in Science and Engineering – Specific Recommendations How can Libraries respond? How can Libraries prepare?

Domain Science Computer Science Cyber infra- structure Archival Sciences Lib/Info Sciences I Center Conceptualization By Chris Greer, NSF

Scholarly Communication published research traditional “published” research non-traditional unpublished research traditional “published” data/ datasets secondary tertiary resources analyzed data/ datasets processed data/ datasets “raw” data/ datasets “traditional” research publication currently many attempts to data mine to uncover data… metadata curation profiles for data allow forward/backward movement through scholarly communication process in the past, libraries involved at this end Source: D. Scott Brandt, Purdue University

One Example

Purdue Purdue University

Founded 1869 by gift from John Purdue Premier programs: engineering ( astronautics: alumnus Neil Armstrong); agriculture; hospitality and tourism; business; computer science; communications. 39,102 students 2007/08 Third largest international student enrollment in U.S. – 4,994 for 2007/08 ( over 2,000 from India, China and Korea combined).

Purdue Purdue University Nine Colleges: Agriculture, Consumer & Family Sciences, Education, Engineering, Liberal Arts, Management, Pharmacy/ Nursing/Health Sciences, Technology, Vet Medicine 73 Departments, several cross- disciplinary: e.g. Agricultural & Biological Engineering

Interdisciplinary collaboration Cyber Infrastructure Oncology Manufacturing Energy Nanotechnology Bioscience Discovery Park Discovery Park: A 44 acre site, interdisciplinary centers which are designed to facilitate and promote leading edge research Entrepreneurship e-Enterprise Environment Learning Center

Envisioning New Interdisciplinary Collaborations Associate Dean for Research, D. Scott Brandt, Professor of Library Science Facilitates individual and interdisciplinary research efforts of the fifty Libraries faculty

Purdue University Libraries Since 2004, initiative for Libraries faculty to collaborate with other faculty across campus—apply library science knowledge and expertise to research problems: collect, organize, describe, curate, archive, disseminate data/information

Determine need for collaboration Hypothesized that researchers have data management needs and that librarians can help meet them Employed top-down and bottom-up investigation for data collection Verified: PU researchers said they need help in collecting, organizing and providing access to their data

Outside of the library Attended research seminars, call-outs, etc., to identify collaboration and funding opportunities Built relationships - found researchers who understood that collecting, organizing and providing access to data and information are not only important, but critical Found problems to solve, then collaborated on solutions Talked about what we know—organizing data and information (different meanings to different groups) Brought something to the table. Had to be prepared to demonstrate something tangible (initially a proof- of-concept or a prototype).

Current areas of collaboration  Discovery Learning Center  Earth & Atmospheric Sciences  English  IT at Purdue  Mechanical Engineering Technology  Regenstrief Center  Graduate School  Oncological Sciences  Agronomy  Biology  Cancer Center  Center for the Environment  Chemical Engineering  Chemistry  Civil Engineering  Cyber Center

Motivation (library participants) Directly related to work, and makes something difficult easier It’s an extension of “our everyday job” Something new and exciting to do Breaking new ground, want to contribute to interdisciplinary initiative Force the issue of how it gets done (i.e., more people added to help out)

Motivation (non-participants) Articulation of what is expected by the Dean Partly determined on a case-by-case basis Has to be “interesting to me” Something that uses “the skills I can bring to it” Need to get credit for it (recognition, reward) Important to allow individual to define what interdisciplinary research is Should be opportunities to "stick your toe in the water" before making big commitment Need time to do it, and to do the “things I want to do”

Distributed Data Curation Center – D2C2 Sustainability for data curation repositories Ontological and taxonomic organization of disciplinary datasets Metadata to facilitate access to data collections Data curation profiles for archiving and preserving datasets

Recap… 22 Libraries faculty involved in 32 grants since April of 2006 New positions: Data Research Scientist to support research Computer Science/Libraries discussion on joint appointment Sun Microsystems gift - Sun StorageTek 5800, 32 terabytes, for D2C2 research.

“100 conversations, lead to 20 discussions, lead to 5 grants, lead to 1 award

Web 2.0 for e-Science: nanoHUB from PowerPoint Presentation by: Mark Lundstrom, Gerhard Klimeck, Michael McLennan, Purdue University, Network for Computational Nanotechnology.

nanoHUB, Purdue University

National Science Foundation (NSF) DataNet e-Science data stewardship, Five $20 million grants Competition to Build a Data network Replicate for print resources for data Proposals being led by librarians, collaborators are computer scientists, information technologists, domain scientists; sociologists, information scientists, computer engineers, museum and K-12 educators; metadata and ontology specialists; data visualization specialists. International collaboration with UK, Australia, and China. Submission deadline March 21 st, announcement summer 2008.

Thank you! Questions and Answers? James L. Mullins – Purdue University, USA