Grappling with Data Management Plans Diane Oerly, Division of IT and Office of Research University of Missouri Panel Presentation for.

Slides:



Advertisements
Similar presentations
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Advertisements

“NSF’s Division of Undergraduate Education: Funding Opportunities for Community Colleges” CUR November 18, 2011 Eun-Woo Chang Montgomery College.
Presentation to Educational Policy Committee Department of Biology Revised March, 2013 Biology Department: Position Requests.
The Imperial College Tissue Bank A searchable catalogue for tissues, research projects and data outcomes Prof Gerry Thomas - Dept. Surgery & Cancer The.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
New Web-Based Course Evaluation Services Available to Schools and Departments Presentation to Faculty Council November 6, 2009.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Bioinformatics for high-throughput DNA sequencing Gabor Marth Boston College Biology New grad student orientation Boston College September 8, 2009.
Why CPH Certification? Certified In Public Health (CPH)
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
1 Exploring NSF Funding Opportunities in DUE Tim Fossum Division of Undergraduate Education Vermont EPSCoR NSF Research Day May 6, 2008.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Open Dialogue on Digital Data management
Coursenotes CS3114: Data Structures and Algorithms Clifford A. Shaffer Yang Cao Department of Computer Science Virginia Tech Copyright ©
BIO337 Systems Biology/Bioinformatics (course # 50524) Spring 2014 Tues/Thurs 11 – 12:30 PM BUR 212 Edward Marcotte/Univ. of Texas/BIO337/Spring 2014.
University of Latvia Faculty of Biology Biology Bachelor Study Programme.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
Julia Nguyen Senior Program Officer Division of Education Programs
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
DMPTool Expert Resources and Support for Data Management Planning Tao Zhang Michael Witt Purdue University Libraries 1.
Establishment of a New BS Biotechnology Program with emphasis in Agrobiotechnology and the creation of the Biotechnology Learning and Research Center Award.
University Library Senior Design Theses: Moving to an electronic collection by collaborating with the School of Engineering Susan Boyd Special Libraries.
The BIO Directorate Microbial Biology Emphasis BIO Advisory Committee April, 2005.
Bioinformatics Core Facility Ernesto Lowy February 2012.
CceHUB A Knowledge Discovery Environment for Cancer Care Engineering Research Ann Christine Catlin HUBzero Workshop November 7, 2008.
Long Range Facility Planning assisted by Silver Falls School District.
Support for Graduate Thesis and Dissertation Work Joan K. Lippincott, Coalition for Networked Information ETD 2011, Cape Town, South Africa.
A Public Trust at Risk: The Heritage Health Index Report on the Condition of Alabama’s Collection.
Developing Faculty-Librarian Partnership : Collaborative Initiative at Al Akhawayn University in Ifrane ( AUI ) Aziz El Hassani Hanane Kakrour Multimedia.
By: Shefali Kapila Tuesday, June 08, 2004 Nursing Informatics Needs Assessment: Distance Programs.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Campus Day, Week 0 January 14, Looking Ahead Enrollment update Associate Dean search BIT meetings Shelter in place drills and door locks.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Indiana Institute For Personalized Medicine David A Flockhart MD, PhD Professor of Medicine, Genetics and Pharmacology Indiana University.
Delta State University College of Education Annual Student Update Part II Dissertation January16, 2010.
Academic Research Enhancement Award (AREA) Program Erica Brown, PhD Director, NIH AREA Program National Institutes of Health 1.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Research and Educational Networking and Cyberinfrastructure Russ Hobby, Internet2 Dan Updegrove, NLR University of Kentucky CI Days 22 February 2010.
Session 2.  Wake Up Call, LSTA Digitization Grant  Digital Preservation Summit, May 2008  ISU Digital Preservation Group, September 2009.
Learning and Engagement in Library Spaces Suzanne E. Thorin Ruth Lilly University Dean of University Libraries and Associate Vice President for Digital.
Community College Survey of Student Engagement (CCSSE) Benchmarks of Effective Educational Practice Summary Report Background: The Community College Survey.
Blended Speech What, Why, How & Who. A blended or hybrid class takes advantage of the best features of both face-to-face (traditional) and online learning.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Component 6 - Health Management Information Systems Unit 1-2 What is Health Informatics?
NSF Grants for Graduate Students Mary Boulton, College Grants Officer An Equal Opportunity University.
Studies for Information Professionals The University of Hong Kong Faculty of Education Division of Information & Technology Studies.
Bioinformatics Core Facility Guglielmo Roma January 2011.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
Master of Science in Biological Informatics PROGRAM DESCRIPTION The MS in Biological Informatics program program aims.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
June 20, 2007ESRI Intl. User Conference Dawn Wright - Oregon State University Val Cummins - Coastal & Marine Resources Centre, IRELAND Liz O’Dea - Coastal.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
One view on integrating Genomics and Informatics into the Undergraduate Nursing Curriculum Prepared by Patti Brennan and Stephanie Gilbertson-White Presented.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
Center for Nursing Informatics Connie White Delaney, PhD, RN, FAAN, FACMI Dean and Professor Co-Director of the Center for Nursing Informatics September.
Program Review Presentation April 30th, 2014
K.SURESH MLISc- Final Year Student
Research Data Management
Creating an Intellectual and Physical Home for Informatics
Presentation transcript:

Grappling with Data Management Plans Diane Oerly, Division of IT and Office of Research University of Missouri Panel Presentation for GPN Annual Meeting, June 2, 2011 N ote: this presentation was ‘partially prepared’ to support the panel discussion, but were not used. So, please contact me if they are unclear or you’d like to know what I was trying/intending to say. Diane Oerly

Grappling with Data Management Plans Background and Culture of the University of Missouri Enrollment (FS2010) 24,900 undergrad, 32,415 total (7,515 out of state, 1,699 intl) Land-grant as well as major research university complexity 345 buildings on 1,250 acres on main campus (19,524 acres statewide) 311 Degrees & certificates offered (93 bachelors, 72 doctoral) Schools of Med & Nursing and large health sciences complex on campus History of Investigators helping fund centrally-shared resources - with exceptions. Significant emphasis on involving undergrad students in research.

Grappling with Data Management Plans Significant security and “effective use” implications of Health Care data.

Grappling with Data Management Plans Selected Areas of Research – from MRI Proposal, January, 2011 High-Throughput Sequence Assembly and Analysis Generate ~186 GB of sequence per week and the instrument is committed 4 weeks in advance. Difficult to assess the total need; however each experiment generates at least ~1 TB of data and requires ~280 CPU hours of processing time every 2.5 days. 3 rd tier analyses significantly increase these processing and storage requirements. For example on the current aging clusters, a linkage analysis with 50,000 SNPs scored in 32,000 individuals consumed over 1.9 million CPU hours of processing. Structural Bioinformatics – Prediction, Retrievals, and Interactions The large-scale annotation of the structures and interactions of the proteins in plant genomes will require about 20 TB of disk space for both regular storage and backup. Will use computational and data storage equipment on a daily basis Large-Scale and High-Throughput Plant Phenotype Analysis Tens of thousands of images. Archiving these high resolution images (avg. 8 MB) requires > 80GB per day. It takes ~ 1Tflop for each image analysis

Grappling with Data Management Plans Selected Areas of Research – Continued. Molecular Mimicry in Inter-Species Interactions It is expected processing of a host-pathogen pair will take up to 100 Petaflops and GB of storage. It is estimated >100 host-pathogen pairs must be analyzed Visualization and Parallelism of Informatics Data The clustering of a typical genome, say of about 37,000 genes, requires ~20 GB of RAM and ~4 Tflops for processing. High-res MRI-based brain structure analysis takes ~1Tflop for processing and about 40GB of storage. Geospatial Informatics for Biology, Ecology, and Environmental Researches Typical imagery data, 0.5 m GSD (ground sample distance), requires ~10 MB of storage per square km of image coverage. Intermediate processing increases the data stored roughly 800%. Multi-date change detection requires ~1.7 Tflop per square kilometer. Area and object processing for content-based retrieval from single date imagery requires ~1.6 Tflops per square km

Grappling with Data Management Plans  Implementation of open-source DSpace, went live August 2008, recently upgraded to Version URL: mospace.umsystem.edumospace.umsystem.edu  Content includes summits and UM-System wide endeavors  Student content as well as faculty and staff - includes thesis and dissertations  Emphasis on content not available elsewhere  Collection currently holds 9,425 unique items

Grappling with Data Management Plans  Collaborative effort between Division of IT, MU Libraries and the University of Missouri Library Systems.  Libraries' crucial role in preserving research and scholarship in all forms and making knowledge accessible for future scholars.  Part of international open access movement.  Helping authors not sign away their intellectual property and fulfill their public dissemination obligations.

Grappling with Data Management Plans MOspace is of course, not the ultimate solution – but we are working to collect and store research data sets that accompany publication in MOspace. See Relevant Library guides for additional information: search for Data Management and for MOspace Local Resources at MU s.pdf s.pdf Recommended Elements of NSF Data Management Plan _elements.pdfhttp://research.missouri.edu/funding/files/nsf_data_management_recommended _elements.pdf