Reproducibility in Large-Scale Computing: A Dissident View from the Trenches Doug James Texas Advanced Computing Center

Slides:



Advertisements
Similar presentations
Welcome to CODE SPREAD Simple Concepts of Coding | Programming.
Advertisements

By: Edith Leticia Cerda
Intra-University Collaborations or How to Win Friends, Influence People and Get Resources? Hal Stern Department of Statistics University of California,
4. May 2007 Workshop on Dynamic Service Level AgreementsPage 1 Dynamic SLA Negotiation in BREIN Bastian Koller High Performance Computing Center Stuttgart.
Importing Data from Banner into the NCAA CAi System
CS4026 Formal Models of Computation Running Haskell Programs – power.
Maurice Hendrix, A3H AH2008, 29/07/2008 A meta level for LAG Adaptation Language.
© 2014 Systems and Proposal Engineering Company. All Rights Reserved Using Natural Language Parsing (NLP) for Automated Requirements Quality Analysis Chris.
Care homes and the NHS Standard Contract Alastair Hill Senior Lead, NHS Standard Contract NHS England RCPA Seminar 20 November 2013.
HELP GUIDE: SUBMITTING YOUR TIMESHEET AND NON-LOCAL TRAVEL VOUCHER FOR THE PITC PQ 2014 ADVANCED TRAINING.
SAIL: Documenting data content and quality, letting the computer take the strain Caroline Brooks Senior Research Analyst, College of Medicine, Swansea.
Home Based Care Re-tender SCA Update – February 2014.
1 LAUSD Mathematics Periodic Benchmark Assessments Using Data to Inform Instruction.
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Beijing, October 2006 Metadata development & deployment: What software business practices apply? William L. Anderson Praxis101 20th International CODATA.
Proposal Preparation and Submission. We will: Prepare budgets according to: Federal/sponsor regulations TAMUS policies Coordinate subcontract and collaborative.
A CONSTRUCTION SAFETY COMPETENCY FRAMEWORK: DEVELOPMENT, INDUSTRY APPLICATION, AND FUTURE BENEFITS Herbert Biggs PhD.
TACC’s mission is to enable discoveries that advance science and society through the application of advanced computing technologies. Texas Advanced Computing.
How to represent numbers using “quick hundreds” “quick tens” and “quick ones” Unit 3 Math Expressions.
Perl Practical Extraction and Report Language Senior Projects II Jeff Wilson.
How NOT to Have a Successful PhD on Middleware Research The Ten Commandments (to carefully avoid) Paolo Costa Vrije Universiteit Amsterdam
National Science Foundation: Transforming Undergraduate Education in Science, Technology, Engineering, and Mathematics (TUES)
Nathan Bauer, 3 rd Period What did I learn?  I learned that Manufacturing machines are not magical or witchcraft, but that It relies on that coordinate.
“Harnessing new ICT’s as a means to ease the job inclusion of visually impaired youth” EBU Youth Seminar Bratislava, Slovak Republic 7th – 8th November.
1.Outline of the opportunity; 2.What this means in practice; 3.Quantifying the financial benefits; 4.Outsourced administration; and 5.Compliance aspects.
Data Analysis in the Water Industry: A Good-Practice Guide with application to SW Deborah Gee, Efthalia Anagnostou Water Statistics User Group - Scottish.
Information provided by LISD Assessment Office.  STAAR stands for: › State of Texas Assessments of Academic Readiness  Implemented in for school year.
XSEDE14 Reproducibility Workshop: Reproducibility in Large Scale Computing – Where do we stand Mark R. Fahey, NICS Robert McLay, TACC XSEDE14 - Reproducibility.
WI Head of Office Meeting April 2011 in Steenwijk.
Directorate for Social, Behavioral, and Economic Sciences Amber L. Story Deputy Division Director Directorate for Social, Behavioral, and Economic Sciences.
MAGIC Gabrielle Allen, Richard Carlson, Grant Miller September 13, 2011 Large Scale Networking (LSN) FY12 Annual Planning Meeting.
Inventory Item Pricing 2011 National Conference. Item Information Option 1 will let you do all item price updates except global price update. Option 2.
1 Special Programming Workshop CSIT-120 Fall 2000 Workshop Targets Solving problems on computer Programming in C++ Writing and Running Programs Programming.
Local Wellness Policy for RCCIs: A Team Approach.
AUTOMATING ACADEMIC STANDINGS Kelly Lyons North Idaho College July 31, 2015 Enrollment & Student Services Coeur d’Alene, Idaho.
Mission Formation. m The Mission Statement should define the broad purposes the college is aiming to achieve, describe the community the program is designed.
Using a MATLAB/Photoshop Interface to Enhance Image Processing in the Interpretation of Radar Imagery The Center for Remote Sensing of Ice Sheets (CReSIS)
Welcome to the January 16, 2013 CPI meeting Today’s agenda includes the following items: Update on University Research Initiatives– Dr. R. Bowen Loftin,
NSF – HSI Workshop 1 Introduction & NSF Overview NSF Workshop for Sponsored Project Administrators at Hispanic Serving Institutions April 13, Miami,
Associate Professor Ian Clark 5 July 2010 University of South Australia Building Innovation in Teaching & Learning (BuILT) Workshop Series.
John Porter Sheng Shan Lu M. Gastil Gastil-Buhl With special thanks to Chau-Chin Lin and Chi-Wen Hsaio.
ISTeC Research Computing Open Forum: Using NSF or National Laboratory Resources for High Performance Computing Bhavesh Khemka.
How To Use The NACM Core Competency Curriculum Guidelines National Association for Court Management 1 How To Use The NACM Core Competency Curriculum Guidelines.
European Research Council │ 1 ERC Advanced Grant Call 2 EPSS Changes / Improvements Section A1: Proposal and PI Information ► Linking of the 1st Keyword.
Robust Requirements Tracing Via Internet Tech:Improving an IV&V Technique SAS 2004July 20, 2004 Alex Dekhtyar Jane Hayes Senthil Sundaram Ganapathy Chidambaram.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
Data Management Lawrence Pratt Tulane University.
Marketing Is All Around Us Chapter 1.1. The Reality…  You have been marketed to since you were very young.  You probably know a lot about marketing.
Shalini Urs, June 23, 2003 Indo-US Workshop on Open Digital Libraries, June ,2003 WelcomeTo.
Module 3.3 Reading and Representing Three-Digit Numbers.
How Good is Your SDTM Data? Perspectives from JumpStart Mary Doi, M.D., M.S. Office of Computational Science Office of Translational Sciences Center for.
Lecture 1 Data Structures Aamir Zia. Introduction Course outline Rules and regulations Course contents Good Programming Practices Data Types and Data.
Dr. Fernando Paniagua Associate Professor CSIT Department
Unit 3: Lesson 1 - The Need for Programming Languages
McKinney-Vento American Recovery and Reinvestment Act
Project & Syllabus.
Microlabs in Computer Science
Persistent Identifiers Implementation in EOSDIS
PresQT - Preservation Quality Tool
Structure of the Code – Phase 2 TF Comments and Proposals
FUSION Software FUSION Data interface LDV Data viewer Sample options
SmartArt Graphics Guided Lesson.
Henry Neeman, University of Oklahoma
Open on the student drive
CMNS October 2011.
U3L1 The Need For Programming
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Metamorphic Exploration of an Unsupervised Clustering Program
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Expenses Confirmation Session
Presentation transcript:

Reproducibility in Large-Scale Computing: A Dissident View from the Trenches Doug James Texas Advanced Computing Center An XSEDE14 Workshop July 14, 2014 Ver 1.0 as of 2 Jul 2014

Where Do I Stand?* “No, thank you...” Run your code Duplicate your environment Bitwise reproducibility “Yes, please...” Implement your algorithm Portable results Understand the differences reproducibility > re-enactment *your mileage may vary

What Do I Want to See Next?* Promote the value of fundamentals a la "Ten Simple Rules" – Especially automation, change management, traceability – No magic bullets imposed from above; let the marketplace decide Expect clarity – Readable, well-documented source code – Datasets: provenance, formats, meaning of fields Encourage discussion and reflection – e.g. NSF Grant Proposal Guide Ch II.C.2.j (Data Management Plan): "Plans for data management...may include..." [How 'bout a sixth item that says something like...] 6. mechanisms and processes that enhance opportunities for reproducibility." *your mileage may vary