Make the World a Better Place through Reproducible Research Roger D. Peng Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Wall.

Slides:



Advertisements
Similar presentations
Archives and RELs: A Janus-Faced Approach Peter Hirtle CUL Intellectual Property Officer And IRIS Technology Strategist Licensed under the Creative Commons.
Advertisements

Characterization and Management of Multiple Components of Cost and Risk in Disclosure Protection for Establishment Surveys Discussion of Advances in Disclosure.
Intellectual Property Rights (IPR)
29e CONFÉRENCE INTERNATIONALE DES COMMISSAIRES À LA PROTECTION DES DONNÉES ET DE LA VIE PRIVÉE 29 th INTERNATIONAL DATA PROTECTION AND PRIVACY COMMISSIONERS.
Principal Patent Analyst
Ethical and Social Issues in Information Systems
EuroCRIS Conference Brussels Legal Issues Heather Weaver Business & Information Technology Department Open Access – disentangling the legal conundrum Heather.
Laboratory Notebook FITT (Fostering Interregional Exchange in ICT Technology Transfer)
CS CS 5150: Software Engineering Lecture 5 Legal Aspects of Software Engineering 1.
Intellectual Property in the Digital Age Series “Don’t I Own My Own Work?” Negotiating to Keep Your Copyright Intellectual Property in the Digital Age:
An Introduction to Social Simulation Andy Turner Presentation as part of Social Simulation Tutorial at the.
Chapter 14 Legal Aspects of Sport Marketing
9-1 Supporting Management and Decision Making 9-2 The Managers and Decision Making The Manager’s job Manager decisions and computerized support Modeling.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
IST4Balt Workshop - April 7, New approach for protection of author rights in knowledge society 1 New approach for protection of author rights in.
WORLD BANK Publications The reference of choice on development The Promise, and Challenge, of Implementing Open Access at the World Bank Carlos Rossel.
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
 area of law that deals with protecting the rights of those who create original works  Also called as confidential information.  It is called “intellectual”
Thinking the unthinkable: a library without a catalogue Reconsidering the future of our discovery tools.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
A centre of expertise in data curation and preservation Digital Curation Centre/ Edinburgh eScience Collaborative Workshop – 12th June 2008 Funded by:
Development of a Hydro/Meteorological Data Management System For improved water management Objectives Establish a hydrological Data Management System (DMS)
Lesson Objectives To know and understand what R&D is To understand why businesses spend money on R&D to invent and innovate. To know the difference between.
Data Governance Understanding the Issues and Rights Associated With Your Research Data Scholarly Communications Brown Bag Series 25 April 2012 Geneva Henry.
Contextual framework for research. Purpose of contextual framework To provide a shared language to underpin the PHEA E-learning proposals, initiatives.
Author’s Rights And the Role of Copyright Slides produced by the Copyright Education & Consultation Program.
Think Tank and University Relationships: Finding the Synergies Dr. Ifediora C. Amobi Executive Director African Heritage Institution Enugu, NIGERIA Wednesday,
Year 6 Students.  What is Copyright?  ‘How Copyright Works’ by John Gibbs  Examples of Copyright  When do I need Permission? What can I do Without.
Intellectual Property (Quinn Chapter 4) CS4001 Kristin Marsicano.
Data: legal issues 6 October 2014 Hugo Besemer. We all have our ideas about legal issues. Let’s test them by discussing a case  Who is the owner of the.
3 rd and 4 th. Learning Outcomes Students should be able to identify, describe and summarize the steps in the research process.
1 Ethics of Computing MONT 113G, Spring 2012 Session 32 Software as Intellectual Property.
The HMO Research Network (HMORN) is a well established alliance of 18 research departments in the United States and Israel. Since 1994, the HMORN has conducted.
Monitoring and Evaluation of Electronic Resource Use Unit 1.0: Introduction.
Impact of Air Pollution on Public Health: Transportability of Risk Estimates Jonathan M. Samet, MD, MS NERAM V October 16, 2006 Vancouver, B.C. Department.
Systematic literature searching Information skills for PhD students: 2 Jane Falconer Improving health worldwidewww.lshtm.ac.uk.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Open access and subscription journals: implications for low- and middle-income countries Moderated by Subhasree Raghavan Presented by Emma Veitch and Paul.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
RESEARCH An Overview A tutorial PowerPoint presentation by: Ramesh Adhikari.
Benefits of Open Access Sabina Leonelli Department of Sociology and Philosophy, Egenis
Copyright Issues in Data Management CHRISTINE FRUIN / SCHOLARLY COMMUNICATIONS LIBRARIAN.
Your rights to your published work: a workshop addressing these questions: 1. “Can I post my publications in full text on… my web site my departmental.
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
ENGINEERING DESIGN PROCESS. OBJECTIVES IDENTIFY THE STEPS OF THE ENGINEERING DESIGN PROCESS. DETERMINE CRITERIA FOR THE DEVELOPMENT OF A NEW TECHNOLOGY.
Issues in RDM This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative Commons Attribution 4.0 International License.
Filling institutional repositories: considering copyright issues Susan Veldsman eIFL Content Manager
Data: legal issues 15 February, 2016 Marijn Post and Hugo Besemer.
Metadata Driven Survey Research Jeremy Iverson. Open Standards.
RDA-WDS Publishing Data IG Data Bibliometrics Working Group.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Evaluating Efforts to Support Collaborative Research: Lessons Learned from the AHRQ MCC Research Network Jessie Gerteis, MPH Abt Associates, Inc. 27 th.
WIPO’s work on E-waste An introduction to the Patent Landscape Reports Project and WIPO’s Cooperation with UNEP, BRS Secretariat COP 2013 Geneva 6 May.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Web bugs are tiny graphic files embedded in messages and Web pages that are designed to monitor who is reading the message or Web page and.
Kathleen Shearer Data management: The new frontier for libraries.
The Reproducible Research Advantage Why + how to make your research more reproducible Presentation for the Center for Open Science June 17, 2015 April.
Computer Science Standards Review
Tim Friede Department of Medical Statistics
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
CARER Proposal Writing Workshop November 2004
W. Christopher Lenhardt
Trevor Taylor, Director, Member Services, Asia and the Americas,
Research data finder Etsin
University & Industry Collaborative IP Development
Horizon 2020: Open data pilots and lessons learnt
Sabrina Iavarone Senior User Services Officer
PlainLanguage.gov success story
Global Grid Forum (GGF) Orientation
Presentation transcript:

Make the World a Better Place through Reproducible Research Roger D. Peng Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Wall of Wonder

Trends in Scientific Research Signal-to-noise in many investigations is getting smaller Smaller relative risks –e.g. relative risk of mortality is per 10 ppb of ozone High-throughput measurement technologies Powerful computers

Trends in Computing: Then...

...And Now

The Result? Large databases for investigating subtle associations Interactive computing with advanced statistical algorithms Sophisticated searches across models and variables to identify important risks Bigger and better studies

Replication: The Standard Scientific evidence is strengthened when important findings are replicated by independent investigators, data, methods, laboratories, instruments, etc. Replication is often not possible because of time, funding constraints Policy decisions must often be made with evidence at hand

Reproducible Research: A Minimum Standard Published research where the following are made available: Analytic data Computer code implementing methods Documentation about code/data All are distributed using standard means

Benefits of Reproducible Research Published findings can be verified Alternative analyses conducted Challenge uninformed criticisms (“put up or shut up”) Expedite exchange of ideas among investigators

Challenges to RR “If I give away my data, others will publish results and scoop me” “I own my data and ideas, other people don’t necessarily have any rights to them” Why should I just give away my intellectual property?

Ya see, it’s what I call the “ownership society”

Property [Automatt][JRodrigues] [james.thompson] [nervsappy]

“Intellectual Property” “the intangible value created by human creativity and invention” –from JHSPH Office of Technology Transfer (emphasis added) How can something that is intangible be property?

There’s No Such Thing as “Intellectual Property” If I copy your book, you still have your book If I use your idea, you still have your idea If I copy your data, you still have your data If I use your statistical model, you still have your statistical model If I implement your algorithm, you still have your algorithm etc.

Research done by you regardless of sharing What are the Potential Gains and Losses from Sharing Data? Research done by you regardless of sharing Data Research done by others Research you would have done if you hadn’t shared data Don’t share Share = Y(1) = Y(0) (a)D = 0 (b)D < 0 (c)D > 0 D = Y(1) - Y(0)

What is a Dataset? Represents already published findings and ideas Contains potential findings and ideas yet to be discovered and exploited Datasets do not fit well into the framework of copyrights and patents

What Do We Need? Infrastructure –Tools for researchers, developers –Repositories for datasets –Rights framework for datasets Privacy preservation Handle computer language Babel Structured research modularity

“WWKD”

“WWKD” What Would Karl Do?

Models for Reproducibility

An Example

Partial Rights for Data? A First Cut Full access: the data can be used for any purpose Attribution: the data can be use for any purpose with a specific citation Share-alike: the data can be used for any purpose but any “improvements” must be made available under the same license Reproduction-only: the data can only be used for reproducing published results and commenting via a letter to the editor

Thank you!