How to be a successful computational scientist.

Slides:



Advertisements
Similar presentations
The meaning of Reliability and Validity in psychological research
Advertisements

Useful tricks in studying reading papers doing research writing papers publishing papers English e-manuscripts.
What does science mean to you??? This power point and the copyrighted images within it may not be reproduced or distributed except for normal classroom.
Source: G. Stylianou - Writing for Computer Science, Justin Zobel Ethics.
Writing a Research Paper
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
The Scientific Method.
This program is provided with compliments from By Doug Steward.
Copyright © Wondershare Software Essays Admissions, School, SAT.
AP English Language & Composition Exam Review
SCIENCE FAIR 2009.
Constructing a Reasoned Argument argument.ppt
©2015 Paul Read 5.5 Writing Opinion Essays in Part Two /sizes/z/in/photostream/
The Scientific Method A Way to Solve a Problem.
Version Control.
Where did plants and animals come from? How did I come to be?
Science Focus Lesson SC.5.N.2.2 Practice of Science/ Replication
SCIENCE FAIR 2010.
June REU 2003 How to Conduct Research Some Rules of Thumb.
HOW TO REVISE AND EDIT EFFECTIVELY. REVISION VS. EDITING  Revision is content-focused. Revision is a time to identify holes in an argument, information.
Laboratory Investigations Each lab group will submit a single input. All members of the group will get THE SAME grade UNLESS... You are observed goofing.
Pitfalls of your first paper Shu Cai Institute of Computing Technology, Chinese Academy of Sciences
Thesis Statement Advanced Placement World History New Manchester High School Mrs. Jackson.
Sept 17, 2007C.Watters 1 Reviewing Published Articles.
The Research Paper English 12. Argumentative Research Papers  Present a strong claim to a possibly resistant audience  You will gather evidence by looking.
How to ask the best questions.   This is “the action or process of observing something or someone carefully or in order to gain information.”  The.
Obtain and review client feedback. Creating evaluation or feedback tools Importance of client feedback  The use of client feedback is very useful to.
How to Develop a Science Fair Project
What is a Gold Roth IRA?.
Development Environment
Research Methods Purpose: Become familiar with the methods and importance of scientific research in psychology (and everyday life)
AP CSP: Finding a Data Story
What is a CAT? What is a CAT?.
Thinking Like a Scientist
Unit 5: Hypothesis Testing
Testing Hypotheses about Proportions
CHAPTER 9 Testing a Claim
Reviewing your Program
Testing Hypotheses About Proportions
Presentation Notes and Session Plans
The Structure of an Essay
LESSON 12 - Loops and Simulations
How to Write a Position Argument
The Nature of Science What is Science..?.
The Structure of an Essay
Science Fair Project.
Experimental Psychology PSY 433
Testing Hypotheses about Proportions
ESL Exit 2017 Spring Semester.
Check Your Paragraph.
Peer Review ENGL 106: FYC Beth Towle.
Experimental Plan? Who needs a plan anyway??.
Unit 6: Application Development
The Scientific Method By Torrie Epperson TLA Secondary Session 4
Voice Persuasive Essay.
Let’s Write a Memoir.
Significance Tests: The Basics
HINT: It is not the subject of your essay
Chapter One: An Introduction to Programming and Visual Basic
Programming.
Significance Tests: The Basics
Testing Hypotheses About Proportions
Lesson 2 Get Started with Python – Post-Installation – Use the GUI.
Abstract An abstract is an abbreviated version of your science fair project final report. For most science fairs it is limited to a maximum of 250 words.
Biology Writing a Lab Report
Editing and Revising Lesson #12.
The Scientific Process
Lecture 3: Communicate in Writing
CS249: Neural Language Model
Presentation transcript:

How to be a successful computational scientist.

Best Practice #1: never show them your data. No one can disagree with your results if you don’t give them access to your data. Be sure not to back up your data, as this makes it difficult to claim that you’ve lost it. Only keep the intermediate stages of your data around. That way you have plausible deniability: “Oh, that’s not the final data set. We smoothed out that inconsistency.” When people leave the lab, urge them to take their data with them. Then lose their e-mail addresses.

Best Practice #2: do not, under any circumstances, communicate clearly. Do not document your code or data. If you must, use a language other than English. This minimizes the chance of someone discovering a flaw in your logic. Make sure that no one author is responsible for the contents of a paper. This will make it impossible for people to ask detailed questions of the authors. Keep as much information as possible in an opaque format like Word, that cannot be searched or opened by everyone. Never write details in e-mails. Assume that someone will obtain all your e-mail someday.

Best Practice #3: never release your source code. Even if your logic is sound, your code may not be. This way, no one will ever find out. Use multiple programming languages, even (or especially) if you don’t know them well. The only people who know more than one programming language don’t know enough science to argue with your results. If no one has your source code, no one can use it to do their own science and scoop you.

Best Practice #4: judge computational science by results, not quality. As long as you’re producing good results, who cares if they’re right? Grants are based on your results, not on replicated or reproduced results. Best Practices #1, #2, and #3 ensure that no one will ever be able to tell if your results are correct.

Best Practice #5: use as much data as possible. This is self-evident: the more data you have, the more correct your hypothesis must be. It costs money. You have to use it somehow! And, after all, much of that data is noisy. You’re heroic for pulling any signal out of it! What signal you do see has to be correct.

…but seriously… It’s hard to do good computational science. (I hope we’ve conveyed that.) But it can be a lot of fun. It’s just another way to do science, after all. Just think about what could go wrong and plan accordingly!

The Trouble with Replication Most research is never replicated exactly, but rather reproduced (qual/quant similar results on different data). That’s OK. However, on occasions when people have tried, most research (estimates range to over 88%) cannot be replicated. That’s not OK. Consider: that last statement is about cancer research… You cannot build on a shaky foundation.

Towards responsible conduct of computational research: some proposed principles Computation is part & parcel of the research. All inputs to the computational process need to be recorded & relayed as part of publication, incl data (in as raw a state as possible) and analysis parameters. The goal is to allow replication by reviewers and readers.

It’s easy to be nihilistic about computing “correctness” Kim & Speed, 2013

…but just because it’s not entirely right doesn’t mean it’s not useful. Document what you did computationally. Make sure that your data and analysis support your actual conclusions. Run it by some computational colleagues before submitting.

What next? In person training: Data Carpentry Software Carpentry Continued learning: Find an online community! Twitter, BioStars, SeqAnswers… Come visit Matt in NH and Meg in KY!!

Terminate your instances!! (Last but not least)