© 2003, Carla Ellis Experimentation in Computer Systems Research Why: “It doesn’t matter how beautiful your theory is, it doesn’t matter how smart you.

Slides:



Advertisements
Similar presentations
Chapter 2 The Process of Experimentation
Advertisements

Critical Reading Strategies: Overview of Research Process
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Animal, Plant & Soil Science
What is Science?.
G. Alonso, D. Kossmann Systems Group
Overarching Goal: Understand that computer models require the merging of mathematics and science. 1.Understand how computational reasoning can be infused.
The Scientific Method.
Statistical Methods in Computer Science Hypothesis Life-cycle Ido Dagan.
Workloads Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Live workload Benchmark applications Micro- benchmark.
ITEC 451 Network Design and Analysis. 2 You will Learn: (1) Specifying performance requirements Evaluating design alternatives Comparing two or more systems.
Chapter 14: Usability testing and field studies. 2 FJK User-Centered Design and Development Instructor: Franz J. Kurfess Computer Science Dept.
Project 4 U-Pick – A Project of Your Own Design Proposal Due: April 14 th (earlier ok) Project Due: April 25 th.
Chapter 14: Usability testing and field studies. Usability Testing Emphasizes the property of being usable Key Components –User Pre-Test –User Test –User.
1 PERFORMANCE EVALUATION H Often one needs to design and conduct an experiment in order to: – demonstrate that a new technique or concept is feasible –demonstrate.
1 PERFORMANCE EVALUATION H Often in Computer Science you need to: – demonstrate that a new concept, technique, or algorithm is feasible –demonstrate that.
Aim: How to test a hypothesis and design an experiment Do Now: 1) What do you need to know in order to test your hypothesis? 2)Take out your homework.
Chapter One: The Science of Psychology
Science and Engineering Practices
Y. Rong June 2008 Modified in Feb  Industrial leaders  Initiation of a project (any project)  Innovative way to do: NABC ◦ Need analysis ◦ Approach.
Section 2: Science as a Process
Chapter 14: Usability testing and field studies
CSD 5100 Introduction to Research Methods in CSD First Day Opening Stretch Course Requirements/Syllabus What is Science? What is Research? The Scientific.
Unit 1 Lesson 3 What Are Some Types of Investigations?
Chapter One: The Science of Psychology. Ways to Acquire Knowledge Tenacity Tenacity Refers to the continued presentation of a particular bit of information.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
What is Science? Science is a system of knowledge based on facts and principles.
Assessing the Frequency of Empirical Evaluation in Software Modeling Research Workshop on Experiences and Empirical Studies in Software Modelling (EESSMod)
LEVEL 3 I can identify differences and similarities or changes in different scientific ideas. I can suggest solutions to problems and build models to.
An Examination of Science. What is Science Is a systematic approach for analyzing and organizing knowledge. Used by all scientists regardless of the field.
Quantitative and Qualitative Approaches
ICOM 6115: Computer Systems Performance Measurement and Evaluation August 11, 2006.
Nature of Science 11/8/2015 Nature of Science 1. 11/8/2015 Nature of Science The investigation and exploration of natural events A way to answer questions.
Course Information Andy Wang CIS 5930 Computer Systems Performance Analysis.
Review of Research Methods. Overview of the Research Process I. Develop a research question II. Develop a hypothesis III. Choose a research design IV.
Chapter 10 Verification and Validation of Simulation Models
Simulation Techniques Overview Simulation environments emulation/ exec- driven event- driven sim trace- driven sim stochastic sim Workload parameters System.
Vague idea “groping around” experiences Hypothesis Model Initial observations Experiment Data, analysis, interpretation Results & final Presentation Experimental.
Science Science is  The process of trying to understand the world  A way of knowing, thinking and learning  Based on observation and experimentation.
THE NATURE OF SCIENCE. What Science Is and Is Not.
© 2003, Carla Ellis Vague idea “groping around” experiences Hypothesis Model Initial observations Experiment Data, analysis, interpretation Results & final.
The Psychologist as Detective, 4e by Smith/Davis © 2007 Pearson Education Chapter One: The Science of Psychology.
URBDP 591 A Lecture 16: Research Validity and Replication Objectives Guidelines for Writing Final Paper Statistical Conclusion Validity Montecarlo Simulation/Randomization.
FYP 446 /4 Final Year Project 2 Dr. Khairul Farihan Kasim FYP Coordinator Bioprocess Engineering Program Universiti Malaysia Perls.
Vague idea “groping around” experiences Hypothesis Model Initial observations Experiment Data, analysis, interpretation Results & final Presentation Experimental.
Helpful hints for planning your Wednesday investigation.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 1 Research: An Overview.
Science Latin root which means “knowing” Latin root which means “knowing” Way of learning about the natural world and the knowledge gained through the.
FYP 446 /4 Final Year Project 2 Dr. Khairul Farihan Kasim FYP Coordinator Bioprocess Engineering Program Universiti Malaysia Perls.
Applying Principles of Learning Diane Ebert-May Department of Plant Biology Michigan State University Assessment.
Jette hannibal Internal assessment Experimental research.
Biology, Chapter 1.1 What is Science?.
© 2003, Carla Ellis Model Vague idea “groping around” experiences Hypothesis Initial observations Experiment Data, analysis, interpretation Results & final.
Course Information Andy Wang CIS 5930 Computer Systems Performance Analysis.
Role of Experimentation and Experimental Planning Research Methods CPE 401 / 6002 / 6003 Professor Will Zimmerman.
Unit 1 Lesson 3 What Are Some Types of Investigations?
Paper Airplanes & Scientific Methods
Lecture 01: A Brief Summary
ITEC 451 Network Design and Analysis
Chapter 10 Verification and Validation of Simulation Models
Unit 1 Lesson 3 What Are Some Types of Investigations?
Software Engineering Experimentation
Identifying Inquiry and Stating the Problem
Computer Systems Performance Evaluation
Science Chapter 1.
Computer Systems Performance Evaluation
Scientific Inquiry Take out some note cards, a pencil, and your note card holder Write the following terms on one note card each: Take a textbook from.
Biological Science Applications in Agriculture
Presentation transcript:

© 2003, Carla Ellis Experimentation in Computer Systems Research Why: “It doesn’t matter how beautiful your theory is, it doesn’t matter how smart you are – if it doesn’t agree with the experiment, it’s wrong.” R. Feynman

© 2003, Carla Ellis Why? W. Tichy in “Should Computer Scientists Experiment More?” argues for experimentation. Model / theory testing As Feynman’s quotation suggests, an experiment can identify flaws in a theory (e.g. underlying assumptions that are violated by reality) Exploration Go where no system has gone before (i.e., open whole new areas for investigation) Demonstrate / identify the importance of a potential new research problem

© 2003, Carla Ellis Evidence of real problem, justification, opportunity, feasibility, understanding Boundary of system under test, workload & system parameters that affect behavior. Questions that test the model. metrics to answer questions, factors to vary, levels of factors. Vague idea “groping around” experiences Hypothesis Model Initial observations Experiment Data, analysis, interpretation Results & final Presentation Experimental Lifecycle

© 2003, Carla Ellis Vague idea ModelExperiment Data, analysis, interpretation Results & final Presentation Common Practice No iteration No preliminary investigation No hypothesis precisely articulated

© 2003, Carla Ellis Lots of Ways to Attack Experimentation Not general – only applies to the “system under test”. Not forward-looking – motivations and observations based on the past not the future. Lack of representative workloads – inadequate benchmarks. No culture of independent replication of other’s experiments. Real data can be messy. Learn to do it “right”

© 2003, Carla Ellis A Systematic Approach 1.Understand the problem, frame the questions, articulate the goals. A problem well-stated is half-solved. Must remain objective Be able to answer “why” as well as “what” 2.Select metrics that will help answer the questions. 3.Identify the parameters that affect behavior System parameters (e.g., HW config) Workload parameters (e.g., user request patterns) 4.Decide which parameters to study (vary).

© 2003, Carla Ellis 5.Select technique: Measurement of prototype implementation How invasive? Can we quantify interference of monitoring? Can we directly measure what we want? Simulation – how detailed? Validated against what? Repeatability 6.Select workload Representative? Community acceptance Availability A Systematic Approach

© 2003, Carla Ellis 7.Run experiments How many trials? How many combinations of parameter settings? Sensitivity analysis on other parameter values. 8.Analyze and interpret data Statistics, dealing with variability, outliers 9.Data presentation 10.Where does it lead us next? New hypotheses, new questions, a new round of experiments A Systematic Approach

© 2003, Carla Ellis Summary ofTopics Choosing measurement or simulation techniques Metrics Workload selection –Standard benchmark suites –Micro benchmarks –Synthetic benchmarks –Representativeness Monitoring –Instrumentation techniques –Timing issues –Data collection –Intrusiveness Data analysis / statistics –Misleading with data Workload characterization –Workload generators Experimental design Data presentation Different kinds of simulators –Event driven –Trace driven –Execution driven –Validation

© 2003, Carla Ellis Range of Experimental Systems Research at Duke In support of –Benchmarking (e.g., Fstress) –Workload characterization / tools (e.g., Cprof, Trickle-down) –Simulators (e.g., Modelnet) –Testbeds (e.g., PlanetLab) Used in evaluation of systems –Almost everything else we do here.

© 2003, Carla Ellis How Course Will Work Webpage // Approximately 40 pages per week in Jain Reading list – papers from the literature (coming soon) Course project (leverage whatever experimental projects you have to do for thesis, 2 nd year project, coursework). –Systematic approach, experimental design decisions made explicitly and justified –Mini-conference during exam week. 2 exams over readings Class sessions: 40 minutes lecture, discussion –Assigned expertise

© 2003, Carla Ellis Discussion Introduce yourself. What are your current research interests? –What conference represents your research community? Do you have an on-going project? What are your reasons for taking the course? –e.g. my advisor made me do it Do you have prior expertise we can benefit from?