Empirically Assessing End User Software Engineering Techniques Gregg Rothermel Department of Computer Science and Engineering University of Nebraska --

Slides:



Advertisements
Similar presentations
CS527: Advanced Topics in Software Engineering (Software Testing and Analysis) Darko Marinov September 18, 2008.
Advertisements

Alvin Kwan Division of Information & Technology Studies
Department of Industrial Management Engineering 1.Introduction ○Usability evaluation primarily summative ○Informal intuitive evaluations by designers even.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
CJ 526 Statistical Analysis Research methods and statistics.
MSc Applied Psychology PYM403 Research Methods Validity and Reliability in Research.
Research and Diversity
RESEARCH METHODS Lecture 19
Personality, 9e Jerry M. Burger
Sabine Mendes Lima Moura Issues in Research Methodology PUC – November 2014.
Introduction to Communication Research
Knowledge is Power Marketing Information System (MIS) determines what information managers need and then gathers, sorts, analyzes, stores, and distributes.
Formulating the research design
Psychology and Scientific Research. Experimental Science Definition: inquiry in seeking facts and the search for truth through testing of theories and.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 14.
I want to test a wound treatment or educational program but I have no funding or resources, How do I do it? Implementing & evaluating wound research conducted.
Descriptive and Causal Research Designs
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
McGraw-Hill © 2006 The McGraw-Hill Companies, Inc. All rights reserved. The Nature of Research Chapter One.
Chapter 9 Marketing Research And Information Systems
Research Methods Key Points What is empirical research? What is the scientific method? How do psychologists conduct research? What are some important.
Research Strategies, Part 2
History & Research Research Methods Unit 1 / Learning Goal 2.
IIT BOMBAYIDP in Educational Technology * Paper Planning Template Resource – Paper-Planning-Template(SPT)Version 1.0, Dec 2013 Download from:
Descriptive and Causal Research Designs
Exploratory Research Design Week 02
Human Computer Interaction
Chapter 2 Research in Abnormal Psychology. Slide 2 Research in Abnormal Psychology  Clinical researchers face certain challenges that make their investigations.
Chapter 2: The Research Enterprise in Psychology.
The Scientific Method in Psychology.  Descriptive Studies: naturalistic observations; case studies. Individuals observed in their environment.  Correlational.
Copyright ©2008 by Pearson Education, Inc. Pearson Prentice Hall Upper Saddle River, NJ Foundations of Nursing Research, 5e By Rose Marie Nieswiadomy.
The Research Enterprise in Psychology
Psychology Liudexiang
Assessing the Frequency of Empirical Evaluation in Software Modeling Research Workshop on Experiences and Empirical Studies in Software Modelling (EESSMod)
Experimentation in Computer Science (Part 1). Outline  Empirical Strategies  Measurement  Experiment Process.
HOLT, RINEHART AND WINSTON P SYCHOLOGY PRINCIPLES IN PRACTICE 1 Chapter 2 PSYCHOLOGICAL METHODS Section 1: Conducting ResearchConducting Research Section.
Notes on Research Design You have decided –What the problem is –What the study goals are –Why it is important for you to do the study Now you will construct.
Where did plants and animals come from? How did I come to be?
What is Computer Science?  Three paradigms (CACM 1/89) Theory (math): definitions, theorems, proofs, interpretations Abstraction (science): hypothesize,
What is Psychology?. Psychology is… The science that studies behavior and the physiological and cognitive processes that underlie behavior It is empirical.
Why is Research Important?. Basic Research Pure science or research Research for the sake of finding new information and expanding the knowledge base.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
Experimental Method. METHODS IN PSYCHOLOGY 1.Experimental Method 2.Observation Method 3.Clinical Method.
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Assessing Peer Support and Usability of Blogging Technology Yao Jen Chang Department of Electronic Engineering Chung-Yuan Christian University, Taiwan.
Question paper 1997.
©2010 John Wiley and Sons Chapter 2 Research Methods in Human-Computer Interaction Chapter 2- Experimental Research.
Research Design ED 592A Fall Research Concepts 1. Quantitative vs. Qualitative & Mixed Methods 2. Sampling 3. Instrumentation 4. Validity and Reliability.
Experimentation in Computer Science (Part 2). Experimentation in Software Engineering --- Outline  Empirical Strategies  Measurement  Experiment Process.
WERST – Methodology Group
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Opener Imagine that all the birds around the school are dying. How might you investigate the problem? Write a brief description of your investigation.
Chapter Eight: Quantitative Methods
Chapter 6 - Standardized Measurement and Assessment
Lesson 4.5 – Conducting a Survey to Collect Two-Variable Data.
Lesson Overview Lesson Overview What Is Science?.
WHAT IS RESEARCH? According to Redman and Morry,
A. Strategies The general approach taken into an enquiry.
Formulating the Research Design Faisal Abbas, PhD Lecture 8 th.
Sociology. Sociology is a science because it uses the same techniques as other sciences Explaining social phenomena is what sociological theory is all.
McGraw-Hill © 2007 The McGraw-Hill Companies, Inc. All rights reserved. Slide 1 Sociological Research SOCIOLOGY Richard T. Schaefer 2.
Research Methods & Design Outline
HOLT, RINEHART AND WINSTON P SYCHOLOGY PRINCIPLES IN PRACTICE 1 Chapter 2 PSYCHOLOGICAL METHODS Section 1: Conducting ResearchConducting Research Section.
Research Design. How do we know what we know? The way we make reasoning Deductive logic Begins with one or more premises, reasoning then proceeds logically.
Understanding Results
Statistical Data Analysis
Statistical Data Analysis
RESEARCH METHODS Lecture 19
By Hyunsook Do, Sebastian Elbaum, Gregg Rothermel
Presentation transcript:

Empirically Assessing End User Software Engineering Techniques Gregg Rothermel Department of Computer Science and Engineering University of Nebraska -- Lincoln

Questions Addressed How can we use empirical studies to better understand issues/approaches in end user SE? What are some of the problems empiricists working on end-user SE face? What are some of the opportunities for software engineering researchers working in this area?

Outline Background on empirical approaches Empiricism in the end-user SE context Problems for empiricism in end-user SE Conclusion

Outline Background on empirical approaches Empiricism in the end-user SE context Problems for empiricism in end-user SE Conclusion

Empirical Approaches: Types Survey – interviews or questionnaires Controlled Experiment - in the laboratory, involves manipulation of variables Case Study - observational, often in-situ

Empirical Approaches: Surveys Pose questions via interviews or questionnaires Process: select variables and choose sample, frame questions that relate to variables, collect data, analyze and generalize from data Uses: descriptive (assert characteristics), explanatory (assess why), exploratory (pre-study) Resource: E. Babbie, Survey Research Methods, Wadsworth, 1990

Empirical Approaches: Controlled Experiments Manipulate independent variables and measure effects on dependent variables Requires randomization over subjects and objects (partial exception: quasi-experiments) Relies on controlled environment (fix or sample over factors not being manipulated) Often involves a baseline (control group) Supports use of statistical analyses Resource: Wohlin et al., Experimentation in Software Engineering, Kluwer, 2000

Empirical Approaches: Case Studies Study a phenomenon (process, technique, device) in a specific setting Can involve comparisons between projects Less control, randomization, and replicability Easier to plan than controlled experiments Uses include larger investigations such as longitudinal or industrial Resource: R. K. Yin, Case Study Research Design and Methods, Sage Publications, 1994

Empirical Approaches: Comparison

Outline Background on empirical studies Empiricism in the end-user SE context Problems for empiricism Conclusion

Three Aspects of Empiricism 1.Studies of EUSE (and SE) have two focal points –The ability of end users to use devices/processes –The devices and processes themselves 2.Evaluation and design of devices and processes are intertwined: –Summative evaluation helps us assess them –Formative evaluation helps us design them 3.We need families of empirical studies: –To generalize results –Studies inform and motivate further studies

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge through Families of Studies user environment, device

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge through Families of Studies user environment, device

Empirical Studies in WEUSE Papers Surveys -Scaffidi et al.: usage of abstraction, programming practices -Miller et al.: how users generate names for form fields -Segal: needs/characteristics of professional end user developers -Sutcliffe: costs/benefits perceived by users of a web- based content mgmt. system Domain analysis –Elbaum et al.: fault types in Matlab programs Controlled experiments –Fisher et al.: infrastructure support for spreadsheet studies

Cell turns more blue (more “tested”). Testing also flows upstream, marking other affected cells too. Example: What You See is What You Test (WYSIWYT) At any time, user can check off correct value.

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge of End User SE through Families of Studies user environment, device

Study 1: Effectiveness of DU- adequate test suites (TOSEM 1/01) RQ: Can DU-adequate test suites detect faults more effectively than other types of test suites? Compared DU-adequate vs randomly generated suites of the same size, for ability to detect various seeded faults, across 8 spreadsheets Result: DU-adequate suites were significantly better than random at detecting faults

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge of End User SE through Families of Studies user environment, device

RQs: Are WYSIWYT users more (effective, efficient) than Ad-Hoc? Compared two groups of users, one using WYSIWYT, one not, each on two spreadsheet validation tasks Participants drawn from Undergraduate Computer Science classes Participants using WYSIWYT were significantly better at creating DU-adequate suites, with less redundancy in testing Study 2: Usefulness of WYSIWYT (ICSE 6/00)

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge of End User SE through Families of Studies user environment, device

Study 3: Usefulness of WYSIWYT with End Users (ICSM 11/01) RQs: Are WYSIWYT users more (accurate, active at testing) than Ad-Hoc? Compared two groups of users, one using WYSIWYT, one not, each on two spreadsheet modification tasks Participants drawn from Undergraduate Business classes Participants using WYSIWYT were more accurate in making modifications, and did more testing

User can enter assertions System can figure out more assertions User can enter assertions Study 4: Using Assertions (ICSE 5/03)

Domain Analyses Think-Aloud, Formative Case Studies, Surveys Controlled Experiments Controlled Experiments Summative Case Studies Exploratory, Theory Dev. Hypothesis Testing Generalization Building Empirical Knowledge of End User SE through Families of Studies user environment, device

RQs: will end users use assertions and do they understand the devices Observed persons as they worked with Forms/3 spreadsheets with assertion facilities provided Study 4: Using Assertions (ICSE 5/03)

There’s got to be something wrong with the formula! Study 4: Using Assertions (ICSE 5/03)

Outline Background on empirical studies Empiricism in the end-user SE context Problems for empiricism in end-user SE Conclusion

Problems for Empiricism in EUSE Threats to validity – factors that limit our ability to draw valid conclusions –External: ability to generalize –Internal: ability to correctly infer connections between dependent and independent variables –Construct: ability of dependent variable to capture the effect being measured –Conclusion: ability to apply statistical tests

External Validity Subjects (participants) aren’t representative Programs (objects) aren’t representative Environments aren’t representative Problems are trivial or atypical

Internal Validity Learning effects, expectation bias, … Non-homogeneity among groups (different in experience, training, motivation) Devices or measurement tools faulty Timings are affected by external events The act of observing can change behavior (of users, certainly, but also of artifacts)

Construct Validity Lines of code may not adequately represent amount of work done Test coverage may not be a valid surrogate for fault detection ability Successful generation of values doesn’t guarantee successful use of values Self-grading may not provide an accurate measure of confidence

Conclusion Validity Small sample sizes Populations don’t meet requirements for use of statistical tests Data distributions don’t meet requirements for use of statistical tests

Other Problems Cost of experimentation Difficulty of finding suitable subjects Difficulty of finding suitable objects Difficulty of getting the design right

Outline Background on empirical studies Empiricism in the end-user SE context Problems for empiricism in end-user SE Conclusion

Questions Addressed How can we use empirical studies to better understand issues/approaches in end user SE? –Via families of appropriate studies, using feedback and replication What are some of the problems empiricists working on end-user SE face? –Threats to validity, many particular to this area –Costs, and issues for experiment design/setup What are some of the opportunities for software engineering researchers working in this area? –Myriad, given the range of study types applicable –Better still with collaboration

Empirically Assessing End User Software Engineering Techniques Gregg Rothermel Department of Computer Science and Engineering University of Nebraska -- Lincoln