Magne Jørgensen Myths, Over- simplifications and Unconfirmed Claims in Software Engineering Magne Jørgensen.

Slides:



Advertisements
Similar presentations
Quality management in corporate language training Lincq Workshop November 2013.
Advertisements

Action Research Not traditional educational research often research tests theory not practical Teacher research in classrooms and/or schools/districts.
Indiana State University Assessment of General Education Objectives Using Indicators From National Survey of Student Engagement (NSSE)
Chapter 3 Flashcards. obligation of an individual to other individuals based on a social or legal contract to justify his or her actions; the processes.
Advances in Human Resource Development and Management
Mapping Studies – Why and How Andy Burn. Resources The idea of employing evidence-based practices in software engineering was proposed in (Kitchenham.
Extended Project Research Skills 1 st Feb Aims of this session  Developing a clear focus of what you are trying to achieve in your Extended Project.
Realism in Assessment of Effort Estimation Uncertainty: It Matters How You Ask By Magne Jorgensen IEEE Transactions on Software Engineering Vol. 30, No.
Critical Appraisal Dr Samira Alsenany Dr SA 2012 Dr Samira alsenany.
Evidence-Based Medicine Week 3 - Prognosis Department of Medicine - Residency Training Program Tuesdays, 9:00 a.m. - 11:30 a.m., UW Health Sciences Library.
Teaching Evidence-Based Software Enginering to University Students Magne Jørgensen (based on a paper by Magne Jørgensen, Barbara Kitchenham, and Tore Dybå.
CSCD 555 Research Methods for Computer Science
Introduction to Research
Using the Biomedical Library & Its Resources: Becoming Efficient Information Managers Public Health & Epidemiology PHE 131 Winter 2010.
Program Evaluation Spero Manson PhD
Software Development Overview CPSC 315 – Programming Studio Spring 2009.
Thinking Processes By Marvi Matos. College of Engineering, UPR BS, Chem E My background.
A teachers’ project: “Towards learner autonomy”. A teachers’ project: towards learner autonomy §Rationale §What we wanted to achieve §The process §Problems.
1 Department of Social Work Roberts Wesleyan College Overview of Evidence-Based Practice.
Software Development Overview CPSC 315 – Programming Studio Spring 2008.
Introduction, Acquiring Knowledge, and the Scientific Method
Session 6: Writing from Sources Audience: 6-12 ELA & Content Area Teachers.
Chapter One: The Science of Psychology
WHAT DO WE KNOW ABOUT AGILE SOFTWARE DEVELOPMENT? Author : Tore Dyba and Torgeir Dingsoyr Reporter : 卓卿安.
丘偉廷. It can successfully occur within university administration, as I have personally experienced. The online educational team implemented and.
Using Situational awareness and decision making
Critical Analysis and Problem Solving Merging Critical Thought and Assessments in Modern Maritime Education IMLA 19 Conference 2011 Captain Gregory Hanchrow.
What should teachers do in order to maximize learning outcomes for their students?
Research Methods Key Points What is empirical research? What is the scientific method? How do psychologists conduct research? What are some important.
Chapter 4: Lecture Notes
CompSci 230 Software Design and Construction
RESPONDENT BACKGROUND DISTRIBUTION Data from 31 survey respondents Student Assessment of Their Learning Gains from Conducting Collaborative Research Projects.
Chapter One: The Science of Psychology. Ways to Acquire Knowledge Tenacity Tenacity Refers to the continued presentation of a particular bit of information.
Evaluation of software engineering. Software engineering research : Research in SE aims to achieve two main goals: 1) To increase the knowledge about.
What is E3BP? How do you integrate the findings from CAPs/CATs into everyday clinical practice? Elise Baker, Ph.D. The University of Sydney NSW SPEECH.
Semester 2: Lecture 9 Analyzing Qualitative Data: Evaluation Research Prepared by: Dr. Lloyd Waller ©
Research Utilization in Nursing Chapter 21
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 2 Evidence-Based Nursing: Translating Research Evidence Into Practice.
Overview of Chapter The issues of evidence-based medicine reflect the question of how to apply clinical research literature: Why do disease and injury.
1 f02kitchenham5 Preliminary Guidelines for Empirical Research in Software Engineering Barbara A. Kitchenham etal IEEE TSE Aug 02.
Database Analysis and the DreamHome Case Study
The Next Generation Science Standards: 4. Science and Engineering Practices Professor Michael Wysession Department of Earth and Planetary Sciences Washington.
Empirical Software Engineering in Industry Short Courses David Janzen, Clark Turner California Polytechnic State University San Luis Obispo, CA Hossein.
Software Development Overview CPSC 315 – Programming Studio Spring 2013.
Developing Academic Reading Skills Planning Research Chapter 2.
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Introduction to Research. Purpose of Research Evidence-based practice Validate clinical practice through scientific inquiry Scientific rational must exist.
REGIONAL EDUCATIONAL LAB ~ APPALACHIA The Effects of Hybrid Secondary School Courses in Algebra 1 on Teaching Practices, Classroom Quality and Adolescent.
Writing Exercise Try to write a short humor piece. It can be fictional or non-fictional. Essay by David Sedaris.
Continual Service Improvement Methods & Techniques.
Introduction Outcome-Informed Evidence-Based Practice (EBP)
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Research Design Quantitative. Quantitative Research Design Quantitative Research is the cornerstone of evidence-based practice It provides the knowledge.
Developing Smart objectives and literature review Zia-Ul-Ain Sabiha.
Chapter 2: Thinking and Reading Critically ENG 113: Composition I.
Research methods revision The next couple of lessons will be focused on recapping and practicing exam questions on the following parts of the specification:
Smith/Davis (c) 2005 Prentice Hall Chapter One The Science of Psychology PowerPoint Presentation created by Dr. Susan R. Burns Morningside College.
From myths and fashions to evidence-based software engineering Magne Jørgensen.
Practice focused learning:
Ten years with Evidence-Based Software Engineering What is it
Magne Jørgensen Simula Research Laboratory
Objectives of the Course and Preliminaries
MeOTa fall conference October 22, 2016
HLT 540 Competitive Success-- snaptutorial.com
HLT 540Competitive Success/tutorialrank.com
HLT 540 Education for Service-- snaptutorial.com
HLT 540 Education for Service-- tutorialrank.com.
HLT 540 Teaching Effectively-- snaptutorial.com
Introducing ISTQB Agile Foundation Extending the ISTQB Program’s Support Further Presented by Rex Black, CTAL Copyright © 2014 ASTQB 1.
Presentation transcript:

Magne Jørgensen Myths, Over- simplifications and Unconfirmed Claims in Software Engineering Magne Jørgensen

The paper clip was invented by a Norwegian

Short men are more aggressive (The Napoleon complex)

Most communication is non-verbal

There were/is a software crisis (page 13 of their 1994-report): “We then called and mailed a number of confidential surveys to a random sample of top IT executives, asking them to share failure stories.”

Difficult to remove a myth Years of critique may, however, have had an small effect (from the CHAOS Report – 2013):

45% of features of “traditional projects” are never used (source: The Standish Group, XP 2002) No-one seems to know (and the Standish Group does not tell) anything about this study! Why do so many believe (and use) this non-interpretable, non-validated claim? They benefit from it (agile community) + confirmation bias (we all know at least one instance that fit the claim)

14% Waterfall and 42% of Agile projects are successful (source: The Standish Group, The Chaos Manifesto 2012) Successful = “On time, on schedule and with specified functionality” Can you spot a serious error of this comparison?

There is an increase in cost of removing errors in later phases

The magic number 7 ± 2

The number one in the stink parade …

The ease of creating myths: Are risk-willing or risk-averse developers better? Study design: Research evidence + Self-generated argument. Question: Based on your experience, do you think that risk-willing programmers are better than risk-averse programmers? 1 (totally agree) – 5 (No difference) - 10 (totally disagree) Neutral group: Average 5.0 Group A: Group B: Initially Average 3.3 Debriefing Average 2: weeks later Average 3: 3.5 Initially Average 5.4 Debriefing Average 2: weeks later Average 3: 4.9

How to convince software engineers Context: Assume that a test course provider claims: ”The course will lead to substantial increase in test efficiency and quality for most participants.” How likely do you think this claim is true, given [reduced explanation of options]: A: No other information B: Supporting claims from reference clients C: Supporting study conducted by the course provider D: Convincing explanation (but no empirical evidence) E: Supporting experience from a colleague (It helped him) F: Supporting scientific study completed at a renowned university G: Own experience (It helped me)

Results not very promising for evidence-based software engineering A: No other information B: Support from reference clients C: Supporting study conducted by the course provider D: Convincing explanation (but no empirical evidence) E: Supporting experience from a colleague (It helped him) F: Supporting scientific study completed at a renowned university G: Own experience (It helped me)

“I see it when I believe it” vs “I believe it when I see it” Design: –Data sets with randomly set performance data comparing “traditional” and “agile” methods. –Survey of each developer’s belief in agile methods Question: How much do you, based on the data set, agree in: “Use of agile methods has caused a better performance when looking at the combination of productivity and user satisfaction.” Result: –Previous belief in agile determined what they saw in the random data

When making a decision or choice, the world is no more the same (Dan Gilbert)

Elements of more evidence-based practise (Myth busting in software engineering) 1.Find out what is meant by the claim –Is it possible to falsify evaluate the claim? –How general is it meant to be? 2.Prepare for evaluation of validity of claim and argumentation –Be aware of your confirmation bias tendency, especially when you agree –Consider what you would consider as valid evidence to support the claim. –Check for vested interest 3.Collect/create and evaluate evidence –Research-based evidence –Practice-based evidence –Locally created, experimental evidence 4.Synthesize evidence and conclude (if possible)

Evidence-based software engineering (EBSE) The main steps of EBSE are as follows: 1.Convert a relevant problem or need for information into an answerable question. 2.Search the literature and practice-based experience for the best available evidence to answer the question. 3.Critically appraise the evidence for its validity, impact, and applicability. 4.Integrate the appraised evidence with practical experience and the client's values and circumstances to make decisions about practice. 5.Evaluate performance in comparison with previous performance and seek ways to improve it. NB: EBSE is NOT the same as systematic literature review! Tore Dybå, Barbara Kitchenham and Magne Jørgensen, Evidence-based Software Engineering for Practitioners, IEEE Software, Vol. 22, No. 1, Jan-Feb 2005.

EXPERIENCE FROM TEACHING EVIDENCE-BASED SOFTWARE ENGINEERING

Background University courses with students and with software professionals –30 hours lecturing –Supervised project report writing with self-selected “problem formulation” Short courses/seminars with software professionals In total, I’v taught several hundreds of students and software professionals EBSE.

Example of what they learn: A structure of analyzing claims, argumentation and evidence (Toulmin’s model) Data Claim Backing Warrant Qualifier Reservation

Any positive impact from teaching Evidence-Based Software Engineering? Positive impact: –They get more critical towards claims made by gurus and in textbooks, especially when there are “vested interest” –The get better and asking what claims really mean –They discover the strength of collecting and evaluating evidence Challenges: –Evaluating evidence and argumentation is new to them. Only a few (20%?) develop good evaluation skills, although all seem to improve through the courses. –Unlike evidence-based medicine, EBSE cannot rely on research-based evidence only. They need to be trained in collecting practice-based evidence.

Challenge: Lack of evidence Empirical research is frequently sparse and not very relevant! –… and the quality is not impressive either. A need for training in collecting practice-based evidence –Very much the same method as for evaluating research A need for training in designing and collecting own evidence –This is more new ground, but the benefit from training in experimental method, use of representative “trials” etc. is very promising. –On the long term, this type of evidence (local, highly relevant, collected in the context it will be used) is perhaps the one with most potential of changing the software practices. NB: The problems with this kind of learning/improvement in other frameworks (CMM-I, PSP, PDCA, GQM, …) do not defend very much optimism. An advantage of EBSE may be the focus on posing a meaningful problems and the combination of different sources of evidence.

What I tried to say … The software engineering discipline is filled with myths, fashion, over- simplifications and unconfirmed claims. –Being aware of the mechanisms may make us more rational The software engineering industry waste much energy due to lack of collecting evidence on what works and what not. –Example: We change from one technology/method to the next without knowing much about whether it will improve anything As researchers/university employees we should try –Train software professionals in EBSE, including how to become myth busters –Provide (synthesized) evidence that is relevant and of good quality. –Write books and guidelines that are evidence-based

Coffee dehydrates your body

BONUS MATERIAL

IMPACT OF VESTED INTEREST IN SOFTWARE ENGINEERING STUDIES (A BRIEF DISTRACTION) 35 PUBLISHED COMPARISONS OF REGRESSION AND ANALOGY- BASED EFFORT ESTIMATION MODELS

Regression-based cost estimation model better Study Effect size = MMRE_analogy – MMRE_regression Analogy-based models better

Regression-based cost estimation model better Study Effect size = MMRE_analogy – MMRE_regression Development of own analogy-based model (vested interests) Regression models better