Learning from Mistakes: A Comprehensive Study on Real-World Concurrency Bug Characteristics Ben Shelton.

Slides:



Advertisements
Similar presentations
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science 1 Three Challenges for Transactional Computing J. Eliot B. Moss Associate Professor,
Advertisements

Concurrency Issues Motivation, Problems, Directions Dennis Kafura - CS Operating Systems1.
Understanding and Detecting Real-World Performance Bugs
An Case for an Interleaving Constrained Shared-Memory Multi-Processor Jie Yu and Satish Narayanasamy University of Michigan.
Guoliang Jin, Linhai Song, Wei Zhang, Shan Lu, and Ben Liblit University of Wisconsin–Madison Automated Atomicity- Violation Fixing.
Programming Types of Testing.
/ PSWLAB Concurrent Bug Patterns and How to Test Them by Eitan Farchi, Yarden Nir, Shmuel Ur published in the proceedings of IPDPS’03 (PADTAD2003)
Transactional Memory (TM) Evan Jolley EE 6633 December 7, 2012.
Submitted by: Omer & Ofer Kiselov Supevised by: Dmitri Perelman Networked Software Systems Lab Department of Electrical Engineering, Technion.
Concurrent Processes Lecture 5. Introduction Modern operating systems can handle more than one process at a time System scheduler manages processes and.
Microsoft Research Faculty Summit Yuanyuan(YY) Zhou Associate Professor University of Illinois, Urbana-Champaign.
Language Support for Lightweight transactions Tim Harris & Keir Fraser Presented by Narayanan Sundaram 04/28/2008.
1 New Architectures Need New Languages A triumph of optimism over experience! Ian Watson 3 rd July 2009.
Building Secure Software Chapter 9 Race Conditions.
1 Sharing Objects – Ch. 3 Visibility What is the source of the issue? Volatile Dekker’s algorithm Publication and Escape Thread Confinement Immutability.
CS510 Concurrent Systems Class 5 Threads Cannot Be Implemented As a Library.
Learning From Mistakes—A Comprehensive Study on Real World Concurrency Bug Characteristics Shan Lu, Soyeon Park, Eunsoo Seo and Yuanyuan Zhou Appeared.
Your Interactive Guide to the Digital World Discovering Computers 2012.
Exceptions and Mistakes CSE788 John Eisenlohr. Big Question How can we improve the quality of concurrent software?
1 I.Introduction to Algorithm and Programming Algoritma dan Pemrograman – Teknik Informatika UK Petra 2009.
Selecting and Implementing An Embedded Database System Presented by Jeff Webb March 2005 Article written by Michael Olson IEEE Software, 2000.
Multi-core Programming Thread Profiler. 2 Tuning Threaded Code: Intel® Thread Profiler for Explicit Threads Topics Look at Intel® Thread Profiler features.
What Change History Tells Us about Thread Synchronization RUI GU, GUOLIANG JIN, LINHAI SONG, LINJIE ZHU, SHAN LU UNIVERSITY OF WISCONSIN – MADISON, USA.
DEBUGGING. BUG A software bug is an error, flaw, failure, or fault in a computer program or system that causes it to produce an incorrect or unexpected.
COMP 111 Threads and concurrency Sept 28, Tufts University Computer Science2 Who is this guy? I am not Prof. Couch Obvious? Sam Guyer New assistant.
Colorama: Architectural Support for Data-Centric Synchronization Luis Ceze, Pablo Montesinos, Christoph von Praun, and Josep Torrellas, HPCA 2007 Shimin.
The Relational Model1 Transaction Processing Units of Work.
An Undergraduate Course on Software Bug Detection Tools and Techniques Eric Larson Seattle University March 3, 2006.
Design - programming Cmpe 450 Fall Dynamic Analysis Software quality Design carefully from the start Simple and clean Fewer errors Finding errors.
Concurrency unlocked Programming
Diagnosing and Fixing Concurrency Bugs Credits to Dr. Guoliang Jin, Computer Science, NC STATE Presented by Tao Wang.
HXY Debugging Made by Contents 目录 History of Java MT Sequential & Parallel Different types of bugs Debugging skills.
The C++11 Memory Model CDP Based on “C++ Concurrency In Action” by Anthony Williams, The C++11 Memory Model and GCCThe C++11 Memory Model and GCC Wiki.
Gauss Students’ Views on Multicore Processors Group members: Yu Yang (presenter), Xiaofang Chen, Subodh Sharma, Sarvani Vakkalanka, Anh Vo, Michael DeLisi,
Soyeon Park, Shan Lu, Yuanyuan Zhou UIUC Reading Group by Theo.
Testing Concurrent Programs Sri Teja Basava Arpit Sud CSCI 5535: Fundamentals of Programming Languages University of Colorado at Boulder Spring 2010.
Explicitly Parallel Programming with Shared-Memory is Insane: At Least Make it Deterministic! Joe Devietti, Brandon Lucia, Luis Ceze and Mark Oskin University.
Adaptive Software Lock Elision
Healing Data Races On-The-Fly
ME 142 Engineering Computation I
Types for Programs and Proofs
Testing and Debugging PPT By :Dr. R. Mall.
Processes and Threads Processes and their scheduling
CSC 591/791 Reliable Software Systems
Threads Cannot Be Implemented As a Library
Topics: jGRASP editor ideosyncrasies assert debugger.
Effective Data-Race Detection for the Kernel
Understand the Programming Process
Amir Kamil and Katherine Yelick
Threads and Memory Models Hal Perkins Autumn 2011
Intel® Parallel Studio and Advisor
Design and Programming
Performance Optimization for Embedded Software
Changing thread semantics
Liveness And Performance
Shared Memory Programming
Units with – James tedder
Understand the Programming Process
Programming.
Chapter 32 Concurrency Bugs
Design and Implementation Issues for Atomicity
Amir Kamil and Katherine Yelick
Lecture 2 The Art of Concurrency
Tools for the development of parallel applications
Foundations and Definitions
Review of Previous Lesson
Problems with Locks Andrew Whitaker CSE451.
CSE 332: Concurrency and Locks
Controlled Interleaving for Transactions
Pointer analysis John Rollinson & Kaiyuan Li
Presentation transcript:

Learning from Mistakes: A Comprehensive Study on Real-World Concurrency Bug Characteristics Ben Shelton

Motivation Concurrent programming is necessary Multicore revolution Concurrent programming is difficult, and programmers need tools to help them write more reliable concurrent programs Concurrency testing / static analysis tools Narrowing the search space Languages and features that make it easier for programmers to express concurrency Transactional Memory?

Methodology Four open-source applications: MySQL Apache Mozilla OpenOffice Search bug databases for keywords involving concurrency and randomly choose 500 bug reports Check that each bug is caused by concurrency error 105 concurrency bugs in final sample set

Main Findings 97% of non-deadlock bugs are atomicity-violation or order-violation. 66% of non-deadlock bugs involve concurrent accesses to only one variable. 97% of deadlock bugs involve two threads in contention for two or fewer resources. 73% of non-deadlock bugs were fixed by techniques other than adding/changing locks. Transactional memory could help in about one third of the bugs examined, but additional language features for concurrency are recommended.

Atomicity Violations

Order Violations

Order Violations, continued…

Low-Hanging Fruit 22% of deadlock bugs are caused by a thread trying to acquire a resource it already has Almost trivial to check for 66% of non-deadlock bugs involve concurrent accesses to only one variable This is the assumption that current concurrency testing software makes – turns out to be valid

Slightly-Higher-Hanging Fruit 96% of the bugs can be reproduced every time with a certain partial order between only two threads Pairwise testing (computationally tractable) will catch most of these bugs! 92% of the bugs can be reproduced every time with a certain partial order between no more than four memory accesses Cuts down search space from exponential to polynomial

Locks Are (Often) Not The Answer Adding or changing locks was only used to fix 20 out of 74 non-deadlock bugs. Instead: Condition check Change ordering of program statements Algorithm redesign Authors state: “Just telling programmers that certain conflicting accesses are not protected by the same lock is not enough to fix concurrency bugs.” What about Eraser and tools like it?

The Fix Isn’t Always Right The First Time ~30% of the Mozilla bugs had at least one incorrect patch posted before the final correct fix. When fixing deadlock bugs by allowing one thread to give up on acquiring a resource, programmers sometimes introduced non-deadlock bugs. Takeaways: This stuff is hard! Programmers need help Language support Better debugging and analysis tools

Transactional Memory This bit of code looks like a transaction! Programmer can rely on TM library to provide desired functionality rather than trying to reimplement it herself

Transactional Memory, continued… Not suitable for I/O operations HW-based TM may not work for large critical sections Still isn’t able to express order in many cases Even with extensions to TM, wouldn’t work for 20 out of 105 bugs in this study But 85 out of 105 is still pretty good!

Paper Conclusions By making simplifying assumptions, we can build practical concurrency testing tools that will catch the majority of expected concurrency bugs. Transactional memory is a useful tool, and would be even more useful with expansions/modifications. Programming languages need a better way to express order relationships between thread actions, and concurrency testing tools need to do a better job of illustrating bugs involving these relationships.

Is The Data Set Representative? Authors claim that the data set is representative of two main classes of concurrent programs: Server-based Client-based Data set isn’t evenly distributed between applications 57 bugs in Mozilla; only 8 bugs in OpenOffice

TM Enhancements (From Microsoft Research [5]) Authors support TM enhancements like watch/retry, retry/orElse

TM Enhancements, continued… Neither GCC’s new support for STM, nor Intel’s STM compiler for C++, nor the Deuce runtime for Java support these primitives STM is seeing wider adoption – reasonable to think they’ll be available soon

Improved Programming Tools Grace: Safe Multithreading – a potential solution for fork/join parallelism [4] Run each thread as a process Each thread executes optimistically but locally Check for collision at commit Threads run concurrently but commit in program order Programmer can reason as if program were serial Reduces the likelihood of order violations

Improved Debugging Tools CTrigger – follow-up work from the authors [2] Test improbable thread interleavings to do a faster job of detecting/characterizing atomicity violations Execution Synthesis [3] Tool gives the programmer a set of initial inputs and an execution order that is guaranteed to reproduce the bug Better than “watch this variable”

References [1] Shan Lu, Soyeon Park, Eunsoo Seo, and Yuanyuan Zhou. 2008. Learning from mistakes: a comprehensive study on real world concurrency bug characteristics. SIGARCH Comput. Archit. News 36, 1 (March 2008), 329-339. [2] Soyeon Park, Shan Lu, and Yuanyuan Zhou. 2009. CTrigger: exposing atomicity violation bugs from their hiding places. In Proceedings of the 14th international conference on Architectural support for programming languages and operating systems (ASPLOS '09). ACM, New York, NY, USA, 25-36. [3] Cristian Zamfir and George Candea. 2010. Execution synthesis: a technique for automated software debugging. In Proceedings of the 5th European conference on Computer systems(EuroSys '10). ACM, New York, NY, USA, 321-334. [4] Emery D. Berger, Ting Yang, Tongping Liu, and Gene Novark. 2009. Grace: safe multithreaded programming for C/C++. SIGPLAN Not. 44, 10 (October 2009), 81- 96. [5] Simon Peyton Jones (Microsoft Research Cambridge). Transactional Memory and Concurrency. OSCON ‘07. http://research.microsoft.com/en- us/um/people/simonpj/papers/stm/STM-OSCON.pdf