Ben Livshits Based in part of Stanford class slides from and slides from Ben Zorn’s talk slides.

Ben Livshits Based in part of Stanford class slides from http://www.stanford.edu/class/cs295/ and slides from Ben Zorn’s talk slides

Lecture 1: Introduction to static analysis Lecture 2: Introduction to runtime analysis Lecture 3: Applications of static and runtime analysis reliability & bug finding security performance 2

Purify for finding memory errors [1992] Detecting memory leaks with Purify and GC [1992] Detecting dangling pointers & buffer overruns with DieHard [2006] Detecting buffer overruns with StackGard [1997] 3

Dangling pointers: If the program mistakenly frees a live object, the allocator may overwrite its contents with a new object or heap metadata. Buffer overflows: Out-of-bound writes can corrupt the contents of live objects on the heap. Heap metadata overwrites: If heap metadata is stored near heap objects, an out-of-bound write can corrupt it. Uninitialized reads: Reading values from newly-allocated or unallocated memory leads to undefined behavior. Invalid frees: Passing illegal addresses to free can corrupt the heap or lead to undefined behavior. Double frees: Repeated calls to free of objects that have already been freed cause freelist-based allocators to fail. 4

FINDING MEMORY ERRORS IN C/C++ PROGRAMS 5

C/C++ are not memory-safe Neither the compiler nor the runtime system enforces type abstractions What is memory-safe vs. type-safe? Possible to read or write outside of your intended data structure Among other bad behaviors What else is possible that you can’t do in Java or Scheme or ML or F#? 6

Each byte of memory is in one of 3 states: Unallocated: cannot be read or written Allocated but uninitialized: cannot be read Allocated and initialized: anything goes 7

Check the state of each byte on each access Binary instrumentation Add code before each load and store Represent states as giant array 2 bits per byte of memory What is the memory overhead? 25%!! Catches byte-level errors Won’t catch bit-level errors 8

We can only detect bad accesses if they are to unallocated or uninitialized memory Try to make all bad accesses be of those two forms We can make this part of our custom memory allocator 9

Red Zones Leave buffer space between allocated objects that is never allocated Guarantees that walking off the end of an array accesses unallocated memory Aging Freed Memory When memory is freed, do not reallocate immediately Helps catch dangling pointer errors 10

One of the first commercially successful runtime tools Was an independent company that got bought by IBM Rational Overhead can vary from 25% to 40x 11

12 This is where buffer overruns come from! Why can’t we catch them? This is where buffer overruns come from! Why can’t we catch them?

PROGRESS & OPEN PROBLEMS 13

Memory leaks are at least as serious as memory corruption errors Also very difficult to find Manifest only over hours, days, weeks Often persist in production code Managed languages such as Java and C# don’t really help 14

We can find many memory leaks using techniques borrowed from garbage collection Any memory with no pointers to it is leaked There is no way to free this memory Run a garbage collector But don’t free any garbage Just detect the garbage Any inaccessible memory is leaked memory Can we do this in C/C++ at all? Sort of… 15

It is sometimes hard to tell what is accessible in a C/C++ program? Cases No pointers to a malloc’d block: definitely garbage No pointers to head of a malloc’d block: maybe garbage Pointers to the head of a malloc’d block: not garbage by usual definition 16

From time to time, run a garbage collector Use mark and sweep Report areas of memory that are definitely or probably garbage No type safety ==> no memory safety Is this as easy as in Java? Bookkeeping Need to report who malloc’d the blocks originally Store this information in the red zone between objects Used in Purify, but watch out for memory overhead 17

A Limitation Only finds leaks to unreachable objects Doesn’t cover leaks in languages with GC Retaining data structures longer than needed In practice, also a serious source of leaks, especially in Java... 18

Look for objects not accessed for a “long time”. For each object Track it from the moment it is allocated Record the time of the last access (read or write) Discard information when object is de-allocated Periodically Scan all objects Warn about objects unused for a “long time” 19

TOLERATING MEMORY ERRORS AT RUNTIME 20

Go beyond all-or-nothing guarantees Type checking, verification rarely a 100% solution C#, Java both call to C/C++ libraries Traditional engineering allows for errors by design Complement existing approaches Static analysis has scalability limits Managed code especially good for new projects DART, Fuzz testing effective for generating illegal test cases 21

Buffer overflow char *c = malloc(100); c[101] = ‘a’; Dangling reference char *p1 = malloc(100); char *p2 = p1; free(p1); p2[0] = ‘x’; a 22 c 0 99 p1 0 99 p2 x

Increase robustness of installed code base Potentially improve millions of lines of code Minimize effort – ideally no source mods, no recompilation Reduce requirement to patch Patches are expensive (detect, write, deploy) Patches may introduce new errors Trade resources for robustness E.g., more memory implies higher reliability Make deployment easy Change the allocator DLL, no changes to code needed Make existing programs more fault tolerant Define semantics of programs with errors Programs complete with correct result despite errors 23

Emery D. Berger and Benjamin G. Zorn, "DieHard: Probabilistic Memory Safety for Unsafe Languages", PLDI’06 DieHard: correct execution in face of errors with high probability Plug-compatible replacement for malloc/free in C lib Define “infinite heap semantics” Programs execute as if each object allocated with unbounded memory All frees ignored Approximating infinite heaps: 3 key ideas 1.Overprovisioning 2.Randomization 3.Replication Allows analytic/probabilistic reasoning about safety 24

25 Expand size requests by a factor of M (e.g., M=2) 1234512345 Randomize object placement 12345 Pr(write corrupts) = ½ ? Pr(write corrupts) = ½ !

26 Replicate process with different randomization seeds 12345 P2 12345 P3 input Broadcast input to all replicas Compare outputs of replicas, kill when replica disagrees 12345 P1 Voter

Allocation Segregate objects by size (log2), bitmap allocator Within size class, place objects randomly in address space Separate metadata from user data Fill objects with random values – for detecting uninitialized reads De-allocation Expansion factor => frees deferred Extra checks for illegal free 27

Synthetic: Tolerates high rate of synthetically injected errors in SPEC programs Spec benchmarks: Detected two previously unreported benign bugs (197.parser and espresso) Avoiding real errors: Successfully hides buffer overflow error in Squid web cache server (v 2.3s5) Avoids dangling pointer error in Mozilla DoS in glibc & Windows 29

AVOIDING SECURITY EXPLOITS 30

“Smashing the Stack for Fun and Profit” Aleph One (AKA Elias Levy), Phrack 49, August 1996 It is a cook book for how to create exploits for “stack smashing” attacks Prior to this paper, buffer overflow attacks were known, but not widely exploited “Validate all input parameters” is a security principle going back to the 1960s After this paper, attacks became rampant Stack smashing vulns are massively common, easy to discover, and easy to exploit 31

Buffer overflow: Program accepts string input, placing it in a buffer Program fails to correctly check the length of the input Attacker gets to overwrite adjacent state, corrupting it Stack Smash: Special case of a buffer overflow that corrupts the activation record 32

Return address Overflow changes it to point somewhere else “Shell Code” Point to exploit code that was encoded as CPU instructions in the attacker’s string That code does exec(“/bin/sh”) hence “shell code” 33

Why are we so vulnerable to something so trivial? Because C chose to represent strings as null terminated instead of (base, bound) tuples Because strings grow up and stacks grow down Because we use Von Neumann architectures that store code and data in the same memoryVon Neumann architectures But these things are hard to change … mostly Try to move away from Von Neumann architecture by making key regions of memory be non-executable Problem: x86 memory architecture does not distinguish between “readable” and “executable” per page 34

“Solar Designer” introduces the Linux non-executable stack patch Fun with x86 segmentation registers maps the stack differently from the heap and static data Results in a non-executable stack Effective against naïve Stack Smash attacks Bypassable: Inject your shell code into the heap (still executable) Point return address at your shell code in the heap 35

Compile in integrity checks for activation records Insert a “canary word” (after the Welsh miner’s canary) If the canary word is damaged, then your stack is corrupted Instead of jumping to attacker code, abort the program Log the intrusion attempt 36

Written in a few days by one intern Less than 100 lines of code patch to GCC Helped a lot that the GCC function preamble and function post amble code generator routines were nicely isolated First canary was hardcoded 0xDEADBEEF Easily spoofable, but worked for proof of concept 37

The random canary: Pull a random integer from the OS /dev/random at process startup time Simple in concept, but in practice it is very painful to make reading from /dev/random work while still inside crt0.o Made it work, but motivated us to seek something simpler “Terminator” canary: CR, LF, 00, -1: the symbols that terminate various string library functions Rationale: will cause all the standard string mashers to terminate while trying to write the canary  cannot spoof the canary and successfully write beyond it Still vulnerable to attacks against poorly used memcpy() code, but buffer overflows thought to be rare 38

1999, “Emsi” creates the frame pointer attack Frame pointer stored below the canary  corruptible Change FP to point to a fake activation record constructed on the heap Function return code will believe FP, interpret the fake activation record, and jump to shell code Bypasses both Terminator and Random Canaries XOR Random Canary XOR the correct return address with the random canary Integrity check must match both the random number, and the correct return address 39

Focus on malware detection and prevention Much harder to “fix” than stack-based buffer overruns Externally exploitable bugs Nozzle Runtime detector for heap spraying attacks False positive rates: 10 - 6 Overhead: 5-10% Zozzle Static/statistical detector False positive rates: 10 - 6 Overhead: very small 40

Ben Livshits Based in part of Stanford class slides from and slides from Ben Zorn’s talk slides.

Similar presentations

Presentation on theme: "Ben Livshits Based in part of Stanford class slides from and slides from Ben Zorn’s talk slides."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Ben Livshits Based in part of Stanford class slides from and slides from Ben Zorn’s talk slides.

Similar presentations

Presentation on theme: "Ben Livshits Based in part of Stanford class slides from and slides from Ben Zorn’s talk slides."— Presentation transcript:

Similar presentations

About project

Feedback