Download presentation
Presentation is loading. Please wait.
1
Static and dynamic analysis of binaries
Reverse Engineering 101 Static and dynamic analysis of binaries
2
The same ideas can be transferred over to Win PEs and Mach-O binaries.
Preamble This presentation will focus on reverse engineering x86 Linux ELF binaries. The same ideas can be transferred over to Win PEs and Mach-O binaries.
3
What is Reverse Engineering?
Traditionally it means to understand something by taking it apart and analysing it’s components. In terms of IT, we can take apart binaries to learn exactly how they work at an instruction level. There are 2 main techniques: Static and Dynamic Analysis Something amazing.
4
IDA looks pretty dope when you have it open.
Why learn it? It’s fun to learn new skills. It’s a problem solving skill that’ll help you in many other areas of Info-Sec research such as Binary Exploitation. Learning RE techniques can lead to gaining a greater appreciation for the complexities of computers. IDA looks pretty dope when you have it open. It is also a great skill to have.
5
This can lead to developing patches for software.
Why learn it? Continued You can analyse vulnerable programs to find out why exactly they’re vulnerable. This can lead to developing patches for software. You could also develop exploits tailored to a binary, if you really wanted to… 😏
6
What will I need to get started?
Tools you’ll need to start your journey into RE IDA (Demo Version Available) – GNU Binutils – Install via your *nix package manager GDB – As above A Text Editor – To take notes If you don’t want to use IDA you can use a similar tool like: Hopper or BinaryNinja
7
Okay, so let’s get started.
8
Static Analysis Part I
9
What is Static Analysis?
To analyse a binary statically is to do so without executing it. The binary is analysed as a file and not a running process. Static Analysis is done with disassemblers, decompilers, hex editor, and other similar tools. We’ll focus on disassemblers.
10
Disassemblers Disassemblers will take your binary as input and interpret the machine instructions it contains into a low level language such as assembly. For the x86 Linux ELFs we’ll see later it’ll be interpreted into x86 Intel Syntax Assembly.
11
Disassemblers Continued
Most modern disassemblers can: Find embedded symbols (basically, mnemonics for variables, functions, etc. ) Explicitly interpret specific parts of the binary as either data or text (code) [map the binaries regions]. Find Structs, Enums, Strings, Functions, Types, etc. Make a pretty graph showing the control flow of the program. Show Imports / Exports of libraries and external functions … and more!
12
Static Analysis of a Very Simple C Program
To introduce the functionality of static and dynamic analysis we’ll be using the following code. #include <stdio.h> int main() { int a = 1; int b = 2; a = a + b; return (0); } simple.elf compiled x86 machine instructions gcc … e5 83 ec 0c 31 c0 c7 45 fc c7 45 f c7 45 f b 4d f8 03 4d f4 89 4d f8 83 c4 0c 5d c3 … This C code is compiled into an ELF, which when represented by hex shows gcc –m32 -O0 -o simple.elf This doesn’t mean much, but you could translate this into assembly if you had an x86 opcode lookup table. You can see the hex by using a hex editor or objdump -d simple.elf
13
Static Analysis of a Very Simple C Program Continued I
push ebp mov ebp, esp sub esp, 0xc xor eax, eax mov DWORD PTR [ebp-0x4], 0x0 mov DWORD PTR [ebp-0x8], 0x1 mov DWORD PTR [ebp-0xc], 0x2 mov ecx, DWORD PTR [ebp-0x8] add ecx, DWORD PTR [ebp-0xc] mov DWORD PTR [ebp-0x8], ecx add esp, 0xc pop ebp ret objdump File: simple.elf Here’s what the output looks like from Objdump (from GNU binutils) using objdump -M intel -d simple.elf Make sure you include “-M intel” so it produces Intel Syntax Assembly. This is what we’ll be using. I did clean up the output, just to get the assembly code.
14
Static Analysis of a Very Simple C Program Continued II
Using IDA we can see it has given us some symbols (var_C, var_8, etc…) These are offsets that can be used with ebp to find local variables in our main function’s stack frame. In more complicated programs that use conditions, we’ll be able to see control flow. We’ll look at that later. simple.elf loaded into IDA
15
Static Analysis of a Very Simple C Program Continued III
push ebp ; Push previous stack frame. mov ebp, esp ; Move SP to EBP to set new stack frame. sub esp, 0xc ; Reserve 0xc bytes for local variables. xor eax, eax ; Clear eax (eax is returned from the function). mov DWORD PTR [ebp-0x4], 0x0 ; Move 0x0 into local variable ebp-0x4. mov DWORD PTR [ebp-0x8], 0x1 ; Move 0x1 into local variable ebp-0x8. mov DWORD PTR [ebp-0xc], 0x2 ; Move 0x2 into local variable ebp-0xc. mov ecx, DWORD PTR [ebp-0x8] ; Move local variable ebp-0x8 into ecx. add ecx, DWORD PTR [ebp-0xc] ; Add local variable ebp-0xc to ecx. mov DWORD PTR [ebp-0x8], ecx ; Move value of ecx into local variable ebp-0x8. add esp, 0xc ; Set SP back to location before. pop ebp ; Restore base pointer. ret ; Pop EIP. You should have some background in reading basic Intel Syntax Assembly. If not, that’s fine. Most instructions are self explanatory. If you want to know more about how registers, stack frames, and calling conventions work you can view the “Binary Exploitation” slides on the Wiki.
16
Dynamic Analysis Part 2
17
What is Dynamic Analysis?
Unlike Static Analysis, you analyse a binary by executing it and following it’s process of execution. You can perform all the same actions as if you were statically analysing, but with the advantage of running the code and seeing how it physically modifies registers and memory. This is often more quick. Two main tools that are used are Debuggers and Memory Editors. We’ll focus on Debuggers (though debuggers can edit memory).
18
Debuggers Debuggers will take your binary as input, create a running process, and attach itself to that process. The debugger can halt, step through, and modify all aspects of your binary’s running process. We’ll be using GDB.
19
Most modern debuggers can:
Debuggers Continued Most modern debuggers can: Do mostly all that a disassembler can do – and more. Disassemble the instructions in the program, see which instruction is going to run next, and then step through those instructions. Read / Write memory (heap, stack), map memory regions. Modify and inspect register values. Manipulate and tracking states.
20
Dynamic Analysis of a Very Simple C Program
gdb simple.elf -q Reading symbols from simple.elf...(no debugging symbols found)...done. (gdb) disassemble main Dump of assembler code for function main: 0x00001f80 <+0>: push ebp 0x00001f81 <+1>: mov ebp,esp 0x00001f83 <+3>: sub esp,0xc 0x00001f86 <+6>: xor eax,eax 0x00001f88 <+8>: mov DWORD PTR [ebp-0x4],0x0 0x00001f8f <+15>: mov DWORD PTR [ebp-0x8],0x1 0x00001f96 <+22>: mov DWORD PTR [ebp-0xc],0x2 0x00001f9d <+29>: mov ecx,DWORD PTR [ebp-0x8] 0x00001fa0 <+32>: add ecx,DWORD PTR [ebp-0xc] 0x00001fa3 <+35>: mov DWORD PTR [ebp-0x8],ecx 0x00001fa6 <+38>: add esp,0xc 0x00001fa9 <+41>: pop ebp 0x00001faa <+42>: ret End of assembler dump. Attaching GDB to simple.elf using gdb simple.elf and disassembling the main function
21
Dynamic Analysis of a Very Simple C Program Continued
(gdb) break *main Breakpoint 1 at 0x1f80 (gdb) run Starting program: /Users/nandayo/Desktop/simple.elf Breakpoint 1, 0x00001f80 in main () (gdb) disassemble Dump of assembler code for function main: => 0x00001f80 <+0>: push ebp 0x00001f81 <+1>: mov ebp,esp 0x00001f83 <+3>: sub esp,0xc 0x00001f86 <+6>: xor eax,eax 0x00001f88 <+8>: mov DWORD PTR [ebp-0x4],0x0 0x00001f8f <+15>: mov DWORD PTR [ebp-0x8],0x1 0x00001f96 <+22>: mov DWORD PTR [ebp-0xc],0x2 0x00001f9d <+29>: mov ecx,DWORD PTR [ebp-0x8] 0x00001fa0 <+32>: add ecx,DWORD PTR [ebp-0xc] 0x00001fa3 <+35>: mov DWORD PTR [ebp-0x8],ecx 0x00001fa6 <+38>: add esp,0xc 0x00001fa9 <+41>: pop ebp 0x00001faa <+42>: ret End of assembler dump. Setting a breakpoint in the main function, running the program, and disassembling to see which instruction we’ve landed on when it hits the breakpoint.
22
Dynamic Analysis of a Very Simple C Program Continued
What if we wanted to see the final result of a + b from our C program? Well, we know this is the line: Is where it moves the value of ecx (being our addition of a and b) back into memory location [ebp-0x8], which is our memory address of a. We can print this location after the instruction is executed. 0x00001fa3 <+35>: mov DWORD PTR [ebp-0x8],ecx (gdb) break *main+38 Breakpoint 2 at 0x1fa6 (gdb) continue Continuing. Breakpoint 2, 0x00001fa6 in main () (gdb) x/dwx $ebp-0x8 0xbffffae0: 0x THE RESULT! Set a breakpoint AFTER the instruction, so we know it has executed. Then we can examine 1 DWORD in hex at memory location [ebp-0x8].
23
Quick Static Analysis Test
push ebp mov ebp, esp sub esp, 0xc xor eax, eax mov DWORD PTR [ebp-0x8], 0x5 mov DWORD PTR [ebp-0xc], 0x4 mov ecx, DWORD PTR [ebp-0x8] sub ecx, DWORD PTR [ebp-0xc] mov DWORD PTR [ebp-0x8], ecx add esp, 0xc pop ebp ret What is the value inside the local variable [ebp-0x8]?
24
Try the “Firetruck” challenge from the C2C 2016 CTF Event
Try a real challenge! Try the “Firetruck” challenge from the C2C 2016 CTF Event You can find the challenge at: ctf.hacktheplanet.club/challenges#Firetruck Give it a go before you watch the solution: here You can solve this challenge through Static Analysis, however you can use what ever tool you would like to I used IDA.
25
GDB / Reversing Basics: https://www.youtube.com/watch?v=VroEiMOJPm8
Useful Resources IDA Basics: GDB / Reversing Basics: Assembly Basics:
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.