Download presentation
Presentation is loading. Please wait.
Published byAgatha Alexander Modified over 9 years ago
1
Embedded Lab. Park Yeongseong
2
Introduction State of the art Core values Design Experiment Discussion Conclusion Q&A
4
Identifying same or similar code is very important Previous works ◦ Static source code comparison – C1 ◦ Static excutable code comparison – C2 ◦ Dynamic control flow based methods – C3 ◦ Dynamic API based methods – C4
5
Three highly desired requirements ◦ R1 – Resiliency ◦ R2 - Ability to directly work on binary executables ◦ R3 – Platform independence BUT!!!! Not satisfy requirement ◦ Static source code comparison – C1 R1 R2 ◦ Static excutable code comparison – C2 R1 ◦ Dynamic control flow based methods – C3 R1 R3 ◦ Dynamic API based methods – C4 R3
6
Introduce new approach ◦ Core-values 5 optimization options (-O0 ~ -O3, -Os) 3 Compilers ( GCC, TCC, WCC ) KlassMaster, Thicket, Loco/Diablo Obfuscators
7
Code Obfuscation Techniques ◦ data obfuscation, control obfuscation, layout obfuscation and preventive transformations ◦ indirect branches, control-flow flattening, function- pointer aliasing Static Analysis Based Plagiarism Detection ◦ String-based ◦ AST-based ◦ Token-based ◦ PDG-based ◦ Birthmark-based
8
Dynamic Analysis Based Plagiarism Detection ◦ Whole program path based (WPP) ◦ Sequence of API function calls birthmark(EXESEQ) ◦ Frequency of API function calls birthmark(EXEFREQ) ◦ System call based birthmark
11
Not all values associated with the execution of a program are core-values ◦ Value-updating instruction ◦ Related to the program’s semantics
12
To refine value sequences ◦ Sequential refinement – reduction rate 16%~34% ◦ Optimization-based refinement – 5 optimization ◦ Address removal – exclude pointer values
14
Intel Quad-Core 2.00 GHz CPU 4GB RAM Linux machin QEMU 0.9.1 Questions 1.resilient 2.false accusation 3.credible
15
Obfuscation techniques ◦ SandMark, KlassMaster : Java bytecode obfuscators Test application : Jlex ◦ Lexical analyzer
16
Test Application ◦ 5 individual XML pasers:expat, libxml2, Parsifal, rxp,xercesc
17
Test application ◦ Bzip2, gzip, oggenc, 9 of 11 programs Result ◦ Similarity scores between 0 and 0.27 ◦ zip and gzip similarity scores are 1.0 Same compression algorithm : deflate ◦ zip and bzip2 similarity scores are 0.01 to 0.03 Different compression algorithm : block sorting
18
introduce a novel approach to dynamic characterization of executable programs. The value-based method successfully discriminates 34 plagiarisms by SandMark, KlassMaster, Thicket.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.