AFL

American Fuzzy Lop

1 / 81

2 / 81

AgendaWhy care?
Why it's better
"hello world" setup
Why it's faster
How to fuzz a server
Testcase generation
measuring coverage with afl-cov
Error detection
Compiler transformation for better coverage
Dictionaries
Test case minification aka fuzzer maintenance
"Corpus driven fuzzing"
Beyond crashes
Targeting
3 / 81

Why care?

http://lcamtuf.coredump.cx/afl/

4 / 81

Why it's better5 / 81

6 / 81

7 / 81

"Pulling JPEGs out of thin air"

https://lcamtuf.blogspot.dk/2014/11/pulling-jpegs-out-of-thin-air.html

8 / 81

"Hello world" setup9 / 81

"Hello world" setup

# CC=~/tools/afl-2.35b/afl-clang-fast \
CXX=~/tools/afl-2.35b/afl-clang-fast++ make clean all
# ~/tools/afl-2.35b/afl-fuzz -i testcases/ \
-o output -- ./upx-3.91-src/src/upx.out -d @@

10 / 81

Demo11 / 81

12 / 81

dude@dudebox:~/projects/demo$ tree output/

output/
├── crashes
│   ├── id:000000,sig:06,src:000000,op:flip1,pos:201
│   ├── id:000001,sig:06,src:000000,op:flip1,pos:205
│   ├── id:000002,sig:06,src:000000,op:flip1,pos:206
│   ├── id:000003,sig:06,src:000000,op:flip1,pos:206
│   ├── id:000004,sig:06,src:000000,op:flip1,pos:206
│   └── README.txt
├── fuzz_bitmap
├── fuzzer_stats
├── hangs
├── plot_data
└── queue
    ├── id:000000,orig:ls.compressed
    ├── id:000001,src:000000,op:flip1,pos:0,+cov
    ├── id:000002,src:000000,op:flip1,pos:4
    ├── id:000003,src:000000,op:flip1,pos:5,+cov
    ├── id:000004,src:000000,op:flip1,pos:6,+cov
    ...
    ├── id:000089,src:000000,op:flip1,pos:4101
    ├── id:000090,src:000000,op:flip1,pos:4384
    └── id:000091,src:000000,op:flip1,pos:4764
3 directories, 101 files

13 / 81

Performance optimisations14 / 81

Performance optimisationsFork server
Deferred initialisation
Persistent mode
15 / 81

traditional fuzzing with execve() - How executables normally are startedread executable file from disk
parse executable file
init virtual memory
init stack
init heap
load shared libraries (.dll .so .dylib)
+++
call main()
16 / 81

traditional fuzzing with execve() - How executables normally are started

read executable file from disk <- couldn't care less
parse executable file <- couldn't care less
init virtual memory <- couldn't care less
init stack <- couldn't care less
init heap <- couldn't care less
load shared libraries (.dll .so .dylib) <- couldn't care less
+++ <- couldn't care less
call main()

17 / 81

18 / 81

Fork server19 / 81

Fork serverread executable file from disk
parse executable file
init virtual memory
init stack
init heap
load shared libraries (.dll .so .dylib)
+++
call main()
20 / 81

Fork serverread executable file from disk
parse executable file
init virtual memory
init stack
init heap
load shared libraries (.dll .so .dylib)
+++
fork()
call main()
21 / 81

Fork serverfork()
main()
22 / 81

works automatically!23 / 81

But, we want more24 / 81

Fork serverfork()
main()
25 / 81

Fork serverfork()
main()parse cli args
readConfig()
initStuff()
check for updates
calculate more stuff
+++
readInput()
parseInput()

26 / 81

Deferred initialisationfork()
main()parse cli args
readConfig()
initStuff()
check for updates
calculate more stuff
+++
fork()
readInput()
parseInput()

27 / 81

Deferred initialisationfork()
main()parse cli args
readConfig()
initStuff()
check for updates
calculate more stuff
+++
fork()
readInput()
parseInput()#ifdef __AFL_HAVE_MANUAL_CONTROL
__AFL_INIT();
#endif
readInput()
parseInput()

28 / 81

But, we want MORE29 / 81

Persistent mode30 / 81

Deferred initialisationfork()
readInput()
parseInput()
31 / 81

Persistent modefork()
readInput()
parseInput()
32 / 81

Persistent mode

~~fork()~~
readInput()
parseInput()

init()
while (__AFL_LOOP(1000)) {
    readInput();
    parseInput();
}
exit(0);

33 / 81

Server fuzzing34 / 81

Server fuzzing

init();
while (keep_running) {
    waitForData(); // blocking
    readInput();
    parseInput();
    housekeeping();
}
cleanup();

35 / 81

Server fuzzing

init();
//while (keep_running) {
    //waitForData(); // blocking
    //readInput();
    readFromStdin(); (or)  readFromFile();
    parseInput();
    exit(0);
    housekeeping();
//}
cleanup();

36 / 81

Server fuzzing + deferred initialisation

init();
//while (keep_running) {
    //waitForData(); // blocking
    //readInput();
#ifdef __AFL_HAVE_MANUAL_CONTROL
  __AFL_INIT();
#endif
    readFromStdin(); (or)  readFromFile();
    parseInput();
    exit(0);
    housekeeping();
//}
cleanup();

37 / 81

Server fuzzing + persistent mode

init();
//while (keep_running) {
while (__AFL_LOOP(1000) {
    //waitForData(); // blocking
    //readInput();
    readFromStdin(); (or)  readFromFile();
    parseInput();
    housekeeping();
}
exit(0);
cleanup();

38 / 81

39 / 81

Lazy server fuzzing + persistent mode

init();
//while (keep_running) {
while (__AFL_LOOP(1000) {
    input = readFromStdin(); (or)  readFromFile();
    sendToItself(input);
    waitForData(); // blocking
    readInput();
    parseInput();
    housekeeping();
}
exit(0);
cleanup();

40 / 81

Fuzzing ntpd

network time protocol daemon

41 / 81

ntpd/ntpd.c

    for (;;) {
#if !defined(SIM) && defined(SIGDIE1)
        if (signalled)
            finish_safe(signo);
#endif        
        if (alarm_flag) {    /* alarmed? */
            was_alarmed = TRUE;
            alarm_flag = FALSE;
        }
        /* collect async name/addr results */
        if (!was_alarmed)
            harvest_blocking_responses();
        if (!was_alarmed && !has_full_recv_buffer()) {
            /*
             * Nothing to do.  Wait for something.
             */
            io_handler();
        }

42 / 81

ntpd/ntpd.c - modified

    #define BUFLEN 5120 
    struct sockaddr_in si_other;
    int s, slen=sizeof(si_other);   
    char buf[BUFLEN];
    if ((s=socket(AF_INET, SOCK_DGRAM, IPPROTO_UDP))==-1) {
      perror("socket");
      abort();
    }
    memset((char *) &si_other, 0, sizeof(si_other));
    si_other.sin_family = AF_INET;  
    si_other.sin_addr.s_addr = inet_addr("127.0.0.1");
    si_other.sin_port = htons(123); 
    while (__AFL_LOOP(1000)) {
    ...
        if (!was_alarmed && !has_full_recv_buffer()) {
          memset(buf, 0, BUFLEN);
          size_t insize = read(0, buf, BUFLEN);
          if (sendto(s, buf, insize, 0, (struct sockaddr *)&si_other, slen)==-1) {
            perror("sendto()");
            abort();
          }
          io_handler();      
        }

43 / 81

Demo44 / 81

45 / 81

afl-plot

46 / 81

Testcase generation47 / 81

Testcase generation

http://doc.ntp.org/4.1.0/ntpq.htm

48 / 81

Steal

49 / 81

dude@dudebox:~/projects/ntpd/run/in$ ls -l | wc -l
43
dude@dudebox:~/projects/ntpd/run/in$ ls -l
total 168
-rw-r--r-- 2 dude dude  68 Jun 18 11:57 decodenetnumtrigger1.raw
-rw-r--r-- 2 dude dude  12 Jun 18 11:57 ntpassociations
-rw-r--r-- 1 dude dude  36 Jun 18 11:57 ntpauth
-rw-r--r-- 4 dude dude  12 Jun 18 11:57 ntpbeginningstrange
-rw-r--r-- 2 dude dude  20 Jun 18 11:57 ntpclockvarassocid
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvarassocidset
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvarbadformat
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvarbadformatset
-rw-r--r-- 2 dude dude  20 Jun 18 11:57 ntpclockvardevice
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvardeviceclock
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvardevicelocal
-rw-r--r-- 2 dude dude  32 Jun 18 11:57 ntpclockvardeviceundisciplined
-rw-r--r-- 2 dude dude  20 Jun 18 11:57 ntpclockvarflags
-rw-r--r-- 7 dude dude  20 Jun 18 11:57 ntpclockvarflagsset
-rw-r--r-- 8 dude dude  16 Jun 18 11:57 ntpclockvarpoll
-rw-r--r-- 4 dude dude  20 Jun 18 11:57 ntpclockvarpollset
-rw-r--r-- 7 dude dude  20 Jun 18 11:57 ntpclockvarstatus
-rw-r--r-- 7 dude dude  20 Jun 18 11:57 ntpclockvartimecode
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpclockvartimecodeset
-rw-r--r-- 7 dude dude 104 Jun 18 11:57 ntpmonstats
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulist
-rw-r--r-- 2 dude dude  68 Jun 18 11:57 ntpmrulistkod
-rw-r--r-- 2 dude dude  72 Jun 18 11:57 ntpmrulistladdrset
-rw-r--r-- 2 dude dude  68 Jun 18 11:57 ntpmrulistlimited
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistmincount
-rw-r--r-- 2 dude dude  68 Jun 18 11:57 ntpmrulistmincountset
-rw-r--r-- 2 dude dude  68 Jun 18 11:57 ntpmrulistresallhexmask
-rw-r--r-- 4 dude dude  68 Jun 18 11:57 ntpmrulistresanyhexmask
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistsortorderaddr
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistsortorderavgint
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistsortordercount
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistsortorderlstint
-rw-r--r-- 2 dude dude  52 Jun 18 11:57 ntpmrulistsortorderlstintreverse
-rw-r--r-- 4 dude dude  12 Jun 18 11:57 ntppeers
-rw-r--r-- 4 dude dude  12 Jun 18 11:57 ntppeerschallengeresponse
-rw-r--r-- 2 dude dude  16 Jun 18 11:57 ntpreadvarpeer
-rw-r--r-- 2 dude dude  24 Jun 18 11:57 ntpreadvarprocessor
-rw-r--r-- 4 dude dude  20 Jun 18 11:57 ntpreadvarstatus
-rw-r--r-- 7 dude dude  20 Jun 18 11:57 ntpreadvaversion
-rw-r--r-- 1 dude dude  44 Jun 18 11:57 ntpwritevarpeer
-rw-r--r-- 1 dude dude  52 Jun 18 11:57 ntpwritevarrootdisp
-rw-r--r-- 2 dude dude  48 Jun 18 11:57 raw

50 / 81

Measuring coverage

"What code did I actually fuzz?"

51 / 81

afl-cov

Sucks to figure out that you have been fuzzing the checksum check for multiple weeks Example: PNG

52 / 81

(more) Error detection53 / 81

(more) Error detectionlibdislocator
ASan - Address Sanitizer - libdislocator++ harder to use + insignificant slowdown
valgrind - ASan++, but HUGE slowdown 
MSan - Memory Santizer
TSan - Thread Sanitizer
UBSan - Undefined Behavior Sanitizer
KASan - Kernel Address Sanitizer
54 / 81

(more) Error detectionlibdislocator
ASan - Address Sanitizer - libdislocator++ harder to use + insignificant slowdown
valgrind - ASan++, but HUGE slowdown 
MSan - Memory Santizer
TSan - Thread Sanitizer
UBSan - Undefined Behavior Sanitizer
KASan - Kernel Address Sanitizer
55 / 81

libdislocator

Usage: AFL_LD_PRELOAD=/path/to/libdislocator.so ./afl-fuzz [...other params...] https://github.com/mcarpenter/afl/tree/master/libdislocator

1) It allocates all buffers so that they are immediately adjacent to a 
subsequent PROT_NONE page, causing most off-by-one reads and writes to 
immediately segfault, 
2) It adds a canary immediately below the allocated buffer, to catch 
writes to negative offsets (won't catch reads, though), 
3) It sets the memory returned by malloc() to garbage values, 
improving the odds of crashing when the target accesses uninitialized 
data, 
4) It sets freed memory to PROT_NONE and does not actually reuse it, 
causing most use-after-free bugs to segfault right away, 
5) It forces all realloc() calls to return a new address - and sets 
PROT_NONE on the original block. This catches use-after-realloc bugs, 
6) It checks for calloc() overflows and can cause soft or hard 
failures of alloc requests past a configurable memory limit 
(AFL_LD_LIMIT_MB, AFL_LD_HARD_LIMIT). 
Basically, it is inspired by some of the non-default options available 
for the OpenBSD allocator. It is meant as a more lightweight and 
hassle-free alternative to fuzzing with ASAN / MSAN (although it's 
obviously not as comprehensive).

56 / 81

Address Sanitizer (ASan)

= libdislocator + read checks + more

# AFL_USE_ASAN=1 ...

57 / 81

ASan + 64-bit != </3

58 / 81

docs/notes_for_asan.txt

2) Long version
---------------
ASAN allocates a huge region of virtual address space for bookkeeping purposes.
Most of this is never actually accessed, so the OS never has to allocate any
real pages of memory for the process, and the VM grabbed by ASAN is essentially
"free" - but the mapping counts against the standard OS-enforced limit
(RLIMIT_AS, aka ulimit -v).
On our end, afl-fuzz tries to protect you from processes that go off-rails
and start consuming all the available memory in a vain attempt to parse a
malformed input file. This happens surprisingly often, so enforcing such a limit
is important for almost any fuzzer: the alternative is for the kernel OOM
handler to step in and start killing random processes to free up resources.
Needless to say, that's not a very nice prospect to live with.

"On 64-bit systems, the situation is more murky, because the ASAN allocation is completely outlandish - around 17.5 TB in older versions, and closer to 20 TB with newest ones."

59 / 81

60 / 81

afl/experimental/asan_cgroups/limit_memory.sh

enable cgroups in kernel (grub boot options)
# swapoff -a

# sudo ./limit_memory.sh -u dude -m 500 -- ./afl-fuzz..

61 / 81

62 / 81

ASan

63 / 81

Valgrind

64 / 81

Compiler transformation for more code coverage

https://lafintel.wordpress.com/2016/08/15/circumventing-fuzzing-roadblocks-with-compiler-transformations/

65 / 81

Dictionaries

https://github.com/mcarpenter/afl/tree/master/dictionaries

66 / 81

Test case minification - Fuzzer maintenance67 / 81

Test case minification

afl-cmin: https://github.com/mirrorer/afl/blob/master/afl-cmin

# This tool tries to find the smallest subset of files in the input directory
# that still trigger the full range of instrumentation data points seen in
# the starting corpus. This has two uses:
#
#   - Screening large corpora of input files before using them as a seed for
#     afl-fuzz. The tool will remove functionally redundant files and likely
#     leave you with a much smaller set.
#
#     (In this case, you probably also want to consider running afl-tmin on
#     the individual files later on to reduce their size.)
#
#   - Minimizing the corpus generated organically by afl-fuzz, perhaps when
#     planning to feed it to more resource-intensive tools. The tool achieves
#     this by removing all entries that used to trigger unique behaviors in the
#     past, but have been made obsolete by later finds.

68 / 81

Test case minification

afl-tmin https://github.com/mirrorer/afl/blob/master/afl-tmin.c

   A simple test case minimizer that takes an input file and tries to remove
   as much data as possible while keeping the binary in a crashing state
   *or* producing consistent instrumentation output (the mode is auto-selected
   based on the initially observed behavior).

69 / 81

Corpus driven fuzzing

term coined by Ben Nagy (@rantyben)? https://github.com/bnagy/slides/blob/master/fuzzing_without_pub.pdf

70 / 81

Corpus driven fuzzing

ms15-024 / ms15-029 found by lcamtuf found in IE without ever fuzzing IE
Goal: build good corpora. NOT finding crashes

e.g.

fuzz pdf parser A
take found corpora/testcases and fuzz pdf parser B
profit

71 / 81

Beyond crashes

A fuzzer is good at finding crashes, so let's convert whatever we want to find into a crash.

72 / 81

Beyond crashes

ABORT(3)                      Linux Programmer's Manual                     ABORT(3)
NAME
       abort - cause abnormal process termination
SYNOPSIS
       #include <stdlib.h>
       void abort(void);
DESCRIPTION
       The  abort()  first  unblocks the SIGABRT signal, and then raises that signal
       for the calling process.  This results in the  abnormal  termination  of  the
       process  unless  the SIGABRT signal is caught and the signal handler does not
       return (see longjmp(3)).
       If the abort() function causes process  termination,  all  open  streams  are
       closed and flushed.
       If  the  SIGABRT  signal is ignored, or caught by a handler that returns, the
       abort() function will still terminate the process.  It does this by restoring
       the  default disposition for SIGABRT and then raising the signal for a second
       time.

73 / 81

Logic flaws

input = readInput();
resultA = BigNumLibraryA_call(input);
resultB = BigNumLibraryB_call(input);
if (resultA != resultB) {
    abort();
}
exit(0);

74 / 81

Authentication bypass

...
if (authenticated == true) {
    abort();
} 
exit(0);

75 / 81

Be creative!

sandbox escape?
side channels?
degradation of service?
mess up audit trail?
escalation of privileges?
+++
insert crazy idea here

If it can be converted to a crash, then it can be fuzzed

76 / 81

Targeting77 / 81

Targeting

do something different
.. and be the first to do it

e.g. be creative and don't wait

78 / 81

Targeting

$$$
curiosity
both??

aka "why do you spend your evenings on infosec?"

79 / 81

80 / 81

references

slides: http://dumpco.re/afl
http://imgur.com/a/O7z5F
strategies: https://lcamtuf.blogspot.dk/2014/08/binary-fuzzing-strategies-what-works.html
fuzz upx https://asciinema.org/a/e7bpjng8jj33o53qmctkihka8
fuzz ntpd https://asciinema.org/a/1npswngnfah6m4m0et246e0lr
auxiliary tools by Ben Nagy (@rantyben) https://github.com/bnagy?tab=repositories

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

AFL

Agenda

Why care?

Why it's better

"Pulling JPEGs out of thin air"

"Hello world" setup

"Hello world" setup

Demo

Performance optimisations

Performance optimisations

traditional fuzzing with execve() - How executables normally are started

traditional fuzzing with execve() - How executables normally are started

Fork server

Fork server

Fork server

Fork server

works automatically!

But, we want more

Fork server

Fork server

Deferred initialisation

Deferred initialisation

But, we want MORE

Persistent mode

Deferred initialisation

Persistent mode

Persistent mode

Server fuzzing

Server fuzzing

Server fuzzing

Server fuzzing + deferred initialisation

Server fuzzing + persistent mode

Lazy server fuzzing + persistent mode

Fuzzing ntpd

ntpd/ntpd.c

ntpd/ntpd.c - modified

Demo

Testcase generation

Testcase generation

Steal

Measuring coverage

afl-cov

(more) Error detection

(more) Error detection

(more) Error detection

libdislocator

Address Sanitizer (ASan)

ASan + 64-bit != </3

docs/notes_for_asan.txt

afl/experimental/asan_cgroups/limit_memory.sh

ASan

Valgrind

Compiler transformation for more code coverage

Dictionaries

Test case minification - Fuzzer maintenance

Test case minification

Test case minification

Corpus driven fuzzing

Corpus driven fuzzing

Beyond crashes

Beyond crashes

Logic flaws

Authentication bypass

Be creative!

Targeting

Targeting

Targeting

references

Help