A Survey on Fuzzing¶

Conceptually, a fuzzing test starts with generating massive normal and abnormal inputs to target applications, and try to detect exceptions by feeding the generated inputs to the target applications and monitoring the execution states. 1

Tutorial¶

Fuzzing with AFL workshop: AFL training
- video
- slides
CMPSC 447 Software Security
- Testing and fuzzing
- White‐Box Fuzzing (Combining Testing and Symbolic Execution)

Tool: AFL¶

Some papers¶

Reinforcement Learning-based Hierarchical Seed Scheduling for Greybox Fuzzing (NDSS’21)
Android SmartTVs Vulnerability Discovery via Log-Guided Fuzzing (USENIX Security’21)
Send Hardest Problems My Way: Probabilistic Path Prioritization for Hybrid Fuzzing (NDSS’21)
VDF: Targeted Evolutionary Fuzz Testing of Virtual Devices
FuzzGen: Automatic Fuzzer Generation

Basic Notes¶

Black‐box fuzzing:

Treating the system as a blackbox during fuzzing;
not knowing details of the implementation;
Feed the program random inputs and see if it crashes;

White‐box fuzzing:

Design fuzzing based on internals of the system;
Combines test generation with fuzzing (static analysis/symbolic execution);
Goal: Given a sequential program with a set of input parameters, generate a set of inputs that maximizes code coverage

Greybox Fuzzing¶

Grey‐box fuzzing is also called coverage‐Based fuzzing. It instruments the program, instead of simply treating the program as a black-box, to trace coverage (e.g. path, edge, code, etc.)

New inputs are generated through mutation and crossover/splice on seeds;
- Only a few inputs from the seed pool will be scheduled to generate the next batch of inputs (due to the limited processing capability)
- For example, a single fuzzer instance can only schedule one seed at a time
The generated inputs are selected according to a fitness function;
Selected inputs are then added back to the seed pool for further mutation;

The details of a genetic fuzzing process can be described as:

Given a program \(P\) and a set of initial seeds \(S^0\); (Input)
Each round starts with selecting the next seed for fuzzing according to the scheduling criteria; (Line 5)
Assign a certain amount of power to the scheduled seed, determining how many new test cases will be generated; (Line 6)
Test cases are generated through (random) mutation and crossover based on the scheduled seed; (Line 9)
Compared to blackbox and whitebox fuzzing, the most distinctive step of greybox fuzzing is that, when executing a newly generated input \(I\), the fuzzer uses lightweight instrumentations to capture runtime features and expose them to the fitness function to measure the “quality” of a generated test case ;

Note

Test cases with good quality will then be saved as a new seed into the seed pool; (Line 13-14)
This step allows a greybox to gradually evolve towards a target (e.g., more coverage).

Miscellaneous¶

Cornell CS3110 AFL + Ocaml

1: https://cybersecurity.springeropen.com/articles/10.1186/s42400-018-0002

SeqTrans: Automatic Vulnerability Fix via Sequence to Sequence Learning Reinforcement Learning-based Hierarchical Seed Scheduling for Greybox Fuzzing

27 October 2021

Tags

Archives

A Survey on Fuzzing¶

Tutorial¶

Tool: AFL¶

Some papers¶

Basic Notes¶

Greybox Fuzzing¶

Miscellaneous¶

Quick search

27 October 2021

Tags

Archives

A Survey on Fuzzing¶

Tutorial¶

Tool: AFL¶

Some papers¶

Basic Notes¶

Greybox Fuzzing¶

Miscellaneous¶