MC^2: Rigorous and Efficient Directed Greybox Fuzzing

08/30/2022
by   Abhishek Shah, et al.
0

Directed greybox fuzzing is a popular technique for targeted software testing that seeks to find inputs that reach a set of target sites in a program. Most existing directed greybox fuzzers do not provide any theoretical analysis of their performance or optimality. In this paper, we introduce a complexity-theoretic framework to pose directed greybox fuzzing as a oracle-guided search problem where some feedback about the input space (e.g., how close an input is to the target sites) is received by querying an oracle. Our framework assumes that each oracle query can return arbitrary content with a large but constant amount of information. Therefore, we use the number of oracle queries required by a fuzzing algorithm to find a target-reaching input as the performance metric. Using our framework, we design a randomized directed greybox fuzzing algorithm that makes a logarithmic (wrt. the number of all possible inputs) number of queries in expectation to find a target-reaching input. We further prove that the number of oracle queries required by our algorithm is optimal, i.e., no fuzzing algorithm can improve (i.e., minimize) the query count by more than a constant factor. We implement our approach in MC^2 and outperform state-of-the-art directed greybox fuzzers on challenging benchmarks (Magma and Fuzzer Test Suite) by up to two orders of magnitude (i.e., 134×) on average. MC^2 also found 15 previously undiscovered bugs that other state-of-the-art directed greybox fuzzers failed to find.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

TOFU:Target-Orienter FUzzer

Program fuzzing-providing randomly constructed inputs to a computer prog...
research
04/29/2020

TOFU: Target-Oriented FUzzer

Program fuzzing—providing randomly constructed inputs to a computer prog...
research
11/05/2018

Almost Optimal Distance Oracles for Planar Graphs

We present new tradeoffs between space and query-time for exact distance...
research
08/02/2018

Reconstructing Strings from Substrings: Optimal Randomized and Average-Case Algorithms

The problem called "String reconstruction from substrings" is a mathemat...
research
02/17/2023

Uniformity Testing over Hypergrids with Subcube Conditioning

We give an algorithm for testing uniformity of distributions supported o...
research
06/14/2022

Resolution Limits of Non-Adaptive 20 Questions Search for a Moving Target

Using the 20 questions estimation framework with query-dependent noise, ...
research
10/13/2019

AdaWISH: Faster Discrete Integration via Adaptive Quantiles

Discrete integration in a high dimensional space of n variables poses fu...

Please sign up or login with your details

Forgot password? Click here to reset