BenchPress: A Deep Active Benchmark Generator

08/13/2022
by   Foivos Tsimpourlas, et al.
0

We develop BenchPress, the first ML benchmark generator for compilers that is steerable within feature space representations of source code. BenchPress synthesizes compiling functions by adding new code in any part of an empty or existing sequence by jointly observing its left and right context, achieving excellent compilation rate. BenchPress steers benchmark generation towards desired target features that has been impossible for state of the art synthesizers (or indeed humans) to reach. It performs better in targeting the features of Rodinia benchmarks in 3 different feature spaces compared with (a) CLgen - a state of the art ML synthesizer, (b) CLSmith fuzzer, (c) SRCIROR mutator or even (d) human-written code from GitHub. BenchPress is the first generator to search the feature space with active learning in order to generate benchmarks that will improve a downstream task. We show how using BenchPress, Grewe's et al. CPU vs GPU heuristic model can obtain a higher speedup when trained on BenchPress's benchmarks compared to other techniques. BenchPress is a powerful code generator: Its generated samples compile at a rate of 86 compared to CLgen's 2.33 produces 10x more unique, compiling OpenCL benchmarks than CLgen, which are significantly larger and more feature diverse.

READ FULL TEXT
research
03/02/2023

BenchDirect: A Directed Language Model for Compiler Benchmarks

The exponential increase of hardware-software complexity has made it imp...
research
01/04/2018

Hygienic Source-Code Generation Using Functors

Existing source-code-generating tools such as Lex and Yacc suffer from p...
research
06/02/2023

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

The costly human effort required to prepare the training data of machine...
research
03/27/2023

SimpleNet: A Simple Network for Image Anomaly Detection and Localization

We propose a simple and application-friendly network (called SimpleNet) ...
research
10/18/2022

IF-GAN: A Novel Generator Architecture with Information Feedback

This paper presents an alternative generator architecture for image gene...
research
11/05/2020

A Comparison Study on Infant-Parent Voice Diarization

We design a framework for studying prelinguistic child voicefrom 3 to 24...

Please sign up or login with your details

Forgot password? Click here to reset