CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Neural networks are essential components of learning-based software systems. However, their high compute, memory, and power requirements make using them in low resources domains challenging. For this reason, neural networks are often quantized before deployment. Existing quantization techniques tend to degrade the network accuracy. We propose Counter-Example Guided Neural Network Quantization Refinement (CEG4N). This technique combines search-based quantization and equivalence verification: the former minimizes the computational requirements, while the latter guarantees that the network's output does not change after quantization. We evaluate CEG4N on a diverse set of benchmarks, including large and small networks. Our technique successfully quantizes the networks in our evaluation while producing models with up to 72 better accuracy than state-of-the-art techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2023

Guaranteed Quantization Error Computation for Neural Network Model Compression

Neural network model compression techniques can address the computation ...
research
12/08/2021

Neural Network Quantization for Efficient Inference: A Survey

As neural networks have become more powerful, there has been a rising de...
research
08/01/2023

MRQ:Support Multiple Quantization Schemes through Model Re-Quantization

Despite the proliferation of diverse hardware accelerators (e.g., NPU, T...
research
06/23/2023

QNNRepair: Quantized Neural Network Repair

We present QNNRepair, the first method in the literature for repairing q...
research
06/15/2021

A White Paper on Neural Network Quantization

While neural networks have advanced the frontiers in many applications, ...
research
10/26/2021

Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes

Quantization is a popular technique that transforms the parameter repres...
research
11/13/2018

Iteratively Training Look-Up Tables for Network Quantization

Operating deep neural networks on devices with limited resources require...

Please sign up or login with your details

Forgot password? Click here to reset