Ternary Singular Value Decomposition as a Better Parameterized Form in Linear Mapping

08/15/2023
by   Boyu Chen, et al.
0

We present a simple yet novel parameterized form of linear mapping to achieves remarkable network compression performance: a pseudo SVD called Ternary SVD (TSVD). Unlike vanilla SVD, TSVD limits the U and V matrices in SVD to ternary matrices form in {± 1, 0}. This means that instead of using the expensive multiplication instructions, TSVD only requires addition instructions when computing U(·) and V(·). We provide direct and training transition algorithms for TSVD like Post Training Quantization and Quantization Aware Training respectively. Additionally, we analyze the convergence of the direct transition algorithms in theory. In experiments, we demonstrate that TSVD can achieve state-of-the-art network compression performance in various types of networks and tasks, including current baseline models such as ConvNext, Swim, BERT, and large language model like OPT.

READ FULL TEXT
research
04/18/2022

Fast optimization of common basis for matrix set through Common Singular Value Decomposition

SVD (singular value decomposition) is one of the basic tools of machine ...
research
12/07/2021

Enhancing the SVD Compression

Orthonormality is the foundation of matrix decomposition. For example, S...
research
06/30/2022

Language model compression with weighted low-rank factorization

Factorizing a large matrix into small matrices is a popular strategy for...
research
09/29/2020

What if Neural Networks had SVDs?

Various Neural Networks employ time-consuming matrix operations like mat...
research
09/02/2020

A Survey of Singular Value Decomposition Methods for Distributed Tall/Skinny Data

The Singular Value Decomposition (SVD) is one of the most important matr...
research
04/29/2021

High-Performance Partial Spectrum Computation for Symmetric eigenvalue problems and the SVD

Current dense symmetric eigenvalue (EIG) and singular value decompositio...
research
04/27/2022

SVD Perspectives for Augmenting DeepONet Flexibility and Interpretability

Deep operator networks (DeepONets) are powerful architectures for fast a...

Please sign up or login with your details

Forgot password? Click here to reset