
TRUST: Triangle Counting Reloaded on GPUs
Triangle counting is a building block for a wide range of graph applicat...
read it

TAG: Transformer Attack from Gradient
Although federated learning has increasingly gained attention in terms o...
read it

Dancing along Battery: Enabling Transformer with Runtime Reconfigurability on Mobile Devices
A pruningbased AutoML framework for runtime reconfigurability, namely ...
read it

A Surrogate Lagrangian Relaxationbased Model Compression for Deep Neural Networks
Network pruning is a widely used technique to reduce computation cost an...
read it

Efficient Transformerbased Large Scale Language Representations using Hardwarefriendly Block Structured Pruning
Pretrained largescale language models have increasingly demonstrated hi...
read it

SAPAG: A SelfAdaptive Privacy Attack From Gradients
Distributed learning such as federated learning or collaborative learnin...
read it

ESMFL: Efficient and Secure Models for Federated Learning
Deep Neural Networks are widely applied to various domains. The successf...
read it

MCMIA: Model Compression Against Membership Inference Attack in Deep Neural Networks
Deep learning or deep neural networks (DNNs) have nowadays enabled high ...
read it

FTRANS: EnergyEfficient Acceleration of Transformers using FPGA
In natural language processing (NLP), the "Transformer" architecture was...
read it

A Unified DNN Weight Compression Framework Using Reweighted Optimization Methods
To address the large model size and intensive computation requirement of...
read it

Towards an Efficient and General Framework of Robust Training for Graph Neural Networks
Graph Neural Networks (GNNs) have made significant advances on several f...
read it

A SOTMRAMbased ProcessingInMemory Engine for Highly Compressed DNN Implementation
The computing wall and data movement challenges of deep neural networks ...
read it

Deep Compressed Pneumonia Detection for LowPower Embedded Devices
Deep neural networks (DNNs) have been expanded into medical fields and t...
read it

REQYOLO: A ResourceAware, Efficient Quantization Framework for Object Detection on FPGAs
Deep neural networks (DNNs), as the basis of object detection, will play...
read it

An UltraEfficient MemristorBased DNN Framework with Structured Weight Pruning and Quantization Using ADMM
The high computation and memory storage of large deep neural networks (D...
read it

Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
The stateofart DNN structures involve intensive computation and high m...
read it

A StochasticComputing based Deep Learning Framework using Adiabatic QuantumFluxParametron SuperconductingTechnology
The Adiabatic QuantumFluxParametron (AQFP) superconducting technology ...
read it

ERNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Recurrent Neural Networks (RNNs) are becoming increasingly important for...
read it

Towards BudgetDriven Hardware Optimization for Deep Convolutional Neural Networks using Stochastic Computing
Recently, Deep Convolutional Neural Network (DCNN) has achieved tremendo...
read it

Learning Topics using Semantic Locality
The topic modeling discovers the latent topic probability of the given t...
read it

Structured Weight MatricesBased Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs
Both industry and academia have extensively investigated hardware accele...
read it

PredictionBased Fast Thermoelectric Generator Reconfiguration for Energy Harvesting from Vehicle Radiators
Thermoelectric generation (TEG) has increasingly drawn attention for bei...
read it

Efficient Recurrent Neural Networks using Structured Matrices in FPGAs
Recurrent Neural Networks (RNNs) are becoming increasingly important for...
read it

CLSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs
Recently, significant accuracy improvement has been achieved for acousti...
read it

Towards UltraHigh Performance and Energy Efficiency of Deep Learning Systems: An AlgorithmHardware CoOptimization Framework
Hardware accelerations of deep learning systems have been extensively in...
read it

VIBNN: Hardware Acceleration of Bayesian Neural Networks
Bayesian Neural Networks (BNNs) have been proposed to address the proble...
read it

FFTBased Deep Learning Deployment in Embedded Systems
Deep learning has delivered its powerfulness in many application domains...
read it

CirCNN: Accelerating and Compressing Deep Neural Networks Using BlockCirculantWeight Matrices
Largescale deep neural networks (DNNs) are both compute and memory inte...
read it

HardwareDriven Nonlinear Activation for Stochastic Computing Based Deep Convolutional Neural Networks
Recently, Deep Convolutional Neural Networks (DCNNs) have made unprecede...
read it

SCDCNN: HighlyScalable Deep Convolutional Neural Network using Stochastic Computing
With recent advancing of Internet of Things (IoTs), it becomes very attr...
read it
Caiwen Ding
verfied profile
Assistant professor at the University of Connecticut