Shuangfei Zhai

research

∙ 06/08/2023

BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping

Diffusion models have demonstrated excellent potential for generating di...

4 Jiatao Gu, et al. ∙

research

∙ 06/05/2023

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model

Autoregressive models for text sometimes generate repetitive and low-qua...

2 Yizhe Zhang, et al. ∙

research

∙ 04/13/2023

Learning Controllable 3D Diffusion Models from Single-view Images

Diffusion models have recently become the de-facto approach for generati...

35 Jiatao Gu, et al. ∙

research

∙ 03/11/2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

Training stability is of great importance to Transformers. In this work,...

5 Shuangfei Zhai, et al. ∙

research

∙ 03/07/2023

TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

Denoising Diffusion models have demonstrated their proficiency for gener...

0 David Berthelot, et al. ∙

research

∙ 10/10/2022

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation

Diffusion models (DMs) have recently emerged as SoTA tools for generativ...

7 Jiatao Gu, et al. ∙

research

∙ 07/27/2022

GAUDI: A Neural Architect for Immersive 3D Scene Generation

We introduce GAUDI, a generative model capable of capturing the distribu...

5 Miguel Angel Bautista, et al. ∙

research

∙ 07/15/2022

Position Prediction as an Effective Pretraining Strategy

Transformers have gained increasing popularity in a wide range of applic...

1 Shuangfei Zhai, et al. ∙

research

∙ 06/10/2022

The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon

The grokking phenomenon as reported by Power et al. ( arXiv:2201.02177 )...

13 Vimal Thilak, et al. ∙

research

∙ 02/04/2022

Learning Representation from Neural Fisher Kernel with Low-rank Approximation

In this paper, we study the representation of neural networks from the v...

17 Ruixiang Zhang, et al. ∙

research

∙ 12/02/2021

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

Modeling the world can benefit robot learning by providing a rich traini...

10 Nitish Srivastava, et al. ∙

research

∙ 09/16/2021

Regularized Training of Nearest Neighbor Language Models

Including memory banks in a natural language processing architecture inc...

0 Jean-Francois Ton, et al. ∙

research

∙ 07/01/2021

Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks

We analyze the learning dynamics of infinitely wide neural networks with...

4 Etai Littwin, et al. ∙

research

∙ 05/28/2021

An Attention Free Transformer

We introduce Attention Free Transformer (AFT), an efficient variant of T...

37 Shuangfei Zhai, et al. ∙

research

∙ 05/17/2021

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

Offline Reinforcement Learning promises to learn effective policies from...

18 Yue Wu, et al. ∙

research

∙ 04/21/2021

MetricOpt: Learning to Optimize Black-Box Evaluation Metrics

We study the problem of directly optimizing arbitrary non-differentiable...

0 Chen Huang, et al. ∙

research

∙ 06/27/2020

On the generalization of learning-based 3D reconstruction

State-of-the-art learning-based monocular 3D reconstruction methods lear...

0 Miguel Angel Bautista, et al. ∙

research

∙ 06/18/2020

Set Distribution Networks: a Generative Model for Sets of Images

Images with shared characteristics naturally form sets. For example, in ...

31 Shuangfei Zhai, et al. ∙

research

∙ 06/13/2020

Collegial Ensembles

Modern neural network performance typically improves as model size incre...

0 Etai Littwin, et al. ∙

research

∙ 10/29/2019

Adversarial Fisher Vectors for Unsupervised Representation Learning

We examine Generative Adversarial Networks (GANs) through the lens of de...

33 Shuangfei Zhai, et al. ∙

research

∙ 10/28/2019

Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking

Deep neural networks require collecting and annotating large amounts of ...

12 Alaaeldin El-Nouby, et al. ∙

research

∙ 05/15/2019

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

In most machine learning training paradigms a fixed, often handcrafted, ...

5 Chen Huang, et al. ∙

research

∙ 11/14/2017

A Deep Learning Approach for Expert Identification in Question Answering Communities

In this paper, we describe an effective convolutional neural network fra...

0 Chen Zheng, et al. ∙

research

∙ 09/06/2017

Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records

The rapid growth of Electronic Health Records (EHRs), as well as the acc...

0 Zhengping Che, et al. ∙

research

∙ 11/26/2016

Structural Correspondence Learning for Cross-lingual Sentiment Classification with One-to-many Mappings

Structural correspondence learning (SCL) is an effective method for cros...

0 Nana Li, et al. ∙

research

∙ 11/16/2016

Fully-adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Multi-task learning aims to improve generalization performance of multip...

0 Yongxi Lu, et al. ∙

research

∙ 11/16/2016

S3Pool: Pooling with Stochastic Spatial Sampling

Feature pooling layers (e.g., max pooling) in convolutional neural netwo...

0 Shuangfei Zhai, et al. ∙

research

∙ 05/25/2016

Deep Structured Energy Based Models for Anomaly Detection

In this paper, we attack the anomaly detection problem by directly model...

0 Shuangfei Zhai, et al. ∙

Shuangfei Zhai

Featured Co-authors

Sign in with Google

Consider DeepAI Pro