
Repulsive Attention: Rethinking Multihead Attention as Bayesian Inference
The neural attention mechanism plays an important role in many natural l...
Unsupervised Abstractive Dialogue Summarization for TeteaTetes
Highquality dialoguesummary paired data is expensive to produce and do...
Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
We propose a novel framework for structured bandits, which we call an in...
When does MAML Work the Best? An Empirical Study on ModelAgnostic MetaLearning in NLP Applications
ModelAgnostic MetaLearning (MAML), a modelagnostic metalearning meth...
Reward Constrained Interactive Recommendation with Natural Language Feedback
Textbased interactive recommendation provides richer user feedback and ...
Improving Adversarial Text Generation by Modeling the Distant Future
Autoregressive text generation models usually focus on local fluency, a...
Security Analysis of EOSIO Smart Contracts
The EOSIO blockchain, one of the representative Delegated ProofofStake...
GenDICE: Generalized Offline Estimation of Stationary Values
An important problem that arises in reinforcement learning and Monte Car...
NestedWasserstein SelfImitation Learning for Sequence Generation
Reinforcement learning (RL) has been widely studied for improving sequen...
Learning Diverse Stochastic HumanAction Generators by Learning Smooth Latent Transitions
Humanmotion generation is a longstanding challenging task due to the r...
Collaborative Filtering with A Synthetic Feedback Loop
We propose a novel learning framework for recommendation systems, assist...
Improving Textual Network Learning with Variational Homophilic Embeddings
The performance of many network learning applications crucially hinges o...
Figure Captioning with Reasoning and SequenceLevel Training
Figures, such as bar charts, pie charts, and line plots, are widely used...
TopicGuided Variational Autoencoders for Text Generation
We propose a topicguided variational autoencoder (TGVAE) model for text...
Scalable Thompson Sampling via Optimal Transport
Thompson sampling (TS) is a class of algorithms for sequential decision...
Improving SequencetoSequence Learning via Optimal Transport
Sequencetosequence models are commonly trained via maximum likelihood ...
Sequence Generation with Guider Network
Sequence generation with reinforcement learning (RL) has received signif...
Stochastic ParticleOptimization Sampling and the NonAsymptotic Convergence Theory
Particleoptimization sampling (POS) is a recently developed technique t...
Policy Optimization as Wasserstein Gradient Flows
Policy optimization is a core component of reinforcement learning (RL), ...
Accelerated Firstorder Methods on the Wasserstein Space for Bayesian Inference
We consider doing Bayesian inference by minimizing the KL divergence on ...
A Unified ParticleOptimization Framework for Scalable Bayesian Sampling
There has been recent interest in developing scalable Bayesian sampling ...
Learning Structural Weight Uncertainty for Sequential DecisionMaking
Learning probability distributions on the weights of neural networks (NN...
Particle Optimization in Stochastic Gradient MCMC
Stochastic gradient Markov chain Monte Carlo (SGMCMC) has been increasi...
Ruiyi Zhang
